Research Topics:
Research Topics Context
What makes these clustering engines work?
What knowledge-base techniques could apply here?
Natural language processing?
Research directions
On what basis do we create clusters?
Dictionary definitions → document classification using classic IR techniques
Fixed number (easier for user to visualize?) → minimize some sort of distance metric
Neural-net classifiers…?
How do we refine the query?
assuming we’ve narrowed it down to cars, we may want to cluster on “cheap” or “beautiful”
unstructured, semi-structured → structured
How do we present the results?
How do we describe a cluster?
What document best represents a cluster?