Thursday,
December 6, 2012
9:00 a.m. - 11:00 a.m.
WPI, Campus Center, Taylor Room
Committee Members:
Prof. Matthew O. Ward, Advisor, WPI Computer Science
Prof. Elke A. Rundensteiner, . Co-advisor, WPI Computer Science
Prof. Carolina Ruiz, WPI Computer
Science
Prof. Georges Grinstein, University of
Massachusetts Lowell, External member
Abstract:
Data
mining for patterns and knowledge discovery in multivariate datasets are very
important processes and tasks to help analysts understand the dataset,
describe the dataset, and predict unknown data values. However, conventional
computer-supported data mining approaches often limit the user from getting
involved in the mining process and performing interactions during the pattern
discovery. Besides, without the visual representation of the extracted
knowledge, the analysts can have difficulty explaining and understanding the
patterns. Therefore, instead of directly applying automatic data mining techniques,
it is necessary to develop appropriate techniques and visualization systems
that allow users to interactively perform knowledge discovery, visually
examine the patterns, adjust the parameters, and discover more interesting
patterns based on their requirements.
In the
dissertation, I will discuss different proposed visualization systems to
assist analysts in mining patterns and discovering knowledge in multivariate
datasets, including the design, implementation, and the evaluation. Three
types of different patterns are proposed and discussed, including trends,
clusters of subgroups, and local patterns. For trend discovery, the parameter
space is visualized to allow the user to visually examine the space and find
where good linear patterns exist. For cluster discovery, the user is able to
interactively set the query range on a target attribute, and retrieve all the
sub-regions that satisfy the users requirements. The sub-regions that
satisfy the same query and are near each other are grouped and aggregated to
form clusters. For local pattern discovery, the patterns for the local
sub-region with a focal point and its neighbors are computationally extracted
and visually represented. To discover interesting local neighbors, the
extracted local patterns are integrated and visually shown to the analysts.
Evaluations of the three visualization systems using formal user studies are
also performed and discussed."
|