Computer Science Department

KNOWLEDGE DISCOVERY AND DATA MINING
RESEARCH GROUP
KDDRG

DESCRIPTION

The common themes of the research projects in our group are data mining and knowledge discovery in databases. Knowledge discovery is the process of finding general patterns/principles that summarize/explain a set of "observations". Very large databases have become the standard, making it impossible for human beings to mine the data "by hand" looking for interesting patterns. Automated tools are therefore needed to help during the extraction of these patterns. Examples of application domains include astronomical data from the Hubble telescope, data on consumer preferences obtained by credit card companies, medical histories, genomic data, web usage data, etc.
The knowledge discovery process in databases consists of several steps that can be grouped as follows:

Data Integration: Collecting the target data observations from the different data sources, removing noise from the observations, and integrating them into an appropriate format.

Data Mining: Applying a concrete algorithm to find useful and novel patterns in the integrated data.

Evaluation: Interpreting mined patterns, evaluating them according to usefulness/interestingness criteria, and possibly using visualization tools to aid in understanding the patterns graphically.

PROJECTS

(This list of project is out of date. It will be updated soon. To see current projects, see publications list below.)

Our research projects concentrate mainly on the data mining stage of the knowledge discovery process, though some of them address also the data integration and pattern evaluation stages.

Our Novel Data and Sequence Mining Algorithms and Tools

Association Rule Mining

Efficient Mining of Association Rules

Adaptive-Support Association Rule Mining

Mining Association Rules over Complex Data

Mining Association Rules from Set-Valued Data

An Association Rule Mining System for Set-Valued Data

Mining Association Rules from Sequence-Valued Data

Exploring Temporal Associations in the Stock Market
Exploratory Analysis of Sleep Data

Mining Distance-Based Association Rules.

Sequence Mining

Mining Patterns from Numeric and Categorical Sequences
Mining Association Rules from Sequence-Valued Data

Exploring Temporal Associations in the Stock Market
Exploratory Analysis of Sleep Data

Using Background Knowledge in Data Mining

Data Mining for Genetic Analysis

Motif- and Expression-Based Classification of DNA
Mining Genetic Polymorphisms for patterns in Human Diseases
Mining Distance-Based Association Rules for Gene Expression

Data Mining for Medical Data Analysis

Exploratory Analysis of Sleep Data
Pancreatic Cancer Data Mining

Data Mining for Electronic Commerce

Association Rules for Recommender Systems
Collaborative and Content-Based Filtering using Association Rules
Collaborative and Content-Based Filtering using Neural Networks

Fully Connected Architecture
Mixture of Experts vs. Fully Connected Architectures
Multi-Expert Neural Networks

Data Mining on other Application Domains

Data Mining for Software Engineering
Data Mining for Database Systems

Web Metasearch

Evaluation of Data Mining Tools

MEMBERS

Faculty

Prof. Carolina Ruiz
ruiz@cs.wpi.edu
Office: FL 232
Phone Number: (508) 831-5640

Graduate Students (Current and Former)

Undergraduate Students (Current and Former)

PUBLICATIONS

Publications

COURSES

Courses

RESOURCES

Resources

ruiz@cs.wpi.edu

KNOWLEDGE DISCOVERY AND DATA MINING RESEARCH GROUP KDDRG

DESCRIPTION

PROJECTS (This list of project is out of date. It will be updated soon. To see current projects, see publications list below.)

MEMBERS

Faculty

PUBLICATIONS

COURSES

RESOURCES

KNOWLEDGE DISCOVERY AND DATA MINING
RESEARCH GROUP
KDDRG

PROJECTS

(This list of project is out of date. It will be updated soon. To see current projects, see publications list below.)