This project consists of two parts:
Consider the reduced_userprofile.arff dataset. This dataset was constructed from the userprofile.csv file of the Restaurant & consumer data available at the UCI Data Repository.
*** The written report for your group project should be at most 10 pages long (including all graphs, tables, figures, appendices, ...) and the font size should be no smaller than 11 pts. ***
Construct classification rules using Weka's JRip implementation using the default parameters. Use AYP2012 as the target attribute. Use 10-fold cross-validation to perform an analysis of the classification accuracy.
Construct association rules using Weka's Apriori implementation using the default parameters.
Construct classification association rules using Weka's Apriori implementation using the default parameters, except for car=true. (Lower the min. confidence threshold as needed to obtain at least 10 rules.)
Examine your three sets of rules. Compare and contrast them. Answer the following questions in your description about this experiment:
Use modified parameters and preprocessing techniques to generate a set of JRip classification rules that classifies AYP2012. Provide detailed descriptions about the parameters used to develop your model and/or preprocessing techniques used. One should be able to repeat the experiment from your description. In particular, make sure to experiment with and without rule pruning.
Examine the model. Compare and contrast this model against a ZeroR model, a OneR model, models generated in Projects 2 and 3, and models generated in the challenge above. Answer the following questions in your description about this experiment:
Use preprocessing and modified parameters to generate association rules and classification association rules (CARs) that have high values for goodness metrics. Provide detailed descriptions about the parameters used to develop your model and/or preprocessing techniques used. One should be able to repeat the experiment from your description.
Design another experiment with a different goal other than the ones that have appeared previously in this assignment. Provide detailed descriptions about the parameters used to develop your model and/or preprocessing techniques used. One should be able to repeat the experiment from your description.