This project consists of two parts:
See
Consider the following dataset.
@relation movie-preferences @attribute genre {comedy, drama, action} @attribute critics-reviews {thumbs-up, neutral, thumbs-down} @attribute rating {R, PG-13} @attribute IMAX {true, false} @attribute likes {yes, no} @data ( 1) comedy, thumbs-up, R, false, no ( 2) comedy, thumbs-up, R, true, no ( 3) comedy, neutral, R, false, no ( 4) comedy, thumbs-down, PG-13, false, yes ( 5) comedy, neutral, PG-13, true, yes ( 6) drama, thumbs-up, R, false, yes ( 7) drama, thumbs-down, PG-13, true, yes ( 8) drama, neutral, R, true, yes ( 9) drama, thumbs-up, PG-13, false, yes (10) action, neutral, R, false, yes (11) action, thumbs-down, PG-13, false, yes (12) action, thumbs-down, PG-13, true, no (13) action, neutral, PG-13, false, yes (14) action, neutral, R, true, nowhere the likes attribute is the classification target.
genre = action, critics-reviews = ?, rating = R, IMAX = ?
*** NEW FOR THIS PROJECT: The written report for your group project should be at most 10 pages long (including all graphs, tables, figures, appendices, ...) and the font size should be no smaller than 11 pts. ***
Create a Bayesian network model using Weka's implemenation of NaiveBayes using the default parameters. Use AYP2012 as the target attribute.
Create a Bayesian network model using Weka's implemenation of BayesNet using the default parameters. Use AYP2012 as the target attribute.
Examine your two models. Compare and contrast them. Use 10-fold cross-validation to perform an analysis of the classification accuracy. Answer the following questions in your description about this experiment:
Use modified parameters and preprocessing techniques to generate a NaiveBayes that classifies AYP2012. Provide detailed descriptions about the parameters used to develop your model and/or preprocessing techniques used. One should be able to repeat the experiment from your description.
Examine the model. Compare and contrast this model against a ZeroR model, a OneR model, models generated in Project 2, and models generated in the challenge above. Answer the following questions in your description about this experiment:
Use preprocessing and modified parameters to generate a BayesNet that attempts to accurately classify AYP2012. Provide detailed descriptions about the parameters used to develop your model and/or preprocessing techniques used. One should be able to repeat the experiment from your description. Describe the topology of the model produced.
Examine the model. Compare and contrast this model against a ZeroR model, a OneR model, NaiveBayes, and models generated in Project 2.
Design another experiment with a different goal other than the ones that have appeared previously in this assignment. Provide detailed descriptions about the parameters used to develop your model and/or preprocessing techniques used. One should be able to repeat the experiment from your description. Describe the topology of the model produced. Compare the performance of your model against ZeroR, OneR, and J4.8 models.