Readings: Read Chapters 9, 10, and 12 of your textbook to learn more about the Weka system.
Written Report: Your written report should consist of your answers to the items marked with an asterisk "*" in the assignment description below.
Assignment:
You can find the Weka code in a file called "weka-src.jar", which should be located in the directory where Weka was installed. This "weka-src.jar" file is a zip file. Hence you need to winzip or unzip it to extract its contents. Inside, you will find the .java files that implement Weka.
The
census-income dataset
from the US Census Bureau which is
available at the
Univ. of California Irvine (UCI) Data Repository.
The census-income dataset contains census information for 48,842
people. It has 14 attributes for each person
(age,
workclass,
fnlwgt,
education,
education-num,
marital-status,
occupation,
relationship,
race,
sex,
capital-gain,
capital-loss,
hours-per-week, and
native-country)
and a boolean attribute class classifying the input
of the person as belonging to one of two categories >50K, <=50K.
Hand in a hardcopy of your written report at the beginning of class the day the project is due. We will discuss the results from the project during class.