|
I have previously downloaded the dataset into the following directory:
/cs/courses/cs539/f00/Projects/Census_Income_Data
You can access the dataset from there.
The census-income dataset contains census information for 48,842 people. It has 14 attributes for each person (age, workclass, fnlwgt, education, education-num, marital-status, occupation, relationship, race, sex, capital-gain, capital-loss, hours-per-week, and native-country) and a boolean attribute class classifying the input of the person as belonging to one of two categories >50K, <=50K.
If you use the Census-income data, note that this dataset has missing values. It is up to you how to fill in appropriate data for those missing values. Also, it is up to you to decide if it's a good idea to discretize continues attributes, and if so, how. YOU MUST USE AT LEAST THE FIRST 1000 TEST RECORDS FROM THE Test data IN YOUR EXPERIMENTS.