WPI Worcester Polytechnic Institute

Computer Science Department
------------------------------------------

CS539 Machine Learning 
Assignment 2 - Fall 2000

PROF. CAROLINA RUIZ 

Due: Thursday, September 21, 2000 at 6:00 pm. 
------------------------------------------


PROJECT DESCRIPTION

Construct the most accurate decision tree you can for predicting whether the income of a given person is >50K or <= 50K using the
census-income dataset from the US Census Bureau which is available at the Univ. of California Irvine Repository.

I have downloaded the dataset into the following directory: /cs/courses/cs539/f00/Projects/Census_Income_Data
You can access the dataset from there.

The census-income dataset contains census information for 48,842 people. It has 14 attributes for each person (age, workclass, fnlwgt, education, education-num, marital-status, occupation, relationship, race, sex, capital-gain, capital-loss, hours-per-week, and native-country) and a boolean attribute class classifying the input of the person as belonging to one of two categories >50K, <=50K.


PROJECT ASSIGNMENT

The following are guidelines for the construction of your decision tree:

REPORT AND DUE DATE