CS539 MACHINE LEARNING. SPRING 99
PROJECT 1
Decision Trees for Prediction Income Level

PROF. CAROLINA RUIZ
Department of Computer Science
Worcester Polytechnic Institute



PROJECT DESCRIPTION

Construct the most accurate decision tree you can for predicting whether the income of a given person is >50K or <= 50K using the
census-income dataset from the US Census Bureau which is available at the Univ. of California Irvine Repository.

The census-income dataset contains census information for 48,842 people. It has 14 attributes for each person (age, workclass, fnlwgt, education, education-num, marital-status, occupation, relationship, race, sex, capital-gain, capital-loss, hours-per-week, and native-country) and a boolean attribute class classifying the input of the person as belonging to one of two categories >50K, <=50K.


PROJECT ASSIGNMENT

The following are guidelines for the construction of your decision tree:

REPORT AND DUE DATE

Project 1 is due on Tuesday, February 16 at 5:30 pm. Your system should follow the
Departmental Documentation Standard.