WPI Worcester Polytechnic Institute

Computer Science Department
------------------------------------------

CS539 Machine Learning 
Assignment 3 - Fall 2000

PROF. CAROLINA RUIZ 

Due:
First Part: Thursday, September 28, 2000 at 6:00 pm. 
Second Part: Thursday, October 05, 2000 at 6:00 pm. 

------------------------------------------


PROJECT DESCRIPTION

Construct the most accurate neural network you can for predicting whether the income of a given person is >50K or <= 50K using the
census-income dataset from the US Census Bureau which is available at the Univ. of California Irvine Repository.

I have downloaded the dataset into the following directory: /cs/courses/cs539/f00/Projects/Census_Income_Data
You can access the dataset from there.

The census-income dataset contains census information for 48,842 people. It has 14 attributes for each person (age, workclass, fnlwgt, education, education-num, marital-status, occupation, relationship, race, sex, capital-gain, capital-loss, hours-per-week, and native-country) and a boolean attribute class classifying the input of the person as belonging to one of two categories >50K, <=50K.


PROJECT ASSIGNMENT

This project consists of two parts:
Part 1: Due September 28 at 6:00 pm.
Find all mistakes on the
C code for the neural network error backpropagation algorithm provided with Chapter 4 of the textbook. The main files you need to use are: backprop.c, backprop.h, and facetrain.c (beware that other files to run the experiment of classifying facial expressions described in the chapter are also provided). To the best of my knowledge, the code has ONE mistake. Submit a written report pointing out this mistake as well as all other mistakes you find in the code.
Part 2: Due October 5th at 6:00 pm.
Construct, train (using error backpropagation), and test the most accurate neural network you can to predict the Salary attribute of the Census-Income data. The following are guidelines to construct and train your neural net:

REPORT AND DUE DATE