CS651.
Advanced Topics in Database Systems
Project Teams:
- Students will form teams of two to
work in each project.
Hadoop Platform:
A virtual machine will be available for download that includes the needed platform for the projects. The virtual machine
(around 7GB in size) will basically consist of:
-- Ubuntu OS (Version 12.10)
-- Hadoop platform (Version 1.1.0)
-- Apache Pig (Version 0.10.0)
-- Mahout library (Version 0.7)
-- RHadoop
-- In addition to other software
such as: Jave (Version 1.7) , C (Version 4.7), Perl (Version 5.14.2),
Python (Version 2.7.3), etc.
Virtual Box:
The virtual machine is named "ubuntu-Hadoop-VBoxVersion.ova.zip", and can be downloaded (Here). For this version you will need "VirtualBox" software (Free) available Here.
VMWare:
The virtual machine is named "ubuntu-Hadoop-VMWareVersion.zip",
and can be downloaded (Here). For this version you will need
"VMWare" software (Not Free) available Here.
Note: All
students are granted access to the VMWare software available on the Zoo lab
machines. So, either you can work from your own PC or laptop, or you
can use the Zoo lab facility.
List of Projects:
ID
|
Project Description
|
Release Date
|
Due Date
|
Link
|
1
|
Hadoop Java
|
2/27/2014
|
3/7/2014 (11:59PM)
|
Project 1
|
2
|
Hadoop Pig & Streaming
|
3/17/2014
|
3/28/2014 (11:59PM)
|
Project 2
|
3
|
Hadoop Input Formats
|
4/20/2014
|
5/1/2014 (11:59PM)
|
Project 3
|
Homeworks
ID
| HW
| Release Date
| Due Date
| Link
|
1
|
Object-Relational
|
1/30/2014
|
2/7/2014 (11:59PM)
|
HW1 , Oracle Acces
|
2
|
Data Mining
|
2/11/2014
|
2/22/2014 (11:59PM)
|
HW2
|
3
|
Parallel & Distributed DBs
|
4/4/2014
|
4/14/2014(11:59PM)
|
HW3
|