CS585/DS503: Big Data Management
Project Teams:
For most projects, the students will be assigned into teams
to work in a project. Team partners switch for
each project to experience different team partners, learning styles
and team roles, and to give everyone a chance at success.
There will be one opportunity to choose your own project partners
for the final course project.
Projects
Infrastructures:
The systems used are open-source; and thus available via download
to you via the internet. However,
a virtual machine will also be available for download that
includes the needed software for the projects. The virtual machine
requires around 20GB in size and will consist of software such as:
-- Ubuntu OS
-- Hadoop platform
-- Apache Pig, and Hive
-- Mahout library
-- RHadoop
-- MongoDB
-- In addition to other software
such as: Java , C , Python, etc.
Projects:
 
Project details will be released with each project.
They will involve work with infrastructures including
Hadoop MapReduce, MongoDB, Spark, Hive, and others.