CS585/DS503: Big Data Management
Project Teams:
- Students will be assigned into teams
to work in each project. We aim to have partners switch for
each project to experience different learning styles.
Projects
Infrastructures:
The systems used are open-source; and thus available via download
to you via the internet. However,
a virtual machine will also be available for download that
includes the needed software for the projects. The virtual machine
requires around 20GB in size and will basically consist of:
-- Ubuntu OS
-- Hadoop platform
-- Apache Pig, and Hive
-- Mahout library
-- RHadoop
-- MongoDB
-- In addition to other software
such as: Jave , C , Perl ,
Python, etc.
Projects:
Project details will be released with each project.
They will involve coding using different infrastructures,
such as, Hadoop MapReduce, MongoDB, Spark, Hive, and others.