CS585/DS503: Big Data Management
Project Teams:
- Students will form teams of two to
work in each project.
Projects
Platform:
A virtual machine will be available for download that
includes the needed platform for the projects. The virtual machine
requires around 20GB in size and will basically consist of:
-- Ubuntu OS
-- Hadoop platform
-- Apache Pig, and Hive
-- Mahout library
-- RHadoop
-- MongoDB
-- In addition to other software
such as: Jave , C , Perl ,
Python, etc.
Projects Overview:
Projects will involve coding against different platforms and engines
inclduing Hadoop MapReduce, MongoDB, Spark, Hive, Hadoop Streaming.