CS585/DS503: Big Data Management
- Students will form teams of two to
work in each project.
A virtual machine will be available for download that
includes the needed platform for the projects. The virtual machine
requires around 20GB in size and will basically consist of:
-- Ubuntu OS
-- Hadoop platform
-- Apache Pig, and Hive
-- Mahout library
-- In addition to other software
such as: Jave , C , Perl ,
Projects will involve coding against different platforms and engines
inclduing Hadoop MapReduce, MongoDB, Spark, Hive, Hadoop Streaming.