DS503. Big
Data Management
Project Teams:
- Students will form teams of two to
work on each project.
Software
Infrastructure:
A virtual machine will be available for download that
includes the needed infrastructure/softwaresplatform for the projects.
The virtual machine
needs around 15-20 GBs of free space (around 7GB in size). It will
consist of:
-- Ubuntu OS
-- Hadoop platform
-- Apache Pig
-- Apache Hive
-- Mahout library
-- RHadoop
-- MongoDB
-- In addition to other software
and programming languages.