CS585/DS503: Big Data Management
Home Textbook & Reading List
Grading
Project
Schedule Additional Resources

Project Teams:

Projects Platform:
A virtual machine will be available for download that includes the needed platform for the projects. The virtual machine requires around 20GB in size and will basically consist of:
        -- Ubuntu OS
        -- Hadoop platform 
        -- Apache Pig, and Hive
        -- Mahout library 
        -- RHadoop
        -- MongoDB
        -- In addition to other software such as: Jave , C , Perl , Python, etc.



Projects Overview:
    Projects will involve coding against different platforms and engines inclduing Hadoop MapReduce, MongoDB, Spark, Hive, Hadoop Streaming.