CS585/DS503: Big Data Management

Home Readings
Grading
Projects
Topics + Schedule Additional Resources

Project Teams:

Projects Infrastructures:
The systems used are open-source; and thus available via download to you via the internet. However, a virtual machine will also be available for download that includes the needed software for the projects. The virtual machine requires around 20GB in size and will basically consist of:
        -- Ubuntu OS
        -- Hadoop platform 
        -- Apache Pig, and Hive
        -- Mahout library 
        -- RHadoop
        -- MongoDB
        -- In addition to other software such as: Jave , C , Perl , Python, etc.



Projects:
    Project details will be released with each project. They will involve coding using different infrastructures, such as, Hadoop MapReduce, MongoDB, Spark, Hive, and others.