Big Data Lab
The delivery lab environment for the Data Science and Big Data Analytics class is designed to provide each student with his or her own dedicated VMware-based Virtual Machine (VM) running Linux. This VM host all of the layered software applications that are required to perform the lab exercises in the Student Lab Guide.
Each of these VMs runs a freely downloadable version of:
Linux – CentOS operating system (x86_64 architecture)
Greenplum Database (Single Node Edition)
MADlib extensions to Greenplum database
R, the analytics software package
RStudio Server, enabling a browser-based graphical front-end to R
RODBC extension for Greenplum database / R integration
Apache Hadoop
PuTTy – Provides SSH access to the Linux operating system on the VM