Big Data

Course Code: CPAN 361

Academic Year: 2024-2025

The current rapid growth of data acquisition and increased storage capacities create opportunities for collection, processing and analysis of structured and unstructured data. The course describes the Hadoop architecture, Hadoop Distributed File System (HDFS), how to load data into HDFS and query large amounts of data using Map Reduce. This course also will introduce students to additional tools such as Hive, Pig, Zeppelin, as well as the NoSQL database HBase.