Hadoop -Big Data

Program Content and Objectives:

Due to the advent of new technologies, devices, and communication means like social networking sites, the amount of data produced by mankind is growing rapidly every year. The amount of data produced by us from the beginning of time till 2003 was 5 billion gigabytes. If you pile up the data in the form of disks it may fill an entire football field. The same amount was created in every two days in 2011, and in every ten minutes in 2013. This rate is still growing enormously. Though all this information produced is meaningful and can be useful when processed, it is being neglected.

 

Course highlights  
  • Master the concepts of HDFS and MapReduce framework
  • Understand Hadoop¬† Architecture
  • Setup Hadoop Cluster and write Complex MapReduce programs
  • Learn data loading techniques using Sqoop and Flume
  • Perform data analytics using Pig, Hive and YARN
  • Implement HBase and MapReduce integration
  • Implement Advanced Usage and Indexing
  • Schedule jobs using Oozie
  • Implement best practices for Hadoop development
  • Work on a real life Project on Big Data Analytics
  • Understand Spark and its Ecosystem
  • Learn how to work in RDD in Spark
 

Course Duration

Entry Profile

Exit Profile

120 Hours

Students & Graduates in any stream (Engineering / Non-Engineering)

Big Data Developer

 
Certificate & Placement
Certificate & Placement assistance will be provided to successful candidates only