Apache Hadoop Online Training

Apache Hadoop is an open source software framework that supports data-intensive distributed applications. It provides both reliability and data motion to applications. Nowadays, it is becoming trendy everywhere that we need to store, process, and scrutinize hefty volumes of data. This 3 day Apache Hadoop hands-on training is for system administrator and Java developers who what to learn and use Apache Hadoop to build data processing application and manage Apache Hadoop clusters in development and production servers. This training covers Hadoop Architecture, HBase and Map/Reduce, Pig, Hive, Cassandra, Chukwa, ZooKeeper, Avro, Mahout, Hadoop deployment, and Hadoop integration with existing Data sets.

Day 1

  • Introducing Hadoop
  • Hadoop Components
  • Hadoop Distributed File System
  • MapReduce
  • MapReduce Programming
  • Hadoop Data I/O
  • Hadoop Cluster
  • Advanced MapReduce

Day 2

  • Hadoop on AWS Cloud
  • Managing Hadoop
  • Testing & Debugging
  • Hadoop Security
  • Sqoop
  • Big Data
  • HBase
  • Hbase & MapReduce

Day 3

  • Hive
  • Pig
  • ZooKeeper
  • Avro
  • Cassandra
  • Mahout
  • Case Studies
  • Best Practices

© 2016 Laliwala IT. All rights reserved.