Developer Training

Hadoop Developer Training

          Hadoop Developer Training is for Developers who want to learn to use  Apache Hadoop to build powerful data processing applications.

hadoop developer training

You Will Learn

  • The core technologies of Hadoop
  • How HDFS and MapReduce work
  • How to develop MapReduce applications
  • How to unit test MapReduce applications
  • How to use MapReduce combiners, partitioners and the distributed cache
  • Best practices for developing and debugging MapReduce applications
  • Algorithms for common MapReduce tasks
  • How to join data sets in MapReduce
  • How Hadoop integrates into the data center
  • How to use Mahout’s machine learning algorithms
  • How Hive and Pig can be used for rapid application development
  • How to create large workflows using Oozie


          Hadoop Developer training appropriates for developers who will be writing, maintaining and/or optimizing Hadoop jobs. Participants should have programming experience; knowledge of Java is highly recommended. Understanding of common computer science concepts is a plus. Prior knowledge of Hadoop is not required.

Hands-On Exercises

Throughout the course, students write Hadoop code and perform other hands-on exercises to solidify their understanding of the concepts being presented.

Hadoop Developer Training : Outline

  • Introduction
  • The Motivation for Hadoop
  • Hadoop: Basic Concepts
  • Writing a MapReduce Program
  • Unit Testing MapReduce Programs
  • Delving Deeper into the Hadoop API
  • Practical Development Tips and Techniques
  • Data Input and Output
  • Common MapReduce Algorithms
  • Joining Data Sets in MapReduce Jobs
  • Integrating Hadoop into the Enterprise Workflow
  • Machine Learning and Mahout
  • An Introduction to Hive and Pig
  • An Introduction to Oozie
  • Conclusion
Ver peliculas online