• No products in the cart.

ratings 

Introduction to Hadoop Administration is an introductory-level, hands-on lab-intensive course geared for the administrator (new to Hadoop) who is charged with maintaining a Hadoop cluster and its related components.

PRIVATE
Course Access

Unlimited Duration

Last Updated

March 11, 2021

Students Enrolled

Total Reviews

Posted by
Certification

In this course you will learn about:

· Understand the benefits of distributed computing

· Understand the Hadoop architecture (including HDFS and MapReduce)

· Define administrator participation in Big Data projects

· Plan, implement, and maintain Hadoop clusters

· Deploy and maintain additional Big Data tools (Pig, Hive, Flume, etc.)

· Plan, deploy and maintain HBase on a Hadoop cluster

· Monitor and maintain hundreds of servers

· Pinpoint performance bottlenecks and fix them

Course Curriculum

    • Hadoop history and concepts 00:00:00
    • Ecosystem 00:00:00
    • Distributions 00:00:00
    • High level architecture 00:00:00
    • Hadoop myths 00:00:00
    • Hadoop challenges (hardware / software) 00:00:00
    • Selecting software and Hadoop distributions 00:00:00
    • Sizing the cluster and planning for growth 00:00:00
    • Selecting hardware and network 00:00:00
    • Rack topology 00:00:00
    • Installation 00:00:00
    • Multi-tenancy 00:00:00
    • Directory structure and logs 00:00:00
    • Benchmarking 00:00:00
    • Concepts (horizontal scaling, replication, data locality, rack awareness) 00:00:00
    • Nodes and daemons (NameNode, Secondary NameNode, HA Standby NameNode, DataNode) 00:00:00
    • Health monitoring 00:00:00
    • Command-line and browser-based administration 00:00:00
    • Adding storage and replacing defective drives 00:00:00
    • Parallel computing before MapReduce: compare HPC versus Hadoop administration 00:00:00
    • MapReduce cluster loads 00:00:00
    • Nodes and Daemons (JobTracker, TaskTracker) 00:00:00
    • MapReduce UI walk through 00:00:00
    • MapReduce configuration 00:00:00
    • Job config 00:00:00
    • Job schedulers 00:00:00
    • Administrator view of MapReduce best practices 00:00:00
    • Optimizing MapReduce 00:00:00
    • Fool proofing MR: what to tell your programmers 00:00:00
    • YARN: architecture and use 00:00:00
    • Hardware monitoring 00:00:00
    • System software monitoring 00:00:00
    • Hadoop cluster monitoring 00:00:00
    • Adding and removing servers and upgrading Hadoop 00:00:00
    • Backup, recovery, and business continuity planning 00:00:00
    • Cluster configuration tweaks 00:00:00
    • Hardware maintenance schedule 00:00:00
    • Oozie scheduling for administrators 00:00:00
    • Securing your cluster with Kerberos 00:00:00
    • The future of Hadoop 00:00:00

Course Reviews

Profile Photo
ashar hafeez
0
62

Students

About Instructor

Pak

Course Events

[wplms_eventon_events]

More Courses by Insturctor

© 2021 Ernesto.  All rights reserved.  
X