
ratingsÂ
Working in a hands-on learning environment, led by our Hadoop expert instructor, students will learn about and explore: Organize a successful Hadoop rollout. Load, unload, and manage data in Hadoop. Integrate Hadoop with the existing information infrastructure.
PRIVATE
Course Access
Unlimited Duration
Last Updated
July 29, 2021
Students Enrolled
20
Total Reviews
Posted by
Certification
In today’s time, data with value is branched off into numerous databases across multiple companies. The challenge is bringing the data together. Integrating Hadoop shows how Hadoop is used to collect and load the data on physical devices and the cloud. The book begins with an introduction of Hadoop and the types of data fit for it. Next, it focuses on assembling the integration team and gives an overview of workloads in the organization. You will also identify data sources for Hadoop, such as No SQL Databases and Legacy/Relational Databases, distinguish between ETL and ELT, and learn how to load and unload data into Hadoop. You will also practice managing big data using methods such as Upserts and Use HBase, and discover the advantages of real-time computing and the basic structure of streaming data architecture. Finally, you will interact with the master data of an organization and learn the top 10 mistakes people commit while integrating Hadoop data and how to avoid them.
Course Curriculum
-
- Introducing Hadoop 00:00:00
- Hadoop Distributions 00:00:00
-
- Assembling the Integration Team 00:00:00
- Overview of Workloads for Hadoop in the Organization 00:00:00
- Identifying Data Sources for Hadoop 00:00:00
- Data Profiling 00:00:00
- Analyzing and Profiling Source Systems and Data 00:00:00
- Continued Need for More Speed 00:00:00
- Preference with Hadoop 00:00:00
- Is ETL Dead? 00:00:00
- Big Data ELT 00:00:00
- Importance of Data Quality in Hadoop 00:00:00
- Stewardship of Big Data 00:00:00
- Advantages of Real-Time Computing 00:00:00
- How and Where to Use Spark 00:00:00
- Hadoop and Master Data Management 00:00:00
- Integrating with Master Data 00:00:00
- Data Virtualization 00:00:00
- MDM and Hadoop Disconnects 00:00:00
- Case Studies in Big Data Integration 00:00:00
- Trends in Hadoop and Summary of Ideas 00:00:00
Course Reviews

4
4
1937
Students
About Instructor
Course Events
[wplms_eventon_events]
More Courses by Insturctor
{"title":"","show_title":"0","post_type":"course","taxonomy":"","term":"0","post_ids":"","course_style":"rated","featured_style":"generic","masonry":"","grid_columns":"clear1 col-md-12","column_width":"268","gutter":"30","grid_number":"2","infinite":"","pagination":"","grid_excerpt_length":"100","grid_link":"1","grid_search":"0","course_type":"","css_class":"","container_css":"","custom_css":""}