This course is applicable to users of Big Data version 10.2.1 and forward. Learn to accelerate Big Data Integration through mass ingestion, transformations, and processing of complex files. Optimize the Big Data system performance through monitoring, troubleshooting, and best practices.
OBJECTIVES
After successfully completing this course, students should be able to:
Mass ingest data to Hive and HDFS
Integrate with relational databases using SQOOP
Perform transformations across various engines
Perform initial load
Perform stateful computing and windowing
Process complex files
Monitor logs and troubleshoot
Tune performances of Spark and Blaze jobs
TARGET AUDIENCE
Architect
Developer
PREREQUISITES
Informatica Developer Tool for Big Data Developers
AGENDA SUMMARY
Module 1: Informatica Big Data Management Overview