Enterprise Data Catalog: Configuration and Maintenance
This course is applicable for software version 10.2.1. Gain the skills and knowledge necessary to install, configure, and maintain an Enterprise Data Catalog (EDC) environment. Using the Catalog Administrator, learn to manage and monitor resources, schedules, attributes, and connections for initial implementation and ongoing system maintenance.
OBJECTIVES
After successfully completing this course, students should be able to:
Install and Configure EDC considering the sizing requirements
Use the Catalog Administrator interface
Scan resources to obtain datasets
Manage Resources, Schedules, Attributes, Synonyms and Connections
Configure reusable settings
Manage Data Domains and Composite Data Domains
Extract metadata from data sources using the Universal Connectivity Framework
Create Custom models and Custom resource types
Monitor and Troubleshoot EDC
Use REST APIs
TARGET AUDIENCE
Administrator
Architect
Developer
PREREQUISITES
None
AGENDA SUMMARY
Module 1: Overview of Enterprise Data Catalog
· Major Business Challenges
· EDC as a Solution
· Key capabilities of EDC
· Metadata and Metadata Management
· EDC architecture
· EDC features
· EDC concepts
· Catalog administration tasks
· Catalog Administrator workspaces
Module 2: EDC Pre-Installation
· Installation overview
· Installation Phases
· Perform Pre-installation steps
· Deployment Methods
Module 3: Installation
· Pre-installation checklist
· Installation Files
· Installation modes
· EDC Installation
· Installation in Silent Mode
· Post-installation phases
· Uninstallation steps
Module 4: Resource Creation and Security
· Overview of resources and scanners
· Creation of Users
· Create and scan resources:
o Oracle
o PowerCenter
o Hive
o Business Glossary
o Informatica Platform
· Supported File System and File Formats
· Resource Security
· Lab: Creating Users in Informatica Administrator
· Lab: Creating New Oracle Resources
· Lab: Creating a New PowerCenter Resource
· Lab: Creating Oracle Resources from a Different Schema
· Lab: Creating a Hive Resource
· Lab: Creating a New Business Glossary Resource
· Lab: Creating a BDM Resource Type
· Lab: Creating an Avro Resource
· Lab: Configuring Permissions for Resources
Module 5: Resource Management
· Connections Management
· Connection types
· Profile Configuration Management
· Data Similarity
· Reusable Data Integration Service (DIS) configuration
· Schedule Management
· Lab: Managing PowerCenter Connections
· Lab: Managing BDM Connections
· Lab: Profiling and Data Discovery
· Lab: Setting up a Reusable Data Integration Service Configuration
· Lab: Creating a Schedule
Module 6: Data Domains
· Data Domain Discovery
· Data Domain Discovery Types
· Supported Resource Types for Data Discovery
· Data Domains and Data Domain groups
· Data Domain Curation
· Data Domain Inference
· Data Domain Propagation
· Composite Data Domains
· Smart Domains
· Lab: Creating Rule-Based Data Domains
· Lab: Creating Data Domain Group
· Lab: Creating Smart Domains
· Lab: Creating Composite Data Domains
· Lab: Curating Data Domain
Module 7: Attribute Management and Synonyms
· System and Custom attributes
· Attribute properties
· Edit system attributes
· Create and use custom attributes
· Synonym definition files
· Upload the synonym definition file in Catalog Administrator
· Lab: Editing System Attributes
· Lab: Creating a Custom Attribute
· Lab: Loading Synonyms
Module 8: Universal Connectivity Framework
· Metadata Models
· Universal connectivity framework
· Supported metadata sources
· Creating resource types
· Creating resource for the defined resource type
· Lab: Creating a Resource Based on a Universal Resource Type
Module 9: Custom Models and Resources
· Types of metadata models
· Custom scanner framework
· Custom metadata integration
· Create and manage custom model
· Create the custom resource type
· Create the custom resource
· Custom Scanners
· Extracting metadata from custom scanners
· Metadata Ingestion
· Lab: Creating a Custom Scanner
Module 10: Performance Tuning
· Performance tuning stages and parameters
· EDC sizing recommendations
· Tuning performance based on the size of the data
· Tuning performance for similarity
· Tuning profile warehouse
· Data integration service system requirements for profiling
· Tuning for profiling performance
· Data integration service parameters
· Profile configuration in data integration service
· Data integration service profiling properties
· Lab: Tuning Performance Based on the Size of the Data
Module 11: Monitoring and Troubleshooting Enterprise Data Catalog