IBM WebSphere Data stage and QualityStage 9.1
Course Topics:
Unit -1: Data Warehouse Fundamentals
An introduction to Data Warehousing â?? purpose of Data Warehouse â?? Data Warehouse Architecture â?? Operational Data Store â?? OLTP Vs Warehouse Applications â?? Data Marts- Data marts Vs Data Warehouses â?? Data Warehouse Life cycle .
Unit -2: Data Modelling
Introduction to Data Modeling â?? Entity Relationship model (E-R model) â?? Data Modeling for Data Warehouse, Normalization process â?? Dimensions and fact tables â?? Star Schema and Snowflake Schema.
Unit -3: ETL Design Process
Introduction to Extraction, Transformation & Loading- Types of ETL Tools â?? Key tools in the market.
Unit â?? 4 : Introduction to Data stage Version 7.5x2 & 8.5&9.1
Data stage introduction â?? IBM information Server architecture â?? Data stage components â?? Data Stage main functions â?? Client components- Adding different Servers to our workspace.
Unit â?? 5 : Data stage Administrator
Data stage project Administration - Editing projects and Adding Projects â?? Deleting projects Cleansing up project files â?? Environmental Variablesâ??Environment management â?? Auto purging â?? Runtime Column Propagation(RCP) â?? Add checkpoints for sequencer â?? NLS configuration â??
Unit â?? 6: Data stage Director
Introduction to Data stage Director â?? Validating Data stage Jobs â?? Executing Data stage jobs â?? Job execution status â?? Monitoring a job â?? Job log view â?? job scheduling â?? Creating Batches â?? Scheduling batches.
Unit â?? 7 : Data stage Designer
Introduction to Data stage Designer â?? Importance of Parallelism â?? Pipeline Parallelism â?? Partition Parallelism â?? Partitioning and collecting(In depth coverage of partitioning and collective techniques) â?? Symmetric Multi Processing (SMP) Massively Parallel Processing (MPP)
Introduction to Configuration file- Editing a Configuration file
Partition techniques â?? Data stage Repository Palette â?? Passive and Active stages â?? Job design overview â?? Designer work area â?? Annotations â?? Creating jobs â?? Importing flat file definitions â?? Managing the Metadata environment â?? Dataset management â?? Deletion of Dataset â?? Routines â?? Arguments.
Unit â?? 8: Working with Parallel Job Stages
Database Stages
Oracle â?? Teradata â?? ODBC â?? dynamic RDBMS
File Stages
Sequential file â?? Dataset â?? File set â?? Lookup file set.
Processing Stages
Copy â?? Filter â?? Funnel â?? Sort Remove duplicate â?? Aggregator â?? Modify â?? Compress â?? Expand â?? Decode â?? Encode â?? Switch â?? Pivot stage â?? Lookup â?? Join â?? Merge â?? FTP â?? SCD I,II,III - difference between look up, join and merge â?? change capture â?? Change Apply â?? Compare â?? Difference - External Filter- Surrogate key generator
â?? Transformer.
Real time scenarios using different Processing Stages- Implementing different logics using Transformer.
Debug Stages
Head â?? Tail â?? Peek â?? Column generator â?? Row generator â??Write Range Map Stage - Sample.
Real Time Stages
XML input â?? XML output
Local and Shared containers
Routines creation
Extensive usage of Job parameters, Parameter Sets, Environmental variables in jobs.
Introduction to some of predefined Environmental variablesâ?? creating user defined Environmental variables and implementing the same in parallel jobs
Unit â?? 9: Advanced Stages in Parallel Jobs (Version 8.1)
Explanation of Type1 and Type 2 processes- Implementation of Type1 and Type2 logics using Change Capture stage and SCD Stage-Range Look process â?? Surrogate key generator stage â?? FTP stage â?? Job performance analysis â?? Resource estimation - Performance tuning.
Unit â?? 10:Job Sequencers
Arrange job activities in Sequencer â?? Triggers in Sequencer â?? Restablity â?? Recoverability â?? Notification activity
â?? Terminator activity â?? Wait for file activity â?? Start Loop activity â?? Execute Command activity â?? Nested
Condition activity â?? Exception handling activity â?? User Variable activity â?? End Loop activity â?? Adding
Checkpoints.
Jobs used in different real time scenarios.
Explanation of Sequence Job stages through different Jobs.
Unit â?? 11: IBM Information Server Administration Guide
IBM Web Sphere Data stage administration - Opening the IBM Information Server Web console â?? setting up a projection the console â?? Customizing the project dashboard â?? Setting up security â?? Creating users in the console â?? Assigning security roles to users and groups â?? Managing licenses â?? Managing active sessions â?? Managing logs â?? Managing schedules â?? Backing up and restoring IBM Information Server.
Additional Features
Data stage Certification Guidance.
Performance Tuning of Parallel Jobs.
Stage Concepts UnixCommands, Shall Scripts, Databases.
Unix Commands related to datastage.
Sql concepts related to datastage