Ab Initio Course Content
1.Data warehousing Concepts
2.Introduction to Ab Initio. Operating Systems & Hardware Platforms that it supports
3.Products of Ab Initio Software Corporation
- The Ab Initio Co-Operating System
- Graphical Development Environment (GDE)
- The Ab Initio Enterprise Meta>Environment (EME)
4.Salient Features of Ab Initio
5.Understanding the High Level Architecture of Ab Initio
6.What is the Anatomy of Running a Job? What happens when we push the ‘Run’ Button?
- Deployment Process
- Host Process Creation
- Agent Process Creation
- Component Process Creation
- Successful Component Termination
- Agent Termination
- Host Termination
- Abnormal Component Termination
7.How do we connect from GDE to the Server (RunSettings)
Host Profile File (.aih)
Connection, Co-Operating System, Shell & Host Directory
8.What is Graph Programming? What are the basic parts of an Ab Initio Graph?
9.Data Manipulation Language (DML)
- Record Formats
- DML Expressions
- Transform Functions
- Key Specifiers
10.How do we build a basic Ab Initio Graph? Using .dat(Data File) & .dml(Record Format File)
11.What is Parallelism - Types
- Component Parallelism
- Pipeline Parallelism
- Data Parallelism
12.Sandbox - Types
- Private Sandbox
- Public Sandbox
- 13.Editors in the GDE
- Record Format Editor
- Expression Editor
- Transform Editor
- Key Specifier Editor
- Package Editor
- Variables Editor
- Sequence Specifier Editor
14.Dataset Components
- INPUT FILE
- OUTPUT FILE
- INTERMEDIATE FILE
- LOOKUP FILE
- LOOKUP TEMPLATE
- BLOCK-COMPRESSED LOOKUP TEMPLATE
15.Transform Components - Multistage and Advanced Components
- DEDUP SORTED
- FILTER BY EXPRESSION
- JOIN
- MULTI REFORMAT
- NORMALIZE
- REFORMAT
- ROLLUP
- SCAN
- SCAN WITH ROLLUP
16.Partition Components
- BROADCAST
- PARTITION BY KEY
- PARTITION BY EXPRESSION
- PARTITION BY PERCENTAGE
- PARTITION BY RANGE
- PARTITION BY ROUND-ROBIN
17.De-partition Components
- CONCATENATE
- GATHER
- INTERLEAVE
- MERGE
18.Miscellaneous Components
- REDEFINE FORMAT
- GATHER LOGS
- REPLICATE
- RUN PROGRAM
- TRASH
19.Compress Components
- COMPRESS
- UNCOMPRESS
- DEFLATE
- INFLATE
20.Interactive Database Components (IDB)
- INPUT TABLE
- OUTPUT TABLE
- RUN SQL
- UPDATE TABLE
- JOIN WITH DB
21.Sort Components
- SORT
- SORT WITHIN GROUPS
- SAMPLE
- PARTITION BY KEY AND SORT
22.Validate Components
- CHECK ORDER
- COMPARE RECORDS
- CREATE DATA
- GENERATE RECORDS
- VALIDATE RECORDS
23.Working with Database’s. Introduction Database Configuration File (.dbc)
24.Lookup Functions
- Lookup Local
- Lookup Count
- Lookup First
- Lookup Last
- Lookup Next
25.StringFunctions
- String_char
- String_concat
- String_filter
- String_filter_out
- String_length
- String_lrepad
- String_rindex
- String_substring
- String_upcase
- String_split
- String_length
26.Multifiles and Multidirectories(MFS)
27.Phasing & Checkpointing
28.Reusability Features
- Sub graphs
- Graph Parameters
- Input parameters
- Sandbox Parameters
29.The Ab Initio Enterprise Meta>Environment(EME)
- High level anatomy of the EME
- Relation to sandbox environment
- Checking in your sandbox
- Checking out a graph or a sandbox
- Locking in order to change an object
- Tagging
- Common projects
- Dependency and Impact analysis
30.Advanced Features
- Debugging Techniques
- Plan>It
- Performance Tuning
- PSETS
- PDL Concepts