Ab Initio Course

 

Ab initio Course Overview

Ab Initio Software is an American multinational enterprise software corporation based in Lexington, Massachusetts. The company specializes in high-volume data processing applications and enterprise application integration.

The Ab Initio products are provided on a user-friendly homogeneous and heterogeneous platform for parallel data processing applications. These applications perform functions relating to fourth generation data analysis, batch processing, complex events, quantitative and qualitative data processing, data manipulation graphical user interface (GUI)-based parallel processing software which is commonly used to extract, transform, and load (ETL) data.

It is a flexible and extensible tool which adapts at runtime to the formats of records entered. It allows creation and incorporation of new components obtained from any program that permits integration and reuse of external legacy codes and storage engines.

Ab Initio is a Business Intelligence platform comprised of six data processing products Co operating System, The Component Library, Graphical Development Environment, and Enterprise Meta Environment, Data Profiler, and Conduct It. It is a powerful GUI-based parallel processing tool for ETL data management and analysis.

Anyone who is working on Maintenance project where the underlying ETL tool is Ab Initio.  Anyone who wanted to better their ETL skill sets by learning the happening tool in the Market. Knowledge in the database and/or data warehouse.

Knowledge about different components and its purpose. Build a graph adhering the business requirement. How to blend different tools for different requirement.  Complete understanding of Data Warehousing and ETL terminologies.

You will be hired as data analyst, ETL developer in the companies like wells Fargo, Bank of America, DBS, commonwealth bank and more.

The main topics covered Introduction to Ab initio software, Understanding Host connection settings, Understanding components of a graph, Component Overview-Dataset components, Component Overview-Dataset components, Lookup files and Lookup Functions, Component Overview- Validate Components, performance tuning and more.

Course Syllabus

Introduction to Ab initio software

Ab initio Architecture

Understanding GDE

Understanding Co-op System

Understanding EME/Application hub

Understanding Host connection settings

Configure host settings

Understanding Abinitio Jobs/ Graphs

Basics of Graphs

Building and executing graphs

Understanding Sandboxes/Projects

Understanding components of a graph

Record formats, DML, Transform functions, Key specifiers

Dataset components/ Table components/Program components

Layout

Parameters

Validating the graphs and understanding the error messages

Component Overview-Editors

Record Format Editors

Expression Editors

Transform Editors

Key Specifier Editors

Package Editors

Variable Editors

Component Overview-Dataset components

Input file

Output File

Intermediate file

Lookup files and Lookup Functions

Filter by expression

Dedup sorted

Reformat

Aggregate

Scan

Rollup

Join

Match sorted

Normalize

Denormalize Sorted

Understanding partitioning Concepts and multi files

Understanding Multi files

Understanding Parallelism

Data Parallelism

Component Parallelism

Pipeline parallelism

Component Overview-Partition components

Broadcast

Partition by key

Partition by expression

Partition by round robin

Partition by range

Partition by load balance

Component Overview-DE partition components

Gather

Concatenate

Interleave

Merge

Component Overview-Database Components, DBC file configuration

Understanding database configuration

Input table

Output table

Run SQL

Truncate Table

Update Table

Join with DB

Component Overview- Miscellaneous Components

Redefine format

Gather Logs

Replicate

Run Program

Trash

Component Overview- Validate Components

Check Order

Compare Records

Validate Records

Generate Record

Create Data

Compute Checksum

Understanding Performance Tuning

Component Folding

Additional Graph concepts

Tagging

Branching

Checking-Checkout

Locking

Common projects

Dependency analysis/Impact analysis

Generic Graphs/Reusable Graphs

Conditional components

Concept of Phases and check point

Understanding Plans/Conduct IT Understanding Continuous Flows

Understanding various command line options/ configuration Variables

M commands

Air commands

Configuration variables

Environment