DataStage overview
DataStage is on of the leading ETL products on the BI market. The tool allows integration of the data across multiple systems and processing high volumes of the data.
Datastage has an user-friendly graphical frontend to designing jobs which manage collecting, transforming, validating and loading data from multiple sources, such as the enterprise applications like Oracle, SAP, PeopleSoft and mainframes, to the data warehouse systems.
The application is capable of integrating meta data across the data environment to maintain consistent analytic interpretations.
Datastage provides data quality and reliability for accurate business analysis and reporting.
Datastage history
Datastage was formerly known as Ardent DataStage followed by Ascential DataStage and in 2005 was acquired by IBM and added to the WebSphere family. Starting from 2006 its official name is IBM WebSphere Datastage.
Datastage versions
Datastage server is available and fully supported under windows and unix environment.
Editions:
- Server Edition - contains and supports server jobs (etl-tools.info tutorial is based on DS 7.5.1 server edition)
- Enterprise Edition - includes parallel and server jobs. It's much more scalable than the server edition.
- MVS Edition - for mainframe systems. Jobs are developed on a Windows or Unix/Linux platform and transferred to the mainframe as compiled mainframe jobs.
- DataStage for PeopleSoft - a server edition with prebuilt PeopleSoft EPM jobs.
- DataStage TX - supports complex transactions, formerly known as Mercator.
- DataStage SOA - Real Time Integration pack can turn server or parallel jobs into SOA (Service Oriented Architecture) services.
DataStage components
DataStage client applications are as follows:
- Administrator - Administers DataStage projects, manages global settings and interacts with the system
- Designer - used to create DataStage jobs which are compiled into executable programs. It’s a main module for developers.
- Director - manages running and monitoring DataStage jobs. It’s mainly used by operators and testers.
- Manager - allows browsing and editing the metadata repository.
|