Sap Exchange Infrastructure and in for Ma Tic A
Transcript of Sap Exchange Infrastructure and in for Ma Tic A
-
8/6/2019 Sap Exchange Infrastructure and in for Ma Tic A
1/12
W H I T E P A P E R
SAP Exchange Infrastructure and Informatica PowerCenter:Complementary Data Integration Capabilities to Support SAP Data Migration Projects
-
8/6/2019 Sap Exchange Infrastructure and in for Ma Tic A
2/12
This document contains Confidential, Proprietary and Trade Secret Information (Confidential Information) ofInformatica Corporation and may not be copied, distributed, duplicated, or otherwise reproduced in any mannerwithout the prior written consent of Informatica.
While every attempt has been made to ensure that the information in this document is accurate and complete,some typographical errors or technical inaccuracies may exist. Informatica does not accept responsibility for anykind of loss resulting from the use of information contained in this document. The information contained in thisdocument is subject to change without notice.
The incorporation of the product attributes discussed in these materials into any release or upgrade of anyInformatica software productas well as the timing of any such release or upgradeis at the sole discretion ofInformatica.
Protected by one or more of the following U.S. Patents: 6,032,158; 5,794,246; 6,014,670; 6,339,775;6,044,374; 6,208,990; 6,208,990; 6,850,947; 6,895,471; or by the following pending U.S. Patents:09/644,280; 10/966,046; 10/727,700.
This edition published June 2006
-
8/6/2019 Sap Exchange Infrastructure and in for Ma Tic A
3/12
Table of Contents
Executive Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .2
Architectural and Application Data Integration Services
Components of SAP XI . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .3
SAP Implementation vs. Data Migration . . . . . . . . . . . . . . . . . . . . . . . . . . .4
Addressing the Requirements of SAP Data Migrations:SAP XI and PowerCenter . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .4
Data Analysis/Profiling of Legacy Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .5
Data Quality . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .5
Data Volumes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .5
Data Transformation/Conversion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .6
Legacy Data Access . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .6
Services-based Approach: SAP XI and PowerCenter . . . . . . . . . . . . . . . . . .7
About Informatica . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .9
1SAP Exchange Infrastructure and Informatica PowerCenter
White Paper
-
8/6/2019 Sap Exchange Infrastructure and in for Ma Tic A
4/12
Executive Summary
When evaluating technology options to meet the needs of enterprise data integration, many CIOs
and data architects consider enterprise application integration (EAI). After all, with features to
help address many data integration requirements, EAI toolsets seem like the right choice.
SAP XI and PowerCenter don't provide competing capabilities when it comes to data migration;
they complement one another, providing a comprehensive set of capabilities that ensure efficient
and successful SAP implementations.
When considering a long-term data integration solution, CIOs and data architects whose
technology infrastructure is based on SAP solutions should evaluate SAP Exchange
Infrastructure (SAP XI). SAP XI is native SAP data integration technology that offers capabilities
common to EAI tools and seamless integration into the SAP technology stack. They should also
evaluate Informatica PowerCenter, a single, unified enterprise data integration platform that
allows companies and government organizations of all sizes to access, discover, and integrate
data from virtually any business system, in any format, and deliver that data throughout the
enterprise at any speed.
The purpose of this white paper is to compare and contrast the role of SAP XI and PowerCenter
in data migration projects. This paper examines the differences between the requirements for
integrating business processes and the requirements for integrating data in the context of data
migration into SAP. This paper also explores the ways in which the capabilities of SAP XI and
PowerCenter complement one another and serve to ensure efficient and successful data
migrations and to reduce implementation risks.
2
-
8/6/2019 Sap Exchange Infrastructure and in for Ma Tic A
5/12
Architectural and Application Data Integration ServicesComponents of SAP XI
SAP's vision for SAP XI is to have the native data integration technology serve as a hub for
integrating both data and business processes to help orchestrate both internal and cross-
enterprise business events.
SAP XI is well suited to support this vision, primarily for its ability to meet transactional- and XML-
based data integration requirements. Figure 1 shows an illustration of SAP XI's architecture and
key components. Key capabilities of SAP XI include:
Business process management features essential in orchestrating services and crossapplication business events
Document- and message-oriented data integration Data movement-based XML message processing
Data transformation based on XSLT
Web services-based integration (either a consumer or provider of Web services)
:
Examples of business processes requiring data integration supported by SAP XI include:
Integrate inventory and procurement systems (e.g., when inventory drops below 500 units,send an order of 300 more to procurement system)
Integrate order, financial, and distribution systems (e.g., when each order is entered, generatean invoice and ship the products)
Integrate two transactional systems across the firewall (B2B) (e.g., when an order is placed inSAP of Partner A, send a electronic purchase order document to the fulfillment application ofPartner B.)
3SAP Exchange Infrastructure and Informatica PowerCenter
White Paper
Presentation and Analysis(SAP, GUI, SAP Portals)
Business Process Management(SAP XI)
Flat FilesApplications Databases Mainframe
Application Services
Transactional Data Access(via SAP XI adapters)
Business Process IntegrationSAP XI
Service Repository Service Container
Transaction Processing Message Routing
Reliable Messaging Event Processing
Messages XML UnstructuredData
PDF
Adobe
Figure 1: Architectural and Application Data Integration Services Components of SAP XI
-
8/6/2019 Sap Exchange Infrastructure and in for Ma Tic A
6/12
SAP Implementation vs. Data Migration
Every SAP implementation involves data migration. Most data migrations into SAP involve more
than one source system because organizations are increasing the scope of the businessprocesses supported with SAP solutions.
SAP implementations focus on reengineering and evolving enterprise business processes. The
challenge during the implementation is to determine how to configure SAP to support each
company's unique processes.
Data migration is about data readiness. A data migration is not just about moving the data into
SAP; it's about aligning the data to the business process within SAP. The data migrated into SAP
must be accurate and trustworthy to motivate the business users to transition from their legacy
applications and adopt the SAP solution.
It's a common assumption that migrating data from legacy applications to SAP is simply another
integration task that can be easily handled by SAP XI. However, an efficient and successful
migration of data into SAP involves massive amounts of data. This data must be accessed,
discovered, cleansed, and integrated before it's migrated to the SAP application. All tasks within
a migration-from profiling and analyzing source data, to data transformations and conversions-
must be applied to very large data sets.
To best support data migration into SAP, the selected data integration tool should be based on
an integrated, metadata-driven architecture to provide visibility into all data integration changes
across the entire migration process, including data access, extraction, profiling, transformation,
and loading. The data integration tool should provide the following data migration capabilities:
Upfront analysis and profilingof the complete data set across all sources of data
Extraction and movement of bulk dataincluding both master data and current and historicaltransactional data-from heterogeneous, legacy applications
Set-oriented transformations and conversions, since complex transformations must be applied
when data from multiple sources are joined and made to align with the new business process Centrally-managed, native source system access because migration projects require access to
a wide range of source systemsfrom homegrown mainframe applications to legacy client-server applications
Data audit and lineage capabilities that enable migration teams to trace and verify data,which gives the user community confidence in the quality of the data and encouragesadoption of the new system
Addressing the Requirements of SAP Data Migrations:SAP XI and PowerCenter
This section describes when and how PowerCenter's capabilities can complement those of SAP XI
during data migrationa critical phase of an SAP implementation that's often underestimated.
Since both SAP XI and PowerCenter extract, convert, and write data to targets, why should two
products be considered for SAP data migration?
SAP XI and PowerCenter don't provide competing capabilities when it comes to data migration;
they complement one another, providing a comprehensive set of capabilities that ensure efficient
and successful SAP implementations.
4
-
8/6/2019 Sap Exchange Infrastructure and in for Ma Tic A
7/12
Data Analysis/Profiling of Legacy Data
Data profiling is the analysis of data to understand its content, structure, quality, and
dependencies. As SAP XI does not offer native data profiling functionality, SAP data migration
teams traditionally take a manual approach to profiling legacy data. Manual profiling techniques
range from spot inspections of the legacy applications or sample data extracts, to analysis via
custom-coded reports or elaborate spreadsheets. These data profiling methods typically sample
data in a few key fields to get a sense of what the data is like in these columns, but the results
are often inconsistent and incomplete.
PowerCenter's data profiling capabilities provide comprehensive and accurate information about
the content, quality, and structure of data in virtually any operational system. Organizations can
reduce implementation risks and increase data confidence by automatically assessing the initial
and ongoing quality of data regardless of its location or type.
PowerCenter can help SAP migration teams move beyond wondering if their data is dirty and
onto fixing data issues.
Data Quality
Most organizations attempt to tackle data quality problems by implementing tactical solutions to
improve data quality within a single application or within a single business process. While this
approach may mitigate the problem for part of the organization in the short term, such limited
initiatives generally fail to achieve long-term data quality improvement on a broad scale.
Organizations need a solution to improve and maintain data quality enterprise-wide.
PowerCenter's data cleansing, parsing, and matching capabilities enable organizations to
standardize, validate, enhance, and correct enterprise data such as customer contact
information and corporate or third-party data. Powerful, algorithmic data matching
capabilities identify relationships between data records for de-duplication or group-based
processing. These capabilities:
Improve the accuracy of data, giving end users confidence in the quality of their data
Increases developer productivity, reducing the effort necessary to cleanse and transform data
Maintains data consistency and integrity throughout the enterprise by eliminating dataduplication
SAP XI excels at architecting and driving enterprise business processes. Data quality is another
area which SAP XI does not natively support. By acknowledging the challenge of establishing and
maintaining enterprise-wide data quality, organizations can use PowerCenter and SAP XI to model
a common set of business rules to drive their data quality initiatives.
Data Volumes
Legacy data migration projects involve extracting, converting, and importing data from multiple,
heterogeneous applications into SAP. When you consider that data migration requires themovement of master data (e.g., material/customer/vendor master, account codes, etc.), as well
as the movement of current and historical transactional data (e.g., purchase and sales orders,
etc.), the data volumes are massive-millions of rows and hundreds of gigabytes of data. SAP
migration teams are under pressure to minimize the amount of time critical applications will be
down in order to migrate large data volumes into them.
5SAP Exchange Infrastructure and Informatica PowerCenter
White Paper
-
8/6/2019 Sap Exchange Infrastructure and in for Ma Tic A
8/12
6
SAP XI's data movement capabilities are based exclusively on XML-based document processing.
While beneficial when addressing messaging-based interface scenarios, these capabilities may
not be robust enough to support the overhead of converting every row of data-whether that data
is coming from flat files, databases, or a business applications-to an XML format.Architected to support bulk movement of data, PowerCenter gives SAP migration teams the
confidence that they can meet their limited cutover windows for migration data into SAP.
Data Transformation/Conversion
Both SAP XI and PowerCenter offer mappings, which are graphical representations of data
integration processes, including the source, transformation of data, and target. Differences
between the two products arise when examining how mappings can be used for data migration.
As SAP XI is built primarily for transactional- and XML-based data integration, its data
transformation is based on XSLT. SAP XI's data transformation capabilities rely on creating the
XSLT-based transformations within the SAP XI toolset and are ideal for message-to-message-
based conversions. SAP XI does an excellent job in converting data from one XML format to
another. However, the transformation requirements of data migration projects are likely to exceed
the design parameters of XSLT. It's true that any transformation logic can be supported through
custom coding, but this approach can be costly when maintenance and reuse are considered.
PowerCenter complements SAP XI by exposing any of its transformations as a Web service. This
means that SAP users have the option to run PowerCenter data migration processes within the
SAP XI business process management framework. PowerCenter offers SAP data migration teams
powerful data transformation capabilities that can:
Improve the accuracy of data, giving end users confidence in the quality of their data
Address non-existing fields in legacy applications required by SAP
Fix name and address duplication
Join disparate tables and files (even across disparate applications)
Route data to heterogeneous applications in single mapping (e.g., one mapping can targetdata to SAP, mainframe, database table[s], and spreadsheet[s])
Legacy Data Access
SAP XI and PowerCenter provide different methods and approaches for sourcing legacy data.
SAP XI relies on a combination of native connectors for SAP APIs, along with a library of third-
party adapters. These adapters support integration via such message formats and protocols as
EDI, flat file, JDBC, and JCA. Any data sourced via SAP XI is always converted to an XML payload
prior to processing within SAP XI.
PowerCenter, in contrast, does not require all data to be converted to XML; it can access the
legacy application data directly. Nor does PowerCenter rely on third-party adapters to get to
non-SAP data sources. PowerCenter offers an extensive library of connectivity options.Figure 2 illustrates the diversity in the systems, message formats, and protocols natively
supported by PowerCenter.
-
8/6/2019 Sap Exchange Infrastructure and in for Ma Tic A
9/12
Without considering the context for specific data integration requirements, SAP XI and
PowerCenter seem to offer redundant levels of data access. However, each product offers unique
capabilities that offer specific advantages in the context of data migration. Let's use a real-world
business process to showcase these differences: purchase orders (PO).
Let's say that purchase orders created by a customer need to be sent hourly to the vendor. Each
PO is sent to the vendor via the EDI 850 message format across a VAN and needs to be
sourced, parsed, and mapped into the vendor's SAP ERP system. In this case, SAP XI is the ideal
data integration tool, since this process is a message-oriented scenario and involves EDI, which
is a source type supported by third-party SAP XI adapters.
Let's look at another example. This one involves six years of historical purchase order data thatneed to be migrated from disparate legacy applications based on relational databases,
mainframes, and flat files. Here PowerCenter complements SAP XI. PowerCenter provides native
access to challenging sources of data like mainframe applications. PowerCenter can perform
bulk movement of large data volumes in a limited window of time. Also, PowerCenter can
perform referential integrity checksfor example, PowerCenter can ensure that each PO has valid
vendor master and material master values and that each vendor master address entry is de-
duped and cleansed. This process can be supported in a single mapping with codeless
PowerCenter transformation objects. Trying to support this type of scenario using only SAP XI
would be challenging and time-consuming, requiring significant code to be created to address
performance and transformation requirements.
Services-based Approach: SAP XI and PowerCenterSAP XI is the enabling technology for the Enterprise Services Architecture (ESA), SAP's
blueprint for a Service-oriented Architecture (SOA). The purpose of the ESA is to show
SAP architects/developers how to use of SOA/Web services technology to better conform
to business goals.1
7SAP Exchange Infrastructure and Informatica PowerCenter
White Paper
Real-Time Data Sources TIBCO IBM WebSphere MQ JMS SAP MSMQ WEBM
Web Services
Unstructured DataPDF Word ExcelVertical Standards
(e.g., HL7, SWIFT, ACORD)Print Stream BLOBsAny proprietary data
format/standard
Open and RelationalData Sources
Oracle IBM MicrosoftSybase Informatix Teradata
Flat Files XML Web Logs
Informatica
PowerCenter
EnterpriseSoftware Sources
Mainframe AS/400 JDEPeopleSoft Siebel SAP
SAS Essbase Lotus Notes
Across the Firewall/WAN
Remote Data AccessRemote or OutsourcedBusiness Applications
Figure 2: PowerCenter Provides Access to All Sources of Enterprise Data
1 Mattern, Thomas and Matthias Haendly. "ESA: A 2005 'Business-Savvy' Take on SOAs,"Integration DeveloperNews, February 9, 2005.
-
8/6/2019 Sap Exchange Infrastructure and in for Ma Tic A
10/12
PowerCenter's data services-based approach supports the vision of SAP's ESA. Each data
integration requirement specific to SAP data migrations supported by PowerCenter can be
exposed to SAP XI via a data service. Building on SAP XI's architectural and application data
integration services components shown earlier in Figure 1, Figure 3 shows how PowerCenter'sdata services complement and co-exist with SAP XI.
Just as SAP XI would be used as an information hub to orchestrate services for transactional and
message-oriented business processes, it can also call out to any PowerCenter data service.
Figure 4 shows how an SAP XI business process flow can call out to a PowerCenter data service
for moving bulk legacy data or facilitating complex data transformations based on set based
aggregations, for example.
8
Presentation and Analysis(SAP, GUI, SAP Portals)
Business Process Management(SAP XI)
Flat FilesApplications Databases Mainframe
Application ServicesData Migration Services
Data Access for Migration Transactional Data Access
(via SAP XI adapters)
Business Process IntegrationSAP XI
Ser vi ce Re posi to ry Serv ice Conta iner
Transaction Processing Message Routing
Rel iable Me ssaging Even t Pr ocess ing
Messages XML UnstructuredData
PDF
Adobe
Enterprise Data ServicesPowerCenter Data Integration Services for Data Migration
Data Profiling
Data Transformation
Data Cleansing
Data Movement
Data Lineage
Data Quality
Figure 3: Enterprise Data Services Components Illustrate Complementary Data Integration Between PowerCenterand SAP XI
SAP XI Business Process Flow
PowerCenter Web Service Provider
PowerCenter Data Migration Service
S TF(x)
WS TranformationRequest
WS TranformationResponse
Figure 4: Detail of SAP XI Business Process Calling Out and Consuming a PowerCenter Data Service
-
8/6/2019 Sap Exchange Infrastructure and in for Ma Tic A
11/12
Conclusion
The chapter on "Analyzing and Cleansing Legacy Data" from SAP's online manual (available at
www.help.sap.com) states that the analysis of legacy data is critical for successful migration intoSAP and has the reader consider the following questions:
Which data exists?
How is the data structured (e.g., length, sequence)?
Which data can be transferred unmodified, which must be converted, and which cannot betransferred at all?
It does not elaborate how this analysis is to be done across the heterogeneous landscape of
existing legacy applications. The assumption is that identifying and preparing the legacy data for
input into SAP is the responsibility of the SAP implementation team. Assumptions such as this
one can lead SAP implementation teams to underestimate the scope and effort required to
successfully complete the data migration effort critical for any new or ongoing SAP
implementation phase.
While SAP XI is suited for transactional- and message-oriented-based data integration
requirements, SAP's native data integration technology can be complemented to extend its value
for data migration phases in SAP implementations. Informatica PowerCenter offers robust data
integration and transformation capabilities as data services that can be exposed as Web services
and leveraged on demand by SAP XI to ensure efficient and successful data migrations and
reduce implementation risks.
To find out more about PowerCenter can help you in your next SAP data migration project, please
visit us at www.informatica.com or call us at (800) 653-3871.
About Informatica
Informatica Corporation delivers data integration software and services to solve a problem facingmost large organizations: the fragmentation of data across disparate systems. Informatica helps
organizations gain greater business value from their information assets by integrating their
enterprise data. Informatica's open, platform-neutral software reduces costs, speeds time to
results, and scales to handle data integration projects of any size or complexity. With a proven
track record of success that extends back to 1993, Informatica helps companies and
government organizations of all sizes realize the full business potential of their enterprise data.
That's why Informatica is known as the data integration company.
9SAP Exchange Infrastructure and Informatica PowerCenter
White Paper
-
8/6/2019 Sap Exchange Infrastructure and in for Ma Tic A
12/12
Worldwide Headquarters, 100 Cardinal Way, Redwood City, CA 94063, USAphone: 650.385.5000 fax: 650.385.5500 toll-free in the US: 1.800.653.3871 www.informatica.com
Informatica Offices Around The Globe: Australia Belgium Canada China France Germany Japan Korea the Netherlands Singapore Switzerland United Kingdom USA
2006 Informatica Corporation. All rights reserved. Printed in the U.S.A. Informatica, the Informatica logo, and, PowerCenter are trademarks or registered trademarks of Informatica Corporation in the United States and in jurisdictionsthroughout the world. All other company and product names may be tradenames or trademarks of their respective owners.
J50949 6732 (06/26/06)