SAP Datasphere – integration of data from various sources | All for One Poland

SAP Datasphere – integration of data from various sources

From SAP systems through external SQL databases to Google solutions

SAP Datasphere is currently the only data warehouse developed by SAP.  It enables the integration, transformation, unification and storage of data from different source systems. In this cloud environment, GUI-based data modeling and view creation is extremely intuitive and simple. It does not require knowledge of programming or using SQL syntax. Let's see if the integration of SAP Datasphere itself with external systems is equally simple.

SAP Datasphere is currently the only data warehouse developed by SAP.  It enables the integration, transformation, unification and storage of data from different source systems. In this cloud environment, GUI-based data modeling and view creation is extremely intuitive and simple. It does not require knowledge of programming or using SQL syntax. Let's see if the integration of SAP Datasphere itself with external systems is equally simple.

When connecting SAP Datasphere to S/4HANA or SAP ECC systems, data can be taken over in several ways, using CDS tables and views. They can be loaded into SAP Datasphere or used in remote mode, especially when the data volume is smaller.

This raises the question: if the data is loaded into SAP Datasphere, is delta also supported? Can standard SAPI extractors be used, and if so, to what extent? What does the process of data feeding from external databases or source systems based on third-party cloud environments look like in SAP Datasphere?

In this article, we answer these questions, especially in the context of SAP systems. We present the possibilities of integrating SAP systems with a particular focus on S/4HANA and SAP ECC. We discuss how we handled SAPI extractors in SAP Datasphere for one of our customers. We describe the possibilities of integrating SAP Datasphere with external SQL databases, as well as cloud solutions, with a focus on Google products.

SAP S/4HANA

Undoubtedly, it is SAP S/4HANA that is best integrated with the SAP Datasphere data warehouse, both in on-premise and cloud versions of the system (public and private). The supported data sources for this type of system are tables, CDS (Core Data Services) views, and SAPI extractors. It is worth noting at this point that SAPI extractors commonly used when extracting data from SAP ECC systems to external data warehouses, e.g. SAP BW, can still be used in SAP Datasphere, however they are not recommended. This applies to the S/4HANA source system as well as all systems in the SAP portfolio. In the later section of the article, we elaborate on the limitations regarding SAPI extractors within SAP Datasphere, and discuss how they were overcome for one of our major customers.

In the S/4HANA systems, both CDS views and tables can be loaded into SAP Datasphere, or the data they contain can be accessed remotely. Remote access to data can be particularly justified for data with smaller volumes, less frequently needed for reporting, or when timeliness at a given moment is important. When loading CDS views into SAP Datasphere, delta is also supported. Appropriate annotations on the side of the view itself in the source system are required for this. It is also possible to replicate CDS tables and views in real time – every few tens of seconds.

With regard to S/4HANA systems, two more enhancements are worth mentioning:

enabling automatic creation of views on the SAP Datasphere side based on CDS views representing master data of dimensions or facts;

allowing for bulk updating of data from selected tables and/or CDS views.

SAP ECC system

The integration of SAP ECC systems with SAP Datasphere largely depends on the version of SAP you have, as well as the database on which the transaction system is based. Within the test environment, the oldest SAP system we were able to connect to SAP Datasphere was SAP ECC 6.0 in ABAP and BASIS version 7.0. on a SQL database. For this version of the system, the supported data sources were tables available in remote mode with the possibility of saving them in the cloud data warehouse.

Much more opportunities for integration are provided by SAP ECC 6.0 systems based on the HANA database, with the minimum 7.4. version of BASIS and ABAO components. Then, in addition to tables, CDS views can also be the data source. However, in this version of ECC systems, CDS views will not support delta. Furthermore, it is worth noting that in the case of a newer version of the system (BASIS 7.5 component), we managed to operate in the test environment an SLT server allowing for the replication of tables in real time (every few tens of seconds). SAPI extractors are also available, but are not recommended.

It is worth noting that for SAP HANA-based systems, it is possible to connect only the database. Then the source in SAP Datasphere can be defined calculation views.

Case study: Use of SAPI extractors

The SAPI extractors commonly used in SAP ECC systems to export data externally can also be used in SAP Datasphere, but this is not recommended. In the first place, it is not recommended to access them from SAP Datasphere in remote mode. This is because very often extractors are based on complex functional modules, which can significantly affect the performance of data extraction into the data warehouse.

On the other hand, when data is loaded into SAP Datasphere through data flow from extractors, the operation of the delta mechanism is hindered. For it to work effectively, the key fields of an extractor should be identical to the key fields of an internal table on the SAP Datasphere side. However, many standard extractors do not have such key fields defined, and it is not recommended to create them. So is it impossible to use SAPI extractors at all for SAP Datasphere?

For one of our customers, it was important to enable data extraction into SAP Datasphere using 2lis* extractors, which had been already widely used, tested and extended. Since it was not possible to support them as a standard, the problem was solved by using the SAP BW component built into SAP ECC. From a technical point of view, the solution required the following steps on the SAP ECC side in the RSA1 transaction:

  • Replication and activation of the data sources we wanted to use in SAP Datasphere;
  • Creation of infopackages that allowed us to support the standard delta mechanism;
  • Determining the technical name of the so-called PSA table, which was to receive data from the loaded deltas;
  • Implementing a data loading chain using infopackages into the PSA table and cleaning it regularly (every 30 days), which allowed the process to run every day.

Also on the SAP ECC side, but in the Eclipse environment, actions had to be taken to define appropriate CDS views that would support the delta mechanism in the context of SAP Datasphere:

  • Creating views that return the number of the last request from the PSA table for a given source so that the data exported to SAP Datasphere is limited only to data loaded into the PSA table from the current/last day;
  • Creating views containing all fields of the PSA table for a given source for the last request. The views defined in this way were the basis for replicating the necessary data in SAP Datasphere, based on SAPI extractors and taking into account the delta mechanism defined for them.

External SQL databases

External SQL databases can also be connected to SAP Datasphere. This can be done with most of the commonly used databases. However, you should check on the SAP website whether the version of your SQL database is supported. Some databases, such as Microsoft SQL Server and Oracle, have built-in connectors already in SAP Datasphere. This means that it is possible to connect them directly. In other cases, the connection of the databases is done via the universal JDBC connector.

The data sources for external SQL databases are the tables defined on them. They can be accessed remotely as well as by loading data depending on the database used and its version. In addition, in each case, the installation of adapters suitable for the existing SQL databases is required on the Cloud Connector side.

Google cloud solutions

Data sources for SAP Datasphere can be cloud solutions not only from SAP, but also from third parties. Examples include Google BigQuery and Google Cloud Storage. Especially the first of these tools seems interesting from the point of view of SAP Datasphere, as it does not require data replication. Data can be accessed from SAP Datasphere completely remotely. What does this mean in practice?

If we have defined Google BigQuery, which collects data from mechanisms such as Google Trends or Google Advertisements, we can use this data directly in SAP Datasphere It is possible to integrate or collate it in combination with data from other sources without having to store it in our data warehouse.

Only selected SAP Datasphere source systems and the method of their integration are discussed above. The list of connectors enabling connection to individual databases or systems is long. Even if a dedicated connector for a particular source system/database cannot be used, it is possible to use a more general mechanism, such as the already mentioned JDBC or Odata service.

In addition, SAP Datasphere has built-in mechanisms for exporting data externally, including to third-party solutions.

All of this makes SAP Datasphere seem an attractive solution for both existing SAP users and those who are just starting their adventure with SAP already in S/4HANA version or who do not have any SAP system at all. There are many possibilities for integration and access to data. It is worth noting that most data can be accessed from SAP Datasphere in remote mode, without the need to load data into the warehouse. The environment interface itself is also user-friendly and graphical, enabling advanced key users to model data.

The term “advanced" means here primarily that they must know where the data they are interested in is stored in the source system, i.e. in which tables or CDS views.

More about SAP Datasphere

SAP Datasphere is a cloud-based data storage solution that allows you to efficiently manage large amounts of data from different sources in a unified and integrated way. Key features and capabilities include data modeling, integration and virtualization. It also provides tools for information sharing and collaboration between project teams. It is the next generation of the SAP Data Warehouse Cloud solution.

Write us Call us Send email






    1. Personal data is processed pursuant to Article 6 (1) (a) of the Regulation of the European Parliament and of the Council (EU) 2016/679 of April 27, 2016 – the General Data Protection Regulation
    2. The data controller is All for One Poland sp. z o.o. with its registered office in Złotniki, ul. Krzemowa 1 62-002 Suchy Las. Contact data of the Data Protection Supervisor: iod@all-for-one.com.
    3. Consent to data processing is voluntary, but necessary for contact. Consent may be withdrawn at any time without prejudice to the lawfulness of the processing carried out on the basis of consent prior to its withdrawal.
    4. The data will be processed for the purposes stated above and until this consent is withdrawn, and access to the data will be granted only to selected persons who are duly authorised to process it.
    5. Any person providing personal data shall have the right of access to and rectification, erasure, restriction of processing, the right to object to the processing and to the transfer of data, the right to restriction of processing and the right to object to the processing, the right to data transfer.
    6. Every person whose data is processed has the right to lodge a complaint with the supervisory authority, which is the President of the Personal Data Protection Office (ul. Stawki 2, 00-193 Warsaw).
    7. Personal data may be made available to other entities from the group that All for One Poland sp. z o.o. is part of – also located outside the European Economic Area, for marketing purposes. All for One Poland ensures that the data provided to these entities is properly secured, and the person whose data is processed has the right to obtain a copy of the data provided and information on the location of the data provision.

    +48 61 827 70 00

    The office is open
    Monday to Friday
    from 8am to 4pm (CET)

    General contact for the company
    office.pl@all-for-one.com

    Question about products and services
    info.pl@all-for-one.com

    Question about work and internships
    kariera@all-for-one.com

    This site is registered on wpml.org as a development site. Switch to a production site key to remove this banner.