In todays article, we would like to give you a rough overview on SAP Data Hub. According to SAP, enterprises are looking for an integrative open solution to extend, accelerate and make the data flow accessible to all users across all their data landscapes . SAP also conducted a study which found that 74 percent of IT decision makers felt,their data landscape was so complex that it compromised flexibility. 86 percent said they could get much more out of their data, if they could only access it. The SAP Data Hub is supposed to be the solution to this.
What exactly does SAP Data Hub offer?
SAP Data Hub is a central platform for integrating data, managing data processing and managing metadata using SAP VORA. You can build powerful pipelines and manage, share and disseminate distributed data. SAP Data Hub offers various basic features that are very beneficial for a company. In the following, I will closely detail you these features.
As companies will become aware of opportunities and risks through a better understanding of the data landscape. SAP Data Hub provides organizations with a detailed and easy-to-understand view of their data landscapes. This can be SAP software sources such as SAP HANA or non-SAP sources such as Apache Hadoop. This new approach enables companies to a better understanding of the use and quality of their data and its respective correlations.
Image 1: SAP Data Hub
Furthermore, SAP Data Hub offers the extension and faster execution of data projects by building powerful data processing pipelines. The integrative environment allows these pipelines to be created and easily copied, modified, and reused. In addition, SAP Data Hub supports flexible, data-driven applications, that are shifted to the native environments (location of data) because of the relocation of computing processes, and hence, will result faster in better business results.
But what are the main arguments for the Data Hub?
The following 5 characteristics will emphasize its benefits:
The Data Hub provides a general view of the company and Big Data by aggregating all data from data sources, business applications and processes. This improves the transparency of the data landscapes and simplifies the identification of Big Data sources and their usability. For this purpose, the “landscape management” serves as a management tool for the connections, zones and systems of the landscapes. Employees can also use the Data Hub cockpit to improve their daily work through “self-service” interfaces. The following figure provides an example of such a cockpit.
Image 2: Cockpit of the Data Hub
Intelligent discovery of data correlations improves data quality through self-service. The SAP Data Hub provides a representation of the data correlations in your company. Your data is prepared, cleansed and its connections centrally controlled (Data Discovery and Preparation). The system and metadata recognition enable you to search any connected system (metadata management and cataloguing) and the “modeler” provides the data records, models data pipelines and manages metadata. This provides a visualization of the data flow in your systems and processes.
SAP Data Hub enables an efficient data enrichment. For this purpose, a variety of calculation methods such as OLAP and machine learning are used for the distributed data/data pipelines. To perform this function, the “modeler” is used.
Orchestrating the data from start to finish and process the data where it is located. With this option, expensive and unnecessary data movements are avoided. It does not matter whether the data is in the cloud, “on-premise” or both. The status of these workflows can be very well planned and monitored by “monitoring and scheduling”.
Security policies are dynamically maintained in one place to meet regulatory and corporate requirements. Your metadata can be managed, quality defects can be detected and corrected. With the “security and policies” feature, necessary security settings and guidelines can be created.
With the Data Hub, SAP simplifies data handling. Using data pipelines, users direct and control the data flows and processing, whereby the data always remains at its storage location.
The following points are summarized below:
- Simpler, more scalable approach to data landscape integration, management and control
- Expand and accelerate data-driven projects through powerful data processing pipelines
- Modern, open architecture supports various data storage systems
- A central platform for data management in your company
We can already say that SAP Data Hub is very appealing for companies that want to get more out of their data. Lets look a bit further.
A look into the future
SAP Data Hub helps you to make full use of the added value of your data. When diving deeper into the topic, you will stumble across the term SAP Leonardo. Together with SAP Vora and SAP Cloud Platform Big Data Services, the SAP Data Hub forms the central building blocks for it. SAP Leonardo is the next step in data evaluation and THE system for digital innovations. But this isa topic you will get to read more about in a separate article within the next weeks.
Let’s summarize the topic once again
SAP is paving the way for a data-driven company with new solutions such as the SAP Data Hub. This will also enable digital innovations. As a central platform, SAP Data Hub aims at helping to make the very complex data streams controllable or scalable. The central tasks of the system are data integration, data orchestration and data governance. The data used can be stored in various systems or databases. Open source technologies can be integrated into SAP systems and databases, as well as those of third-party providers, using suitable interfaces. Something, that was previously difficult to achieve.
With Data Hub, SAP assists its customers in taking the right steps towards becoming data-driven companies. And, even though these can be cumbersome, they will be essential for the future.
Sources of the images: SAP SE