Informatica Enterprise Data Catalog (EDC)

Back to overview

Digital communication and processing have become indispensable in today's world. As a result, the flood of information is constantly increasing. Data is of inestimable value across all industries, but it also poses a major challenge. In order to be able to efficiently evaluate large amounts of data from different sources, appropriate technologies are required. in-factory has been a Platinum Partner of Informatica since July 2021, which has created the Enterprise Data Catalog (EDC) to solve this problem.

in-factory is a Platinum Partner of Informatica:

EDC is a data catalog that uses artificial intelligence and machine learning to merge and catalog large amounts of data from a variety of sources. Various features, such as semantic search and a comprehensive overview of the relationships between individual data records, greatly simplify the search and analysis of the data.

Data domains are a particularly practical feature of EDC. These are semantic labels that EDC can identify by evaluating data patterns and metadata based on the semantics of column data or names. Data similarity plays a crucial role here, but is normally a time-consuming process. EDC takes advantage of machine learning to cluster similar columns and identify those with high similarity quickly and effectively. Related data domains can be combined into logical groups (so-called data domain groups) for easier evaluation.

A practical example: Following the merger of two companies, all supplier data was merged into a common database using EDC. To ensure that relevant information can still be found and analyzed quickly and efficiently despite the increased volume of data, data domains are to be used. In addition, suppliers are to be allocated to the North East and Central regions to simplify the search.

With the help of Similiarity Discovery, EDC automatically adds suitable data domains to the data assets. Individual rules can be set up for this, for example for the degree of similarity in %. If EDC does not provide a suitable suggestion for a data domain, data domains can be created, edited and, if necessary, deleted manually via the Catalog Administrator*. In the example, the data domain wohnort_plz is created and assigned to the corresponding assets. Once the data domains have been assigned, suitable data domain groups can also be created in the Catalog Administrator (in the example the regions North East and Central) and the corresponding data domains (in the example wohnort_plz) can be assigned.

 

Example of different data domains:

Example of a Data Domain Group in Enterprise Data Catalog:

EDC is already being used in many projects and is becoming increasingly popular. In our digital world, such applications give companies a valuable advantage by supporting them in handling and efficiently evaluating the flood of data.


*Informatica Catalog Administrator is a tool for monitoring and managing resources, attributes and connections.

Do you have any questions? We have the answers!

Please write to us. We look forward to hearing from you!