Data lineage is a description of data movements and transformations at various abstraction levels along data chains and relationships between data at these levels. Data lineage ensures the traceability and transparency of data processing, transformation, and integration.
Data lineage helps a company to comply with various legislative requirements, support multiple business and IT changes, and reduce IT maintenance costs.
In our consulting journey, we realized that the implementation of a data management framework and data lineage documentation are two sides of the same coin:
The implementation of a data management framework follows the logic of data lineage documentation and vice versa.
We developed a data lineage metamodel based on the analysis of various industry guidelines and legislations. This model allows linking all DM capabilities and their artifacts. In this context, this data lineage metamodel also serves the purpose of documenting a knowledge graph of data assets. We use this metamodel in our practices.
We summarized our implementation experience in the 3-Phase method to implement a data lineage business case.