With the proliferation of data sharing, data analytics, and the diffusion of data in decision-making, a crucial concern has settled in the minds of decision makers, technologists, and business analysts: How can you trust the data that you rely on? In practice, questions about the origins of data and their processing are answered through the tracing of the provenance and lineage of data. The Support Centre for Data Sharing just published a technical guidance paper on data lineage and provenance to answer this question of data traceability in practice.
It is meant to be an information primer and guide, aimed at domain experts and decision-makers tasked with defining their organisation’s approach to data traceability. Practitioners should gain a more structured, categorical view of the relevant concepts, challenges, and technologies. To realise this objective, the report provides readers with:
- An in-depth understanding of data provenance and lineage;
- A clear sense for common applications areas and challenges; and
- Insights into approaches that allow the tracing of data provenance and lineage.