Introducing end-to-end data lineage (preview) visualization in Amazon DataZone
by Esra Kayabali | on 27 JUN 2024 | in Amazon DataZone, Analytics, Announcements. Feature, Launch, News | Permalink | Comments | Share
Voice by Polly
Amazon DataZone is a data management service to catalog, discover, analyze, share, and govern data between. Data producers and consumers in your organization. Engineers, data scientists, product managers, analysts, and business users can easily access data throughout your. Organization using a unifie data portal so. That they can discover, use, and collaborate to derive data-driven insights.
Now, I am excite to announce in preview a new
API-driven and OpenLineage compatible data Europe Cell Phone Number List lineage capability in Amazon DataZone. Which provides an end-to-end view of data movement over time. Data lineage is a new feature within Amazon. DataZone that helps users visualize and understand data provenance, trace change management. Conduct root cause analysis when a data error is reporte, and be prepare for questions on data movement from source to target. This feature provides a comprehensive view of lineage events. Capture automatically from Amazon DataZone’s catalog along with other events capture programmatically outside of Amazon DataZone by stitching them together for an asset.
When you need to validate how the data of interest originate
The organization, you may rely on manual documentation Afghanistan Phone Number List or human connections. This manual process is time-consuming and can result in inconsistency, which directly reduces your trust in the data. Data lineage in Amazon DataZone can raise trust by helping you understand where the data originated, how it has changed, and its consumption in time. For example, data lineage can be programmatically setup to show the data from the time it was captured as raw files in Amazon Simple Storage Service (Amazon S3), through its ETL transformations using AWS Glue, to the time it was consumed in tools such as Amazon QuickSight.
With Amazon DataZone’s data lineage, you can reduce the time spent mapping a data asset and its relationships, troubleshooting and developing pipelines, and asserting data governance practices. Data lineage helps you gather all lineage information in one place using API, and then provide a graphical view with which data users can be more productive, make better data-driven decisions, and also identify the root cause of data issues.
Let me tell you how to get started with data lineage in Amazon DataZone. Then, I will show you how data lineage enhances the Amazon DataZone data catalog experience by visually displaying connections about how a data asset came to be so you can make informed decisions when searching or using the data asset.