What is Data Quality?

Data quality is a perception or an assessment of data's fitness to serve its purpose in a given context.
It is described by several dimensions like

Correctness / Accuracy: Accuracy of data is the degree to which the captured data correcfly describes the real world entity.

Consistency: This is about the single version of truth. Consistency means data throughout the enterprise should be sync with each other.

Aspects of data quality include: 


  • Accuracy
  • Completeness
  • Update status
  • Relevance
  • Consistency across data sources
  • Reliability
  • Appropriate presentation
  • Accessibility


Within an organization, acceptable data quality is crucial to operational and transactional processes and to the reliability of business analytics (BA) / business intelligence (BI) reporting. Data quality is affected by the way data is entered, stored and managed. Data quality assurance (DQA) is the process of verifying the reliability and effectiveness of data. 

Comments

Post a Comment

Popular posts from this blog

GP - Kerberos errors and resolutions

How to set Optimizer at database level in greenplum

GP - SQL Joins