Monday, June 27, 2005

 

Data Quality - ความหมาย


เคยคุยเรื่อง Data Quality กับเพื่อนโดยส่วนมากได้ยินแล้วก็บอกว่ามันคืออะไร ก็มีหลายบทความที่ให้ความหมายไว้หลากหลาย วันนี้ไปเจอใน Wikipedia ค่อนข้างจะชอบใจกับความหมายที่ให้ไว้และมีอ้างอิงให้อย่างครบครันครับ

Data quality
From Wikipedia, the free encyclopedia.

Data quality refers to the quality of data. Data are of high quality "if they are fit for their intended uses in operations, decision making and planning" (J.M. Juran). Alternatively, the data are deemed of high quality if they correctly represent the real-world construct to which they refer. These two views can often be in disagreement, even about the same set of data used for the same purpose.

Overview

There are a number of theoretical frameworks for understanding data quality. One framework seeks to integrate the product perspective (conformance to specifications) and the service perspective (meeting consumers' expectations) (Kahn et al 2002). Another framework is based in semiotics to evaluate the quality of the form, meaning and use of the data (Price and Shanks, 2004). One highly theoretical approach analyzes the ontological nature of information systems to define data quality rigorously (Wand and Wang, 1996).

A considerable amount of data quality research involves investigating and describing various categories of desirable attributes (or dimensions) of data. These lists commonly include accuracy, correctness, currency, completeness and relevance. Nearly 200 such terms have been identified and there is little agreement in their nature (are these concepts, goals or criteria?), their definitions or measures (Wang et al, 1993). Software engineers may recognise this as a similar problem to the so-called Ilities.

MIT has a Total Data Quality Management program, led by Professor Richard Wang, which produces a large number of publications and hosts a significant international conference in this field.

In practice, data quality is a concern for professionals involved with a wide range of information systems, ranging from datawarehousing and business intelligence to customer relationship management and supply chain management. One industry study estimated the total cost to the US economy of data quality problems at over US$600 billion per annum (Eckerson, 2002).

The market is going some way to providing data quality assurance. A number of vendors make tools for analysing and repairing poor quality data in situ, service providers can clean the data on a contract basis and consultants can advise on fixing processes or systems to avoid data quality problems in the first place. There are several well-known authors and self-styled experts, with Larry English perhaps the most popular guru.


External links

* SearchCRM.com (http://searchcrm.techtarget.com?offer=Wikipedia) Original daily news, webcasts, expert advice, white papers and more resources on data quality.
* DMReview.com (http://www.dmreview.com/portals/portal.cfm?topicId=230005) Data Quality Portal for practitioners, containing articles, whitepapers and webinars related to data quality.
* MIT's Total Data Quality Management Program (http://web.mit.edu/tdqm/www/index.shtml) With academic and industry courses, publications and conferences.
* International Conference on Information Quality (http://www.iqconference.org/) Annual conference on Data Quality research for practitioners and academics, with previous publications.
* Interactive IQ Explorer (http://ghill.customer.netspace.net.au/iq_attr.html) For the exploration of data quality concepts, dimensions and attributes.
* Larry English's Homepage (http://www.infoimpact.com/) Conferences, books, whitepapers, case studies and consulting - Larry English does it all.
* Eckerson, W. (2002) "Data Warehousing Special Report: Data quality and the bottom line", Article (http://www.adtmag.com/article.asp?id=6321)
* Kahn, B., Strong, D., Wang, R. (2002) "Information Quality Benchmarks: Product and Service Performance," Communications of the ACM, April 2002. pp. 184-192. Article (http://www.iqconference.org/documents/publications/TDQMpub/ KahnStrongWangCACMApr02.pdf)
* Price, R. and Shanks, G. (2004) A Semiotic Information Quality Framework, Proc. IFIP International Conference on Decision Support Systems (DSS2004): Decision Support in an Uncertain and Complex World, Prato. Article (http://vishnu.sims.monash.edu.au:16080/dss2004/proceedings/ pdf/65_Price_Shanks.pdf)
* Redman, T. C. (2004) Data: An Unfolding Quality Disaster Article (http://www.dmreview.com/article_sub.cfm?articleId=1007211)
* Wand, Y. and Wang, R. (1996) “Anchoring Data Quality Dimensions in Ontological Foundations,” Communications of the ACM, November 1996. pp. 86-95. Article (http://www.iqconference.org/documents/publications/TDQMpub/ WandWangCACMNov96.pdf)
* Wang, R., Kon, H. & Madnick, S. (1993), Data Quality Requirements Analysis and Modelling, Ninth International Conference of Data Engineering, Vienna, Austria. Article

This page is powered by Blogger. Isn't yours?