A Framework for the Classification of Data and

download A Framework for the Classification of Data and

of 18

Transcript of A Framework for the Classification of Data and

  • 8/7/2019 A Framework for the Classification of Data and

    1/18

    M. Pamela Neely Business Research Consortium

    April 17, 2010

  • 8/7/2019 A Framework for the Classification of Data and

    2/18

    Agenday Introductiony Other D IQ Research Frameworksy DIQ PMOT Research Frameworky DIQ Research Topicsy Conclusions and Further Research

  • 8/7/2019 A Framework for the Classification of Data and

    3/18

    Introductiony The quality of data and information is recognized as

    an area of concern in both industry and academiay Emerging as an important area of research

    y Journal of Data and Information Q uality y International Journal of Data Q uality y International Conference on Information Q uality y Other practitioner conferences dedicated to D IQ

  • 8/7/2019 A Framework for the Classification of Data and

    4/18

    O ther D IQ Frameworksy A Framework for Analysis of Data Q uality Research

    (Wang, Storey, Firth, 1995)y

    Seven Elementsy Management Responsibilitiesy Operation and Assurance Costsy Research and Developmenty Productiony Distributiony Personnel Managementy Legal Function

  • 8/7/2019 A Framework for the Classification of Data and

    5/18

    O ther D IQ Frameworksy A Product Perspective on Total Data Q uality

    Management (Wang, 1998)y Extends the Information as Product analogy

    y Information Manufacturing System ( ISM) producesy Information Products ( IP)y Which have some value to the consumer

    y Four Roles Identifiedy Information suppliersy Information manufacturersy Information consumersy Information product managers

  • 8/7/2019 A Framework for the Classification of Data and

    6/18

    O ther D IQ Frameworksy Reflections on Data Q uality Research: A Review of the

    Literature since 1995 (Neely, 2005)y Further extends information as product adding fitness-for-

    use characteristics identified by J.M. Juran (1998)y Who are the users of the product or service?y H ow are the users actually putting the product or service to use?y What specific determinants, from a user perspective, make a

    product or service fit for use?y

    What are the ec onomi c res ou rces available to the producer anduser?y Is there a possibility or probability of danger to human s af e ty ?

    y Matrix is developed with Wang s dimensions as one axis andJuran s criteria as the other

  • 8/7/2019 A Framework for the Classification of Data and

    7/18

    O ther D IQ Frameworksy A Framework to Assess Decision Q uality (Ge and

    Helfert, 2007)y Three Categories for Classification

    y Information quality assessmenty Metric layery Dimension layery Methodology layer

    y

    Information quality managementy Contextual information quality y 2x2 matrix developed for classification of papersy Several research questions are posed

  • 8/7/2019 A Framework for the Classification of Data and

    8/18

    O ther D IQ Frameworksy Accounting Information Systems Research: Is it another

    QWERTY? (Poston and Grabski, 2000)y Information Lifecycle perspective

    y Database issuesy Data entry y Data storagey Data viewsy Overall database issues (entry, storage, and views)

    y Data then used to make decisions and support action takeny Decisions with respect to what data should be captured and entered in

    the A IS leads back to data entry y Overarching the whole thing is accountability of the management

    of the A ISy Underpinning it is data verification, supported by internal controls

    and auditing

  • 8/7/2019 A Framework for the Classification of Data and

    9/18

    O ther D IQ Frameworksy Beyond Accuracy: What Data Q uality Means to Data

    Consumers (Wang and Strong, 1996)y

    Definitionaly Four categories of data quality

    y Intrinsicy Contextualy Representationaly Accessibility

    y Fitness-for-use becomes a primary focus to determine thequality of the data

  • 8/7/2019 A Framework for the Classification of Data and

    10/18

    O ther D IQ Frameworksy An investigation of the impact of organization size on

    data quality issues (Nord, Nord and Xu, 2005)y

    Examine critical factors for D IQ in accounting informationsystems, defining stakeholdersy Information producersy Information custodiansy Information consumersy Data/database managersy Internal auditors

  • 8/7/2019 A Framework for the Classification of Data and

    11/18

    O ther D IQ Frameworksy Overview and Framework for Data and Information

    Q uality Research (Madnick, Wang, Lee, and Zhu,

    2009)y Describe a framework for classification of D IQ literaturey Research topicsy Research methods

    y Topics are drawn from the scope list of the Journal of Data andInformation Q uality

    y Eleven research methods broad and narrow are identifiedy 2x2 matrix developed

  • 8/7/2019 A Framework for the Classification of Data and

    12/18

    DIQ PM OT Research Frameworky None of the previously described frameworks

    adequately captures the complexity of the D IQ

    literaturey DIQ touches the entire organizationy DIQ draws on multiple disciplinesy DIQ crosses organizational boundariesy Context, or fitness-for-use, is a critical component in

    determining the quality of data and information

  • 8/7/2019 A Framework for the Classification of Data and

    13/18

    DIQ PM OT Research Frameworky Development of a research framework facilitates

    classification of the body of literaturey

    Provides researchers an opportunity to understandwhat exists and what still needs to be discoveredy Provides a foundation for the generation of research

    questions

  • 8/7/2019 A Framework for the Classification of Data and

    14/18

    DIQ PM OT Research Frameworky Extending previous research the elements of the

    PMOT framework are:y

    P eopley Processes

    y M anagementy perational

    y T echnology

    y Three major research strands ( MOT ) with P eoplebeing an integral part of all three strands

  • 8/7/2019 A Framework for the Classification of Data and

    15/18

    PM OT

    Man

  • 8/7/2019 A Framework for the Classification of Data and

    16/18

    DIQ Research Topics- Sample

    Research Q uestionsPolicies & Procedures

    What are upper management's roles and responsibilities in the creation, approval, and enforcement of D IQ -related policies,

    procedures and systems? What policies and procedures should exist specifically to deal with human safety issues such as privacy,

    accuracy and integrity which are important in fields like healthcare?

    Legal & EthicalWhat legal and regulatory requirements exist that pertain to D IQ (e.g., H IPPA, S OX , & GLB)? What internal controls should exist

    to ensure D IQ (e.g., H IPPA, S OX , & GLB)?

    Personnel ManagementWhat training is necessary and why? What motivates employees to care about D IQ ? What is the economic impact when

    personnel are improperly trained, resulting in poor D IQ ?

    ProductionWhat interventions are available to improve the quality of data (e.g., training, managerial direction, incentives, outlier reports)?What are the cost categories and benefits of production, including data cleaning and integrity checks?

    Distribution

    What tools are available to document the processes and metrics of a distribution system (e.g., IDEF0, IQ Matrix, IASDO Model)?What processes and metrics are or should be a part of the distribution system?What metadata is needed? What processes exist to prevent misdirecting or distributing information to unauthorized people?

    O perations & AssuranceWhat are the economic costs associated with D IQ in an information system, database or accounting system? What are the

    metrics for measuring the operation and assurance costs? What are the most important security threats to D IQ ? What is the

    return on investment of putting data quality systems in place?

    Dimensions & MeasurementWhat data quality dimensions and their corresponding metrics exist or should be monitored? What procedures, surveys andmetrics must be in place to validate and verify D IQ ?

    Systems & DesignWhat technologies and processes are available to manage data uncertainty (e.g., approximate, probabilistic, inexact, incomplete,

    imprecise, fuzzy, inaccurate data)? What costs (e.g., poor decision making, increased legal costs, and bad public relations) are

    associated with not incorporating D IQ into systems design?

    Information Products ( IP)

    What methods exist to measure IP quality? What are the factors that influence people's judgment about IQ (e.g., education,

    experience, website domain, industry, firm size, cognitive authority)? What are the costs and benefits of improving IP? What

    measures are used to show the impact of D IQ with respect to how IP affects human safety (e.g., percentage of loss on retirement

    accounts, investment decisions)?

  • 8/7/2019 A Framework for the Classification of Data and

    17/18

    C onclusionsy Several frameworks have been developed within the

    fieldy

    As the field matures it becomes increasingly clearerthat fitness-for-use is a critical component indetermining quality

    y DIQ reaches across the organization and quality is in

    the eye of the beholdery Understanding both the environment for D IQ and the

    literature that has been conducted in critical tomoving towards answers

  • 8/7/2019 A Framework for the Classification of Data and

    18/18

    Further Researchy Flesh out the list of questionsy Classify the body of literature along several metrics

    y PMOT question(s) addressedy Industry y Reference Discipliney Research Methody Researcher Goalsy Audiencey Key Findings