Data Governance and Big Data - A Necessary …...Confidential End-to-End Operational Model for Big...

9
Data Governance and Big Data - A Necessary Convergence Richard Goldberg Chief Data Governance Officer Citibank Global Consumer Bank

Transcript of Data Governance and Big Data - A Necessary …...Confidential End-to-End Operational Model for Big...

Page 1: Data Governance and Big Data - A Necessary …...Confidential End-to-End Operational Model for Big Data Analytics Management & Governance Controls & Compliance Technology Tools Data

Data Governance and Big Data -

A Necessary Convergence

Richard Goldberg

Chief Data Governance Officer

Citibank Global Consumer Bank

Page 2: Data Governance and Big Data - A Necessary …...Confidential End-to-End Operational Model for Big Data Analytics Management & Governance Controls & Compliance Technology Tools Data

- 1 -

As our businesses continue to expand its use and dependency on big data, it is also

evident that we need to extend the reach of data governance to ensure the data is

properly managed and leveraged within the confines of regulations and associated

business controls

The challenge is when and how to establish a data governance foundation; and what are

the main priorities on managing data in this new horizon

Key areas we will discuss include:

When should data governance become involved in “big data” initiatives?

What are the main issues associated with data governance and “big data”?

What are the main priorities associated with data governance and “big data”?

What is needed to address a data governance foundation around “big data”?

Data Governance and Big Data – A Necessary Convergence

Discussion Focus Area

Page 3: Data Governance and Big Data - A Necessary …...Confidential End-to-End Operational Model for Big Data Analytics Management & Governance Controls & Compliance Technology Tools Data

- 2 -

As The World is Changing….

Data is exploding

Compliance pressure

continues to increase

Business demands

better information for

faster decisions

More data available to

manage

Page 4: Data Governance and Big Data - A Necessary …...Confidential End-to-End Operational Model for Big Data Analytics Management & Governance Controls & Compliance Technology Tools Data

- 3 -

Big Data content doubles every 12 to 18 months!

Spreadsheets, homegrown

databases

Scattered

information

Master data

inconsistency

…..Today’s Data Challenges Continue to Proliferate

Data Governance is needed to

manage this environment

Page 5: Data Governance and Big Data - A Necessary …...Confidential End-to-End Operational Model for Big Data Analytics Management & Governance Controls & Compliance Technology Tools Data

- 4 -

Definition of Data Governance

Data Governance: A formal system of accountability designed to enforce proper management of

data assets and the performance of data functions. Data governance encompasses the actions

(includes people, processes, and technology) used to ensure that key information delivered

throughout the organization is appropriately defined, used and maintained

Data Accessibility

Data Availability

Data Quality

Data Consistency

Data Security

Data Auditability

Standards Policies & Processes Organization

Data Architecture

Data Governance

Data definitions & Taxonomies

Master/ Reference

data

Enterprise data model

Technology & Tools standards

Data definitions Monitoring & measurement

Data access & delivery

Data change management

Rules & Responsibilities

Training & education

Planning & prioritization

Org change management

Business

Page 6: Data Governance and Big Data - A Necessary …...Confidential End-to-End Operational Model for Big Data Analytics Management & Governance Controls & Compliance Technology Tools Data

- 5 -

Why do we need to Govern Data?

Information is a vital enterprise resource and is critical to business success

Consistent and accurate information is fundamental for financial reporting, business

intelligence and measuring the execution of corporate strategies

Regulatory compliance has come to the forefront of information initiatives

Historically, data has not been governed with the same rigor given to other vital assets,

as evidenced by the lack of quality, timeliness, transparency and reuse

The introduction of leveraging unstructured data (“big data”) has increased risk and

exposure if not managed properly

Page 7: Data Governance and Big Data - A Necessary …...Confidential End-to-End Operational Model for Big Data Analytics Management & Governance Controls & Compliance Technology Tools Data

Confidential

End-to-End Operational Model for Big Data

Analytics

Management & Governance Controls &

Compliance

Technology

Tools

Data Governance

Application

Development

Hadoop

Engineering

Data

Cleansing

Data

Transforming

Data

Integration

Operational

Support

Infrastructure

Hadoop

Environment

Big Data

Sandbox

Data

Provisioning

Analytics

Training

Data

Security

Knowledge

Management

Tools

Training

Idea Management

Portfolio

Management

Idea

Inventory

Value

Management Prioritization

Steering

Committees

Data & Analytics

Roadmap

Service

Management

Strategy &

Planning

Community

Management

Program

Management

Project

Management

Analytics

Tools

Data

Ingestion

Execution

Analytics

Execution

Model

Management

Model

Creation

Analytics

Support

Analytics

Peer Review

Analytics

Support

Data

Acquisition

Data

De-Identification Focus for

discussion Data

Privacy

Data

Access

Page 8: Data Governance and Big Data - A Necessary …...Confidential End-to-End Operational Model for Big Data Analytics Management & Governance Controls & Compliance Technology Tools Data

Confidential

Data Governance is Essential in Setting up a Big Data Environment

Data Governance – Big Data

Data

Cleansing

Data

Transforming

Data

Integration

Data

Provisioning

Data

Security

Data

Ingestion

Data

Acquisition

Data

De-Identification

Data

Privacy

Data

Access

Access to data must be approved by local business unit and Data Governance Officer

Access will be on a limited and timed basis

Data analytics will start out as R&D only and may shift to production after approvals

Data Privacy controls will be enforced

There will be no access to PII data

All data needs to be de-identified to avoid ability to link back to account owner

Access will be limited to a security zone (i.e. product/region) and users can see all of the

data sets associated only to that zone

If an analyst needs access to data from another line of Business, Data Governance

Office (DGO) pre-approval is required

Page 9: Data Governance and Big Data - A Necessary …...Confidential End-to-End Operational Model for Big Data Analytics Management & Governance Controls & Compliance Technology Tools Data

Confidential Confidential

Discussion/Questions