News Corp - Data Driven NYC // June 2014 (28)

14
DATA SCIENCE AT NEWS CORP Rachel Schutt, Chief Data Scientist 24 June, 2014

description

News Corp Chief Data Scientist Rachel Schutt presented at June's edition of Data Driven NYC. News Corp is a global vertically integrated media company with properties in film, television, cable, magazines, newspapers, and publishing.

Transcript of News Corp - Data Driven NYC // June 2014 (28)

Page 1: News Corp - Data Driven NYC // June 2014 (28)

DATA SCIENCE AT NEWS CORPRachel Schutt, Chief Data Scientist!

24 June, 2014

Page 2: News Corp - Data Driven NYC // June 2014 (28)

DATA DRIVEN NYC / 24 JUNE, 2014

RACHEL SCHUTT

2

Became Chief Data Scientist at News Corp in October 2013. Previously a Data Scientist at Google in New York, and is a published author and professor at Columbia.

Page 3: News Corp - Data Driven NYC // June 2014 (28)

DATA DRIVEN NYC / 24 JUNE, 2014

OUR BUSINESSES

3

Page 4: News Corp - Data Driven NYC // June 2014 (28)

NEWS CORP AND DATANAME OF THE PRESENTATION / XX MONTH, XXXX

DATA STRATEGY

4

• Chief Data Scientist— new role !

• Responsible for global data strategy, as component of global technology strategy, led by CTO, Paul Cheesbrough

!

• Many interesting ways data & journalism intersect

!

• Building a data culture !

• Initial strategy —> Shift in strategy !

• In close collaboration with SVP—Platforms, Simon Smith

!

!

Page 5: News Corp - Data Driven NYC // June 2014 (28)

DATA DRIVEN NYC / 24 JUNE, 2014

OUR APPROACH

5

Make data part of our DNA

People Internal experts supported by world-class partners A cross functional team of doers and implementers Prefer investment in talented people over tools

!

Technology Best in class data tech stack Everyone in the team should be able to code Create data products in preference to static reports

!

Values Agile - fast moving, focused on business results Collaborative engagement model with stakeholders Strong design ethos - using visuals to tell stories with data

Data

Technology

Journalism

Page 6: News Corp - Data Driven NYC // June 2014 (28)

DATA DRIVEN NYC / 24 JUNE, 2014

WHAT KIND OF DATA DO WE HAVE?

6

Page 7: News Corp - Data Driven NYC // June 2014 (28)

NEWS CORP AND DATANAME OF THE PRESENTATION / XX MONTH, XXXXNEWS CORP AND DATA

MOST SYSTEMS PRODUCE LOGS

Commerce Call Centre Billing Sales & Circ Web Logs Device Logs Ad Logs Social

Page 8: News Corp - Data Driven NYC // June 2014 (28)

NEWS CORP AND DATANAME OF THE PRESENTATION / XX MONTH, XXXXNEWS CORP AND DATA

WHY ARE LOGS IMPORTANT

time

User ID Sub Date Cancel Date Status

1 2014-04-01 - Trialist

2 2014-03-15 - Subscriber

3 2014-02-15 2014-04-15 Canceled

User1: Signup

User2: Signup

User3: Signup

User1: Start Trial

User2: Start Trial User2: Finish Trial

User3: Start Trial User3: Finish Trial User3: Cancel

ULTIMATE TRUTH SNAPSHOT IN TIME

Page 9: News Corp - Data Driven NYC // June 2014 (28)

DATA DRIVEN NYC / 24 JUNE, 2014

OUR APPROACH

9

Traditional enterprise data warehouse approach…

Page 10: News Corp - Data Driven NYC // June 2014 (28)

DATA DRIVEN NYC / 24 JUNE, 2014

OUR APPROACH

10

Our approach…

Page 11: News Corp - Data Driven NYC // June 2014 (28)

DATA DRIVEN NYC / 24 JUNE, 2014

THE DATA SCIENCE PROCESS

11

Page 12: News Corp - Data Driven NYC // June 2014 (28)

DATA DRIVEN NYC / 24 JUNE, 2014

SOME EXAMPLES DATA SCIENCE IN ACTION

Churn Models

Propensity Analysis User Behavior Modeling

Page 13: News Corp - Data Driven NYC // June 2014 (28)

DATA DRIVEN NYC / 24 JUNE, 2014

DATA + JOURNALISM

13

• Business side vs news room • Data Scientists should learn from journalists • Data Visualization and Infographics in reporting

• NLP in news room • Data-driven decision making in news room • Driving high quality traffic that converts to subscriptions

• Paywall model • Acquisition and retention-- predictive modeling

• How can data help shape the future of news?

Page 14: News Corp - Data Driven NYC // June 2014 (28)

DATA DRIVEN NYC / 24 JUNE, 2014

WE’RE HIRING [email protected]

14