Bbc news labs at yahoo! hack europe

13
BBC News Labs Yahoo! Hack Europe (London)

Transcript of Bbc news labs at yahoo! hack europe

Page 1: Bbc news labs at yahoo! hack europe

BBC News LabsYahoo! Hack Europe (London)

Page 2: Bbc news labs at yahoo! hack europe

BBC News Labs

• Matt Shearer – Lead, BBC News Labs@completedespair

• Jeremy Tarling – Data Architect, BBC News@jeremytarling

• Paul Wilton – Tech Architect, Ontoba Ltd@pwilton

• AND Viktor Tron, Matt Haynes, Mark Ransby

Page 3: Bbc news labs at yahoo! hack europe

BBC News Labs

1. History2. Now3. What’s possible4. APIs

(no On-Stage, LIVE coding. sorry)

Page 4: Bbc news labs at yahoo! hack europe

History

• The News Juicer is 1 year old

• 2012 : "All we have is a bunch of articles...We need a semantic prototyping platform!"

1

GrabBBC News

& Sport Articles

2

Extract Concepts

3

Match to DBpedia

4

Annotate Article

5

Push to Triplestore

6

Expose via API

The News Juicer

Page 5: Bbc news labs at yahoo! hack europe

History

Page 6: Bbc news labs at yahoo! hack europe

Now

• 84,400+ News & Sport articles tagged.

• 4 semantic APIs (there were only 3 yesterday)

• bbc.mashery.com (yahoo:hack)

Page 7: Bbc news labs at yahoo! hack europe

What’s possible

Page 8: Bbc news labs at yahoo! hack europe

APIs

Jeremy Tarling - Data Architect, BBC News @jeremytarling

the Juicer API has 4 endpoints:1. article semantic annotation2. article full text search3. concept co-occurrence4. concept search

Page 9: Bbc news labs at yahoo! hack europe

API 1 - article semantic annotation

• GET a list of BBC News articles by concept

• support for SPARQL queries

• explore the DBpedia graph

• exampleso "articles about Conservative politicians"o "articles about places within 25 miles of Chester"o "articles about companies in the aerospace

industry"

Page 10: Bbc news labs at yahoo! hack europe

API 2 - article full text search

• perform full text search of BBC News articles

• filter by section: "politics", "business"

• specify data range, limit + offset

• example: "find 5 articles with the words 'horsemeat' and 'Tesco' in from the UK section since Jan 1st 2012"

Page 11: Bbc news labs at yahoo! hack europe

API 3 - concept co-occurrence

• select a DBpedia concept: <http://dbpedia.org/resource/David_Cameron>

• specify the type: <http://dbpedia.org/ontology/Person>

• returns an ordered list of people that also appear in BBC news articles alongside David Cameron, and their frequencies

Page 12: Bbc news labs at yahoo! hack europe

API 4 - find concepts

• full text search for concepts that are tagged on articles

• specify a search term or phrase

• returns OpenSearch Suggestions JSON for the semantic concepts

Page 13: Bbc news labs at yahoo! hack europe

Thanks and enjoy

@BBC_News_Labs