Gramsci source@lod2014 roma

36
Leveraging Linked Data technologies in the GramsciSource project Christian Morbidoni Università Politecnica delle Marche, Ancona, Italy Supported by http://spaziodati.eu/ http://netseven.it/ http://semedia.dibet.univpm.it/ lunedì 24 febbraio 14

description

This presentation was given at LOD2014, Rome. It describes a first prototype of Gramsci's texts exploration based on semi-automatically extracted data.

Transcript of Gramsci source@lod2014 roma

Page 1: Gramsci source@lod2014 roma

Leveraging Linked Data technologies in the

GramsciSource project

Christian MorbidoniUniversità Politecnica delle Marche, Ancona, Italy

Supported by

http://spaziodati.eu/http://netseven.it/http://semedia.dibet.univpm.it/

lunedì 24 febbraio 14

Page 2: Gramsci source@lod2014 roma

About the GramsciSource project

• GramsciSource is funded by the MIUR under the FIRB action

• Interdisciplinary

• Humanities Scholarship (Rome, Bologna)

• Computer Science & DH (Ancona, Bologna)

• Extract and create meaningful data about the domain

• Put in place instruments for scholars (but not only), leveraging data to search, organize and understand the text.

lunedì 24 febbraio 14

Page 3: Gramsci source@lod2014 roma

Context

• The Goal:

• Bring data and data analysis to scholars in the form of tool to explore and understand the matter of study.

• A first simple prototype:

• Browse the texts along different axes...

• Main ingredients: Linked Data + NER

lunedì 24 febbraio 14

Page 4: Gramsci source@lod2014 roma

Let’s start ... from the end :-)

Text

Try it at:http://goo.gl/PzQNsf

lunedì 24 febbraio 14

Page 5: Gramsci source@lod2014 roma

The RecipeMashup of Domain Dictionary with LOD data

served with Faceted Search

lunedì 24 febbraio 14

Page 6: Gramsci source@lod2014 roma

The Gramsci LOD Dictionary

lunedì 24 febbraio 14

Page 7: Gramsci source@lod2014 roma

The Gramsci LOD Dictionary

• A commented dictionary of the Gramsci

lunedì 24 febbraio 14

Page 8: Gramsci source@lod2014 roma

The Gramsci LOD Dictionary

• A commented dictionary of the Gramsci

• Well known and recognized by the community

lunedì 24 febbraio 14

Page 9: Gramsci source@lod2014 roma

The Gramsci LOD Dictionary

• A commented dictionary of the Gramsci

• Well known and recognized by the community

lunedì 24 febbraio 14

Page 10: Gramsci source@lod2014 roma

The Gramsci LOD Dictionary

• A commented dictionary of the Gramsci

• Well known and recognized by the community

• .. RDFized

lunedì 24 febbraio 14

Page 11: Gramsci source@lod2014 roma

lunedì 24 febbraio 14

Page 12: Gramsci source@lod2014 roma

lunedì 24 febbraio 14

Page 13: Gramsci source@lod2014 roma

lunedì 24 febbraio 14

Page 14: Gramsci source@lod2014 roma

lunedì 24 febbraio 14

Page 15: Gramsci source@lod2014 roma

lunedì 24 febbraio 14

Page 16: Gramsci source@lod2014 roma

lunedì 24 febbraio 14

Page 17: Gramsci source@lod2014 roma

lunedì 24 febbraio 14

Page 18: Gramsci source@lod2014 roma

purl.org/gramscisource/dictionary/entry/Potere

lunedì 24 febbraio 14

Page 19: Gramsci source@lod2014 roma

purl.org/gramscisource/dictionary/entry/Potere

http://gramscisource.org/quaderno/13/nota/31

lunedì 24 febbraio 14

Page 20: Gramsci source@lod2014 roma

purl.org/gramscisource/dictionary/entry/Potere

http://gramscisource.org/quaderno/13/nota/31

cito:cites

lunedì 24 febbraio 14

Page 21: Gramsci source@lod2014 roma

purl.org/gramscisource/dictionary/entry/Potere

http://gramscisource.org/quaderno/13/nota/31

cito:cites cito:cites

lunedì 24 febbraio 14

Page 22: Gramsci source@lod2014 roma

lunedì 24 febbraio 14

Page 23: Gramsci source@lod2014 roma

lunedì 24 febbraio 14

Page 24: Gramsci source@lod2014 roma

dbpedia:Firenze

dbpedia:Battaglia_di_Caporetto

dbpedia:Esperanto

dbpedia:Don_Chisciotte

dbpedia:Goethe

cito:mentions

cito:mentions

cito:mentions

cito:mentions

cito:mentions

lunedì 24 febbraio 14

Page 25: Gramsci source@lod2014 roma

dbpedia:Firenze

dbpedia:Battaglia_di_Caporetto

dbpedia:Esperanto

dbpedia:Don_Chisciotte

dbpedia:Goethe

cito:mentions

cito:mentions

cito:mentions

cito:mentions

cito:mentions

Person/Writer

Event

Lingua

Book

City

lunedì 24 febbraio 14

Page 26: Gramsci source@lod2014 roma

FIELD_NAME = mentions_bookQUERY = select distinct ?uri ?value

where { ?uri rdf:type <http://purl.org/gramscisource/ont#Nota>. ?text :isPartOf ?uri. ?text <http://purl.org/spar/cito/mentions> ?entity. ?entity rdf:type <http://dbpedia.org/ontology/Book>. ?entity rdfs:label ?value. }

lunedì 24 febbraio 14

Page 27: Gramsci source@lod2014 roma

FIELD_NAME = mentions_bookQUERY = select distinct ?uri ?value

where { ?uri rdf:type <http://purl.org/gramscisource/ont#Nota>. ?text :isPartOf ?uri. ?text <http://purl.org/spar/cito/mentions> ?entity. ?entity rdf:type <http://dbpedia.org/ontology/Book>. ?entity rdfs:label ?value. }

OpenRDF

Sesame

SPARQL

lunedì 24 febbraio 14

Page 28: Gramsci source@lod2014 roma

FIELD_NAME = mentions_bookQUERY = select distinct ?uri ?value

where { ?uri rdf:type <http://purl.org/gramscisource/ont#Nota>. ?text :isPartOf ?uri. ?text <http://purl.org/spar/cito/mentions> ?entity. ?entity rdf:type <http://dbpedia.org/ontology/Book>. ?entity rdfs:label ?value. }

OpenRDF

Sesame

SPARQL

SPARQL result/Solr doc

lunedì 24 febbraio 14

Page 29: Gramsci source@lod2014 roma

FIELD_NAME = mentions_bookQUERY = select distinct ?uri ?value

where { ?uri rdf:type <http://purl.org/gramscisource/ont#Nota>. ?text :isPartOf ?uri. ?text <http://purl.org/spar/cito/mentions> ?entity. ?entity rdf:type <http://dbpedia.org/ontology/Book>. ?entity rdfs:label ?value. }

Solr Document

id = pur.org/gramscisource/quaderno/13/nota/31field = mentions_book_ssvalue = Divina Commedia

OpenRDF

Sesame

SPARQL

SPARQL result/Solr doc

lunedì 24 febbraio 14

Page 30: Gramsci source@lod2014 roma

FIELD_NAME = annotated_byQUERY = select distinct ?uri ?value where { ?n <http://purl.org/pundit/ont/ao#includes> ?a. ?n <http://purl.org/dc/terms/creator> ?auth. ?auth <http://xmlns.com/foaf/0.1/name> ?value. ?a <http://www.openannotation.org/ns/hasTarget> ?f. ?f <http://purl.org/dc/terms/isPartOf> ?uri. FILTER regex(str(?uri),\"http://gramscisource\",\"i\") }

lunedì 24 febbraio 14

Page 31: Gramsci source@lod2014 roma

FIELD_NAME = annotated_byQUERY = select distinct ?uri ?value where { ?n <http://purl.org/pundit/ont/ao#includes> ?a. ?n <http://purl.org/dc/terms/creator> ?auth. ?auth <http://xmlns.com/foaf/0.1/name> ?value. ?a <http://www.openannotation.org/ns/hasTarget> ?f. ?f <http://purl.org/dc/terms/isPartOf> ?uri. FILTER regex(str(?uri),\"http://gramscisource\",\"i\") }

Server

SPARQL

lunedì 24 febbraio 14

Page 32: Gramsci source@lod2014 roma

FIELD_NAME = annotated_byQUERY = select distinct ?uri ?value where { ?n <http://purl.org/pundit/ont/ao#includes> ?a. ?n <http://purl.org/dc/terms/creator> ?auth. ?auth <http://xmlns.com/foaf/0.1/name> ?value. ?a <http://www.openannotation.org/ns/hasTarget> ?f. ?f <http://purl.org/dc/terms/isPartOf> ?uri. FILTER regex(str(?uri),\"http://gramscisource\",\"i\") }

Solr Document

id = pur.org/gramscisource/quaderno/13/nota/31field = annotated_by_ssvalue = Christian Morbidoni

SPARQL result/Solr doc

Server

SPARQL

lunedì 24 febbraio 14

Page 33: Gramsci source@lod2014 roma

lunedì 24 febbraio 14

Page 34: Gramsci source@lod2014 roma

Supported by

http://spaziodati.eu/http://netseven.it/http://semedia.dibet.univpm.it/

lunedì 24 febbraio 14

Page 35: Gramsci source@lod2014 roma

Next stepsAcquire more domain knowledge

Make use of more data from it.DBPedia.org

Explore Knowledge Extraction ....

.... as well as quantitative analysis (e.g. n-grams)

Build specialized visualizations (e.g. graph based)

lunedì 24 febbraio 14

Page 36: Gramsci source@lod2014 roma

Thank you.

[email protected]@ChrMor

lunedì 24 febbraio 14