Ramses An annotated corpus of Late Egyptian texts · of Late Egyptian texts Background information,...

64
+ Ramses: An annotated corpus of Late Egyptian texts Background information, recent developments and work in progress Stéphane Polis (F.R.S.-FNRS – ULg)

Transcript of Ramses An annotated corpus of Late Egyptian texts · of Late Egyptian texts Background information,...

  • +

    Ramses: An annotated corpus of Late Egyptian texts Background information, recent developments and work in progress

    Stéphane Polis (F.R.S.-FNRS – ULg)

  • +Outline of the talk

    n  Background information about the Ramses Project

    2

  • +Outline of the talk

    n  Background information about the Ramses Project

    n  Ramses Online (2015)

    3

  • +Outline of the talk

    n  Background information about the Ramses Project

    n  Ramses Online (2015)

    n  Responsive website based on Bootstrap

    4

  • +Outline of the talk

    n  Background information about the Ramses Project

    n  Ramses Online (2015)

    n  Responsive website based on Bootstrap

    5

  • +Outline of the talk

    n  Background information about the Ramses Project

    n  Ramses Online (2015)

    n  Responsive website based on Bootstrap

    6

  • +Outline of the talk

    n  Background information about the Ramses Project

    n  Ramses Online (2015)

    n  Responsive website based on Bootstrap

    7

  • +Outline of the talk

    n  Background information about the Ramses Project

    n  Ramses Online (2015)

    n  Responsive website based on Bootstrap

    n  With powerful linguistic searching capabilities

    8

  • +Outline of the talk

    n  Background information about the Ramses Project

    n  Ramses Online (2015)

    n  Responsive website based on Bootstrap

    n  With powerful linguistic searching capabilities

    n  In interaction with its users

    9

  • +Outline of the talk

    n  Background information about the Ramses Project

    n  Ramses Online (2015)

    n  Responsive website based on Bootstrap

    n  With powerful linguistic searching capabilities

    n  In interaction with its users

    10

  • +Outline of the talk

    n  Background information about the Ramses Project

    n  Ramses Online (2015)

    n  Responsive website based on Bootstrap

    n  With powerful linguistic searching capabilities

    n  In interaction with its users

    11

  • +Outline of the talk

    n  Background information about the Ramses Project

    n  Ramses Online (2015)

    n  Responsive website based on Bootstrap

    n  With powerful linguistic searching capabilities

    n  In interaction with its users

    12

  • +Outline of the talk

    n  Background information about the Ramses Project

    n  Ramses Online (2015)

    n  Recent developments and work in progress

    13

  • +Outline of the talk

    n  Background information about the Ramses Project

    n  Ramses Online (2015)

    n  Recent developments and work in progress

    n  Event sourcing

    n  TEI interchange format

    n  Ontologies and metadata thesauri

    n  Linked data

    14

  • +The Ramses Project Background information

    15

  • +The Ramses Project

    History n  July 2006

    n  « Informatique & Égyptologie » (Oxford)

    16

  • +The Ramses Project

    Goal n  Build a richly annotated corpus of Late Egyptian texts

    17

    1000 - 3000 - 2000 - 1000 0

  • +The Ramses Project

    Goal n  Build a richly annotated corpus of Late Egyptian texts

    18

    1000 - 3000 - 2000 - 1000 0

  • +The Ramses Project

    Goal n  Build a richly annotated corpus of Late Egyptian texts

    n  Useful both for philologists and linguists

    19

  • +The Ramses Project

    JAVA software (MySQL – texts stored in XML) n  LexiconEditor

    n  TextEditor

    20

  • +The Ramses Project

    What kind of data? n  Hieroglyphic spellings

    21

  • +The Ramses Project

    What kind of data? n  Hieroglyphic spellings

    n  Lemmatization and morphological annotation

    22

  • +The Ramses Project

    What kind of data? n  Hieroglyphic spellings

    n  Lemmatization and morphological annotation

    n  Textual criticism

    23

  • +The Ramses Project

    What kind of data? n  Hieroglyphic spellings

    n  Lemmatization and morphological annotation

    n  Textual criticism

    n  Translation (French / English)

    24

  • +The Ramses Project

    History n  2015

    25

  • +The Ramses Project

    History n  2015

    26

  • +The Ramses Project

    The corpus n  Number of texts

    27

    0

    500

    1000

    1500

    2000

    2500

    3000

    3500

    4000

    4500

    2006 2007 2008 2009 2010 2011 2012 2013 2014

  • +The Ramses Project

    The corpus n  Number of occurrences

    28

    0

    100000

    200000

    300000

    400000

    500000

    600000

    2006 2007 2008 2009 2010 2011 2012 2013 2014

  • +Ramses Online An overview

    29

  • +Ramses Online

    30

  • +Ramses Online

    31

  • + 32

  • + 33 Ramses Online

  • +Ramses Online Simple queries

    34

  • +Ramses Online Simple queries

    35

  • +Ramses Online Simple queries

    36

  • +Ramses Online Complex queries

    37

  • +Ramses Online Complex queries

    38

  • +Ramses Online Interaction with the users

    39

  • +Ramses Online Interaction with the users

    40

  • +Ramses Online Interaction with the users

    41

  • +Ramses Online Interaction with the users

    42

  • +Ramses Online Interaction with the users

    43

  • +Ramses Online Interaction with the users

    44

  • +Ramses Online Interaction with the users

    45

  • +Ramses Online Interaction with the users

    46

  • +Recent developments and work in progress

    47

  • +Recent developments and work in progress

    n  Event sourcing

    48

  • +Recent developments and work in progress

    n  Event sourcing n  Evolution at different levels

    n  Hieroglyphic encoding and annotations

    n  Structure of the database

    49

  • +Recent developments and work in progress

    n  Event sourcing n  Evolution at different levels

    n  Hieroglyphic encoding and annotations

    n  Structure of the database

    n  The history of the database is the database = data are (results of) events

    n  One can visualize the data at any point in time

    n  Search for any type of event

    50

  • +Recent developments and work in progress

    n  Event sourcing

    n  TEI interchange format

    51

  • +Recent developments and work in progress

    n  Event sourcing

    n  TEI interchange format n  Laurent Coulon (EPHE – Paris)

    n  Frederik Elwert (CERES – Bochum)

    n  Emmanuelle Morlock (HiSoMA – CNRS)

    n  Stéphane Polis (F.R.S.-FNRS – Liège)

    n  Vincent Razanajao (ULg – Liège)

    n  Serge Rosmorduc (CNAM – Paris)

    n  Simon Schweitzer (BBAW – Berlin)

    n  Daniel A. Werning (EXC Topoi – Berlin)

    52

  • +Recent developments and work in progress

    n  Event sourcing

    n  TEI interchange format

    53

  • +Recent developments and work in progress

    n  Event sourcing

    n  TEI interchange format

    54

  • Block statue of Ḥr (III), son of Nsr-Jmn. Cairo CG 42230 Upper side – texts on right and left shoulders

    Text layouts /TEI/teiHeader/fileDesc/sourceDesc/msDesc/ physDesc/objectDesc/layoutDesc Deux colonnes de texte sur l'épaule droite Deux colonnes de texte sur l'épaule gauche

    Formule de donation royale

    Abstract text /TEI/teiHeader/fileDesc/sourceDesc/msDesc/msContents/msItem

    dy m ḥsw.t n.ṯ ḫr nswt ḥry sšw ḥw.t-nṯr n pr Ỉmn Ḥr mȝ‘ ḫrw sȝ mỉ n

  • +Name Type Data sample and/or source Use

    Title types ODD ancient, modern /TEI/teiHeader/fileDesc/sourceDesc/msDesc/msContents/msItem/msItem/title/@typeBibliographical types ODD Monograph, Article /TEI/teiHeader/fileDesc/sourceDesc/msDesc/additional/listBibl/bibl/@typeBibliographical subtypes

    ODD in a periodical, i a collected volume

    /TEI/teiHeader/fileDesc/sourceDesc/msDesc/additional/listBibl/bibl/@subtype

    Reference types ODD oeb, Aigyptos /TEI/teiHeader/fileDesc/sourceDesc/msDesc/additional/listBibl/bibl/ref/@typeTitle level types ODD TEI /TEI/teiHeader[1]/fileDesc[1]/sourceDesc[1]/msDesc[1]/additional[1]/listBibl[1]/bibl[2]/title[1]/@levelbiblScope types ODD Issue, page, col. /TEI/teiHeader[1]/fileDesc[1]/sourceDesc[1]/msDesc[1]/additional[1]/listBibl[1]/bibl[3]/biblScope[1]Bibliographical types ODD Monograph, Article /TEI/teiHeader/fileDesc/sourceDesc/msDesc/additional/listBibl/bibl/@typeHand style types ODD Calligraphic cursive /TEI/teiHeader/fileDesc/sourceDesc/msDesc/physDesc/handDesc/handNote/@scriptRole types ODD Finder, Buyer, Seller /TEI/teiHeader/fileDesc/sourceDesc/msDesc/msPart/history/provenance/persName/@role

    /TEI/teiHeader/fileDesc/sourceDesc/msDesc/msPart/history/acquisition/listPerson/person/persName/@roleTypes of list of people ODD Editorial, peopleInDocument,

    deitiesInDocument/TEI/teiHeader/profileDesc/particDesc/listPerson/@type

    Revision status types ODD TEI /TEI/teiHeader/revisionDescText div types ODD edition, translation, apparatus,

    commentary/TEI/text/body/div/@type

    Text line orientations ODD verti-left-to-right, horiz-left-to-right, verti-right-to-left, horiz-right-to-left

     

    Supplied ODD Not in EpiDoc : Idealized, conjecture, haplography

    /TEI/text/body/div/div/ab/s/w//supplied/@reason

    rend for seg ODD Cartouche /TEI/text/body/div/div/ab/s/seg/@rendType of seg ODD oval, serekh, hwt /TEI/text/body/div/div/ab/s/seg/@typeSubtypes of seg ODD full, opening, closing /TEI/text/body/div/div/ab/s/seg/@subtypeText location types THS +

    ODD  /TEI/teiHeader/fileDesc/sourceDesc/msDesc/msContents/msItem/locus/@scheme

    /TEI/teiHeader/fileDesc/sourceDesc/msDesc/physDesc/objectDesc/supportDesc/support/list/item/@anaLanguage types THS +

    ODDOld Egyptian, Middle Egyptian, Late Egyptian

    /TEI/teiHeader/fileDesc/sourceDesc/msDesc/msContents/msItem/msItem/textLang/@mainLang /TEI/teiHeader/fileDesc/sourceDesc/msDesc/msContents/msItem/msItem/textLang/@otherLangs /TEI/teiHeader/profileDesc/langUsage/language/@ident /TEI/teiHeader/profileDesc/particDesc/listPerson/person/persName/@xml:lang

    Script types THS + ODD

    Hieroglyphic, hieratic, demotic /TEI/teiHeader/fileDesc/sourceDesc/msDesc/physDesc/scriptDesc/scriptNote/@scriptRef

    Writing tool THS + ODD

    Qalam, cisel /TEI/teiHeader/fileDesc/sourceDesc/msDesc/physDesc/scriptDesc/scriptNote/@medium

    Material THS + ODD

    Eagle /TEI/teiHeader/fileDesc/sourceDesc/msDesc/physDesc/objectDesc/supportDesc/support/material/@ref

    Preservation state types

    THS + ODD

    Eagle /TEI/teiHeader/fileDesc/sourceDesc/msDesc/physDesc/objectDesc/supportDesc/condition/rs/@ref

    Text types THS Documentary, literary, /TEI/teiHeader/fileDesc/sourceDesc/msDesc/msContents/msItem/@anaObject types THS Statue, ostracon, papyrus, temple

    wall, doorjamb/TEI/teiHeader/fileDesc/sourceDesc/msDesc/physDesc/objectDesc/supportDesc/support/objectType/@ref

    Dates THS Dynasty 18, Amenhotep III, Roman Period

    /TEI/teiHeader/fileDesc/sourceDesc/msDesc/history/origin/origDate/date/@datingPoint

    Types of provenance THS Excavations, market /TEI/teiHeader/fileDesc/sourceDesc/msDesc/history/provenance/@type /TEI/teiHeader/fileDesc/sourceDesc/msDesc/msPart/history/provenance/@type

    Repositories THS Egyptian Museum Cairo, Louvre /TEI/teiHeader/fileDesc/sourceDesc/msDesc/msPart/msIdentifier/repository/@refInv. numbering types THS JE (Cairo Museum), CG (Cairo

    Museum)/TEI/teiHeader/fileDesc/sourceDesc/msDesc/msPart/msIdentifier/idno/@type

  • +Name Type Data sample and/or source Use

    Title types ODD ancient, modern /TEI/teiHeader/fileDesc/sourceDesc/msDesc/msContents/msItem/msItem/title/@typeBibliographical types ODD Monograph, Article /TEI/teiHeader/fileDesc/sourceDesc/msDesc/additional/listBibl/bibl/@typeBibliographical subtypes

    ODD in a periodical, i a collected volume

    /TEI/teiHeader/fileDesc/sourceDesc/msDesc/additional/listBibl/bibl/@subtype

    Reference types ODD oeb, Aigyptos /TEI/teiHeader/fileDesc/sourceDesc/msDesc/additional/listBibl/bibl/ref/@typeTitle level types ODD TEI /TEI/teiHeader[1]/fileDesc[1]/sourceDesc[1]/msDesc[1]/additional[1]/listBibl[1]/bibl[2]/title[1]/@levelbiblScope types ODD Issue, page, col. /TEI/teiHeader[1]/fileDesc[1]/sourceDesc[1]/msDesc[1]/additional[1]/listBibl[1]/bibl[3]/biblScope[1]Bibliographical types ODD Monograph, Article /TEI/teiHeader/fileDesc/sourceDesc/msDesc/additional/listBibl/bibl/@typeHand style types ODD Calligraphic cursive /TEI/teiHeader/fileDesc/sourceDesc/msDesc/physDesc/handDesc/handNote/@scriptRole types ODD Finder, Buyer, Seller /TEI/teiHeader/fileDesc/sourceDesc/msDesc/msPart/history/provenance/persName/@role

    /TEI/teiHeader/fileDesc/sourceDesc/msDesc/msPart/history/acquisition/listPerson/person/persName/@roleTypes of list of people ODD Editorial, peopleInDocument,

    deitiesInDocument/TEI/teiHeader/profileDesc/particDesc/listPerson/@type

    Revision status types ODD TEI /TEI/teiHeader/revisionDescText div types ODD edition, translation, apparatus,

    commentary/TEI/text/body/div/@type

    Text line orientations ODD verti-left-to-right, horiz-left-to-right, verti-right-to-left, horiz-right-to-left

     

    Supplied ODD Not in EpiDoc : Idealized, conjecture, haplography

    /TEI/text/body/div/div/ab/s/w//supplied/@reason

    rend for seg ODD Cartouche /TEI/text/body/div/div/ab/s/seg/@rendType of seg ODD oval, serekh, hwt /TEI/text/body/div/div/ab/s/seg/@typeSubtypes of seg ODD full, opening, closing /TEI/text/body/div/div/ab/s/seg/@subtypeText location types THS +

    ODD  /TEI/teiHeader/fileDesc/sourceDesc/msDesc/msContents/msItem/locus/@scheme

    /TEI/teiHeader/fileDesc/sourceDesc/msDesc/physDesc/objectDesc/supportDesc/support/list/item/@anaLanguage types THS +

    ODDOld Egyptian, Middle Egyptian, Late Egyptian

    /TEI/teiHeader/fileDesc/sourceDesc/msDesc/msContents/msItem/msItem/textLang/@mainLang /TEI/teiHeader/fileDesc/sourceDesc/msDesc/msContents/msItem/msItem/textLang/@otherLangs /TEI/teiHeader/profileDesc/langUsage/language/@ident /TEI/teiHeader/profileDesc/particDesc/listPerson/person/persName/@xml:lang

    Script types THS + ODD

    Hieroglyphic, hieratic, demotic /TEI/teiHeader/fileDesc/sourceDesc/msDesc/physDesc/scriptDesc/scriptNote/@scriptRef

    Writing tool THS + ODD

    Qalam, cisel /TEI/teiHeader/fileDesc/sourceDesc/msDesc/physDesc/scriptDesc/scriptNote/@medium

    Material THS + ODD

    Eagle /TEI/teiHeader/fileDesc/sourceDesc/msDesc/physDesc/objectDesc/supportDesc/support/material/@ref

    Preservation state types

    THS + ODD

    Eagle /TEI/teiHeader/fileDesc/sourceDesc/msDesc/physDesc/objectDesc/supportDesc/condition/rs/@ref

    Text types THS Documentary, literary, /TEI/teiHeader/fileDesc/sourceDesc/msDesc/msContents/msItem/@anaObject types THS Statue, ostracon, papyrus, temple

    wall, doorjamb/TEI/teiHeader/fileDesc/sourceDesc/msDesc/physDesc/objectDesc/supportDesc/support/objectType/@ref

    Dates THS Dynasty 18, Amenhotep III, Roman Period

    /TEI/teiHeader/fileDesc/sourceDesc/msDesc/history/origin/origDate/date/@datingPoint

    Types of provenance THS Excavations, market /TEI/teiHeader/fileDesc/sourceDesc/msDesc/history/provenance/@type /TEI/teiHeader/fileDesc/sourceDesc/msDesc/msPart/history/provenance/@type

    Repositories THS Egyptian Museum Cairo, Louvre /TEI/teiHeader/fileDesc/sourceDesc/msDesc/msPart/msIdentifier/repository/@refInv. numbering types THS JE (Cairo Museum), CG (Cairo

    Museum)/TEI/teiHeader/fileDesc/sourceDesc/msDesc/msPart/msIdentifier/idno/@type

  • +Recent developments and work in progress

    n  Event sourcing

    n  TEI exchange format

    n  Thesauri and ontologies

    58

  • 59

  • +Recent developments and work in progress

    n  Event sourcing

    n  TEI exchange format

    n  Thesauri and ontologies

    n  Linked data

    60

  • +Recent developments and work in progress

    n  Event sourcing

    n  TEI exchange format

    n  Thesauri and ontologies

    n  Linked data

    61

  • +Recent developments and work in progress

    n  Event sourcing

    n  TEI exchange format

    n  Thesauri and ontologies

    n  Linked data

    62

  • +

    http://ramses.ulg.ac.be

    63

  • +

    Thanks!

    64