The Role of Ontology in the Era of Big Military Data

55
Distributed Common Ground System – Army (DCGS-A) Barry Smith Director National Center for Ontological Research The Role of Ontology in the Era of Big (Military) Data 1

description

Ontology and information integration for military intelligence http://ncor.buffalo.edu/OI2/

Transcript of The Role of Ontology in the Era of Big Military Data

Page 1: The Role of Ontology in the Era of Big Military Data

Distributed Common Ground System – Army (DCGS-A)

Barry SmithDirector

National Center for Ontological Research

The Role of Ontology in the Era of Big (Military) Data

1

Page 2: The Role of Ontology in the Era of Big Military Data

Distributed Development of a Shared Semantic Resource (SSR)

in support of US Army’s Distributed Common Ground System Standard Cloud (DSC) initiative

with thanks to: Tanya Malyuta, Ron Rudnicki

Background materials: http://x.co/yYxN

2

Page 3: The Role of Ontology in the Era of Big Military Data

3

Page 4: The Role of Ontology in the Era of Big Military Data

4

Making data (re-)usable through common controlled vocabularies

• Allow multiple databases to be treated as if they were a single data source by eliminating terminological redundancy in ways data are described – not ‘Person’, and ‘Human’, and ‘Human Being’, and ‘Pn’,

and ‘HB’, but simply: person• Allow development and use of common tools and

techniques, common training, single validation of data, focused around – semantic technology– coordinated ontology development and use

Page 5: The Role of Ontology in the Era of Big Military Data

5

Ontology =def.

• controlled vocabulary organized as a graph• nodes in the graph are terms representing types in

reality • each node is associated with definition and

synonyms• edges in the graph represent well-defined relations

between these types• the graph is structured hierarchically via subtype

relations

Page 6: The Role of Ontology in the Era of Big Military Data

6

Ontologies

• computer-tractable representations of types in specific areas of reality

• divided into more and less general– upper = organizing ontologies, provide common

architecture and thus promote interoperability– lower = domain ontologies, provide grounding in

reality• reflecting top-down and bottom-up strategy

Page 7: The Role of Ontology in the Era of Big Military Data

8

Success story in biomedicineGoal: integration of biological and clinical data

– across different species– across levels of granularity (organ,

organism, cell, molecule)– across different perspectives (physical,

biological, clinical)– within and across domains (growth, aging,

environment, genetic disease, toxicity …)

Page 8: The Role of Ontology in the Era of Big Military Data

RELATION TO TIME

GRANULARITY

CONTINUANT OCCURRENT

INDEPENDENT DEPENDENT

ORGAN ANDORGANISM

Organism(NCBI

Taxonomy)

Anatomical Entity(FMA, CARO)

OrganFunction

(FMP, CPRO) Phenotypic

Quality(PaTO)

Biological Process

(GO)CELL AND CELLULAR

COMPONENT

Cell(CL)

Cellular Compone

nt(FMA, GO)

Cellular Function

(GO)

MOLECULEMolecule

(ChEBI, SO,RnaO, PrO)

Molecular Function(GO)

Molecular Process

(GO)The Open Biomedical Ontologies (OBO) Foundry

9

Page 9: The Role of Ontology in the Era of Big Military Data

RELATION TO TIME

GRANULARITY

CONTINUANT OCCURRENT

INDEPENDENT DEPENDENT

COMPLEX OFORGANISMS

Family, Community, Population

OrganFunction

(FMP, CPRO)

Population Phenotype

PopulationProcess

ORGAN ANDORGANISM

Organism(NCBI

Taxonomy)

Anatomical Entity(FMA, CARO) Phenotypic

Quality(PaTO)

Biological Process

(GO)CELL AND CELLULAR

COMPONENT

Cell(CL)

Cellular Componen

t(FMA, GO)

Cellular Function

(GO)

MOLECULEMolecule

(ChEBI, SO,RnaO, PrO)

Molecular Function(GO)

Molecular Process

(GO)Population-level ontologies 10

Page 10: The Role of Ontology in the Era of Big Military Data

RELATION TO TIME

GRANULARITY

CONTINUANT OCCURRENT

INDEPENDENT DEPENDENT

ORGAN ANDORGANISM

Organism(NCBI

Taxonomy)

Anatomical Entity(FMA, CARO)

OrganFunction

(FMP, CPRO) Phenotypic

Quality(PaTO)

Biological Process

(GO)

CELL AND CELLULAR

COMPONENT

Cell(CL)

Cellular Compone

nt(FMA, GO)

Cellular Function

(GO)

MOLECULEMolecule

(ChEBI, SO,RnaO, PrO)

Molecular Function(GO)

Molecular Process

(GO)Environment Ontology

En

viro

nm

ent

On

tolo

gy

11

Page 11: The Role of Ontology in the Era of Big Military Data

CONTINUANT OCCURRENT

INDEPENDENT DEPENDENT

ORGAN ANDORGANISM

Organism(NCBI

Taxonomy)

Anatomical Entity

(FMA, CARO)

OrganFunction

(FMP, CPRO) Phenotypic

Quality(PaTO)

Organism-Level Process

(GO)

CELL AND CELLULAR

COMPONENT

Cell(CL)

Cellular Compone

nt(FMA, GO)

Cellular Function

(GO)

Cellular Process

(GO)

MOLECULEMolecule

(ChEBI, SO,RNAO, PRO)

Molecular Function(GO)

Molecular Process

(GO)

rationale of OBO Foundry coverage

GRANULARITY

RELATION TO TIME

12

Page 12: The Role of Ontology in the Era of Big Military Data

OBO Foundry approach extended into other domains

13

NIF Standard Neuroscience Information Framework

ISF Ontologies Integrated Semantic Framework

OGMS and Extensions Ontology for General Medical Science

IDO Consortium Infectious Disease Ontology

cROP Common Reference Ontologies for Plants

Page 13: The Role of Ontology in the Era of Big Military Data

Anatomy Ontology(FMA*, CARO)

Environment

Ontology(EnvO)

Infectious Disease

Ontology(IDO*)

Biological Process

Ontology (GO*)

Cell Ontology

(CL)

CellularComponentOntology

(FMA*, GO*) Phenotypic Quality

Ontology(PaTO)Subcellular Anatomy Ontology

(SAO)

Sequence Ontology (SO*)

Molecular Function

(GO*)Protein Ontology(PRO*)

14

top level

domain level

Basic Formal Ontology (BFO)

Modular organization + Extension strategy

Page 14: The Role of Ontology in the Era of Big Military Data

~100 ontologies using BFOUS Army Biometrics Ontology

Brucella Ontology (IDO-BRU)

eagle-i and VIVO (NCRR)

Financial Report Ontology (to support SEC through XBRL)

IDO Infectious Disease Ontology (NIAID)

Malaria Ontology (IDO-MAL)

Nanoparticle Ontology (NPO)

Ontology for Risks Against Patient Safety (RAPS/REMINE)

Parasite Experiment Ontology (PEO)

Subcellular Anatomy Ontology (SAO) 

Vaccine Ontology (VO)

…15

Page 15: The Role of Ontology in the Era of Big Military Data

Basic Formal Ontology

Monday, April 10, 2023 16

BFO:Entity

BFO:Continuant BFO:Occurrent

BFO:ProcessBFO:Independent Continuant

BFO

BFO:Dependent Continuant

BFO:Disposition

Page 16: The Role of Ontology in the Era of Big Military Data

Basic Formal Ontology and Mental Functioning Ontology (MFO)

Monday, April 10, 2023 17

BFO:Entity

BFO:Continuant BFO:Occurrent

BFO:Process

Organism

BFO:Independent Continuant

BFOMFO

BFO:Dependent Continuant

Behaviour inducing state

Mental Functioning Related Anatomical

Structure

Cognitive Representation

BFO:Quality

Affective Representation

Mental Process

Bodily ProcessBFO:Disposition

Page 17: The Role of Ontology in the Era of Big Military Data

BFO:Entity

BFO:Continuant BFO:Occurrent

BFO:ProcessBFO:Independent

Continuant

BFOMFO

BFO:Dependent Continuant

Cognitive Representation

Affective Representation

Mental Process

Bodily ProcessBFO:Disposition

MFO-EM

Emotion Occurrent

Organism

Emotional Action Tendencies

Appraisal

Subjective Emotional Feeling

Physiological Response to

Emotion Process

inheres_in

is_output_of

Emotional Behavioural Process

Appraisal Process

has_part

agent_of

Emotion Ontology extends MFO

Page 18: The Role of Ontology in the Era of Big Military Data

Monday, April 10, 2023 19

Sample from Emotion Ontology: Types of Feeling

Page 19: The Role of Ontology in the Era of Big Military Data

The problem of joint / coalition operations

Fire Support

Logistics

Air Operation

s

Intelligence

Civil-Military

Operations

Targeting

Maneuver &Blue Force

Tracking

23

Page 20: The Role of Ontology in the Era of Big Military Data

US DoD Civil Affairs strategy for non-classified information sharing

24

Page 21: The Role of Ontology in the Era of Big Military Data

Ontologies / semantic technologycan help to solve this problem

Fire Support

Logistics

Air Operation

s

Intelligence

Civil-Military

Operations

Targeting

Maneuver &

Blue Force

Tracking

25

Page 22: The Role of Ontology in the Era of Big Military Data

But each community produces its own ontology, this will merely create new, semantic siloes

Fire Support

Logistics

Air Operation

s

Intelligence

Civil-Military

Operations

Targeting

Maneuver &Blue Force

Tracking

26

Page 23: The Role of Ontology in the Era of Big Military Data

27

What we are doing to avoid the problem of semantic siloes

Distributed Development of a Shared Semantic Resource

Pilot testing to demonstrate feasibility

Page 24: The Role of Ontology in the Era of Big Military Data

Anatomy Ontology(FMA*, CARO)

Environment

Ontology(EnvO)

Infectious Disease

Ontology(IDO*)

Biological Process

Ontology (GO*)

Cell Ontology

(CL)

CellularComponentOntology

(FMA*, GO*) Phenotypic Quality

Ontology(PaTO)Subcellular Anatomy Ontology

(SAO)

Sequence Ontology (SO*)

Molecular Function

(GO*)Protein Ontology(PRO*)

28

top level

domain level

Basic Formal Ontology (BFO)

creating the analog of this in the military domain

Page 25: The Role of Ontology in the Era of Big Military Data

Semantic Enhancement

Annotation (tagging) of source data models using terms from coordinated ontologies

– data remain in their original state (are treated at arms length)

– tagged using interoperable ontologies created in tandem– can be as complete as needed, lossless, long-lasting

because flexible and responsive– big bang for buck – measurable benefit even from first

small investments

Coordination through shared governance and training29

Page 26: The Role of Ontology in the Era of Big Military Data

Main challenge: Will it scale?

The problem of scalability turns on • the ability to accommodate ever increasing

volumes and types of data and numbers of users

• can we preserve coordination (consistency, non-redundancy) as ever more domains become involved?

• can we respond in agile fashion to ever changing bodies of source data?

31

Page 27: The Role of Ontology in the Era of Big Military Data

Strategy for agile ontology creation

• Identify or create carefully validated general purpose plug-and-play reference ontology modules for principal domains

• Develop a method whereby these reference ontologies can be extended very easily to cope with specific, local data through creation of application ontologies

32

Page 28: The Role of Ontology in the Era of Big Military Data

vehicle =def: an object used for transporting people or goods

tractor =def: a vehicle that is used for towing

crane =def: a vehicle that is used for lifting and moving heavy objects

vehicle platform=def: means of providing mobility to a vehicle

wheeled platform=def: a vehicle platform that provides mobility through the use of wheels

tracked platform=def: a vehicle platform that provides mobility through the use of continuous tracks

artillery vehicle = def. vehicle designed for the transport of one or more artillery weapons

wheeled tractor = def. a tractor that has a wheeled platform

Russian wheeled tractor type T33 = def. a wheeled tractor of type T33 manufactured in Russia

Ukrainian wheeled tractor type T33 = def. a wheeled tractor of type T33 manufactured in Ukraine

Reference Ontology Application Ontology

Page 29: The Role of Ontology in the Era of Big Military Data

vehicle =def: an object used for transporting people or goods

tractor =def: a vehicle that is used for towing

crane =def: a vehicle that is used for lifting and moving heavy objects

vehicle platform=def: means of providing mobility to a vehicle

wheeled platform=def: a vehicle platform that provides mobility through the use of wheels

tracked platform=def: a vehicle platform that provides mobility through the use of continuous tracks

artillery vehicle = def. vehicle designed for the transport of one or more artillery weapons

wheeled tractor = def. a tractor that has a wheeled platform

Russian wheeled tractor type T33 = def. a wheeled tractor of type T33 manufactured in Russia

Ukrainian wheeled tractor type T33 = def. a wheeled tractor of type T33 manufactured in Ukraine

Reference Ontology Application Ontology

Page 30: The Role of Ontology in the Era of Big Military Data

AIRS Reference Ontologies

Basic Formal Ontology

(BFO)

Extended Relation Ontology

Time OntologyQuality

Ontology

Information Entity

OntologyGeospatial Ontology

Event OntologyArtifact

Ontology

Agent Ontology

Page 31: The Role of Ontology in the Era of Big Military Data

Agent Ontology

Social Network, Skills, and Occupations

Page 32: The Role of Ontology in the Era of Big Military Data

Event Ontology

Actions, Natural Events and Time-Dependent Attributes

Page 33: The Role of Ontology in the Era of Big Military Data

Geospatial Ontology

Regions, Geopolitical Entities, Geographic Features, and Locations

Page 34: The Role of Ontology in the Era of Big Military Data

40

http://milportal.org

Page 35: The Role of Ontology in the Era of Big Military Data

41

Page 36: The Role of Ontology in the Era of Big Military Data

42

Page 37: The Role of Ontology in the Era of Big Military Data

43

Page 38: The Role of Ontology in the Era of Big Military Data

An example of agile application ontology development:

The Bioweapons Ontology (BWO)

44

Page 39: The Role of Ontology in the Era of Big Military Data

Kinds of chemical and biological weapons

Chemical Nerve agents (sarin gas)Blister agents (mustard gas)Blood agents (cyanide gas)

BiologicalInfectious agents – BWO(I)

Toxic agents (botulinum toxin, ricin) – BWO(T)

45

Page 40: The Role of Ontology in the Era of Big Military Data

We focus here on BWO(I)

Infectious agents–Bacterial (anthrax, bubonic plague,

tularemia, brucellosis, cholera …)–Viral (Ebola, Marburg …)

46

Page 41: The Role of Ontology in the Era of Big Military Data

BFO IDO StaphIDO

Independent Continuant

Infectious disorder

Staph. aureusdisorder

Dependent Continuant

Infectious disease

Protective resistance

MRSA

Methicillin resistance

Occurrent Infectious disease course MRSA course

Examples of ontology terms

47

Page 42: The Role of Ontology in the Era of Big Military Data

Infectious Disease Ontology (IDO)

IDO Core (Reference Ontology)• General terms in the ID domain.

IDO Extensions (Application Ontologies)• Disease-, host-, pathogen-specific.

• Developed by subject matter experts.

The hub-and-spokes strategy ensures that logical content of IDO Core is automatically inherited by the IDO Extensions

•with thanks to Lindsay Cowell (University of Texas SW Medical Center) and Albert Goldfain (Blue Highway, Inc.)

Page 43: The Role of Ontology in the Era of Big Military Data

IDO Core

• Contains general terms in the ID domain:– E.g., ‘colonization’, ‘pathogen’, ‘infection’

• A contract between IDO extension ontologies and the datasets that use them.

• Intended to represent information along several dimensions:– biological scale (gene, cell, organ, organism, population)– discipline (clinical, immunological, microbiological) – organisms involved (host, pathogen, and vector types)

Page 44: The Role of Ontology in the Era of Big Military Data

BFO IDO StaphIDO

Independent Continuant

Infectious disorder

Staph. aureusdisorder

Dependent Continuant

Infectious disease

Protective resistance

MRSA

Methicillin resistance

Occurrent Infectious disease course MRSA course

Examples of ontology terms

50

Page 45: The Role of Ontology in the Era of Big Military Data

IDO Extensions

IDO – BrucellosisIDO – Dengue FeverIDO – InfluenzaIDO – MalariaIDO – Staphylococcus Aureus BacteremiaIDO – Vector Surveillance and ManagementIDO – Plant VO – Vaccine OntologyBWO(I) – Bioweapons Ontology (Infectious Agents)

51

Page 46: The Role of Ontology in the Era of Big Military Data

How IDO evolves: the case of Staph. aureus

IDOCore

IDOSa

IDOHumanSa

IDORatSa

IDOStrep

IDORatStrep

IDOHumanStrep

IDOMRSa

IDOHumanBacterial

IDOAntibioticResistant

IDOMAL IDOHIVHUB and SPOKES:Domain ontologies

SEMI-LATTICE:By subject matter experts in different communities of interest.

IDOFLU

Page 47: The Role of Ontology in the Era of Big Military Data
Page 48: The Role of Ontology in the Era of Big Military Data

54

Page 49: The Role of Ontology in the Era of Big Military Data
Page 50: The Role of Ontology in the Era of Big Military Data

BWO:disease by infectious agent = def. a disease that is the consequence of the presence of

pathogenic microbial agents, including pathogenic viruses, pathogenic bacteria, fungi, protozoa, multicellular parasites, and aberrant proteins known as prions

Page 51: The Role of Ontology in the Era of Big Military Data

Strategy used to build BWO(I)with thanks to Lindsay Cowell and Oliver He (Michigan)

1. Start with a glossary such as: http://www.emedicinehealth.com/biological_warfare/

2. Select corresponding terms from IDO core and related ontologies such as the CHEBI Chemistry Ontology terms needed to describe bioweapons

3. All ontology terms keep their original definitions and IDs.

4. The result is a spreadsheet

57

Page 52: The Role of Ontology in the Era of Big Military Data

5. Where glossary terms have no ontology equivalent, create BWO ontology terms and

definitions as needed

58

no corresponding ontology term

Page 53: The Role of Ontology in the Era of Big Military Data

6. Use the Ontofox too to create the first version of the BWO(I) application ontology (http://ontofox.hegroup.org/)

7. Use BWO(I) in annotations, and where gaps are identified create extension terms, for instance – weaponized brucella – aerosol anthrax– smallpox incubation period

This establishes a virtuous cycle between ontology development and use in annotations

59

Page 54: The Role of Ontology in the Era of Big Military Data

Potential uses of BWO

– semantic enhancement of bioweapons intelligence data– results will be automatically interoperable with relevant bioinformatics and public health IT tools for dealing with infections, epidemics, vaccines, forensics, …–to annotate research literature and research data on bioweapons – to create computable definitions to substitute for definitions in free text glossaries

60

Page 55: The Role of Ontology in the Era of Big Military Data

Why do people think they need lexicons

• Training• Compiling lessons learned• Compiling results of testing, e.g. of proposed new

doctrine• Collective inferencing• Official reporting• Doctrinal development• Standard operating procedures• Sharing of data• People need to (ensure that they) understand each other