Nuxeo Iks 2009 11 13

8
Olivier Grisel - 2009-11-13 - IKS Semantic Lifting Named Entities Extraction with UIMA Thursday, November 12, 2009

description

Short introductionary slides introducing some of the work done on the Scribo project to extract Named Entities in textual documents with a UIMA engine.

Transcript of Nuxeo Iks 2009 11 13

Page 1: Nuxeo Iks 2009 11 13

Olivier Grisel - 2009-11-13 - IKS

Semantic LiftingNamed Entities Extraction with UIMA

Thursday, November 12, 2009

Page 2: Nuxeo Iks 2009 11 13

Nuxeo

• Open Source ECM

• Nuxeo DM 5.3 available

• office document management with workspaces

• download it at http://nuxeo.com

• Soon: Nuxeo DAM

• Multimedia content

• Full ajax search based browsing

2

Thursday, November 12, 2009

Page 3: Nuxeo Iks 2009 11 13

http://SCRIBO.ws

• Goal: content to knowledge using ontologies

• 3 academic research teams

• 2 NLP startups

• 2 Open Source ECM / Wiki software editors

• 2 use case providers:

• News agency

• Linux distribution

3

Thursday, November 12, 2009

Page 4: Nuxeo Iks 2009 11 13

UIMA

• Chain components to extract annotations on text and images

• Initially developed by IBM

• Now an Apache Software Foundation project

• Several existing components (OpenNLP, ClearTK, ...)

• Easy to wrap new libraries as UIMA annotators

4

Thursday, November 12, 2009

Page 5: Nuxeo Iks 2009 11 13

Scribo UIMA chain

5

Thursday, November 12, 2009

Page 6: Nuxeo Iks 2009 11 13

Scribo UIMA chain editor

6

Thursday, November 12, 2009

Page 7: Nuxeo Iks 2009 11 13

Embedded UIMA chain

7

Thursday, November 12, 2009

Page 8: Nuxeo Iks 2009 11 13

It’s Open Source

• Clone it!

• http://hg.nuxeo.org/sandbox/scribo

• http://hg.nuxeo.org/sandbox/nuxeo-uima

• Give me feedback!

• http://twitter.com/ogrisel

8

Thursday, November 12, 2009