Semantic Search in E-Discovery

18
Semantic Search in E-Discovery David Graus Research on the application of text mining and information retrieval for fact finding in regulatory investigations

description

 

Transcript of Semantic Search in E-Discovery

Page 1: Semantic Search in E-Discovery

Semantic Search in E-Discovery

David Graus

Research on the application of text mining and information retrieval for fact finding in regulatory investigations

Page 2: Semantic Search in E-Discovery

Semantic search in e-discovery

Who’s Involved?

2

Prof. dr. Maarten de RijkeDirector Intelligent Systems Lab, UvA

David van Dijk, MSc.Researcher E-Discovery, CREATE-IT applied research

Dr. Hans HenselerLector E-Discovery, CREATE-IT applied research

Menno Israël, MSc.Teamleader Knowledge and Expertise Centre for Intelligent Data Analysis (Kecida), NFI

David Graus, MSc.PhD Candidate, Semantic Search in E-Discovery, UvA

Zhaochun Ren, MSc.PhD Candidate, Semantic Search in E-Discovery, UvA

Page 3: Semantic Search in E-Discovery

Semantic search in e-discovery

Introduction

£ Semantic Search in E-Discovery

3

Page 4: Semantic Search in E-Discovery

Semantic search in e-discovery

What is

£ Semantic Search in E-Discovery� retrieving and securing digital forensic evidence

4

Page 5: Semantic Search in E-Discovery

Semantic search in e-discovery

What is

£ Semantic Search in E-Discovery

5

Page 6: Semantic Search in E-Discovery

Semantic search in e-discovery

What is

£ Semantic Search in E-Discovery� retrieving and securing digital forensic evidence� from emails, forums, etc...

6

Page 7: Semantic Search in E-Discovery

Semantic search in e-discovery

What is

£ Semantic Search in e-Discovery

7

Page 8: Semantic Search in E-Discovery

Semantic search in e-discovery

Challenge

8

¢ Finding out who knew what, from whom, and when

Page 9: Semantic Search in E-Discovery

Semantic search in e-discovery

Challenge

9

¢ Finding out who knew what, from whom, and when¢ Generic search is not the answer

Page 10: Semantic Search in E-Discovery

Semantic search in e-discovery

Finding evidence for E-Discovery

10

¢ We don’t know what we’re looking for¢ What we’re looking for might be deliberately hidden¢ Communication might be very domain-specific,

contextualized or incomplete

Page 11: Semantic Search in E-Discovery

Semantic search in e-discovery

Task

11

¢ Retrieve all relevant traces¢ Highly iterative search process¢ Support (re)formulating questions and hypotheses

Page 12: Semantic Search in E-Discovery

Semantic search in e-discovery

How do we approach this?

¢ Two subprojects:£ Information Retrieval

� Finding material of unstructured nature from large collections£ Information Extraction/Text Mining

� Discovering patterns in data

12

Page 13: Semantic Search in E-Discovery

Semantic search in e-discovery

How do we approach this?

¢ Information Retrieval£ Integrating structure/context of data in retrieval models

� Capturing forum and email context� Conversational search

13

Page 14: Semantic Search in E-Discovery

Semantic search in e-discovery

How do we approach this?

¢ Information Extraction/Text Mining£ Extracting structured knowledge from user generated

content� Semantic pre-processing� Social network inference� Information maps

14

Page 15: Semantic Search in E-Discovery

Semantic search in e-discovery

How do we approach this?

¢ Information Retrieval <-> Information Extraction

15

Page 16: Semantic Search in E-Discovery

Semantic search in e-discovery

Current work (first steps)

¢ Information Retrieval£ Twitter Mining (as a form of conversational search)

¢ Information Extraction/Text Mining£ Entity linking (for semantic document enrichment)

¢ TREC/TAC benchmarking events£ TREC Legal Track 2011 (2013?)

16

Page 17: Semantic Search in E-Discovery

Semantic search in e-discovery

Contributions

¢ xTAS: Open source text analysis toolkit¢ iColumbo: Internet monitoring framework¢ Used by:

£ Internet Recherche Netwerk£ Koninklijke Bibliotheek£ Beeld en Geluid£ ... You?

17

Page 18: Semantic Search in E-Discovery

Semantic search in e-discovery

Semantic search in E-discovery

¢ David Graus¢ [email protected]

18