Technology Assisted Review: Moving Beyond the First Generation of E-Discovery Review

Technology Assisted Review Moving Beyond the First Generation

John Tredennick CEO/Founder

Catalyst

§  1,800 Exabytes

§  1.8 million Petabytes

§  1.8 billion Terabytes

§  1.8 trillion Gigabytes

§  1.8 quadrillion Megabytes

1.8 Zettabytes a year

Library of Congress—30 Terabytes

Exploding Content >> Big Data

Sixty Million Libraries of Congress each year!

60 million libraries a year...

... and growing

2003" 2004" 2005" 2006" 2007" 2008" 2009" 2010" 2011" 2012"

Case Size (in Gigabytes)

Big Data >> Big Discovery

Telling Stories 1.  Your job has not changed. 2.  But it has gotten a bit harder. . .

þ  Find the story

þ  Tell the story

þ  Prove the story

Is This New?

We Already Use It

Predictive Ranking

What is the Process? 1.  Assemble your files

Shredding the Documents

What is the Process? 1.  Assemble your files 2.  Add seed documents to the mix 3.  Analyze seeds and rank similar

documents

How Does it Work?

§  Support Vector Machines §  Naïve Bayes §  K-Nearest Neighbor §  Geospatial Predictive Modeling §  Latent Semantic

"I may be less interested in the science behind the "black box” than in whether it produced responsive documents with reasonably high recall and high precision.“ Peck, M.J. (SDNY)

What Goes on Under the Hood?

The computer builds a big, complex search!

What terms are most likely to be associated with good documents?

What terms are most likely to be associated with bad documents?

What is the Process? 1.  Assemble your files 2.  Add seed documents to the mix 3.  Analyze seeds and rank similar

documents 4.  Test results and provide more

samples—iterative process 5.  Order review by ranking

Cut Point

Ranking a Document Set

Understanding the Savings

0%# 10%# 20%# 30%# 40%# 50%# 60%# 70%# 80%# 90%# 100%#

Percen

tage)of

vant)Docum

ents)Foun

call))

Percentage)of)Documents)Reviewed)

Yield)Curve)

Percentage of relevant documents found

Number of documents in the review

Linear Review

0%# 10%# 20%# 30%# 40%# 50%# 60%# 70%# 80%# 90%# 100%#

Yield&Curve&

%&of&Documents&

Review 12% and get 80% recall

0%# 10%# 20%# 30%# 40%# 50%# 60%# 70%# 80%# 90%# 100%#

Yield&Curve&

%&of&Documents&

Review 25% and get 95% recall

12,000

10,000

10,000 20,000 30,000 40,000 50,000 60,000 70,000 80,000 90,000

Reviewed

Wellington F Responsive Review

80% Recall Review 29,248

100% (Linear) Review 85,725

12,000

10,000

10,000 20,000 30,000 40,000 50,000 60,000 70,000 80,000 90,000

Reviewed

Wellington F Responsive Review

100% (Linear) Review 85,725

Predict(Review 80%(Recall 95%(RecallResponsive 9,168 10,887Reviewed 29,248 39,112Reduction 56,477 46,613Saving<($4<Doc) $225,908< $186,452<

1.  You only get one bite at the apple.

2.  Subject matter experts are required for training.

3.  You must train on randomly selected documents.

4.  You can’t start TAR training until you have all of your documents.

5.  TAR doesn’t work on foreign (Asian) language documents.

6.  TAR doesn’t work with sparse collections.

The Five Myths of TAR

Technology Assisted Review: Moving Beyond the First Generation of E-Discovery Review

Law

Transcript of Technology Assisted Review: Moving Beyond the First Generation of E-Discovery Review

M – 33 THE EFFECT OF MEDIA-ASSISTED GUIDED DISCOVERY ...

Removing limitations, redeﬁ ning discovery - EY · PDF fileRemoving limitations, redeﬁ ning discovery EY Technology Assisted Review: compelling, transparent and defensible

Judicial Acceptance of Technology Assisted Review (TAR)

Technology-assisted Review Hands-on Workshopilta.personifycloud.com/webfiles/productfiles/1501902/HAND3.pdf · Technology-assisted Review Hands-on Workshop ... All machine learning

2008 Assisted Living State Regulatory Review

CCNA Discovery Curriculum Review

REVIEW PAPER Model-assisted integration of physiological ...

Computer-Assisted Keyword and Document Set Discovery from Unstructured … · 2016-07-05 · Computer-Assisted Keyword and Document Set Discovery from Unstructured Text Gary Kingy

2016 Assisted Living State Regulatory Review

2011 Assisted Living State Regulatory Review

Physician Assisted Suicide: An Unbiased Review

Learning-assisted automated planning: review, appraisal ...

The E-Discovery Games: A Closer Look at Technology Assisted Document Review David D. Lewis, Ph.D., Information Retrieval Consultant

For Peer Review - UCL Discovery - UCL Discovery

Overview - BCS Technology Assisted Review

IN HCBS Redesign Data Report: Review of Assisted Living ... Assisted Living Data Report... · IN HCBS Redesign Data Report: Review of Assisted Living Services Introduction In April

Computer‐Assisted Keyword and Document Set Discovery from ... · Computer-Assisted Keyword and Document Set Discovery from Unstructured Text Gary King Harvard University Patrick

A Review of Laser-Assisted Versus Traditional ......REVIEW A Review of Laser-Assisted Versus Traditional Phacoemulsiﬁcation Cataract Surgery H. Burkhard Dick. Tim Schultz Received:

Assisted Discovery of On-Chip Debug Interfaces Joe Grand - Black Hat

Computer-Assisted Reading and Discovery for Student-Generated ...