SlideShare una empresa de Scribd logo
1 de 35
Descargar para leer sin conexión
© Relativity. All rights reserved.
7 Days of Playing Minesweeper, or
How to Shut Down Whistleblower Defense with Analytics
Elise Tropiano, Senior Technical Product Manager
© Relativity. All rights reserved.
Elise Tropiano
Senior Technical Product Manager, Analytics
© Relativity. All rights reserved.
Agenda
Who We Are
The
Problems
We Solve
How We
Solve These
Problems
Case Study
Who we are
• Fast-growing legal tech company
• Unstructured big data platform
enhanced with advanced
analytics, machine learning, and
powerful visualizations
• 800+ employees worldwide
• Headquartered in Chicago, with
offices in London, Kraków, Hong
Kong and Melbourne
Witamy w naszym
Krakowskim biurze
• Product Innovation Center
• Opened in September, 2015
• Focus on data transfer solutions
• Growing team up to 100 this year
© Relativity. All rights reserved.
Relativity helps manage and analyze data
relevant to litigation and investigations
© Relativity. All rights reserved.
organize data
discover the truth
act on it
© Relativity. All rights reserved.
data problems we solve
© Relativity. All rights reserved.
We live in a world where there is an electronic trail of evidence in every potential
litigation. Every sent email or document created could be relevant in a trial.
© Relativity. All rights reserved.
= 45,000,000 emails
Scale of a Hypothetical Large Case
500 people potentially involved
x 100 each sending emails per day
x 180 working days a year
x 5 years
© Relativity. All rights reserved.
It all could be potentially relevant in litigation.
And it’s not just emails that are relevant…
© Relativity. All rights reserved.
750M+ files
in the largest case in Relativity
© Relativity. All rights reserved.
our solution
© Relativity. All rights reserved.
SaaS platform
© Relativity. All rights reserved.
our platform
search & search workflow
machine learning
email analytics
workflow & applications
repository
data analytics & reporting
© Relativity. All rights reserved.
whistleblower or extortionist?
© Relativity. All rights reserved.
Drinker Biddle is a full-service law firm providing
litigation, regulatory and business solutions to public
and private corporations, multinational Fortune 100
companies and start-ups.
© Relativity. All rights reserved.
• When a senior executive for a publicly
traded company was fired for
underperformance, he made a serious
allegation on his way out the door.
• He claimed he was laid off because of
his repeated attempts to inform
officials that the company was
falsifying quarterly financial reports to
the public.
© Relativity. All rights reserved.
Kick off a long list of tasks for the
company, including waiting for a lawyer
to send them a demand letter, gearing up
for defense, coaxing out the facts as
knowledge evolves, and possibly settling
the case before even getting to the truth.
Two Options to Handle This Case
Traditional Approach
Start an internal investigation and figure
out exactly what happened and decide
how to handle it.
Analytical Approach
Cost: $1,500,000+ Cost: $80,000
© Relativity. All rights reserved.
Investigation Timeline
Days 1-3
• Collected multi-sourced data
– One million emails + thousands of complex financial reports
• Relativity Analytics: Email threading
– Grouped email conversations and found initial set of relevant
meeting notes and presentations
• Relativity Analytics: Keyword expansion
– Ran terms round in these meeting notes through keyword
expansion to find additional relevant terms
• Relativity Analytics: Clustering
– Groups documents into conceptually similar groups
– Prioritized the review of clusters containing relevant documents
to find additional sources of intelligence
DAY 1 DAY 2 DAY 3
© Relativity. All rights reserved.
What ACTUALLY Happened?
Former employee had been emailing
with his wife on his personal
account and forwarding those
emails to his work email.
The emails contained keywords and
phrases relevant to the investigation,
so Drinker Biddle were able to make a
case to collect from former employee’s
personal email, and load that into
Relativity for further review.
Drinker Biddle found evidence of the
former employee working with his wife,
an employment attorney, to develop a
case against the company, and they were
able to prove he was drafting emails
about fraudulent accounting before the
quarterly numbers were recorded.
© Relativity. All rights reserved.
Relativity Analytics helped shut down
a whistleblower defense in 7 days,
saving an estimated $1.5M+
© Relativity. All rights reserved.
How We Did It
Email Threading
DIVIDE emails into segments and understand each
segment’s metadata.
1
COMBINE emails into conversation threads.
2
IDENTIFY inclusive emails for optimal efficiency.
3
© Relativity. All rights reserved.
From: Brandon Gauthier
Sent: Tuesday, November 24, 2015 7:50 AM
To: Michael Di Salvo
Subject: Demo Email
I need some email data for a demo, can you reply
back to this email. Thanks!
From: Michael Di Salvo
Sent: Tuesday, November 24, 2015 9:52 AM
To: Brandon Gauthier
Subject: Re: Demo Email
This is me, replying to your e-mail.
You sir, are very welcome.
From: Brandon Gauthier
Sent: Tuesday, November 24, 2015 7:53 AM
To: Michael Di Salvo
Subject: Re: Demo Email
One more time! This way we can show email
segments better.
From: Michael Di Salvo
Sent: Tuesday, November 24, 2015 9:54 AM
To: Brandon Gauthier
Subject: Re: Demo Email
This is me, creating an additional segment.
From: Brandon Gauthier
Sent: Tuesday, November 24, 2015 7:50 AM
To: Michael Di Salvo
Subject: Demo Email
I need some email data for a demo, can you reply
back to this email. Thanks!
From: Michael Di Salvo
Sent: Tuesday, November 24, 2015 9:52 AM
To: Brandon Gauthier
Subject: Re: Demo Email
This is me, replying to your e-mail.
You sir, are very welcome.
© Relativity. All rights reserved.
How We Did It
Keyword
Expansion
CREATE multi-dimensional space from terms in document text.
1
IDENTIFY terms that are conceptually similar to a
user-provided query.
2
RETURN the terms for augmented searching.
3
© Relativity. All rights reserved.
“Personally-held”
© Relativity. All rights reserved.
How We Did It
Clustering
INDEX documents into multi-dimensional space.
1
HIERARCHICALLY GROUP documents into conceptually similar
groups using the document text.
2
VISUALIZE clusters for promoting corpus understanding
and searching.
3
© Relativity. All rights reserved.
© Relativity. All rights reserved.
“It’s incredibly unjust for companies to pay
a settlement if they’re unsure that a claim
has merit simply because they don’t have
the money or the resources to investigate
it properly.”
Chief Data Scientist and Partner, Drinker Biddle
Bennett Borden
© Relativity. All rights reserved.
any questions?
© Relativity. All rights reserved.
thank you

Más contenido relacionado

Similar a 7 Days of Playing Minesweeper, or How to Shut Down Whistleblower Defense with Analytics - Elise Tropiano, Relativity

LoanResolve Brief Presentation
LoanResolve Brief PresentationLoanResolve Brief Presentation
LoanResolve Brief Presentationjimmymac935
 
eTapestry Webinar
eTapestry WebinareTapestry Webinar
eTapestry Webinarmikekierce
 
Case Organization, Analysis & Presentation in the Age of eDiscovery
Case Organization, Analysis & Presentation in the Age of eDiscoveryCase Organization, Analysis & Presentation in the Age of eDiscovery
Case Organization, Analysis & Presentation in the Age of eDiscoveryLexisNexis Software Division
 
Email Marketing and Digital Copywriting
Email Marketing and Digital CopywritingEmail Marketing and Digital Copywriting
Email Marketing and Digital CopywritingSpotler
 
Legal Tech Innovators Showcase @ ABA TECHSHOW
Legal Tech Innovators Showcase @ ABA TECHSHOWLegal Tech Innovators Showcase @ ABA TECHSHOW
Legal Tech Innovators Showcase @ ABA TECHSHOWEvolve Law
 
2014 ota databreach3
2014 ota databreach32014 ota databreach3
2014 ota databreach3Meg Weber
 
eTapestry webinar
eTapestry webinareTapestry webinar
eTapestry webinarrmmcnult
 
Catelas Legal - Intelligent Discoveryor Slideshare
Catelas Legal - Intelligent Discoveryor SlideshareCatelas Legal - Intelligent Discoveryor Slideshare
Catelas Legal - Intelligent Discoveryor SlideshareRob Levey
 
CYBER SECURITY and DATA PRIVACY 2022: Data Breach Response - Before and After...
CYBER SECURITY and DATA PRIVACY 2022: Data Breach Response - Before and After...CYBER SECURITY and DATA PRIVACY 2022: Data Breach Response - Before and After...
CYBER SECURITY and DATA PRIVACY 2022: Data Breach Response - Before and After...Financial Poise
 
Michael Barber - Precon- Make Email Great Again — With an Actual Plan On How ...
Michael Barber - Precon- Make Email Great Again — With an Actual Plan On How ...Michael Barber - Precon- Make Email Great Again — With an Actual Plan On How ...
Michael Barber - Precon- Make Email Great Again — With an Actual Plan On How ...Julia Grosman
 
Building Information Governance Policies and Workflows
Building Information Governance Policies and WorkflowsBuilding Information Governance Policies and Workflows
Building Information Governance Policies and WorkflowskCura_Relativity
 
Iapp cipmExact IAPP CIPM Questions And Answers
Iapp cipmExact IAPP CIPM Questions And AnswersIapp cipmExact IAPP CIPM Questions And Answers
Iapp cipmExact IAPP CIPM Questions And AnswersArmstrongsmith
 
Tale of two law firms utah bar - january 25 2016 - final
Tale of two law firms   utah bar - january 25 2016 - finalTale of two law firms   utah bar - january 25 2016 - final
Tale of two law firms utah bar - january 25 2016 - finalGary Allen
 
Minimize Your Client's Risk: From IP to Cash Flow
Minimize Your Client's Risk: From IP to Cash FlowMinimize Your Client's Risk: From IP to Cash Flow
Minimize Your Client's Risk: From IP to Cash FlowTraklight.com
 
eDiscovery Perspective
eDiscovery PerspectiveeDiscovery Perspective
eDiscovery PerspectiveRuss Gould
 
2017-01-23-Regulatory Compliance Watch - 6 Cybersecurity for Financial Servic...
2017-01-23-Regulatory Compliance Watch - 6 Cybersecurity for Financial Servic...2017-01-23-Regulatory Compliance Watch - 6 Cybersecurity for Financial Servic...
2017-01-23-Regulatory Compliance Watch - 6 Cybersecurity for Financial Servic...Raj Goel
 
Data Breach Response: Before and After the Breach
Data Breach Response: Before and After the BreachData Breach Response: Before and After the Breach
Data Breach Response: Before and After the BreachFinancial Poise
 

Similar a 7 Days of Playing Minesweeper, or How to Shut Down Whistleblower Defense with Analytics - Elise Tropiano, Relativity (20)

LoanResolve Brief Presentation
LoanResolve Brief PresentationLoanResolve Brief Presentation
LoanResolve Brief Presentation
 
Investigation and discovery tools in law firms
Investigation and discovery tools in law firmsInvestigation and discovery tools in law firms
Investigation and discovery tools in law firms
 
eTapestry Webinar
eTapestry WebinareTapestry Webinar
eTapestry Webinar
 
Case Organization, Analysis & Presentation in the Age of eDiscovery
Case Organization, Analysis & Presentation in the Age of eDiscoveryCase Organization, Analysis & Presentation in the Age of eDiscovery
Case Organization, Analysis & Presentation in the Age of eDiscovery
 
Email Marketing and Digital Copywriting
Email Marketing and Digital CopywritingEmail Marketing and Digital Copywriting
Email Marketing and Digital Copywriting
 
Streamline Your Court Interactions With Technology
Streamline Your Court Interactions With TechnologyStreamline Your Court Interactions With Technology
Streamline Your Court Interactions With Technology
 
Legal Tech Innovators Showcase @ ABA TECHSHOW
Legal Tech Innovators Showcase @ ABA TECHSHOWLegal Tech Innovators Showcase @ ABA TECHSHOW
Legal Tech Innovators Showcase @ ABA TECHSHOW
 
2014 ota databreach3
2014 ota databreach32014 ota databreach3
2014 ota databreach3
 
eTapestry webinar
eTapestry webinareTapestry webinar
eTapestry webinar
 
Catelas Legal - Intelligent Discoveryor Slideshare
Catelas Legal - Intelligent Discoveryor SlideshareCatelas Legal - Intelligent Discoveryor Slideshare
Catelas Legal - Intelligent Discoveryor Slideshare
 
CYBER SECURITY and DATA PRIVACY 2022: Data Breach Response - Before and After...
CYBER SECURITY and DATA PRIVACY 2022: Data Breach Response - Before and After...CYBER SECURITY and DATA PRIVACY 2022: Data Breach Response - Before and After...
CYBER SECURITY and DATA PRIVACY 2022: Data Breach Response - Before and After...
 
Michael Barber - Precon- Make Email Great Again — With an Actual Plan On How ...
Michael Barber - Precon- Make Email Great Again — With an Actual Plan On How ...Michael Barber - Precon- Make Email Great Again — With an Actual Plan On How ...
Michael Barber - Precon- Make Email Great Again — With an Actual Plan On How ...
 
Building Information Governance Policies and Workflows
Building Information Governance Policies and WorkflowsBuilding Information Governance Policies and Workflows
Building Information Governance Policies and Workflows
 
Iapp cipmExact IAPP CIPM Questions And Answers
Iapp cipmExact IAPP CIPM Questions And AnswersIapp cipmExact IAPP CIPM Questions And Answers
Iapp cipmExact IAPP CIPM Questions And Answers
 
Tale of two law firms utah bar - january 25 2016 - final
Tale of two law firms   utah bar - january 25 2016 - finalTale of two law firms   utah bar - january 25 2016 - final
Tale of two law firms utah bar - january 25 2016 - final
 
Minimize Your Client's Risk: From IP to Cash Flow
Minimize Your Client's Risk: From IP to Cash FlowMinimize Your Client's Risk: From IP to Cash Flow
Minimize Your Client's Risk: From IP to Cash Flow
 
eDiscovery Perspective
eDiscovery PerspectiveeDiscovery Perspective
eDiscovery Perspective
 
2017-01-23-Regulatory Compliance Watch - 6 Cybersecurity for Financial Servic...
2017-01-23-Regulatory Compliance Watch - 6 Cybersecurity for Financial Servic...2017-01-23-Regulatory Compliance Watch - 6 Cybersecurity for Financial Servic...
2017-01-23-Regulatory Compliance Watch - 6 Cybersecurity for Financial Servic...
 
Data Breach Response: Before and After the Breach
Data Breach Response: Before and After the BreachData Breach Response: Before and After the Breach
Data Breach Response: Before and After the Breach
 
SNW Fall 2009
SNW Fall 2009SNW Fall 2009
SNW Fall 2009
 

Más de Evention

The Factorization Machines algorithm for building recommendation system - Paw...
The Factorization Machines algorithm for building recommendation system - Paw...The Factorization Machines algorithm for building recommendation system - Paw...
The Factorization Machines algorithm for building recommendation system - Paw...Evention
 
A/B testing powered by Big data - Saurabh Goyal, Booking.com
A/B testing powered by Big data - Saurabh Goyal, Booking.comA/B testing powered by Big data - Saurabh Goyal, Booking.com
A/B testing powered by Big data - Saurabh Goyal, Booking.comEvention
 
Near Real-Time Fraud Detection in Telecommunication Industry - Burak Işıklı, ...
Near Real-Time Fraud Detection in Telecommunication Industry - Burak Işıklı, ...Near Real-Time Fraud Detection in Telecommunication Industry - Burak Işıklı, ...
Near Real-Time Fraud Detection in Telecommunication Industry - Burak Işıklı, ...Evention
 
Assisting millions of active users in real-time - Alexey Brodovshuk, Kcell; K...
Assisting millions of active users in real-time - Alexey Brodovshuk, Kcell; K...Assisting millions of active users in real-time - Alexey Brodovshuk, Kcell; K...
Assisting millions of active users in real-time - Alexey Brodovshuk, Kcell; K...Evention
 
Machine learning security - Pawel Zawistowski, Warsaw University of Technolog...
Machine learning security - Pawel Zawistowski, Warsaw University of Technolog...Machine learning security - Pawel Zawistowski, Warsaw University of Technolog...
Machine learning security - Pawel Zawistowski, Warsaw University of Technolog...Evention
 
Building a Modern Data Pipeline: Lessons Learned - Saulius Valatka, Adform
Building a Modern Data Pipeline: Lessons Learned - Saulius Valatka, AdformBuilding a Modern Data Pipeline: Lessons Learned - Saulius Valatka, Adform
Building a Modern Data Pipeline: Lessons Learned - Saulius Valatka, AdformEvention
 
Apache Flink: Better, Faster & Uncut - Piotr Nowojski, data Artisans
Apache Flink: Better, Faster & Uncut - Piotr Nowojski, data ArtisansApache Flink: Better, Faster & Uncut - Piotr Nowojski, data Artisans
Apache Flink: Better, Faster & Uncut - Piotr Nowojski, data ArtisansEvention
 
Privacy by Design - Lars Albertsson, Mapflat
Privacy by Design - Lars Albertsson, MapflatPrivacy by Design - Lars Albertsson, Mapflat
Privacy by Design - Lars Albertsson, MapflatEvention
 
Elephants in the cloud or how to become cloud ready - Krzysztof Adamski, GetI...
Elephants in the cloud or how to become cloud ready - Krzysztof Adamski, GetI...Elephants in the cloud or how to become cloud ready - Krzysztof Adamski, GetI...
Elephants in the cloud or how to become cloud ready - Krzysztof Adamski, GetI...Evention
 
Deriving Actionable Insights from High Volume Media Streams - Jörn Kottmann, ...
Deriving Actionable Insights from High Volume Media Streams - Jörn Kottmann, ...Deriving Actionable Insights from High Volume Media Streams - Jörn Kottmann, ...
Deriving Actionable Insights from High Volume Media Streams - Jörn Kottmann, ...Evention
 
Enhancing Spark - increase streaming capabilities of your applications - Kami...
Enhancing Spark - increase streaming capabilities of your applications - Kami...Enhancing Spark - increase streaming capabilities of your applications - Kami...
Enhancing Spark - increase streaming capabilities of your applications - Kami...Evention
 
Big Data Journey at a Big Corp - Tomasz Burzyński, Maciej Czyżowicz, Orange P...
Big Data Journey at a Big Corp - Tomasz Burzyński, Maciej Czyżowicz, Orange P...Big Data Journey at a Big Corp - Tomasz Burzyński, Maciej Czyżowicz, Orange P...
Big Data Journey at a Big Corp - Tomasz Burzyński, Maciej Czyżowicz, Orange P...Evention
 
Stream processing with Apache Flink - Maximilian Michels Data Artisans
Stream processing with Apache Flink - Maximilian Michels Data ArtisansStream processing with Apache Flink - Maximilian Michels Data Artisans
Stream processing with Apache Flink - Maximilian Michels Data ArtisansEvention
 
Scaling Cassandra in all directions - Jimmy Mardell Spotify
Scaling Cassandra in all directions - Jimmy Mardell SpotifyScaling Cassandra in all directions - Jimmy Mardell Spotify
Scaling Cassandra in all directions - Jimmy Mardell SpotifyEvention
 
Big Data for unstructured data Dariusz Śliwa
Big Data for unstructured data Dariusz ŚliwaBig Data for unstructured data Dariusz Śliwa
Big Data for unstructured data Dariusz ŚliwaEvention
 
Elastic development. Implementing Big Data search Grzegorz Kołpuć
Elastic development. Implementing Big Data search Grzegorz KołpućElastic development. Implementing Big Data search Grzegorz Kołpuć
Elastic development. Implementing Big Data search Grzegorz KołpućEvention
 
H2 o deep water making deep learning accessible to everyone -jo-fai chow
H2 o deep water   making deep learning accessible to everyone -jo-fai chowH2 o deep water   making deep learning accessible to everyone -jo-fai chow
H2 o deep water making deep learning accessible to everyone -jo-fai chowEvention
 
That won’t fit into RAM - Michał Brzezicki
That won’t fit into RAM -  Michał  BrzezickiThat won’t fit into RAM -  Michał  Brzezicki
That won’t fit into RAM - Michał BrzezickiEvention
 
Stream Analytics with SQL on Apache Flink - Fabian Hueske
Stream Analytics with SQL on Apache Flink - Fabian HueskeStream Analytics with SQL on Apache Flink - Fabian Hueske
Stream Analytics with SQL on Apache Flink - Fabian HueskeEvention
 
Hopsworks Secure Streaming as-a-service with Kafka Flinkspark - Theofilos Kak...
Hopsworks Secure Streaming as-a-service with Kafka Flinkspark - Theofilos Kak...Hopsworks Secure Streaming as-a-service with Kafka Flinkspark - Theofilos Kak...
Hopsworks Secure Streaming as-a-service with Kafka Flinkspark - Theofilos Kak...Evention
 

Más de Evention (20)

The Factorization Machines algorithm for building recommendation system - Paw...
The Factorization Machines algorithm for building recommendation system - Paw...The Factorization Machines algorithm for building recommendation system - Paw...
The Factorization Machines algorithm for building recommendation system - Paw...
 
A/B testing powered by Big data - Saurabh Goyal, Booking.com
A/B testing powered by Big data - Saurabh Goyal, Booking.comA/B testing powered by Big data - Saurabh Goyal, Booking.com
A/B testing powered by Big data - Saurabh Goyal, Booking.com
 
Near Real-Time Fraud Detection in Telecommunication Industry - Burak Işıklı, ...
Near Real-Time Fraud Detection in Telecommunication Industry - Burak Işıklı, ...Near Real-Time Fraud Detection in Telecommunication Industry - Burak Işıklı, ...
Near Real-Time Fraud Detection in Telecommunication Industry - Burak Işıklı, ...
 
Assisting millions of active users in real-time - Alexey Brodovshuk, Kcell; K...
Assisting millions of active users in real-time - Alexey Brodovshuk, Kcell; K...Assisting millions of active users in real-time - Alexey Brodovshuk, Kcell; K...
Assisting millions of active users in real-time - Alexey Brodovshuk, Kcell; K...
 
Machine learning security - Pawel Zawistowski, Warsaw University of Technolog...
Machine learning security - Pawel Zawistowski, Warsaw University of Technolog...Machine learning security - Pawel Zawistowski, Warsaw University of Technolog...
Machine learning security - Pawel Zawistowski, Warsaw University of Technolog...
 
Building a Modern Data Pipeline: Lessons Learned - Saulius Valatka, Adform
Building a Modern Data Pipeline: Lessons Learned - Saulius Valatka, AdformBuilding a Modern Data Pipeline: Lessons Learned - Saulius Valatka, Adform
Building a Modern Data Pipeline: Lessons Learned - Saulius Valatka, Adform
 
Apache Flink: Better, Faster & Uncut - Piotr Nowojski, data Artisans
Apache Flink: Better, Faster & Uncut - Piotr Nowojski, data ArtisansApache Flink: Better, Faster & Uncut - Piotr Nowojski, data Artisans
Apache Flink: Better, Faster & Uncut - Piotr Nowojski, data Artisans
 
Privacy by Design - Lars Albertsson, Mapflat
Privacy by Design - Lars Albertsson, MapflatPrivacy by Design - Lars Albertsson, Mapflat
Privacy by Design - Lars Albertsson, Mapflat
 
Elephants in the cloud or how to become cloud ready - Krzysztof Adamski, GetI...
Elephants in the cloud or how to become cloud ready - Krzysztof Adamski, GetI...Elephants in the cloud or how to become cloud ready - Krzysztof Adamski, GetI...
Elephants in the cloud or how to become cloud ready - Krzysztof Adamski, GetI...
 
Deriving Actionable Insights from High Volume Media Streams - Jörn Kottmann, ...
Deriving Actionable Insights from High Volume Media Streams - Jörn Kottmann, ...Deriving Actionable Insights from High Volume Media Streams - Jörn Kottmann, ...
Deriving Actionable Insights from High Volume Media Streams - Jörn Kottmann, ...
 
Enhancing Spark - increase streaming capabilities of your applications - Kami...
Enhancing Spark - increase streaming capabilities of your applications - Kami...Enhancing Spark - increase streaming capabilities of your applications - Kami...
Enhancing Spark - increase streaming capabilities of your applications - Kami...
 
Big Data Journey at a Big Corp - Tomasz Burzyński, Maciej Czyżowicz, Orange P...
Big Data Journey at a Big Corp - Tomasz Burzyński, Maciej Czyżowicz, Orange P...Big Data Journey at a Big Corp - Tomasz Burzyński, Maciej Czyżowicz, Orange P...
Big Data Journey at a Big Corp - Tomasz Burzyński, Maciej Czyżowicz, Orange P...
 
Stream processing with Apache Flink - Maximilian Michels Data Artisans
Stream processing with Apache Flink - Maximilian Michels Data ArtisansStream processing with Apache Flink - Maximilian Michels Data Artisans
Stream processing with Apache Flink - Maximilian Michels Data Artisans
 
Scaling Cassandra in all directions - Jimmy Mardell Spotify
Scaling Cassandra in all directions - Jimmy Mardell SpotifyScaling Cassandra in all directions - Jimmy Mardell Spotify
Scaling Cassandra in all directions - Jimmy Mardell Spotify
 
Big Data for unstructured data Dariusz Śliwa
Big Data for unstructured data Dariusz ŚliwaBig Data for unstructured data Dariusz Śliwa
Big Data for unstructured data Dariusz Śliwa
 
Elastic development. Implementing Big Data search Grzegorz Kołpuć
Elastic development. Implementing Big Data search Grzegorz KołpućElastic development. Implementing Big Data search Grzegorz Kołpuć
Elastic development. Implementing Big Data search Grzegorz Kołpuć
 
H2 o deep water making deep learning accessible to everyone -jo-fai chow
H2 o deep water   making deep learning accessible to everyone -jo-fai chowH2 o deep water   making deep learning accessible to everyone -jo-fai chow
H2 o deep water making deep learning accessible to everyone -jo-fai chow
 
That won’t fit into RAM - Michał Brzezicki
That won’t fit into RAM -  Michał  BrzezickiThat won’t fit into RAM -  Michał  Brzezicki
That won’t fit into RAM - Michał Brzezicki
 
Stream Analytics with SQL on Apache Flink - Fabian Hueske
Stream Analytics with SQL on Apache Flink - Fabian HueskeStream Analytics with SQL on Apache Flink - Fabian Hueske
Stream Analytics with SQL on Apache Flink - Fabian Hueske
 
Hopsworks Secure Streaming as-a-service with Kafka Flinkspark - Theofilos Kak...
Hopsworks Secure Streaming as-a-service with Kafka Flinkspark - Theofilos Kak...Hopsworks Secure Streaming as-a-service with Kafka Flinkspark - Theofilos Kak...
Hopsworks Secure Streaming as-a-service with Kafka Flinkspark - Theofilos Kak...
 

Último

如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样wsppdmt
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...nirzagarg
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1ranjankumarbehera14
 
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制vexqp
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRajesh Mondal
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...nirzagarg
 
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制vexqp
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...nirzagarg
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Klinik kandungan
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...Bertram Ludäscher
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...gajnagarg
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...nirzagarg
 
Data Analyst Tasks to do the internship.pdf
Data Analyst Tasks to do the internship.pdfData Analyst Tasks to do the internship.pdf
Data Analyst Tasks to do the internship.pdftheeltifs
 
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制vexqp
 
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling ManjurJual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjurptikerjasaptiker
 

Último (20)

Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
 
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit RiyadhCytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
 
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for Research
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
 
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
 
Data Analyst Tasks to do the internship.pdf
Data Analyst Tasks to do the internship.pdfData Analyst Tasks to do the internship.pdf
Data Analyst Tasks to do the internship.pdf
 
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
 
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling ManjurJual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
 

7 Days of Playing Minesweeper, or How to Shut Down Whistleblower Defense with Analytics - Elise Tropiano, Relativity

  • 1. © Relativity. All rights reserved. 7 Days of Playing Minesweeper, or How to Shut Down Whistleblower Defense with Analytics Elise Tropiano, Senior Technical Product Manager
  • 2. © Relativity. All rights reserved. Elise Tropiano Senior Technical Product Manager, Analytics
  • 3. © Relativity. All rights reserved. Agenda Who We Are The Problems We Solve How We Solve These Problems Case Study
  • 4. Who we are • Fast-growing legal tech company • Unstructured big data platform enhanced with advanced analytics, machine learning, and powerful visualizations • 800+ employees worldwide • Headquartered in Chicago, with offices in London, Kraków, Hong Kong and Melbourne
  • 5. Witamy w naszym Krakowskim biurze • Product Innovation Center • Opened in September, 2015 • Focus on data transfer solutions • Growing team up to 100 this year
  • 6. © Relativity. All rights reserved. Relativity helps manage and analyze data relevant to litigation and investigations
  • 7. © Relativity. All rights reserved. organize data discover the truth act on it
  • 8. © Relativity. All rights reserved. data problems we solve
  • 9. © Relativity. All rights reserved. We live in a world where there is an electronic trail of evidence in every potential litigation. Every sent email or document created could be relevant in a trial.
  • 10. © Relativity. All rights reserved. = 45,000,000 emails Scale of a Hypothetical Large Case 500 people potentially involved x 100 each sending emails per day x 180 working days a year x 5 years
  • 11. © Relativity. All rights reserved. It all could be potentially relevant in litigation. And it’s not just emails that are relevant…
  • 12. © Relativity. All rights reserved. 750M+ files in the largest case in Relativity
  • 13. © Relativity. All rights reserved. our solution
  • 14. © Relativity. All rights reserved. SaaS platform
  • 15. © Relativity. All rights reserved. our platform search & search workflow machine learning email analytics workflow & applications repository data analytics & reporting
  • 16. © Relativity. All rights reserved. whistleblower or extortionist?
  • 17. © Relativity. All rights reserved. Drinker Biddle is a full-service law firm providing litigation, regulatory and business solutions to public and private corporations, multinational Fortune 100 companies and start-ups.
  • 18. © Relativity. All rights reserved. • When a senior executive for a publicly traded company was fired for underperformance, he made a serious allegation on his way out the door. • He claimed he was laid off because of his repeated attempts to inform officials that the company was falsifying quarterly financial reports to the public.
  • 19. © Relativity. All rights reserved. Kick off a long list of tasks for the company, including waiting for a lawyer to send them a demand letter, gearing up for defense, coaxing out the facts as knowledge evolves, and possibly settling the case before even getting to the truth. Two Options to Handle This Case Traditional Approach Start an internal investigation and figure out exactly what happened and decide how to handle it. Analytical Approach Cost: $1,500,000+ Cost: $80,000
  • 20. © Relativity. All rights reserved. Investigation Timeline Days 1-3 • Collected multi-sourced data – One million emails + thousands of complex financial reports • Relativity Analytics: Email threading – Grouped email conversations and found initial set of relevant meeting notes and presentations • Relativity Analytics: Keyword expansion – Ran terms round in these meeting notes through keyword expansion to find additional relevant terms • Relativity Analytics: Clustering – Groups documents into conceptually similar groups – Prioritized the review of clusters containing relevant documents to find additional sources of intelligence DAY 1 DAY 2 DAY 3
  • 21. © Relativity. All rights reserved. What ACTUALLY Happened? Former employee had been emailing with his wife on his personal account and forwarding those emails to his work email. The emails contained keywords and phrases relevant to the investigation, so Drinker Biddle were able to make a case to collect from former employee’s personal email, and load that into Relativity for further review. Drinker Biddle found evidence of the former employee working with his wife, an employment attorney, to develop a case against the company, and they were able to prove he was drafting emails about fraudulent accounting before the quarterly numbers were recorded.
  • 22. © Relativity. All rights reserved. Relativity Analytics helped shut down a whistleblower defense in 7 days, saving an estimated $1.5M+
  • 23. © Relativity. All rights reserved. How We Did It Email Threading DIVIDE emails into segments and understand each segment’s metadata. 1 COMBINE emails into conversation threads. 2 IDENTIFY inclusive emails for optimal efficiency. 3
  • 24. © Relativity. All rights reserved. From: Brandon Gauthier Sent: Tuesday, November 24, 2015 7:50 AM To: Michael Di Salvo Subject: Demo Email I need some email data for a demo, can you reply back to this email. Thanks! From: Michael Di Salvo Sent: Tuesday, November 24, 2015 9:52 AM To: Brandon Gauthier Subject: Re: Demo Email This is me, replying to your e-mail. You sir, are very welcome. From: Brandon Gauthier Sent: Tuesday, November 24, 2015 7:53 AM To: Michael Di Salvo Subject: Re: Demo Email One more time! This way we can show email segments better. From: Michael Di Salvo Sent: Tuesday, November 24, 2015 9:54 AM To: Brandon Gauthier Subject: Re: Demo Email This is me, creating an additional segment. From: Brandon Gauthier Sent: Tuesday, November 24, 2015 7:50 AM To: Michael Di Salvo Subject: Demo Email I need some email data for a demo, can you reply back to this email. Thanks! From: Michael Di Salvo Sent: Tuesday, November 24, 2015 9:52 AM To: Brandon Gauthier Subject: Re: Demo Email This is me, replying to your e-mail. You sir, are very welcome.
  • 25.
  • 26.
  • 27. © Relativity. All rights reserved. How We Did It Keyword Expansion CREATE multi-dimensional space from terms in document text. 1 IDENTIFY terms that are conceptually similar to a user-provided query. 2 RETURN the terms for augmented searching. 3
  • 28. © Relativity. All rights reserved. “Personally-held”
  • 29.
  • 30. © Relativity. All rights reserved. How We Did It Clustering INDEX documents into multi-dimensional space. 1 HIERARCHICALLY GROUP documents into conceptually similar groups using the document text. 2 VISUALIZE clusters for promoting corpus understanding and searching. 3
  • 31. © Relativity. All rights reserved.
  • 32.
  • 33. © Relativity. All rights reserved. “It’s incredibly unjust for companies to pay a settlement if they’re unsure that a claim has merit simply because they don’t have the money or the resources to investigate it properly.” Chief Data Scientist and Partner, Drinker Biddle Bennett Borden
  • 34. © Relativity. All rights reserved. any questions?
  • 35. © Relativity. All rights reserved. thank you