SlideShare a Scribd company logo
1 of 24
Gil Irizarry
MAY 22, 2019
AI FOR GOOD
Bad Guys, Messy Data, & NLP
LIQUID TRAVEL BAN
2AI FOR GOOD ● BASIS TECHNOLOGY
LIQUID BOMB PLOT
3AI FOR GOOD ● BASIS TECHNOLOGY
JIHADI BRIDES TRAGEDY
4AI FOR GOOD ● BASIS TECHNOLOGY
Image Sources:
- Bethnal trio: Mirror
- Article: Independent
ALL THE EVIDENCE EXISTS
5AI FOR GOOD ● BASIS TECHNOLOGY
Scotland Yard
Report
ID
Social Activity
Image Sources:
- Tweet: : ISD Global
WHAT’S AT STAKE
6AI FOR GOOD ● BASIS TECHNOLOGY
FINANCIAL STABILITY
Global Money Laundering Operations
1% of Illegal Funds Captured
PUBLIC SAFETY
Deaths from Terrorist Attacks in Europe
11,288 from 1970-2017
Sources:
- Terrorism: Washington Post
- Money laundering: Wall Street Journal
UNPACKING THE AI SYSTEM
##AI FOR GOOD ● BASIS TECHNOLOGY
THE PROPOSED SOLUTION: NLP/NLU
##AI FOR GOOD ● BASIS TECHNOLOGY
COMMON PATTERN
##AI FOR GOOD ● BASIS TECHNOLOGY
80% of data is
unstructured
Join Processed
and Structured
Data into
Knowledge Graph
1) Natural Language
Processing Extracts
Facts
2) Scored for
confidence
& relevance
Mine Graph
For
Patterns
& Changes
People
Organizatio
ns
Locations
Relationshi
ps
Searching
Alerting
Anomaly Detection
COLLECT
EXTRAC
T
COMBIN
E
ANALYZE
Reporting
!
...
CHALLENGES AT EVERY LEVEL
##AI FOR GOOD ● BASIS TECHNOLOGY
COLLE
CT
EXTRAC
T
COMBIN
E
ANALYZ
E
● Domains
● Languages
● Training
Data
● Data Salad!
● Data
Access
● Duplication
● Variation
● Ambiguity
● Semantics
● Honey Pots
● Training
Data
● GIGO
● Data
Overload
● Alert Bombs
● Privacy
● Trust
... government officials
were convicted of
corruption. ABC
Company saw a drop in
sales as …
CHALLENGES AND ANTI-PATTERNS
##AI FOR GOOD ● BASIS TECHNOLOGY
Identifying Context
1) Reliance on Keywords
2) Naive Rules
Leads to False Positives
and False Negatives
COLLE
CT
EXTRAC
T
COMBIN
E
ANALYZ
E
CHALLENGES AND ANTI-PATTERNS
##AI FOR GOOD ● BASIS TECHNOLOGY
Identifying Proper Names
3) Name Variants
4) Name Parts (common keys)
Leads to False Positives
and False Negatives
abdul rashid
abdal rashide
abdal-rasheed
abdul-rashiyd
abdul-rachid
abd-errshiyd
abd-errchide
abd-errcheed
Abdul-Rasheed
➔
COLLE
CT
EXTRAC
T
COMBIN
E
ANALYZ
E
BOSTON BOMBING
##AI FOR GOOD ● BASIS TECHNOLOGY
Challenges & Anti-patterns
3) Failure to match variants
4) Failure to disambiguate
5) Failure to model what
matters
6) Monolingual design
“Operation Hairball”
CHALLENGES AND ANTI-PATTERNS
##AI FOR GOOD ● BASIS TECHNOLOGY
COLLE
CT
EXTRAC
T
COMBIN
E
ANALYZ
E
CROSS-LINGUAL SEMANTIC MODELING
##AI FOR GOOD ● BASIS TECHNOLOGY
CROSS-LINGUAL SEMANTIC MODELING
##AI FOR GOOD ● BASIS TECHNOLOGY
Mapping
Algorith
m
Arabic
English
Chines
e
Multilingual
Embedding
s Space
CROSS-LINGUAL SEMANTIC MODELING
##AI FOR GOOD ● BASIS TECHNOLOGY
Machine Learning
‫למידה‬‫חישובית‬Eagle
Pharmaceuticals Inc.
Eagle
Drugs, Co.
Tesla
Energy Storage
‫טסלה‬
AI
‫تيسال‬‫موتورز‬
計算学習
‫אחסון‬‫אנרגיה‬
AI BUILDING BLOCKS: Algorithms & High Quality Data
##AI FOR GOOD ● BASIS TECHNOLOGY
● NN NER
● NN CLASS
● NN RELAX
● SVM
● TEXT
EMBEDDING
S
● NNs
● NL SEARCH
● CLASSIC ML
● ANOMALY
DETECTION
● HMM
● SEMANTIC
MODELING
● GRAPH
SIMILARITY
● Data Filtering
● Classification
● Deduplication
● High Quality
Annotations
● Language & domain
combos
● Active Learning
Feedback
● High Quality Name
Pairs in every
language pair
● Confidence
Modeling
● Semantic Model
● Baseline
“normal”
● Queries
● Visualizations
COLLE
CT
EXTRAC
T
COMBIN
E
ANALYZ
E
PUTTING IT ALL TOGETHER
##AI FOR GOOD ● BASIS TECHNOLOGY
COLLE
CT
EXTRAC
T
COMBIN
E
ANALYZ
E
People
Organizatio
ns
Locations
Relationshi
ps
Searching
Alerting
Anomaly Detection
Reporting
!
...
##AI FOR GOOD ● BASIS TECHNOLOGY
THIS TECHNOLOGY IS ALREADY
AT WORK
CAPTURING EL CHAPO
##AI FOR GOOD ● BASIS TECHNOLOGY
Source: U.S. Immigration and Customs Enforcement
KEY DOMAINS OF IMPACT
##AI FOR GOOD ● BASIS TECHNOLOGY
National Security Financial ServicesLaw EnforcementIntelligence
THANK YOU
##AI FOR GOOD ● BASIS TECHNOLOGY
Gil Irizarry
● Basis Technology
● I engineer NLP / NLU tech for
good
● Please reach out!
@conoagil
##AI FOR GOOD ● BASIS TECHNOLOGY
Thank You

More Related Content

Similar to Ai for Good: Bad Guys, Messy Data, & NLP

What does Generative AI mean for public policy?
What does Generative AI mean for public policy?What does Generative AI mean for public policy?
What does Generative AI mean for public policy?Sam Gilbert
 
AI Masterclass at ASU GSV 2019
AI Masterclass at ASU GSV 2019AI Masterclass at ASU GSV 2019
AI Masterclass at ASU GSV 2019Sergey Karayev
 
Spohrer Ntegra 20230324 v12.pptx
Spohrer Ntegra 20230324 v12.pptxSpohrer Ntegra 20230324 v12.pptx
Spohrer Ntegra 20230324 v12.pptxISSIP
 
AI and the future workforce
AI and the future workforceAI and the future workforce
AI and the future workforceHudson UK
 
Don't Forget the 'H' in HR: Ethics, Trust & People Analytics
Don't Forget the 'H' in HR: Ethics, Trust & People AnalyticsDon't Forget the 'H' in HR: Ethics, Trust & People Analytics
Don't Forget the 'H' in HR: Ethics, Trust & People AnalyticsDavid Green
 
Leading in a digital world for MIT Research School Comfer
Leading in a digital world for MIT Research School ComferLeading in a digital world for MIT Research School Comfer
Leading in a digital world for MIT Research School ComferRobin Teigland
 
AI and Journalism Talk City University
AI and Journalism Talk City UniversityAI and Journalism Talk City University
AI and Journalism Talk City UniversityPOLIS LSE
 
Smart Data 2017 #AI & #FutureofWork
Smart Data 2017 #AI & #FutureofWorkSmart Data 2017 #AI & #FutureofWork
Smart Data 2017 #AI & #FutureofWorkSteve Ardire
 
Ai morality-today-2018-web
Ai morality-today-2018-webAi morality-today-2018-web
Ai morality-today-2018-webTom Daly
 
Artificial Intelligence.pptx
Artificial Intelligence.pptxArtificial Intelligence.pptx
Artificial Intelligence.pptxMadanAcharya7
 
Denmark 20190418 v5
Denmark 20190418 v5Denmark 20190418 v5
Denmark 20190418 v5ISSIP
 
How Competitive Sales Battlecards and Silver Bullets Open the Door to Strateg...
How Competitive Sales Battlecards and Silver Bullets Open the Door to Strateg...How Competitive Sales Battlecards and Silver Bullets Open the Door to Strateg...
How Competitive Sales Battlecards and Silver Bullets Open the Door to Strateg...IntelCollab.com
 
The Future of AI: Scenarios, Ethics, and Regulations
The Future of AI: Scenarios, Ethics, and RegulationsThe Future of AI: Scenarios, Ethics, and Regulations
The Future of AI: Scenarios, Ethics, and RegulationsDavid Wood
 
The NIST Machine Learning & AI Initiative
The NIST Machine Learning & AI InitiativeThe NIST Machine Learning & AI Initiative
The NIST Machine Learning & AI Initiativeinside-BigData.com
 
AI, Innovation & Ethics in Marketing by PR Smith, founder of SOSTAC® Plans ...
AI, Innovation & Ethics in Marketing  by PR Smith, founder of SOSTAC® Plans  ...AI, Innovation & Ethics in Marketing  by PR Smith, founder of SOSTAC® Plans  ...
AI, Innovation & Ethics in Marketing by PR Smith, founder of SOSTAC® Plans ...Institutul de Marketing
 
Denmark future of ai 20180927 v8
Denmark future of ai 20180927 v8Denmark future of ai 20180927 v8
Denmark future of ai 20180927 v8ISSIP
 

Similar to Ai for Good: Bad Guys, Messy Data, & NLP (20)

What does Generative AI mean for public policy?
What does Generative AI mean for public policy?What does Generative AI mean for public policy?
What does Generative AI mean for public policy?
 
AI Masterclass at ASU GSV 2019
AI Masterclass at ASU GSV 2019AI Masterclass at ASU GSV 2019
AI Masterclass at ASU GSV 2019
 
Ethical Dilemmas in AI/ML-based systems
Ethical Dilemmas in AI/ML-based systemsEthical Dilemmas in AI/ML-based systems
Ethical Dilemmas in AI/ML-based systems
 
Spohrer Ntegra 20230324 v12.pptx
Spohrer Ntegra 20230324 v12.pptxSpohrer Ntegra 20230324 v12.pptx
Spohrer Ntegra 20230324 v12.pptx
 
AI and the future workforce
AI and the future workforceAI and the future workforce
AI and the future workforce
 
Don't Forget the 'H' in HR: Ethics, Trust & People Analytics
Don't Forget the 'H' in HR: Ethics, Trust & People AnalyticsDon't Forget the 'H' in HR: Ethics, Trust & People Analytics
Don't Forget the 'H' in HR: Ethics, Trust & People Analytics
 
Leading in a digital world for MIT Research School Comfer
Leading in a digital world for MIT Research School ComferLeading in a digital world for MIT Research School Comfer
Leading in a digital world for MIT Research School Comfer
 
AI and Journalism Talk City University
AI and Journalism Talk City UniversityAI and Journalism Talk City University
AI and Journalism Talk City University
 
Smart Data 2017 #AI & #FutureofWork
Smart Data 2017 #AI & #FutureofWorkSmart Data 2017 #AI & #FutureofWork
Smart Data 2017 #AI & #FutureofWork
 
AI for Finance
AI for FinanceAI for Finance
AI for Finance
 
Ai morality-today-2018-web
Ai morality-today-2018-webAi morality-today-2018-web
Ai morality-today-2018-web
 
Artificial Intelligence.pptx
Artificial Intelligence.pptxArtificial Intelligence.pptx
Artificial Intelligence.pptx
 
Data Science for Business
Data Science for Business Data Science for Business
Data Science for Business
 
Denmark 20190418 v5
Denmark 20190418 v5Denmark 20190418 v5
Denmark 20190418 v5
 
How Competitive Sales Battlecards and Silver Bullets Open the Door to Strateg...
How Competitive Sales Battlecards and Silver Bullets Open the Door to Strateg...How Competitive Sales Battlecards and Silver Bullets Open the Door to Strateg...
How Competitive Sales Battlecards and Silver Bullets Open the Door to Strateg...
 
The Future of AI: Scenarios, Ethics, and Regulations
The Future of AI: Scenarios, Ethics, and RegulationsThe Future of AI: Scenarios, Ethics, and Regulations
The Future of AI: Scenarios, Ethics, and Regulations
 
The NIST Machine Learning & AI Initiative
The NIST Machine Learning & AI InitiativeThe NIST Machine Learning & AI Initiative
The NIST Machine Learning & AI Initiative
 
AI, Innovation & Ethics in Marketing by PR Smith, founder of SOSTAC® Plans ...
AI, Innovation & Ethics in Marketing  by PR Smith, founder of SOSTAC® Plans  ...AI, Innovation & Ethics in Marketing  by PR Smith, founder of SOSTAC® Plans  ...
AI, Innovation & Ethics in Marketing by PR Smith, founder of SOSTAC® Plans ...
 
State of AI Report 2019
State of AI Report 2019State of AI Report 2019
State of AI Report 2019
 
Denmark future of ai 20180927 v8
Denmark future of ai 20180927 v8Denmark future of ai 20180927 v8
Denmark future of ai 20180927 v8
 

More from Gil Irizarry

A Rose By Any Other Name.pdf
A Rose By Any Other Name.pdfA Rose By Any Other Name.pdf
A Rose By Any Other Name.pdfGil Irizarry
 
[Apple-organization] and [oranges-fruit] - How to evaluate NLP tools - Basis ...
[Apple-organization] and [oranges-fruit] - How to evaluate NLP tools - Basis ...[Apple-organization] and [oranges-fruit] - How to evaluate NLP tools - Basis ...
[Apple-organization] and [oranges-fruit] - How to evaluate NLP tools - Basis ...Gil Irizarry
 
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entit...
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entit...[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entit...
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entit...Gil Irizarry
 
DevSecOps Orchestration of Text Analytics with Containers
DevSecOps Orchestration of Text Analytics with ContainersDevSecOps Orchestration of Text Analytics with Containers
DevSecOps Orchestration of Text Analytics with ContainersGil Irizarry
 
Towards Identity Resolution: The Challenge of Name Matching
Towards Identity Resolution: The Challenge of Name MatchingTowards Identity Resolution: The Challenge of Name Matching
Towards Identity Resolution: The Challenge of Name MatchingGil Irizarry
 
RapidMiner - Don’t Forget to Pack Text Analytics on Your Data Exploration Jou...
RapidMiner - Don’t Forget to Pack Text Analytics on Your Data Exploration Jou...RapidMiner - Don’t Forget to Pack Text Analytics on Your Data Exploration Jou...
RapidMiner - Don’t Forget to Pack Text Analytics on Your Data Exploration Jou...Gil Irizarry
 
Beginning Native Android Apps
Beginning Native Android AppsBeginning Native Android Apps
Beginning Native Android AppsGil Irizarry
 
From Silos to DevOps: Our Story
From Silos to DevOps:  Our StoryFrom Silos to DevOps:  Our Story
From Silos to DevOps: Our StoryGil Irizarry
 
Make Cross-platform Mobile Apps Quickly - SIGGRAPH 2014
Make Cross-platform Mobile Apps Quickly - SIGGRAPH 2014Make Cross-platform Mobile Apps Quickly - SIGGRAPH 2014
Make Cross-platform Mobile Apps Quickly - SIGGRAPH 2014Gil Irizarry
 
Graphics on the Go
Graphics on the GoGraphics on the Go
Graphics on the GoGil Irizarry
 
Make Mobile Apps Quickly
Make Mobile Apps QuicklyMake Mobile Apps Quickly
Make Mobile Apps QuicklyGil Irizarry
 
Building The Agile Enterprise - LSSC '12
Building The Agile Enterprise - LSSC '12Building The Agile Enterprise - LSSC '12
Building The Agile Enterprise - LSSC '12Gil Irizarry
 
Agile The Kanban Way - Central MA PMI 2011
Agile The Kanban Way - Central MA PMI 2011Agile The Kanban Way - Central MA PMI 2011
Agile The Kanban Way - Central MA PMI 2011Gil Irizarry
 
Transitioning to Kanban: Theory and Practice - Project Summit Boston 2011
Transitioning to Kanban: Theory and Practice - Project Summit Boston 2011Transitioning to Kanban: Theory and Practice - Project Summit Boston 2011
Transitioning to Kanban: Theory and Practice - Project Summit Boston 2011Gil Irizarry
 
Transitioning to Kanban - Aug 11
Transitioning to Kanban - Aug 11Transitioning to Kanban - Aug 11
Transitioning to Kanban - Aug 11Gil Irizarry
 
Transitioning to Kanban
Transitioning to KanbanTransitioning to Kanban
Transitioning to KanbanGil Irizarry
 
Beyond Scrum of Scrums
Beyond Scrum of ScrumsBeyond Scrum of Scrums
Beyond Scrum of ScrumsGil Irizarry
 

More from Gil Irizarry (17)

A Rose By Any Other Name.pdf
A Rose By Any Other Name.pdfA Rose By Any Other Name.pdf
A Rose By Any Other Name.pdf
 
[Apple-organization] and [oranges-fruit] - How to evaluate NLP tools - Basis ...
[Apple-organization] and [oranges-fruit] - How to evaluate NLP tools - Basis ...[Apple-organization] and [oranges-fruit] - How to evaluate NLP tools - Basis ...
[Apple-organization] and [oranges-fruit] - How to evaluate NLP tools - Basis ...
 
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entit...
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entit...[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entit...
[Apple|organization] and [oranges|fruit]: How to evaluate NLP tools for entit...
 
DevSecOps Orchestration of Text Analytics with Containers
DevSecOps Orchestration of Text Analytics with ContainersDevSecOps Orchestration of Text Analytics with Containers
DevSecOps Orchestration of Text Analytics with Containers
 
Towards Identity Resolution: The Challenge of Name Matching
Towards Identity Resolution: The Challenge of Name MatchingTowards Identity Resolution: The Challenge of Name Matching
Towards Identity Resolution: The Challenge of Name Matching
 
RapidMiner - Don’t Forget to Pack Text Analytics on Your Data Exploration Jou...
RapidMiner - Don’t Forget to Pack Text Analytics on Your Data Exploration Jou...RapidMiner - Don’t Forget to Pack Text Analytics on Your Data Exploration Jou...
RapidMiner - Don’t Forget to Pack Text Analytics on Your Data Exploration Jou...
 
Beginning Native Android Apps
Beginning Native Android AppsBeginning Native Android Apps
Beginning Native Android Apps
 
From Silos to DevOps: Our Story
From Silos to DevOps:  Our StoryFrom Silos to DevOps:  Our Story
From Silos to DevOps: Our Story
 
Make Cross-platform Mobile Apps Quickly - SIGGRAPH 2014
Make Cross-platform Mobile Apps Quickly - SIGGRAPH 2014Make Cross-platform Mobile Apps Quickly - SIGGRAPH 2014
Make Cross-platform Mobile Apps Quickly - SIGGRAPH 2014
 
Graphics on the Go
Graphics on the GoGraphics on the Go
Graphics on the Go
 
Make Mobile Apps Quickly
Make Mobile Apps QuicklyMake Mobile Apps Quickly
Make Mobile Apps Quickly
 
Building The Agile Enterprise - LSSC '12
Building The Agile Enterprise - LSSC '12Building The Agile Enterprise - LSSC '12
Building The Agile Enterprise - LSSC '12
 
Agile The Kanban Way - Central MA PMI 2011
Agile The Kanban Way - Central MA PMI 2011Agile The Kanban Way - Central MA PMI 2011
Agile The Kanban Way - Central MA PMI 2011
 
Transitioning to Kanban: Theory and Practice - Project Summit Boston 2011
Transitioning to Kanban: Theory and Practice - Project Summit Boston 2011Transitioning to Kanban: Theory and Practice - Project Summit Boston 2011
Transitioning to Kanban: Theory and Practice - Project Summit Boston 2011
 
Transitioning to Kanban - Aug 11
Transitioning to Kanban - Aug 11Transitioning to Kanban - Aug 11
Transitioning to Kanban - Aug 11
 
Transitioning to Kanban
Transitioning to KanbanTransitioning to Kanban
Transitioning to Kanban
 
Beyond Scrum of Scrums
Beyond Scrum of ScrumsBeyond Scrum of Scrums
Beyond Scrum of Scrums
 

Recently uploaded

Software Coding for software engineering
Software Coding for software engineeringSoftware Coding for software engineering
Software Coding for software engineeringssuserb3a23b
 
Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprisepreethippts
 
Introduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdfIntroduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdfFerryKemperman
 
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...OnePlan Solutions
 
SensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving CarsSensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving CarsChristian Birchler
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Andreas Granig
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmSujith Sukumaran
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsAhmed Mohamed
 
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanyChristoph Pohl
 
VK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web DevelopmentVK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web Developmentvyaparkranti
 
Cyber security and its impact on E commerce
Cyber security and its impact on E commerceCyber security and its impact on E commerce
Cyber security and its impact on E commercemanigoyal112
 
cpct NetworkING BASICS AND NETWORK TOOL.ppt
cpct NetworkING BASICS AND NETWORK TOOL.pptcpct NetworkING BASICS AND NETWORK TOOL.ppt
cpct NetworkING BASICS AND NETWORK TOOL.pptrcbcrtm
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaHanief Utama
 
Comparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdfComparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdfDrew Moseley
 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEEVICTOR MAESTRE RAMIREZ
 
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdfExploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdfkalichargn70th171
 
A healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfA healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfMarharyta Nedzelska
 
PREDICTING RIVER WATER QUALITY ppt presentation
PREDICTING  RIVER  WATER QUALITY  ppt presentationPREDICTING  RIVER  WATER QUALITY  ppt presentation
PREDICTING RIVER WATER QUALITY ppt presentationvaddepallysandeep122
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Matt Ray
 
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company OdishaBalasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odishasmiwainfosol
 

Recently uploaded (20)

Software Coding for software engineering
Software Coding for software engineeringSoftware Coding for software engineering
Software Coding for software engineering
 
Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprise
 
Introduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdfIntroduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdf
 
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
 
SensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving CarsSensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving Cars
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalm
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML Diagrams
 
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
 
VK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web DevelopmentVK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web Development
 
Cyber security and its impact on E commerce
Cyber security and its impact on E commerceCyber security and its impact on E commerce
Cyber security and its impact on E commerce
 
cpct NetworkING BASICS AND NETWORK TOOL.ppt
cpct NetworkING BASICS AND NETWORK TOOL.pptcpct NetworkING BASICS AND NETWORK TOOL.ppt
cpct NetworkING BASICS AND NETWORK TOOL.ppt
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief Utama
 
Comparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdfComparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdf
 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEE
 
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdfExploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
 
A healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfA healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdf
 
PREDICTING RIVER WATER QUALITY ppt presentation
PREDICTING  RIVER  WATER QUALITY  ppt presentationPREDICTING  RIVER  WATER QUALITY  ppt presentation
PREDICTING RIVER WATER QUALITY ppt presentation
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
 
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company OdishaBalasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
 

Ai for Good: Bad Guys, Messy Data, & NLP

  • 1. Gil Irizarry MAY 22, 2019 AI FOR GOOD Bad Guys, Messy Data, & NLP
  • 2. LIQUID TRAVEL BAN 2AI FOR GOOD ● BASIS TECHNOLOGY
  • 3. LIQUID BOMB PLOT 3AI FOR GOOD ● BASIS TECHNOLOGY
  • 4. JIHADI BRIDES TRAGEDY 4AI FOR GOOD ● BASIS TECHNOLOGY Image Sources: - Bethnal trio: Mirror - Article: Independent
  • 5. ALL THE EVIDENCE EXISTS 5AI FOR GOOD ● BASIS TECHNOLOGY Scotland Yard Report ID Social Activity Image Sources: - Tweet: : ISD Global
  • 6. WHAT’S AT STAKE 6AI FOR GOOD ● BASIS TECHNOLOGY FINANCIAL STABILITY Global Money Laundering Operations 1% of Illegal Funds Captured PUBLIC SAFETY Deaths from Terrorist Attacks in Europe 11,288 from 1970-2017 Sources: - Terrorism: Washington Post - Money laundering: Wall Street Journal
  • 7. UNPACKING THE AI SYSTEM ##AI FOR GOOD ● BASIS TECHNOLOGY
  • 8. THE PROPOSED SOLUTION: NLP/NLU ##AI FOR GOOD ● BASIS TECHNOLOGY
  • 9. COMMON PATTERN ##AI FOR GOOD ● BASIS TECHNOLOGY 80% of data is unstructured Join Processed and Structured Data into Knowledge Graph 1) Natural Language Processing Extracts Facts 2) Scored for confidence & relevance Mine Graph For Patterns & Changes People Organizatio ns Locations Relationshi ps Searching Alerting Anomaly Detection COLLECT EXTRAC T COMBIN E ANALYZE Reporting ! ...
  • 10. CHALLENGES AT EVERY LEVEL ##AI FOR GOOD ● BASIS TECHNOLOGY COLLE CT EXTRAC T COMBIN E ANALYZ E ● Domains ● Languages ● Training Data ● Data Salad! ● Data Access ● Duplication ● Variation ● Ambiguity ● Semantics ● Honey Pots ● Training Data ● GIGO ● Data Overload ● Alert Bombs ● Privacy ● Trust
  • 11. ... government officials were convicted of corruption. ABC Company saw a drop in sales as … CHALLENGES AND ANTI-PATTERNS ##AI FOR GOOD ● BASIS TECHNOLOGY Identifying Context 1) Reliance on Keywords 2) Naive Rules Leads to False Positives and False Negatives COLLE CT EXTRAC T COMBIN E ANALYZ E
  • 12. CHALLENGES AND ANTI-PATTERNS ##AI FOR GOOD ● BASIS TECHNOLOGY Identifying Proper Names 3) Name Variants 4) Name Parts (common keys) Leads to False Positives and False Negatives abdul rashid abdal rashide abdal-rasheed abdul-rashiyd abdul-rachid abd-errshiyd abd-errchide abd-errcheed Abdul-Rasheed ➔ COLLE CT EXTRAC T COMBIN E ANALYZ E
  • 13. BOSTON BOMBING ##AI FOR GOOD ● BASIS TECHNOLOGY
  • 14. Challenges & Anti-patterns 3) Failure to match variants 4) Failure to disambiguate 5) Failure to model what matters 6) Monolingual design “Operation Hairball” CHALLENGES AND ANTI-PATTERNS ##AI FOR GOOD ● BASIS TECHNOLOGY COLLE CT EXTRAC T COMBIN E ANALYZ E
  • 15. CROSS-LINGUAL SEMANTIC MODELING ##AI FOR GOOD ● BASIS TECHNOLOGY
  • 16. CROSS-LINGUAL SEMANTIC MODELING ##AI FOR GOOD ● BASIS TECHNOLOGY Mapping Algorith m Arabic English Chines e Multilingual Embedding s Space
  • 17. CROSS-LINGUAL SEMANTIC MODELING ##AI FOR GOOD ● BASIS TECHNOLOGY Machine Learning ‫למידה‬‫חישובית‬Eagle Pharmaceuticals Inc. Eagle Drugs, Co. Tesla Energy Storage ‫טסלה‬ AI ‫تيسال‬‫موتورز‬ 計算学習 ‫אחסון‬‫אנרגיה‬
  • 18. AI BUILDING BLOCKS: Algorithms & High Quality Data ##AI FOR GOOD ● BASIS TECHNOLOGY ● NN NER ● NN CLASS ● NN RELAX ● SVM ● TEXT EMBEDDING S ● NNs ● NL SEARCH ● CLASSIC ML ● ANOMALY DETECTION ● HMM ● SEMANTIC MODELING ● GRAPH SIMILARITY ● Data Filtering ● Classification ● Deduplication ● High Quality Annotations ● Language & domain combos ● Active Learning Feedback ● High Quality Name Pairs in every language pair ● Confidence Modeling ● Semantic Model ● Baseline “normal” ● Queries ● Visualizations COLLE CT EXTRAC T COMBIN E ANALYZ E
  • 19. PUTTING IT ALL TOGETHER ##AI FOR GOOD ● BASIS TECHNOLOGY COLLE CT EXTRAC T COMBIN E ANALYZ E People Organizatio ns Locations Relationshi ps Searching Alerting Anomaly Detection Reporting ! ...
  • 20. ##AI FOR GOOD ● BASIS TECHNOLOGY THIS TECHNOLOGY IS ALREADY AT WORK
  • 21. CAPTURING EL CHAPO ##AI FOR GOOD ● BASIS TECHNOLOGY Source: U.S. Immigration and Customs Enforcement
  • 22. KEY DOMAINS OF IMPACT ##AI FOR GOOD ● BASIS TECHNOLOGY National Security Financial ServicesLaw EnforcementIntelligence
  • 23. THANK YOU ##AI FOR GOOD ● BASIS TECHNOLOGY Gil Irizarry ● Basis Technology ● I engineer NLP / NLU tech for good ● Please reach out! @conoagil
  • 24. ##AI FOR GOOD ● BASIS TECHNOLOGY Thank You

Editor's Notes

  1. All the information that is needed to find and stop bad actors from entering our financial system already exists and is available to you today; it’s just buried in terabits of messy, unstructured data all over the internet. For those performing investigations and evaluating risk, this needle in a stack of needles problem is huge and growing: Unstructured data already dominates the web (growing exponentially year over year), and the traditional technology these departments use cannot keep up. Recent developments in natural language processing technology (NLP), the field of AI that focuses on human language, have, for the first time, made it possible for automated systems to find and deliver identity-relevant intelligence hidden in unstructured textual data. In this talk, I will share some of the common patterns, common mistakes, and opportunities that I see in the field. These innovations unlock a new world of actionable insight, providing much-needed ammunition in the fight against fraud, money-laundering, financial crime, and terrorism.
  2. As we all know, you can’t take liquids or gels onto commercial flights But Most people don’t know the events that led up to that regulation. USA “3-1-1 Liquids Rule” (source) Each passenger may carry liquids, gels and aerosols in travel-size containers that are 3.4 ounces or 100 milliliters. Each passenger is limited to one quart-size bag of liquids, gels and aerosols. Common travel items that must comply with the 3-1-1 liquids rule include toothpaste, shampoo, conditioner, mouthwash and lotion. German Rule (source) Containers holding liquids may not be larger than 100 ml, otherwise you may not carry them in your hand luggage. All such containers must be placed in a transparent, reclosable plastic bag with a capacity of no more than one liter (for example, an ordinary freezer bag with zipper). The bag may contain any number of containers as long as it is still possible to completely close it. Please remember: Each passenger may only take one such bag on board the plane.
  3. PUNCHLINE In August of 2006, seven aircraft did not explode during their flight over the Atlantic. Instead of plunging into the ocean, they landed on runways—a happy ending to what would have been a human tragedy had law enforcement not been tipped off by some carefully crafted AI.
  4. The sad and unfortunate situation here, is that it could have been avoided. The data exists, crime (which was effectively manipulating nad kidnapping a minor) could be prevented
  5. For US audience: US Homeland Attacks (2015-2018) 64 plots disrupted 21 plots executed
  6. The data and technology exist to make the world a safer place, and it’s already begun to make an impact.
  7. A more recent example of NLP being used to analyze documents for national security is the 2016 capture and subsequent conviction of El Chapo. To find him, intel officer analyzed communications from email, phone, and sms (SIGINT) of El Chapo’s network; used semantic technology to look at the content of his and his networks conversations and determine who was talking about drugs to understand and link to people in the text to create a network; lead to the identification of the network of people that were involved in El Chapo, allowing agencies to find the location and capture him.