SlideShare una empresa de Scribd logo
1 de 25
Chris Mack
MAY 1, 2019
AI FOR GOOD
Bad Guys, Messy Data, & NLP
LIQUID TRAVEL BAN
2AI FOR GOOD ● BASIS TECHNOLOGY
LIQUID BOMB PLOT
3AI FOR GOOD ● BASIS TECHNOLOGY
JIHADI BRIDES TRAGEDY
4AI FOR GOOD ● BASIS TECHNOLOGY
Image Sources:
- Bethnal trio: Mirror
- Article: Independent
ALL THE EVIDENCE EXISTS
5AI FOR GOOD ● BASIS TECHNOLOGY
Scotland Yard Report
ID
Social Activity
Image Sources:
- Tweet: : ISD Global
WHAT’S AT STAKE
6AI FOR GOOD ● BASIS TECHNOLOGY
FINANCIAL STABILITY
Global Money Laundering Operations
1% of Illegal Funds Captured
PUBLIC SAFETY
Deaths from Terrorist Attacks in Europe
11,288 from 1970-2017
Sources:
- Terrorism: Washington Post
- Money laundering: Wall Street Journal
UNPACKING THE AI SYSTEM
##AI FOR GOOD ● BASIS TECHNOLOGY
THE PROPOSED SOLUTION: NLP/NLU
##AI FOR GOOD ● BASIS TECHNOLOGY
COMMON PATTERN
##AI FOR GOOD ● BASIS TECHNOLOGY
80% of data is
unstructured
Join Processed and
Structured Data into
Knowledge Graph
1) Natural Language
Processing Extracts Facts
2) Scored for confidence
& relevance
Mine Graph
For Patterns
& Changes
People
Organizations
Locations
Relationships
Searching
Alerting
Anomaly Detection
Reporting
CHALLENGES AT EVERY LEVEL
##AI FOR GOOD ● BASIS TECHNOLOGY
● Domains
● Languages
● Training Data
● Data Salad!
● Data Access
● Duplication
● Variation
● Ambiguity
● Semantics
● Honey Pots
● Training Data
● GIGO
● Data Overload
● Alert Bombs
● Privacy
● Trust
... government officials were
convicted of corruption.
ABC Company saw a drop in
sales as …
CHALLENGES AND ANTI-PATTERNS
##AI FOR GOOD ● BASIS TECHNOLOGY
Identifying Context
1) Reliance on Keywords
2) Naive Rules
Leads to False Positives
and False Negatives
CHALLENGES AND ANTI-PATTERNS
##AI FOR GOOD ● BASIS TECHNOLOGY
Identifying Proper Names
3) Name Variants
4) Name Parts (common keys)
Leads to False Positives
and False Negatives
abdul rashid
abdal rashide
abdal-rasheed
abdul-rashiyd
abdul-rachid
abd-errshiyd
abd-errchide
abd-errcheed
abd-errchiyd …
Abdul-Rasheed ➔
BOSTON BOMBING
##AI FOR GOOD ● BASIS TECHNOLOGY
Challenges & Anti-patterns
3) Failure to match variants
4) Failure to disambiguate
5) Failure to model what matters
6) Monolingual design
“Operation Hairball”
CHALLENGES AND ANTI-PATTERNS
##AI FOR GOOD ● BASIS TECHNOLOGY
CROSS-LINGUAL SEMANTIC MODELING
##AI FOR GOOD ● BASIS TECHNOLOGY
CROSS-LINGUAL SEMANTIC MODELING
##AI FOR GOOD ● BASIS TECHNOLOGY
Mapping
algorithm
Arabic
English
Chinese
Multilingual
embeddings
space
CROSS-LINGUAL SEMANTIC MODELING
##AI FOR GOOD ● BASIS TECHNOLOGY
Machine Learning
‫חישובית‬ ‫למידה‬Eagle
Pharmaceuticals Inc.
Eagle
Drugs, Co.
Tesla
Energy Storage
‫טסלה‬
AI
‫ﻣوﺗورز‬ ‫ﺗﯾﺳﻼ‬
計算学習
‫אנרגיה‬ ‫אחסון‬
AI BUILDING BLOCKS: Algorithms & High Quality Data
##AI FOR GOOD ● BASIS TECHNOLOGY
● NN NER
● NN CLASS
● NN RELAX
● SVM
● TEXT
EMBEDDINGS
● NNs
● NL SEARCH
● CLASSIC ML
● ANOMALY
DETECTION
● HMM
● SEMANTIC
MODELING
● GRAPH
SIMILARITY
● Data Filtering
● Classification
● Deduplication
● High Quality
Annotations
● Language & domain
combos
● Active Learning
Feedback
● High Quality Name
Pairs in every
language pair
● Confidence Modeling
● Semantic Model
● Baseline
“normal”
● Queries
● Visualizations
PUTTING IT ALL TOGETHER
##AI FOR GOOD ● BASIS TECHNOLOGY
People
Organizations
Locations
Relationships
Searching
Alerting
Anomaly Detection
Reporting
##AI FOR GOOD ● BASIS TECHNOLOGY
THIS TECHNOLOGY IS ALREADY AT WORK
CAPTURING EL CHAPO
##AI FOR GOOD ● BASIS TECHNOLOGY
Source: U.S. Immigration and Customs Enforcement
CAPTURING EL CHAPO
##AI FOR GOOD ● BASIS TECHNOLOGY
Source: El Chapo recaptured in gun battle
KEY DOMAINS OF IMPACT
##AI FOR GOOD ● BASIS TECHNOLOGY
National Security Financial ServicesLaw EnforcementIntelligence
THANK YOU
##AI FOR GOOD ● BASIS TECHNOLOGY
Chris Mack
● Basis Technology
● I design & implement NLP / NLU solutions for good
● Please reach out!
@cgmack
##AI FOR GOOD ● BASIS TECHNOLOGY
Thank You

Más contenido relacionado

Similar a AI For Good Bad guys, messy data, & NLP

Spohrer Ntegra 20230324 v12.pptx
Spohrer Ntegra 20230324 v12.pptxSpohrer Ntegra 20230324 v12.pptx
Spohrer Ntegra 20230324 v12.pptx
ISSIP
 

Similar a AI For Good Bad guys, messy data, & NLP (20)

How ai structures business information ?
How ai structures business information ?How ai structures business information ?
How ai structures business information ?
 
AI Masterclass at ASU GSV 2019
AI Masterclass at ASU GSV 2019AI Masterclass at ASU GSV 2019
AI Masterclass at ASU GSV 2019
 
To The Point: Artificial Intelligence - Facts vs Myths
To The Point: Artificial Intelligence - Facts vs MythsTo The Point: Artificial Intelligence - Facts vs Myths
To The Point: Artificial Intelligence - Facts vs Myths
 
Spohrer Ntegra 20230324 v12.pptx
Spohrer Ntegra 20230324 v12.pptxSpohrer Ntegra 20230324 v12.pptx
Spohrer Ntegra 20230324 v12.pptx
 
AI and the future workforce
AI and the future workforceAI and the future workforce
AI and the future workforce
 
Smart Data 2017 #AI & #FutureofWork
Smart Data 2017 #AI & #FutureofWorkSmart Data 2017 #AI & #FutureofWork
Smart Data 2017 #AI & #FutureofWork
 
Ai morality-today-2018-web
Ai morality-today-2018-webAi morality-today-2018-web
Ai morality-today-2018-web
 
AI, Innovation & Ethics in Marketing by PR Smith, founder of SOSTAC® Plans ...
AI, Innovation & Ethics in Marketing  by PR Smith, founder of SOSTAC® Plans  ...AI, Innovation & Ethics in Marketing  by PR Smith, founder of SOSTAC® Plans  ...
AI, Innovation & Ethics in Marketing by PR Smith, founder of SOSTAC® Plans ...
 
Artificial Intelligence.pptx
Artificial Intelligence.pptxArtificial Intelligence.pptx
Artificial Intelligence.pptx
 
Artificial Intelligence in testing - A STeP-IN Evening Talk Session Speech by...
Artificial Intelligence in testing - A STeP-IN Evening Talk Session Speech by...Artificial Intelligence in testing - A STeP-IN Evening Talk Session Speech by...
Artificial Intelligence in testing - A STeP-IN Evening Talk Session Speech by...
 
Internet of things
Internet of thingsInternet of things
Internet of things
 
Leading in a digital world for MIT Research School Comfer
Leading in a digital world for MIT Research School ComferLeading in a digital world for MIT Research School Comfer
Leading in a digital world for MIT Research School Comfer
 
AI-SDV 2020: AI, IoT, Blockchain & Co: How to keep track and take advantage o...
AI-SDV 2020: AI, IoT, Blockchain & Co: How to keep track and take advantage o...AI-SDV 2020: AI, IoT, Blockchain & Co: How to keep track and take advantage o...
AI-SDV 2020: AI, IoT, Blockchain & Co: How to keep track and take advantage o...
 
Ai presentation by niraj lunavat at ghrcem.pptx
Ai presentation by niraj lunavat at ghrcem.pptxAi presentation by niraj lunavat at ghrcem.pptx
Ai presentation by niraj lunavat at ghrcem.pptx
 
AI for IA's: Machine Learning Demystified at IA Summit 2017 - IAS17
AI for IA's: Machine Learning Demystified at IA Summit 2017 - IAS17AI for IA's: Machine Learning Demystified at IA Summit 2017 - IAS17
AI for IA's: Machine Learning Demystified at IA Summit 2017 - IAS17
 
Kaimar karu - The Real Promise of AI Beyond The Hype
Kaimar karu - The Real Promise of AI Beyond The HypeKaimar karu - The Real Promise of AI Beyond The Hype
Kaimar karu - The Real Promise of AI Beyond The Hype
 
Ethical Dilemmas in AI/ML-based systems
Ethical Dilemmas in AI/ML-based systemsEthical Dilemmas in AI/ML-based systems
Ethical Dilemmas in AI/ML-based systems
 
Some ABCs of Forecasting - James Woudhuysen
Some ABCs of Forecasting - James WoudhuysenSome ABCs of Forecasting - James Woudhuysen
Some ABCs of Forecasting - James Woudhuysen
 
AI and the future workforce - People disruption or opportunity?
AI and the future workforce - People disruption or opportunity?AI and the future workforce - People disruption or opportunity?
AI and the future workforce - People disruption or opportunity?
 
[DSC DACH 23] (Un)Ethical Machines? Why AI Bias Is a Problem and What to Do A...
[DSC DACH 23] (Un)Ethical Machines? Why AI Bias Is a Problem and What to Do A...[DSC DACH 23] (Un)Ethical Machines? Why AI Bias Is a Problem and What to Do A...
[DSC DACH 23] (Un)Ethical Machines? Why AI Bias Is a Problem and What to Do A...
 

Último

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Último (20)

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 

AI For Good Bad guys, messy data, & NLP

  • 1. Chris Mack MAY 1, 2019 AI FOR GOOD Bad Guys, Messy Data, & NLP
  • 2. LIQUID TRAVEL BAN 2AI FOR GOOD ● BASIS TECHNOLOGY
  • 3. LIQUID BOMB PLOT 3AI FOR GOOD ● BASIS TECHNOLOGY
  • 4. JIHADI BRIDES TRAGEDY 4AI FOR GOOD ● BASIS TECHNOLOGY Image Sources: - Bethnal trio: Mirror - Article: Independent
  • 5. ALL THE EVIDENCE EXISTS 5AI FOR GOOD ● BASIS TECHNOLOGY Scotland Yard Report ID Social Activity Image Sources: - Tweet: : ISD Global
  • 6. WHAT’S AT STAKE 6AI FOR GOOD ● BASIS TECHNOLOGY FINANCIAL STABILITY Global Money Laundering Operations 1% of Illegal Funds Captured PUBLIC SAFETY Deaths from Terrorist Attacks in Europe 11,288 from 1970-2017 Sources: - Terrorism: Washington Post - Money laundering: Wall Street Journal
  • 7. UNPACKING THE AI SYSTEM ##AI FOR GOOD ● BASIS TECHNOLOGY
  • 8. THE PROPOSED SOLUTION: NLP/NLU ##AI FOR GOOD ● BASIS TECHNOLOGY
  • 9. COMMON PATTERN ##AI FOR GOOD ● BASIS TECHNOLOGY 80% of data is unstructured Join Processed and Structured Data into Knowledge Graph 1) Natural Language Processing Extracts Facts 2) Scored for confidence & relevance Mine Graph For Patterns & Changes People Organizations Locations Relationships Searching Alerting Anomaly Detection Reporting
  • 10. CHALLENGES AT EVERY LEVEL ##AI FOR GOOD ● BASIS TECHNOLOGY ● Domains ● Languages ● Training Data ● Data Salad! ● Data Access ● Duplication ● Variation ● Ambiguity ● Semantics ● Honey Pots ● Training Data ● GIGO ● Data Overload ● Alert Bombs ● Privacy ● Trust
  • 11. ... government officials were convicted of corruption. ABC Company saw a drop in sales as … CHALLENGES AND ANTI-PATTERNS ##AI FOR GOOD ● BASIS TECHNOLOGY Identifying Context 1) Reliance on Keywords 2) Naive Rules Leads to False Positives and False Negatives
  • 12. CHALLENGES AND ANTI-PATTERNS ##AI FOR GOOD ● BASIS TECHNOLOGY Identifying Proper Names 3) Name Variants 4) Name Parts (common keys) Leads to False Positives and False Negatives abdul rashid abdal rashide abdal-rasheed abdul-rashiyd abdul-rachid abd-errshiyd abd-errchide abd-errcheed abd-errchiyd … Abdul-Rasheed ➔
  • 13. BOSTON BOMBING ##AI FOR GOOD ● BASIS TECHNOLOGY
  • 14. Challenges & Anti-patterns 3) Failure to match variants 4) Failure to disambiguate 5) Failure to model what matters 6) Monolingual design “Operation Hairball” CHALLENGES AND ANTI-PATTERNS ##AI FOR GOOD ● BASIS TECHNOLOGY
  • 15. CROSS-LINGUAL SEMANTIC MODELING ##AI FOR GOOD ● BASIS TECHNOLOGY
  • 16. CROSS-LINGUAL SEMANTIC MODELING ##AI FOR GOOD ● BASIS TECHNOLOGY Mapping algorithm Arabic English Chinese Multilingual embeddings space
  • 17. CROSS-LINGUAL SEMANTIC MODELING ##AI FOR GOOD ● BASIS TECHNOLOGY Machine Learning ‫חישובית‬ ‫למידה‬Eagle Pharmaceuticals Inc. Eagle Drugs, Co. Tesla Energy Storage ‫טסלה‬ AI ‫ﻣوﺗورز‬ ‫ﺗﯾﺳﻼ‬ 計算学習 ‫אנרגיה‬ ‫אחסון‬
  • 18. AI BUILDING BLOCKS: Algorithms & High Quality Data ##AI FOR GOOD ● BASIS TECHNOLOGY ● NN NER ● NN CLASS ● NN RELAX ● SVM ● TEXT EMBEDDINGS ● NNs ● NL SEARCH ● CLASSIC ML ● ANOMALY DETECTION ● HMM ● SEMANTIC MODELING ● GRAPH SIMILARITY ● Data Filtering ● Classification ● Deduplication ● High Quality Annotations ● Language & domain combos ● Active Learning Feedback ● High Quality Name Pairs in every language pair ● Confidence Modeling ● Semantic Model ● Baseline “normal” ● Queries ● Visualizations
  • 19. PUTTING IT ALL TOGETHER ##AI FOR GOOD ● BASIS TECHNOLOGY People Organizations Locations Relationships Searching Alerting Anomaly Detection Reporting
  • 20. ##AI FOR GOOD ● BASIS TECHNOLOGY THIS TECHNOLOGY IS ALREADY AT WORK
  • 21. CAPTURING EL CHAPO ##AI FOR GOOD ● BASIS TECHNOLOGY Source: U.S. Immigration and Customs Enforcement
  • 22. CAPTURING EL CHAPO ##AI FOR GOOD ● BASIS TECHNOLOGY Source: El Chapo recaptured in gun battle
  • 23. KEY DOMAINS OF IMPACT ##AI FOR GOOD ● BASIS TECHNOLOGY National Security Financial ServicesLaw EnforcementIntelligence
  • 24. THANK YOU ##AI FOR GOOD ● BASIS TECHNOLOGY Chris Mack ● Basis Technology ● I design & implement NLP / NLU solutions for good ● Please reach out! @cgmack
  • 25. ##AI FOR GOOD ● BASIS TECHNOLOGY Thank You