SlideShare una empresa de Scribd logo
1 de 25
Chris Mack
MAY 1, 2019
AI FOR GOOD
Bad Guys, Messy Data, & NLP
LIQUID TRAVEL BAN
2AI FOR GOOD ● BASIS TECHNOLOGY
LIQUID BOMB PLOT
3AI FOR GOOD ● BASIS TECHNOLOGY
JIHADI BRIDES TRAGEDY
4AI FOR GOOD ● BASIS TECHNOLOGY
Image Sources:
- Bethnal trio: Mirror
- Article: Independent
ALL THE EVIDENCE EXISTS
5AI FOR GOOD ● BASIS TECHNOLOGY
Scotland Yard Report
ID
Social Activity
Image Sources:
- Tweet: : ISD Global
WHAT’S AT STAKE
6AI FOR GOOD ● BASIS TECHNOLOGY
FINANCIAL STABILITY
Global Money Laundering Operations
1% of Illegal Funds Captured
PUBLIC SAFETY
Deaths from Terrorist Attacks in Europe
11,288 from 1970-2017
Sources:
- Terrorism: Washington Post
- Money laundering: Wall Street Journal
UNPACKING THE AI SYSTEM
##AI FOR GOOD ● BASIS TECHNOLOGY
THE PROPOSED SOLUTION: NLP/NLU
##AI FOR GOOD ● BASIS TECHNOLOGY
COMMON PATTERN
##AI FOR GOOD ● BASIS TECHNOLOGY
80% of data is
unstructured
Join Processed and
Structured Data into
Knowledge Graph
1) Natural Language
Processing Extracts Facts
2) Scored for confidence
& relevance
Mine Graph
For Patterns
& Changes
People
Organizations
Locations
Relationships
Searching
Alerting
Anomaly Detection
Reporting
CHALLENGES AT EVERY LEVEL
##AI FOR GOOD ● BASIS TECHNOLOGY
● Domains
● Languages
● Training Data
● Data Salad!
● Data Access
● Duplication
● Variation
● Ambiguity
● Semantics
● Honey Pots
● Training Data
● GIGO
● Data Overload
● Alert Bombs
● Privacy
● Trust
... government officials were
convicted of corruption.
ABC Company saw a drop in
sales as …
CHALLENGES AND ANTI-PATTERNS
##AI FOR GOOD ● BASIS TECHNOLOGY
Identifying Context
1) Reliance on Keywords
2) Naive Rules
Leads to False Positives
and False Negatives
CHALLENGES AND ANTI-PATTERNS
##AI FOR GOOD ● BASIS TECHNOLOGY
Identifying Proper Names
3) Name Variants
4) Name Parts (common keys)
Leads to False Positives
and False Negatives
abdul rashid
abdal rashide
abdal-rasheed
abdul-rashiyd
abdul-rachid
abd-errshiyd
abd-errchide
abd-errcheed
abd-errchiyd …
Abdul-Rasheed ➔
BOSTON BOMBING
##AI FOR GOOD ● BASIS TECHNOLOGY
Challenges & Anti-patterns
3) Failure to match variants
4) Failure to disambiguate
5) Failure to model what matters
6) Monolingual design
“Operation Hairball”
CHALLENGES AND ANTI-PATTERNS
##AI FOR GOOD ● BASIS TECHNOLOGY
CROSS-LINGUAL SEMANTIC MODELING
##AI FOR GOOD ● BASIS TECHNOLOGY
CROSS-LINGUAL SEMANTIC MODELING
##AI FOR GOOD ● BASIS TECHNOLOGY
Mapping
algorithm
Arabic
English
Chinese
Multilingual
embeddings
space
CROSS-LINGUAL SEMANTIC MODELING
##AI FOR GOOD ● BASIS TECHNOLOGY
Machine Learning
‫חישובית‬ ‫למידה‬Eagle
Pharmaceuticals Inc.
Eagle
Drugs, Co.
Tesla
Energy Storage
‫טסלה‬
AI
‫ﻣوﺗورز‬ ‫ﺗﯾﺳﻼ‬
計算学習
‫אנרגיה‬ ‫אחסון‬
AI BUILDING BLOCKS: Algorithms & High Quality Data
##AI FOR GOOD ● BASIS TECHNOLOGY
● NN NER
● NN CLASS
● NN RELAX
● SVM
● TEXT
EMBEDDINGS
● NNs
● NL SEARCH
● CLASSIC ML
● ANOMALY
DETECTION
● HMM
● SEMANTIC
MODELING
● GRAPH
SIMILARITY
● Data Filtering
● Classification
● Deduplication
● High Quality
Annotations
● Language & domain
combos
● Active Learning
Feedback
● High Quality Name
Pairs in every
language pair
● Confidence Modeling
● Semantic Model
● Baseline
“normal”
● Queries
● Visualizations
PUTTING IT ALL TOGETHER
##AI FOR GOOD ● BASIS TECHNOLOGY
People
Organizations
Locations
Relationships
Searching
Alerting
Anomaly Detection
Reporting
##AI FOR GOOD ● BASIS TECHNOLOGY
THIS TECHNOLOGY IS ALREADY AT WORK
CAPTURING EL CHAPO
##AI FOR GOOD ● BASIS TECHNOLOGY
Source: U.S. Immigration and Customs Enforcement
CAPTURING EL CHAPO
##AI FOR GOOD ● BASIS TECHNOLOGY
Source: El Chapo recaptured in gun battle
KEY DOMAINS OF IMPACT
##AI FOR GOOD ● BASIS TECHNOLOGY
National Security Financial ServicesLaw EnforcementIntelligence
THANK YOU
##AI FOR GOOD ● BASIS TECHNOLOGY
Chris Mack
● Basis Technology
● I design & implement NLP / NLU solutions for good
● Please reach out!
@cgmack
##AI FOR GOOD ● BASIS TECHNOLOGY
Thank You

Más contenido relacionado

Similar a AI For Good Bad guys, messy data, & NLP

How ai structures business information ?
How ai structures business information ?How ai structures business information ?
How ai structures business information ?Nicolas Bombourg
 
AI Masterclass at ASU GSV 2019
AI Masterclass at ASU GSV 2019AI Masterclass at ASU GSV 2019
AI Masterclass at ASU GSV 2019Sergey Karayev
 
To The Point: Artificial Intelligence - Facts vs Myths
To The Point: Artificial Intelligence - Facts vs MythsTo The Point: Artificial Intelligence - Facts vs Myths
To The Point: Artificial Intelligence - Facts vs Mythspuronku
 
Spohrer Ntegra 20230324 v12.pptx
Spohrer Ntegra 20230324 v12.pptxSpohrer Ntegra 20230324 v12.pptx
Spohrer Ntegra 20230324 v12.pptxISSIP
 
AI and the future workforce
AI and the future workforceAI and the future workforce
AI and the future workforceHudson UK
 
Smart Data 2017 #AI & #FutureofWork
Smart Data 2017 #AI & #FutureofWorkSmart Data 2017 #AI & #FutureofWork
Smart Data 2017 #AI & #FutureofWorkSteve Ardire
 
Ai morality-today-2018-web
Ai morality-today-2018-webAi morality-today-2018-web
Ai morality-today-2018-webTom Daly
 
AI, Innovation & Ethics in Marketing by PR Smith, founder of SOSTAC® Plans ...
AI, Innovation & Ethics in Marketing  by PR Smith, founder of SOSTAC® Plans  ...AI, Innovation & Ethics in Marketing  by PR Smith, founder of SOSTAC® Plans  ...
AI, Innovation & Ethics in Marketing by PR Smith, founder of SOSTAC® Plans ...Institutul de Marketing
 
Artificial Intelligence.pptx
Artificial Intelligence.pptxArtificial Intelligence.pptx
Artificial Intelligence.pptxMadanAcharya7
 
Artificial Intelligence in testing - A STeP-IN Evening Talk Session Speech by...
Artificial Intelligence in testing - A STeP-IN Evening Talk Session Speech by...Artificial Intelligence in testing - A STeP-IN Evening Talk Session Speech by...
Artificial Intelligence in testing - A STeP-IN Evening Talk Session Speech by...Kalilur Rahman
 
Leading in a digital world for MIT Research School Comfer
Leading in a digital world for MIT Research School ComferLeading in a digital world for MIT Research School Comfer
Leading in a digital world for MIT Research School ComferRobin Teigland
 
AI-SDV 2020: AI, IoT, Blockchain & Co: How to keep track and take advantage o...
AI-SDV 2020: AI, IoT, Blockchain & Co: How to keep track and take advantage o...AI-SDV 2020: AI, IoT, Blockchain & Co: How to keep track and take advantage o...
AI-SDV 2020: AI, IoT, Blockchain & Co: How to keep track and take advantage o...Dr. Haxel Consult
 
Ai presentation by niraj lunavat at ghrcem.pptx
Ai presentation by niraj lunavat at ghrcem.pptxAi presentation by niraj lunavat at ghrcem.pptx
Ai presentation by niraj lunavat at ghrcem.pptxTechVision8
 
AI for IA's: Machine Learning Demystified at IA Summit 2017 - IAS17
AI for IA's: Machine Learning Demystified at IA Summit 2017 - IAS17AI for IA's: Machine Learning Demystified at IA Summit 2017 - IAS17
AI for IA's: Machine Learning Demystified at IA Summit 2017 - IAS17Carol Smith
 
Kaimar karu - The Real Promise of AI Beyond The Hype
Kaimar karu - The Real Promise of AI Beyond The HypeKaimar karu - The Real Promise of AI Beyond The Hype
Kaimar karu - The Real Promise of AI Beyond The HypeOpenInfra Days Poland 2019
 
Some ABCs of Forecasting - James Woudhuysen
Some ABCs of Forecasting - James WoudhuysenSome ABCs of Forecasting - James Woudhuysen
Some ABCs of Forecasting - James Woudhuysenuxbri
 
AI and the future workforce - People disruption or opportunity?
AI and the future workforce - People disruption or opportunity?AI and the future workforce - People disruption or opportunity?
AI and the future workforce - People disruption or opportunity?Simone Edwards
 
[DSC DACH 23] (Un)Ethical Machines? Why AI Bias Is a Problem and What to Do A...
[DSC DACH 23] (Un)Ethical Machines? Why AI Bias Is a Problem and What to Do A...[DSC DACH 23] (Un)Ethical Machines? Why AI Bias Is a Problem and What to Do A...
[DSC DACH 23] (Un)Ethical Machines? Why AI Bias Is a Problem and What to Do A...DataScienceConferenc1
 

Similar a AI For Good Bad guys, messy data, & NLP (20)

How ai structures business information ?
How ai structures business information ?How ai structures business information ?
How ai structures business information ?
 
AI Masterclass at ASU GSV 2019
AI Masterclass at ASU GSV 2019AI Masterclass at ASU GSV 2019
AI Masterclass at ASU GSV 2019
 
To The Point: Artificial Intelligence - Facts vs Myths
To The Point: Artificial Intelligence - Facts vs MythsTo The Point: Artificial Intelligence - Facts vs Myths
To The Point: Artificial Intelligence - Facts vs Myths
 
Spohrer Ntegra 20230324 v12.pptx
Spohrer Ntegra 20230324 v12.pptxSpohrer Ntegra 20230324 v12.pptx
Spohrer Ntegra 20230324 v12.pptx
 
AI and the future workforce
AI and the future workforceAI and the future workforce
AI and the future workforce
 
Smart Data 2017 #AI & #FutureofWork
Smart Data 2017 #AI & #FutureofWorkSmart Data 2017 #AI & #FutureofWork
Smart Data 2017 #AI & #FutureofWork
 
Ai morality-today-2018-web
Ai morality-today-2018-webAi morality-today-2018-web
Ai morality-today-2018-web
 
AI, Innovation & Ethics in Marketing by PR Smith, founder of SOSTAC® Plans ...
AI, Innovation & Ethics in Marketing  by PR Smith, founder of SOSTAC® Plans  ...AI, Innovation & Ethics in Marketing  by PR Smith, founder of SOSTAC® Plans  ...
AI, Innovation & Ethics in Marketing by PR Smith, founder of SOSTAC® Plans ...
 
Artificial Intelligence.pptx
Artificial Intelligence.pptxArtificial Intelligence.pptx
Artificial Intelligence.pptx
 
Artificial Intelligence in testing - A STeP-IN Evening Talk Session Speech by...
Artificial Intelligence in testing - A STeP-IN Evening Talk Session Speech by...Artificial Intelligence in testing - A STeP-IN Evening Talk Session Speech by...
Artificial Intelligence in testing - A STeP-IN Evening Talk Session Speech by...
 
Internet of things
Internet of thingsInternet of things
Internet of things
 
Leading in a digital world for MIT Research School Comfer
Leading in a digital world for MIT Research School ComferLeading in a digital world for MIT Research School Comfer
Leading in a digital world for MIT Research School Comfer
 
AI-SDV 2020: AI, IoT, Blockchain & Co: How to keep track and take advantage o...
AI-SDV 2020: AI, IoT, Blockchain & Co: How to keep track and take advantage o...AI-SDV 2020: AI, IoT, Blockchain & Co: How to keep track and take advantage o...
AI-SDV 2020: AI, IoT, Blockchain & Co: How to keep track and take advantage o...
 
Ai presentation by niraj lunavat at ghrcem.pptx
Ai presentation by niraj lunavat at ghrcem.pptxAi presentation by niraj lunavat at ghrcem.pptx
Ai presentation by niraj lunavat at ghrcem.pptx
 
AI for IA's: Machine Learning Demystified at IA Summit 2017 - IAS17
AI for IA's: Machine Learning Demystified at IA Summit 2017 - IAS17AI for IA's: Machine Learning Demystified at IA Summit 2017 - IAS17
AI for IA's: Machine Learning Demystified at IA Summit 2017 - IAS17
 
Kaimar karu - The Real Promise of AI Beyond The Hype
Kaimar karu - The Real Promise of AI Beyond The HypeKaimar karu - The Real Promise of AI Beyond The Hype
Kaimar karu - The Real Promise of AI Beyond The Hype
 
Ethical Dilemmas in AI/ML-based systems
Ethical Dilemmas in AI/ML-based systemsEthical Dilemmas in AI/ML-based systems
Ethical Dilemmas in AI/ML-based systems
 
Some ABCs of Forecasting - James Woudhuysen
Some ABCs of Forecasting - James WoudhuysenSome ABCs of Forecasting - James Woudhuysen
Some ABCs of Forecasting - James Woudhuysen
 
AI and the future workforce - People disruption or opportunity?
AI and the future workforce - People disruption or opportunity?AI and the future workforce - People disruption or opportunity?
AI and the future workforce - People disruption or opportunity?
 
[DSC DACH 23] (Un)Ethical Machines? Why AI Bias Is a Problem and What to Do A...
[DSC DACH 23] (Un)Ethical Machines? Why AI Bias Is a Problem and What to Do A...[DSC DACH 23] (Un)Ethical Machines? Why AI Bias Is a Problem and What to Do A...
[DSC DACH 23] (Un)Ethical Machines? Why AI Bias Is a Problem and What to Do A...
 

Último

Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 

Último (20)

Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 

AI For Good Bad guys, messy data, & NLP

  • 1. Chris Mack MAY 1, 2019 AI FOR GOOD Bad Guys, Messy Data, & NLP
  • 2. LIQUID TRAVEL BAN 2AI FOR GOOD ● BASIS TECHNOLOGY
  • 3. LIQUID BOMB PLOT 3AI FOR GOOD ● BASIS TECHNOLOGY
  • 4. JIHADI BRIDES TRAGEDY 4AI FOR GOOD ● BASIS TECHNOLOGY Image Sources: - Bethnal trio: Mirror - Article: Independent
  • 5. ALL THE EVIDENCE EXISTS 5AI FOR GOOD ● BASIS TECHNOLOGY Scotland Yard Report ID Social Activity Image Sources: - Tweet: : ISD Global
  • 6. WHAT’S AT STAKE 6AI FOR GOOD ● BASIS TECHNOLOGY FINANCIAL STABILITY Global Money Laundering Operations 1% of Illegal Funds Captured PUBLIC SAFETY Deaths from Terrorist Attacks in Europe 11,288 from 1970-2017 Sources: - Terrorism: Washington Post - Money laundering: Wall Street Journal
  • 7. UNPACKING THE AI SYSTEM ##AI FOR GOOD ● BASIS TECHNOLOGY
  • 8. THE PROPOSED SOLUTION: NLP/NLU ##AI FOR GOOD ● BASIS TECHNOLOGY
  • 9. COMMON PATTERN ##AI FOR GOOD ● BASIS TECHNOLOGY 80% of data is unstructured Join Processed and Structured Data into Knowledge Graph 1) Natural Language Processing Extracts Facts 2) Scored for confidence & relevance Mine Graph For Patterns & Changes People Organizations Locations Relationships Searching Alerting Anomaly Detection Reporting
  • 10. CHALLENGES AT EVERY LEVEL ##AI FOR GOOD ● BASIS TECHNOLOGY ● Domains ● Languages ● Training Data ● Data Salad! ● Data Access ● Duplication ● Variation ● Ambiguity ● Semantics ● Honey Pots ● Training Data ● GIGO ● Data Overload ● Alert Bombs ● Privacy ● Trust
  • 11. ... government officials were convicted of corruption. ABC Company saw a drop in sales as … CHALLENGES AND ANTI-PATTERNS ##AI FOR GOOD ● BASIS TECHNOLOGY Identifying Context 1) Reliance on Keywords 2) Naive Rules Leads to False Positives and False Negatives
  • 12. CHALLENGES AND ANTI-PATTERNS ##AI FOR GOOD ● BASIS TECHNOLOGY Identifying Proper Names 3) Name Variants 4) Name Parts (common keys) Leads to False Positives and False Negatives abdul rashid abdal rashide abdal-rasheed abdul-rashiyd abdul-rachid abd-errshiyd abd-errchide abd-errcheed abd-errchiyd … Abdul-Rasheed ➔
  • 13. BOSTON BOMBING ##AI FOR GOOD ● BASIS TECHNOLOGY
  • 14. Challenges & Anti-patterns 3) Failure to match variants 4) Failure to disambiguate 5) Failure to model what matters 6) Monolingual design “Operation Hairball” CHALLENGES AND ANTI-PATTERNS ##AI FOR GOOD ● BASIS TECHNOLOGY
  • 15. CROSS-LINGUAL SEMANTIC MODELING ##AI FOR GOOD ● BASIS TECHNOLOGY
  • 16. CROSS-LINGUAL SEMANTIC MODELING ##AI FOR GOOD ● BASIS TECHNOLOGY Mapping algorithm Arabic English Chinese Multilingual embeddings space
  • 17. CROSS-LINGUAL SEMANTIC MODELING ##AI FOR GOOD ● BASIS TECHNOLOGY Machine Learning ‫חישובית‬ ‫למידה‬Eagle Pharmaceuticals Inc. Eagle Drugs, Co. Tesla Energy Storage ‫טסלה‬ AI ‫ﻣوﺗورز‬ ‫ﺗﯾﺳﻼ‬ 計算学習 ‫אנרגיה‬ ‫אחסון‬
  • 18. AI BUILDING BLOCKS: Algorithms & High Quality Data ##AI FOR GOOD ● BASIS TECHNOLOGY ● NN NER ● NN CLASS ● NN RELAX ● SVM ● TEXT EMBEDDINGS ● NNs ● NL SEARCH ● CLASSIC ML ● ANOMALY DETECTION ● HMM ● SEMANTIC MODELING ● GRAPH SIMILARITY ● Data Filtering ● Classification ● Deduplication ● High Quality Annotations ● Language & domain combos ● Active Learning Feedback ● High Quality Name Pairs in every language pair ● Confidence Modeling ● Semantic Model ● Baseline “normal” ● Queries ● Visualizations
  • 19. PUTTING IT ALL TOGETHER ##AI FOR GOOD ● BASIS TECHNOLOGY People Organizations Locations Relationships Searching Alerting Anomaly Detection Reporting
  • 20. ##AI FOR GOOD ● BASIS TECHNOLOGY THIS TECHNOLOGY IS ALREADY AT WORK
  • 21. CAPTURING EL CHAPO ##AI FOR GOOD ● BASIS TECHNOLOGY Source: U.S. Immigration and Customs Enforcement
  • 22. CAPTURING EL CHAPO ##AI FOR GOOD ● BASIS TECHNOLOGY Source: El Chapo recaptured in gun battle
  • 23. KEY DOMAINS OF IMPACT ##AI FOR GOOD ● BASIS TECHNOLOGY National Security Financial ServicesLaw EnforcementIntelligence
  • 24. THANK YOU ##AI FOR GOOD ● BASIS TECHNOLOGY Chris Mack ● Basis Technology ● I design & implement NLP / NLU solutions for good ● Please reach out! @cgmack
  • 25. ##AI FOR GOOD ● BASIS TECHNOLOGY Thank You