SlideShare a Scribd company logo
1 of 20
Download to read offline
IBM Cloud / DOC ID / Month XX, 2018 / © 2018 IBM Corporation
#PartyCloud
A journey into Data Science & AI:
“Latent Panelists Affinities” case study
—
IBM Data Workshop
20-09-2018
Gianmario Spacagna
Chief Scientist, Helixa AI
About me
2IBM Cloud / DOC ID / Month XX, 2018 / © 2018 IBM Corporation
University background:
● Telematics Engineering (Polytechnic of Turin)
● Software Engineering of Distributed Systems (KTH Stockholm)
Working experience:
● Predictive Marketing (AgilOne)
● Cyber Security (Cisco)
● Retail & Business Banking (Barclays)
● Automotive (Pirelli)
● Artificial Social Intelligence (Helixa)
External Projects
3
● Co-author of “Python Deep Learning”
book and
“The Professional Data Science
Manifesto”
● Founder of the “Data Science Milan”
community and the
Machine Intelligence Hub network
Intelligence
4
● Capacity to learn from experience *
● Ability to adapt to different context *
● The use of metacognition to enhance learning *
* Cognitive Psychology 4th edition, Robert J Sternberg, Chapter 13
Social Intelligence
5
● Ability to get along with others *
● Knowledge of social matters *
● Insight into moods and or underlying personality traits of
others *
* Cognitive Psychology 4th edition, Robert J Sternberg, Chapter 13
Artificial (Social) Intelligence
6
● The computational part of the ability to achieve (social)**
goals in the world*
● The application of machine intelligence techniques to social
phenomena ***
* Cognitive Psychology 4th edition, Robert J Sternberg, Chapter 13
** My own social re-interpretation
*** Artificial Social Intelligence, William Sims Bainbridge et al., Annual Review of Sociology, Vol. 20 (1994), pp. 407-436
Generative AI Technology
7
A generative algorithm ensembling AI models and prior knowledge of the
world in order to unify different data sources into a single population of
synthetic users representing an augmented view of U.S. consumers and
their affinities.
Case Study: Panelists Latent Affinities
Influencers &
Celebrities
Products &
Brands
Media &
Publishers
Anonymous Panelists
Survey
Unclassified Entities
9
Products and Brands
Media & Publishers
Influencers and Celebrities
?
?
?
Potentially millions of entities!
Partially-responded Survey
10
Are you
inspired by Elon
Musk?
Are you
interested on
SpaceX
mission?
Have you drunk
Starbucks
coffee in the
last month?
Do you read NY
Times at least
once a week?
Do you listen to
Led Zeppelin?
❌ N/A ✅ N/A ✅
❌ N/A N/A ❌ N/A
✅ ✅ ❌ N/A ✅
✅ N/A ❌ N/A ✅
Given a set of uncategorized entities and a
set of anonymized users along with some
observable affinities:
1. What category each entity belongs to?
2. What are his/her latent affinites?
11
Social Agent Goals
12
1. Identify the category of each entity (e.g. influencer, product, media)
2. Learn representations of the entities (e.g. grouping them in shared
topics such as sports, music genres, movie kinds)
3. Learn how to map one entity to another (e.g. Elon Musk : people =
SpaceX : technology)
4. Estimate latent affinities by reasoning on the available observations
(e.g. if you are interested in Rock music and 70s culture, you are very
likely to be a fan of Led Zeppelin)
Entities Classifier
13
Input:
● Entity attributes
● Survey statistics
Output:
● Influencer
● Product
● Media
Recommender System for Affinities
14
● Build a user-item matrix
● Use implicit feedback to
represent missing
affinities
● Decompose it in the
multiplication of
user-topic and topic-item
● Infer probability scores of
latent items
Alternating Least Squares algorithm
Data Lake
End-to-End Pipeline
15
Entities
Classifier
Recommender
System
REST
Dashboard
Aggregation
Engine
Data
Ingestion
Jobs
Aggregated
Insights
Training datasets
AI
Data product
Generative
Algorithm
Model
Validation
Research Spike Workflow
16IBM Cloud / DOC ID / Month XX, 2018 / © 2018 IBM Corporation
Deliverable:
● Findings report
● Insights
● Know-how
Development Workflow
17
1. Time-boxed research spikes followed by clearly defined feature stories
2. Notebooks good for analysis, entry points and results presentation
3. Code developed in modules and functions in a proper IDE
4. Unit tests: replace “assertEqual” with uncertainty ranges
5. Versioning: both code, priors, data, evaluation results and models
6. Git-flow branching system, pull request, peer review
7. End-to-end testing using small datasets and Continuous Integration
First release plan focused on a MVP satisfying the acceptance
criteria defined by early adopter customers instead of just PoC
Stack
18
IBM Cloud / DOC ID / Month XX, 2018 / © 2018 IBM Corporation
Programming Languages:
Scientific Tools:
Distributed Systems: Machine Learning: Visualization:
Storage:
ModelDB
DevOps:
Versioning:
The Crew
ML Engineer ML Scientist
Chief Scientist Mathematician
Psychologist Data Engineer
NLP Specialist
Challenges
20
1. Business integration (technical and cultural)
2. Lack of expertise in the market and steep learning curve
3. Too many different tools and technologies

More Related Content

What's hot

The A-Z of Data: Introduction to MLOps
The A-Z of Data: Introduction to MLOpsThe A-Z of Data: Introduction to MLOps
The A-Z of Data: Introduction to MLOpsDataPhoenix
 
Data ops: Machine Learning in production
Data ops: Machine Learning in productionData ops: Machine Learning in production
Data ops: Machine Learning in productionStepan Pushkarev
 
QuTrack: Model Life Cycle Management for AI and ML models using a Blockchain ...
QuTrack: Model Life Cycle Management for AI and ML models using a Blockchain ...QuTrack: Model Life Cycle Management for AI and ML models using a Blockchain ...
QuTrack: Model Life Cycle Management for AI and ML models using a Blockchain ...QuantUniversity
 
BigML Release: PCA
BigML Release: PCABigML Release: PCA
BigML Release: PCABigML, Inc
 
Machine Learning Project Lifecycle
Machine Learning Project LifecycleMachine Learning Project Lifecycle
Machine Learning Project LifecycleAbdelhak MAHMOUDI
 
ALT-F1 Techtalk 3 - Google AppEngine
ALT-F1 Techtalk 3 - Google AppEngineALT-F1 Techtalk 3 - Google AppEngine
ALT-F1 Techtalk 3 - Google AppEngineAbdelkrim Boujraf
 
Wen Tang resume~
Wen Tang resume~Wen Tang resume~
Wen Tang resume~Tang Wen
 
Towards Human-Centered Machine Learning
Towards Human-Centered Machine LearningTowards Human-Centered Machine Learning
Towards Human-Centered Machine LearningSri Ambati
 
Interactive Browsing and Navigation in Relational Databases
Interactive Browsing and Navigation in Relational DatabasesInteractive Browsing and Navigation in Relational Databases
Interactive Browsing and Navigation in Relational DatabasesMinsuk Kahng
 
Basic Data Engineering
Basic Data EngineeringBasic Data Engineering
Basic Data EngineeringNovita Sari
 
Intuit - Machine learning platform lifecycle management 2018
Intuit - Machine learning platform lifecycle management  2018Intuit - Machine learning platform lifecycle management  2018
Intuit - Machine learning platform lifecycle management 2018Karthik Murugesan
 
TienResumeFinalV22016
TienResumeFinalV22016TienResumeFinalV22016
TienResumeFinalV22016Nora Tien
 
Carmelo Iaria, AI Academy - How The AI Academy is accelerating NLP projects w...
Carmelo Iaria, AI Academy - How The AI Academy is accelerating NLP projects w...Carmelo Iaria, AI Academy - How The AI Academy is accelerating NLP projects w...
Carmelo Iaria, AI Academy - How The AI Academy is accelerating NLP projects w...Sri Ambati
 
Machine Learning with GraphLab Create
Machine Learning with GraphLab CreateMachine Learning with GraphLab Create
Machine Learning with GraphLab CreateTuri, Inc.
 
RajaSekharPulaVenkata
RajaSekharPulaVenkataRajaSekharPulaVenkata
RajaSekharPulaVenkataRaja Sekhar
 
London atlassian meetup 31 jan 2016 jira metrics-extract slides
London atlassian meetup 31 jan 2016 jira metrics-extract slidesLondon atlassian meetup 31 jan 2016 jira metrics-extract slides
London atlassian meetup 31 jan 2016 jira metrics-extract slidesRudiger Wolf
 

What's hot (20)

The A-Z of Data: Introduction to MLOps
The A-Z of Data: Introduction to MLOpsThe A-Z of Data: Introduction to MLOps
The A-Z of Data: Introduction to MLOps
 
Data ops: Machine Learning in production
Data ops: Machine Learning in productionData ops: Machine Learning in production
Data ops: Machine Learning in production
 
Architecting for Data Science
Architecting for Data ScienceArchitecting for Data Science
Architecting for Data Science
 
QuTrack: Model Life Cycle Management for AI and ML models using a Blockchain ...
QuTrack: Model Life Cycle Management for AI and ML models using a Blockchain ...QuTrack: Model Life Cycle Management for AI and ML models using a Blockchain ...
QuTrack: Model Life Cycle Management for AI and ML models using a Blockchain ...
 
BigML Release: PCA
BigML Release: PCABigML Release: PCA
BigML Release: PCA
 
Machine Learning Project Lifecycle
Machine Learning Project LifecycleMachine Learning Project Lifecycle
Machine Learning Project Lifecycle
 
ALT-F1 Techtalk 3 - Google AppEngine
ALT-F1 Techtalk 3 - Google AppEngineALT-F1 Techtalk 3 - Google AppEngine
ALT-F1 Techtalk 3 - Google AppEngine
 
Wen Tang resume~
Wen Tang resume~Wen Tang resume~
Wen Tang resume~
 
Towards Human-Centered Machine Learning
Towards Human-Centered Machine LearningTowards Human-Centered Machine Learning
Towards Human-Centered Machine Learning
 
Interactive Browsing and Navigation in Relational Databases
Interactive Browsing and Navigation in Relational DatabasesInteractive Browsing and Navigation in Relational Databases
Interactive Browsing and Navigation in Relational Databases
 
Basic Data Engineering
Basic Data EngineeringBasic Data Engineering
Basic Data Engineering
 
Intuit - Machine learning platform lifecycle management 2018
Intuit - Machine learning platform lifecycle management  2018Intuit - Machine learning platform lifecycle management  2018
Intuit - Machine learning platform lifecycle management 2018
 
TienResumeFinalV22016
TienResumeFinalV22016TienResumeFinalV22016
TienResumeFinalV22016
 
Carmelo Iaria, AI Academy - How The AI Academy is accelerating NLP projects w...
Carmelo Iaria, AI Academy - How The AI Academy is accelerating NLP projects w...Carmelo Iaria, AI Academy - How The AI Academy is accelerating NLP projects w...
Carmelo Iaria, AI Academy - How The AI Academy is accelerating NLP projects w...
 
Brian_Thomas_Resume_20160215
Brian_Thomas_Resume_20160215Brian_Thomas_Resume_20160215
Brian_Thomas_Resume_20160215
 
Machine Learning with GraphLab Create
Machine Learning with GraphLab CreateMachine Learning with GraphLab Create
Machine Learning with GraphLab Create
 
RajaSekharPulaVenkata
RajaSekharPulaVenkataRajaSekharPulaVenkata
RajaSekharPulaVenkata
 
London atlassian meetup 31 jan 2016 jira metrics-extract slides
London atlassian meetup 31 jan 2016 jira metrics-extract slidesLondon atlassian meetup 31 jan 2016 jira metrics-extract slides
London atlassian meetup 31 jan 2016 jira metrics-extract slides
 
Resume_ChiungLun_Hung
Resume_ChiungLun_HungResume_ChiungLun_Hung
Resume_ChiungLun_Hung
 
Iwesep19.ppt
Iwesep19.pptIwesep19.ppt
Iwesep19.ppt
 

Similar to Latent Panelists Affinities: a Helixa case study

Benefiting from Semantic AI along the data life cycle
Benefiting from Semantic AI along the data life cycleBenefiting from Semantic AI along the data life cycle
Benefiting from Semantic AI along the data life cycleMartin Kaltenböck
 
Understanding the New World of Cognitive Computing
Understanding the New World of Cognitive ComputingUnderstanding the New World of Cognitive Computing
Understanding the New World of Cognitive ComputingDATAVERSITY
 
Intel 20180608 v2
Intel 20180608 v2Intel 20180608 v2
Intel 20180608 v2ISSIP
 
Algorithm Marketplace and the new "Algorithm Economy"
Algorithm Marketplace and the new "Algorithm Economy"Algorithm Marketplace and the new "Algorithm Economy"
Algorithm Marketplace and the new "Algorithm Economy"Diego Oppenheimer
 
Wall Street Mastermind Sector Spotlight - Technology (October 2023).pdf
Wall Street Mastermind Sector Spotlight - Technology (October 2023).pdfWall Street Mastermind Sector Spotlight - Technology (October 2023).pdf
Wall Street Mastermind Sector Spotlight - Technology (October 2023).pdfSamShiah1
 
Ai open powermeetupmarch25th
Ai open powermeetupmarch25thAi open powermeetupmarch25th
Ai open powermeetupmarch25thIBM
 
icon-aiincs-obusolini201809131800-190310184140.pptx
icon-aiincs-obusolini201809131800-190310184140.pptxicon-aiincs-obusolini201809131800-190310184140.pptx
icon-aiincs-obusolini201809131800-190310184140.pptxyugandharadahiphale2
 
icon-aiincs-obusolini201809131800-190310184140.pptx
icon-aiincs-obusolini201809131800-190310184140.pptxicon-aiincs-obusolini201809131800-190310184140.pptx
icon-aiincs-obusolini201809131800-190310184140.pptxyugandharadahiphale2
 
Data Science, Personalisation & Product management
Data Science, Personalisation & Product managementData Science, Personalisation & Product management
Data Science, Personalisation & Product managementBhaskar Krishnan
 
20 Best Platforms to Learn Data Science and Machine Learning.pdf
20 Best Platforms to Learn Data Science and Machine Learning.pdf20 Best Platforms to Learn Data Science and Machine Learning.pdf
20 Best Platforms to Learn Data Science and Machine Learning.pdfFuture Education Magazine
 
Kazakhstan digital media_at_svl 20191018 v5
Kazakhstan digital media_at_svl 20191018 v5Kazakhstan digital media_at_svl 20191018 v5
Kazakhstan digital media_at_svl 20191018 v5ISSIP
 
2013 - Yhat - YC app.pdf
2013 - Yhat - YC app.pdf2013 - Yhat - YC app.pdf
2013 - Yhat - YC app.pdfAustin Ogilvie
 
ChatGPT for Social Media Listening: practical application with YouScan’s Insi...
ChatGPT for Social Media Listening: practical application with YouScan’s Insi...ChatGPT for Social Media Listening: practical application with YouScan’s Insi...
ChatGPT for Social Media Listening: practical application with YouScan’s Insi...Ray Poynter
 
Tutorial helsinki 20180313 v1
Tutorial helsinki 20180313 v1Tutorial helsinki 20180313 v1
Tutorial helsinki 20180313 v1ISSIP
 
Cognitive Digital Twin by Fariz Saračević
Cognitive Digital Twin by Fariz SaračevićCognitive Digital Twin by Fariz Saračević
Cognitive Digital Twin by Fariz SaračevićBosnia Agile
 
Social Media Data Collection & Analysis
Social Media Data Collection & AnalysisSocial Media Data Collection & Analysis
Social Media Data Collection & AnalysisScott Sanders
 
#OSSPARIS19 - Overcoming open source challenges in reinforcement learning - W...
#OSSPARIS19 - Overcoming open source challenges in reinforcement learning - W...#OSSPARIS19 - Overcoming open source challenges in reinforcement learning - W...
#OSSPARIS19 - Overcoming open source challenges in reinforcement learning - W...Paris Open Source Summit
 

Similar to Latent Panelists Affinities: a Helixa case study (20)

Benefiting from Semantic AI along the data life cycle
Benefiting from Semantic AI along the data life cycleBenefiting from Semantic AI along the data life cycle
Benefiting from Semantic AI along the data life cycle
 
Understanding the New World of Cognitive Computing
Understanding the New World of Cognitive ComputingUnderstanding the New World of Cognitive Computing
Understanding the New World of Cognitive Computing
 
Intel 20180608 v2
Intel 20180608 v2Intel 20180608 v2
Intel 20180608 v2
 
Bridging the Gap
Bridging the GapBridging the Gap
Bridging the Gap
 
Algorithm Marketplace and the new "Algorithm Economy"
Algorithm Marketplace and the new "Algorithm Economy"Algorithm Marketplace and the new "Algorithm Economy"
Algorithm Marketplace and the new "Algorithm Economy"
 
Wall Street Mastermind Sector Spotlight - Technology (October 2023).pdf
Wall Street Mastermind Sector Spotlight - Technology (October 2023).pdfWall Street Mastermind Sector Spotlight - Technology (October 2023).pdf
Wall Street Mastermind Sector Spotlight - Technology (October 2023).pdf
 
Ai open powermeetupmarch25th
Ai open powermeetupmarch25thAi open powermeetupmarch25th
Ai open powermeetupmarch25th
 
Resume kartikeya sharma
Resume kartikeya sharmaResume kartikeya sharma
Resume kartikeya sharma
 
icon-aiincs-obusolini201809131800-190310184140.pptx
icon-aiincs-obusolini201809131800-190310184140.pptxicon-aiincs-obusolini201809131800-190310184140.pptx
icon-aiincs-obusolini201809131800-190310184140.pptx
 
icon-aiincs-obusolini201809131800-190310184140.pptx
icon-aiincs-obusolini201809131800-190310184140.pptxicon-aiincs-obusolini201809131800-190310184140.pptx
icon-aiincs-obusolini201809131800-190310184140.pptx
 
Data Science, Personalisation & Product management
Data Science, Personalisation & Product managementData Science, Personalisation & Product management
Data Science, Personalisation & Product management
 
20 Best Platforms to Learn Data Science and Machine Learning.pdf
20 Best Platforms to Learn Data Science and Machine Learning.pdf20 Best Platforms to Learn Data Science and Machine Learning.pdf
20 Best Platforms to Learn Data Science and Machine Learning.pdf
 
Kazakhstan digital media_at_svl 20191018 v5
Kazakhstan digital media_at_svl 20191018 v5Kazakhstan digital media_at_svl 20191018 v5
Kazakhstan digital media_at_svl 20191018 v5
 
Data Analytics for IoT
Data Analytics for IoT Data Analytics for IoT
Data Analytics for IoT
 
2013 - Yhat - YC app.pdf
2013 - Yhat - YC app.pdf2013 - Yhat - YC app.pdf
2013 - Yhat - YC app.pdf
 
ChatGPT for Social Media Listening: practical application with YouScan’s Insi...
ChatGPT for Social Media Listening: practical application with YouScan’s Insi...ChatGPT for Social Media Listening: practical application with YouScan’s Insi...
ChatGPT for Social Media Listening: practical application with YouScan’s Insi...
 
Tutorial helsinki 20180313 v1
Tutorial helsinki 20180313 v1Tutorial helsinki 20180313 v1
Tutorial helsinki 20180313 v1
 
Cognitive Digital Twin by Fariz Saračević
Cognitive Digital Twin by Fariz SaračevićCognitive Digital Twin by Fariz Saračević
Cognitive Digital Twin by Fariz Saračević
 
Social Media Data Collection & Analysis
Social Media Data Collection & AnalysisSocial Media Data Collection & Analysis
Social Media Data Collection & Analysis
 
#OSSPARIS19 - Overcoming open source challenges in reinforcement learning - W...
#OSSPARIS19 - Overcoming open source challenges in reinforcement learning - W...#OSSPARIS19 - Overcoming open source challenges in reinforcement learning - W...
#OSSPARIS19 - Overcoming open source challenges in reinforcement learning - W...
 

More from Gianmario Spacagna

Managers guide to effective building of machine learning products
Managers guide to effective building of machine learning productsManagers guide to effective building of machine learning products
Managers guide to effective building of machine learning productsGianmario Spacagna
 
Anomaly Detection using Deep Auto-Encoders
Anomaly Detection using Deep Auto-EncodersAnomaly Detection using Deep Auto-Encoders
Anomaly Detection using Deep Auto-EncodersGianmario Spacagna
 
In-Memory Logical Data Warehouse for accelerating Machine Learning Pipelines ...
In-Memory Logical Data Warehouse for accelerating Machine Learning Pipelines ...In-Memory Logical Data Warehouse for accelerating Machine Learning Pipelines ...
In-Memory Logical Data Warehouse for accelerating Machine Learning Pipelines ...Gianmario Spacagna
 
Logical-DataWarehouse-Alluxio-meetup
Logical-DataWarehouse-Alluxio-meetupLogical-DataWarehouse-Alluxio-meetup
Logical-DataWarehouse-Alluxio-meetupGianmario Spacagna
 
Robust and declarative machine learning pipelines for predictive buying at Ba...
Robust and declarative machine learning pipelines for predictive buying at Ba...Robust and declarative machine learning pipelines for predictive buying at Ba...
Robust and declarative machine learning pipelines for predictive buying at Ba...Gianmario Spacagna
 
Parallel Tuning of Machine Learning Algorithms, Thesis Proposal
Parallel Tuning of Machine Learning Algorithms, Thesis ProposalParallel Tuning of Machine Learning Algorithms, Thesis Proposal
Parallel Tuning of Machine Learning Algorithms, Thesis ProposalGianmario Spacagna
 

More from Gianmario Spacagna (7)

Managers guide to effective building of machine learning products
Managers guide to effective building of machine learning productsManagers guide to effective building of machine learning products
Managers guide to effective building of machine learning products
 
Anomaly Detection using Deep Auto-Encoders
Anomaly Detection using Deep Auto-EncodersAnomaly Detection using Deep Auto-Encoders
Anomaly Detection using Deep Auto-Encoders
 
In-Memory Logical Data Warehouse for accelerating Machine Learning Pipelines ...
In-Memory Logical Data Warehouse for accelerating Machine Learning Pipelines ...In-Memory Logical Data Warehouse for accelerating Machine Learning Pipelines ...
In-Memory Logical Data Warehouse for accelerating Machine Learning Pipelines ...
 
Logical-DataWarehouse-Alluxio-meetup
Logical-DataWarehouse-Alluxio-meetupLogical-DataWarehouse-Alluxio-meetup
Logical-DataWarehouse-Alluxio-meetup
 
Robust and declarative machine learning pipelines for predictive buying at Ba...
Robust and declarative machine learning pipelines for predictive buying at Ba...Robust and declarative machine learning pipelines for predictive buying at Ba...
Robust and declarative machine learning pipelines for predictive buying at Ba...
 
TunUp final presentation
TunUp final presentationTunUp final presentation
TunUp final presentation
 
Parallel Tuning of Machine Learning Algorithms, Thesis Proposal
Parallel Tuning of Machine Learning Algorithms, Thesis ProposalParallel Tuning of Machine Learning Algorithms, Thesis Proposal
Parallel Tuning of Machine Learning Algorithms, Thesis Proposal
 

Recently uploaded

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbuapidays
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfOverkill Security
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...apidays
 

Recently uploaded (20)

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 

Latent Panelists Affinities: a Helixa case study

  • 1. IBM Cloud / DOC ID / Month XX, 2018 / © 2018 IBM Corporation #PartyCloud A journey into Data Science & AI: “Latent Panelists Affinities” case study — IBM Data Workshop 20-09-2018 Gianmario Spacagna Chief Scientist, Helixa AI
  • 2. About me 2IBM Cloud / DOC ID / Month XX, 2018 / © 2018 IBM Corporation University background: ● Telematics Engineering (Polytechnic of Turin) ● Software Engineering of Distributed Systems (KTH Stockholm) Working experience: ● Predictive Marketing (AgilOne) ● Cyber Security (Cisco) ● Retail & Business Banking (Barclays) ● Automotive (Pirelli) ● Artificial Social Intelligence (Helixa)
  • 3. External Projects 3 ● Co-author of “Python Deep Learning” book and “The Professional Data Science Manifesto” ● Founder of the “Data Science Milan” community and the Machine Intelligence Hub network
  • 4. Intelligence 4 ● Capacity to learn from experience * ● Ability to adapt to different context * ● The use of metacognition to enhance learning * * Cognitive Psychology 4th edition, Robert J Sternberg, Chapter 13
  • 5. Social Intelligence 5 ● Ability to get along with others * ● Knowledge of social matters * ● Insight into moods and or underlying personality traits of others * * Cognitive Psychology 4th edition, Robert J Sternberg, Chapter 13
  • 6. Artificial (Social) Intelligence 6 ● The computational part of the ability to achieve (social)** goals in the world* ● The application of machine intelligence techniques to social phenomena *** * Cognitive Psychology 4th edition, Robert J Sternberg, Chapter 13 ** My own social re-interpretation *** Artificial Social Intelligence, William Sims Bainbridge et al., Annual Review of Sociology, Vol. 20 (1994), pp. 407-436
  • 7. Generative AI Technology 7 A generative algorithm ensembling AI models and prior knowledge of the world in order to unify different data sources into a single population of synthetic users representing an augmented view of U.S. consumers and their affinities.
  • 8. Case Study: Panelists Latent Affinities Influencers & Celebrities Products & Brands Media & Publishers Anonymous Panelists Survey
  • 9. Unclassified Entities 9 Products and Brands Media & Publishers Influencers and Celebrities ? ? ? Potentially millions of entities!
  • 10. Partially-responded Survey 10 Are you inspired by Elon Musk? Are you interested on SpaceX mission? Have you drunk Starbucks coffee in the last month? Do you read NY Times at least once a week? Do you listen to Led Zeppelin? ❌ N/A ✅ N/A ✅ ❌ N/A N/A ❌ N/A ✅ ✅ ❌ N/A ✅ ✅ N/A ❌ N/A ✅
  • 11. Given a set of uncategorized entities and a set of anonymized users along with some observable affinities: 1. What category each entity belongs to? 2. What are his/her latent affinites? 11
  • 12. Social Agent Goals 12 1. Identify the category of each entity (e.g. influencer, product, media) 2. Learn representations of the entities (e.g. grouping them in shared topics such as sports, music genres, movie kinds) 3. Learn how to map one entity to another (e.g. Elon Musk : people = SpaceX : technology) 4. Estimate latent affinities by reasoning on the available observations (e.g. if you are interested in Rock music and 70s culture, you are very likely to be a fan of Led Zeppelin)
  • 13. Entities Classifier 13 Input: ● Entity attributes ● Survey statistics Output: ● Influencer ● Product ● Media
  • 14. Recommender System for Affinities 14 ● Build a user-item matrix ● Use implicit feedback to represent missing affinities ● Decompose it in the multiplication of user-topic and topic-item ● Infer probability scores of latent items Alternating Least Squares algorithm
  • 16. Research Spike Workflow 16IBM Cloud / DOC ID / Month XX, 2018 / © 2018 IBM Corporation Deliverable: ● Findings report ● Insights ● Know-how
  • 17. Development Workflow 17 1. Time-boxed research spikes followed by clearly defined feature stories 2. Notebooks good for analysis, entry points and results presentation 3. Code developed in modules and functions in a proper IDE 4. Unit tests: replace “assertEqual” with uncertainty ranges 5. Versioning: both code, priors, data, evaluation results and models 6. Git-flow branching system, pull request, peer review 7. End-to-end testing using small datasets and Continuous Integration First release plan focused on a MVP satisfying the acceptance criteria defined by early adopter customers instead of just PoC
  • 18. Stack 18 IBM Cloud / DOC ID / Month XX, 2018 / © 2018 IBM Corporation Programming Languages: Scientific Tools: Distributed Systems: Machine Learning: Visualization: Storage: ModelDB DevOps: Versioning:
  • 19. The Crew ML Engineer ML Scientist Chief Scientist Mathematician Psychologist Data Engineer NLP Specialist
  • 20. Challenges 20 1. Business integration (technical and cultural) 2. Lack of expertise in the market and steep learning curve 3. Too many different tools and technologies