SlideShare una empresa de Scribd logo
1 de 10
Descargar para leer sin conexión
The Beauty of (Big) Data
Privacy Engineering
Yangcheng Huang
Director of Software Engineering, Data & Analytics
Truata
Who we are
§ Truata was founded in 2018, with investment by
Mastercard and IBM
§ Our goal is to be the world’s leading provider of privacy-
enhanced data analytics and management solutions
§ Based in Dublin, we have a team of 70 people with an R&D
focus on developing cutting edge privacy enhancing
technologies (PETs)
§ International client base across major industry verticals
§ Multiple EU regulators consulted on the Truata solution
Truata Anonymization Service is a cutting-edge solution for GDPR-grade
data anonymization & analytics, allowing companies to analyse and monetize
customer data in fully-anonymized form.
§ Sophisticated and
proprietary
technologies for
data anonymization
and risk calibration
§ Able to generate
fully anonymized
data sets ideally
suited for privacy-
preserving analytics
§ Experienced in
delivering large-
scale anonymized
data analytics
projects
§ Able to drive
significant value
from data while
maintaining
customer trust
§ Delivered by our
customer success
team of data
science and privacy
experts
§ Fully focused on
using privacy-centric
techniques to
generate value from
data
§ Consulting solutions based on our proprietary
methodologies, IP and expertise delivered by
industry leading, subject matter experts.
A big-data privacy engineering problem
• Geo privacy
• Zip-level targeted advertising
• Lat/Long GPS
• Shapefile of zip codes
• Using neighbouring zip’s shopping behaviour
• Problems
• Lat/Long mapping (generalisation of GPS information) &
nearest 10
• (32m) customer’s Lat long mapping onto (1.7m) UK Zips
• (1.7m) nearest 10 Zips out of 1.3m Zips (with Customer
transactions)
• A ‘Trillion’ Problem
• 1,000,000,000,000
• Google processes 61.6 billion web pages today
• Dublin Population 2019: 1,214,666
• Measure the similarity of any two Dubliners
Definition of beauty
Definition of beauty
: the quality of being physically attractive.
: the qualities in a person or a thing that give
pleasure to the senses or the mind.
…
Beauty | Definition of Beauty by Merriam-Webster
https://www.merriam-webster.com › dictionary › beauty
Processing result is beautiful
Engineering (algorithm) is beautiful
Engineering (journey) is beautiful
A combination of 99% passion and 1% skills (big data engineering, spatial engineering and software engineering)
Craftsmanship spirit is beautiful
Summary
▪ Beauty of big data privacy engineering
▪ Geo anonymization and geospatial analytics
workloads with Spark
▪ Maximizing the performance by combining multi-
dimensional spatial indexing with Spark in-
memory computations
▪ Journey of productionizing the geo anonymization
workloads
▪ Craftsmanship spirit
▪ Ongoing work
▪ Mobility trajectory anonymization (patents pending)
▪ Mobility pattern anonymization
Contact
Email: yangcheng.huang_AT_truata.com
Linkedin: https://www.linkedin.com/in/yhuang
www.truata.com
Feedback
Your feedback is important to us.
Don’t forget to rate
and review the sessions.

Más contenido relacionado

La actualidad más candente

Weiyan Zhao, Nationwide Insurance - A Decade of Data Science. The Nationwide ...
Weiyan Zhao, Nationwide Insurance - A Decade of Data Science. The Nationwide ...Weiyan Zhao, Nationwide Insurance - A Decade of Data Science. The Nationwide ...
Weiyan Zhao, Nationwide Insurance - A Decade of Data Science. The Nationwide ...
Sri Ambati
 

La actualidad más candente (20)

Embedding Insight through Prediction Driven Logistics
Embedding Insight through Prediction Driven LogisticsEmbedding Insight through Prediction Driven Logistics
Embedding Insight through Prediction Driven Logistics
 
GraphTour - Popular Use Cases
GraphTour - Popular Use CasesGraphTour - Popular Use Cases
GraphTour - Popular Use Cases
 
Fueling AI & Machine Learning: Legacy Data as a Competitive Advantage
Fueling AI & Machine Learning: Legacy Data as a Competitive AdvantageFueling AI & Machine Learning: Legacy Data as a Competitive Advantage
Fueling AI & Machine Learning: Legacy Data as a Competitive Advantage
 
Ai based analytics in the cloud
Ai based analytics in the cloudAi based analytics in the cloud
Ai based analytics in the cloud
 
Shift AI 2020: Business benefits of privacy-preserving synthetic data | Sebas...
Shift AI 2020: Business benefits of privacy-preserving synthetic data | Sebas...Shift AI 2020: Business benefits of privacy-preserving synthetic data | Sebas...
Shift AI 2020: Business benefits of privacy-preserving synthetic data | Sebas...
 
Shift AI 2020: Building AI-first Products - Ehsan Yousefzadeh (AIG Investments)
Shift AI 2020: Building AI-first Products - Ehsan Yousefzadeh (AIG Investments)Shift AI 2020: Building AI-first Products - Ehsan Yousefzadeh (AIG Investments)
Shift AI 2020: Building AI-first Products - Ehsan Yousefzadeh (AIG Investments)
 
Weiyan Zhao, Nationwide Insurance - A Decade of Data Science. The Nationwide ...
Weiyan Zhao, Nationwide Insurance - A Decade of Data Science. The Nationwide ...Weiyan Zhao, Nationwide Insurance - A Decade of Data Science. The Nationwide ...
Weiyan Zhao, Nationwide Insurance - A Decade of Data Science. The Nationwide ...
 
A Connections-first Approach to Supply Chain Optimization
A Connections-first Approach to Supply Chain OptimizationA Connections-first Approach to Supply Chain Optimization
A Connections-first Approach to Supply Chain Optimization
 
Introduction to Neo4j
Introduction to Neo4jIntroduction to Neo4j
Introduction to Neo4j
 
Meg Mude, Intel - Data Engineering Lifecycle Optimized on Intel - H2O World S...
Meg Mude, Intel - Data Engineering Lifecycle Optimized on Intel - H2O World S...Meg Mude, Intel - Data Engineering Lifecycle Optimized on Intel - H2O World S...
Meg Mude, Intel - Data Engineering Lifecycle Optimized on Intel - H2O World S...
 
Eric van tol
Eric van tolEric van tol
Eric van tol
 
AI Foundations Course Module 1 - An AI Transformation Journey
AI Foundations Course Module 1 - An AI Transformation JourneyAI Foundations Course Module 1 - An AI Transformation Journey
AI Foundations Course Module 1 - An AI Transformation Journey
 
Produktdatenmanagement mit Neo4j
Produktdatenmanagement mit Neo4jProduktdatenmanagement mit Neo4j
Produktdatenmanagement mit Neo4j
 
Solution Centric Architectural Presentation - A Journey from Data Paralysis t...
Solution Centric Architectural Presentation - A Journey from Data Paralysis t...Solution Centric Architectural Presentation - A Journey from Data Paralysis t...
Solution Centric Architectural Presentation - A Journey from Data Paralysis t...
 
Knowledge Graphs for Transformation: Dynamic Context for the Intelligent Ente...
Knowledge Graphs for Transformation: Dynamic Context for the Intelligent Ente...Knowledge Graphs for Transformation: Dynamic Context for the Intelligent Ente...
Knowledge Graphs for Transformation: Dynamic Context for the Intelligent Ente...
 
Cloud-Native Microservices
Cloud-Native MicroservicesCloud-Native Microservices
Cloud-Native Microservices
 
Combining a Knowledge Graph and Graph Algorithms to Find Hidden Skills at NASA
Combining a Knowledge Graph and Graph Algorithms to Find Hidden Skills at NASACombining a Knowledge Graph and Graph Algorithms to Find Hidden Skills at NASA
Combining a Knowledge Graph and Graph Algorithms to Find Hidden Skills at NASA
 
Delivering Large Scale Real-time Graph Analytics with Dell Infrastructure and...
Delivering Large Scale Real-time Graph Analytics with Dell Infrastructure and...Delivering Large Scale Real-time Graph Analytics with Dell Infrastructure and...
Delivering Large Scale Real-time Graph Analytics with Dell Infrastructure and...
 
The AI Mindset: Bridging Industry and Academic Perspectives
The AI Mindset: Bridging Industry and Academic PerspectivesThe AI Mindset: Bridging Industry and Academic Perspectives
The AI Mindset: Bridging Industry and Academic Perspectives
 
Getting Your Supply Chain Back on Track with AI
Getting Your Supply Chain Back on Track with AIGetting Your Supply Chain Back on Track with AI
Getting Your Supply Chain Back on Track with AI
 

Similar a The Beauty of (Big) Data Privacy Engineering

Connecta Event: Big Query och dataanalys med Google Cloud Platform
Connecta Event: Big Query och dataanalys med Google Cloud PlatformConnecta Event: Big Query och dataanalys med Google Cloud Platform
Connecta Event: Big Query och dataanalys med Google Cloud Platform
ConnectaDigital
 
Profile quick-2
Profile quick-2Profile quick-2
Profile quick-2
Edge Ict
 
CSA Global Technology Brochure 2017
CSA Global Technology Brochure 2017CSA Global Technology Brochure 2017
CSA Global Technology Brochure 2017
Galen White
 
Blackcoffer Business development
Blackcoffer Business  developmentBlackcoffer Business  development
Blackcoffer Business development
Harshita Singh
 
Blackcoffer Business Development
Blackcoffer Business Development Blackcoffer Business Development
Blackcoffer Business Development
Harshita Singh
 
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
Matt Stubbs
 

Similar a The Beauty of (Big) Data Privacy Engineering (20)

Bdml ecom
Bdml ecomBdml ecom
Bdml ecom
 
Think Big | Enterprise Artificial Intelligence
Think Big | Enterprise Artificial IntelligenceThink Big | Enterprise Artificial Intelligence
Think Big | Enterprise Artificial Intelligence
 
ZIGRAM Introduction September 2020
ZIGRAM Introduction September 2020ZIGRAM Introduction September 2020
ZIGRAM Introduction September 2020
 
Spark: Building an application from Start to Finish
Spark: Building an application from Start to FinishSpark: Building an application from Start to Finish
Spark: Building an application from Start to Finish
 
Connecta Event: Big Query och dataanalys med Google Cloud Platform
Connecta Event: Big Query och dataanalys med Google Cloud PlatformConnecta Event: Big Query och dataanalys med Google Cloud Platform
Connecta Event: Big Query och dataanalys med Google Cloud Platform
 
Creating a Data Driven Organization - StampedeCon 2016
Creating a Data Driven Organization - StampedeCon 2016Creating a Data Driven Organization - StampedeCon 2016
Creating a Data Driven Organization - StampedeCon 2016
 
Profile quick-2
Profile quick-2Profile quick-2
Profile quick-2
 
Fisher Practice Areas 2012
Fisher Practice Areas 2012Fisher Practice Areas 2012
Fisher Practice Areas 2012
 
DataArt Custom Software Engineering with a Human Approach
DataArt Custom Software Engineering with a Human ApproachDataArt Custom Software Engineering with a Human Approach
DataArt Custom Software Engineering with a Human Approach
 
CSA Global Technology Brochure 2017
CSA Global Technology Brochure 2017CSA Global Technology Brochure 2017
CSA Global Technology Brochure 2017
 
HR Catalog (1).pdf
HR Catalog (1).pdfHR Catalog (1).pdf
HR Catalog (1).pdf
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Blackcoffer Business development
Blackcoffer Business  developmentBlackcoffer Business  development
Blackcoffer Business development
 
Blackcoffer Business Development
Blackcoffer Business Development Blackcoffer Business Development
Blackcoffer Business Development
 
Measure Camp Paris 2017 - Digital Analyst Job Description 2020
Measure Camp Paris 2017 - Digital Analyst Job Description 2020Measure Camp Paris 2017 - Digital Analyst Job Description 2020
Measure Camp Paris 2017 - Digital Analyst Job Description 2020
 
IMGS Geospatial User Group 2014: Hexagon Geospatial Vision, Mission and Strategy
IMGS Geospatial User Group 2014: Hexagon Geospatial Vision, Mission and StrategyIMGS Geospatial User Group 2014: Hexagon Geospatial Vision, Mission and Strategy
IMGS Geospatial User Group 2014: Hexagon Geospatial Vision, Mission and Strategy
 
Big Data
Big DataBig Data
Big Data
 
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
 
The Future Based on AI and Analytics
The Future Based on AI and AnalyticsThe Future Based on AI and Analytics
The Future Based on AI and Analytics
 
GraphTalks Stuttgart - Einführung in Graphdatenbanken und Neo4j
GraphTalks Stuttgart - Einführung in Graphdatenbanken und Neo4jGraphTalks Stuttgart - Einführung in Graphdatenbanken und Neo4j
GraphTalks Stuttgart - Einführung in Graphdatenbanken und Neo4j
 

Más de Databricks

Democratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized PlatformDemocratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized Platform
Databricks
 
Stage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI IntegrationStage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI Integration
Databricks
 
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorchSimplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Databricks
 
Raven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction QueriesRaven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction Queries
Databricks
 
Processing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache SparkProcessing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache Spark
Databricks
 

Más de Databricks (20)

DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptx
 
Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1
 
Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2
 
Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 2Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 2
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
 
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
 
Democratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized PlatformDemocratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized Platform
 
Learn to Use Databricks for Data Science
Learn to Use Databricks for Data ScienceLearn to Use Databricks for Data Science
Learn to Use Databricks for Data Science
 
Why APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML MonitoringWhy APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML Monitoring
 
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
The Function, the Context, and the Data—Enabling ML Ops at Stitch FixThe Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
 
Stage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI IntegrationStage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI Integration
 
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorchSimplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorch
 
Scaling your Data Pipelines with Apache Spark on Kubernetes
Scaling your Data Pipelines with Apache Spark on KubernetesScaling your Data Pipelines with Apache Spark on Kubernetes
Scaling your Data Pipelines with Apache Spark on Kubernetes
 
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
Scaling and Unifying SciKit Learn and Apache Spark PipelinesScaling and Unifying SciKit Learn and Apache Spark Pipelines
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
 
Sawtooth Windows for Feature Aggregations
Sawtooth Windows for Feature AggregationsSawtooth Windows for Feature Aggregations
Sawtooth Windows for Feature Aggregations
 
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Redis + Apache Spark = Swiss Army Knife Meets Kitchen SinkRedis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
 
Re-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and SparkRe-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and Spark
 
Raven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction QueriesRaven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction Queries
 
Processing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache SparkProcessing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache Spark
 
Massive Data Processing in Adobe Using Delta Lake
Massive Data Processing in Adobe Using Delta LakeMassive Data Processing in Adobe Using Delta Lake
Massive Data Processing in Adobe Using Delta Lake
 

Último

Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
amitlee9823
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
JoseMangaJr1
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
amitlee9823
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 

Último (20)

Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
ELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptxELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptx
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptx
 

The Beauty of (Big) Data Privacy Engineering

  • 1. The Beauty of (Big) Data Privacy Engineering Yangcheng Huang Director of Software Engineering, Data & Analytics Truata
  • 2. Who we are § Truata was founded in 2018, with investment by Mastercard and IBM § Our goal is to be the world’s leading provider of privacy- enhanced data analytics and management solutions § Based in Dublin, we have a team of 70 people with an R&D focus on developing cutting edge privacy enhancing technologies (PETs) § International client base across major industry verticals § Multiple EU regulators consulted on the Truata solution Truata Anonymization Service is a cutting-edge solution for GDPR-grade data anonymization & analytics, allowing companies to analyse and monetize customer data in fully-anonymized form. § Sophisticated and proprietary technologies for data anonymization and risk calibration § Able to generate fully anonymized data sets ideally suited for privacy- preserving analytics § Experienced in delivering large- scale anonymized data analytics projects § Able to drive significant value from data while maintaining customer trust § Delivered by our customer success team of data science and privacy experts § Fully focused on using privacy-centric techniques to generate value from data § Consulting solutions based on our proprietary methodologies, IP and expertise delivered by industry leading, subject matter experts.
  • 3. A big-data privacy engineering problem • Geo privacy • Zip-level targeted advertising • Lat/Long GPS • Shapefile of zip codes • Using neighbouring zip’s shopping behaviour • Problems • Lat/Long mapping (generalisation of GPS information) & nearest 10 • (32m) customer’s Lat long mapping onto (1.7m) UK Zips • (1.7m) nearest 10 Zips out of 1.3m Zips (with Customer transactions) • A ‘Trillion’ Problem • 1,000,000,000,000 • Google processes 61.6 billion web pages today • Dublin Population 2019: 1,214,666 • Measure the similarity of any two Dubliners
  • 4. Definition of beauty Definition of beauty : the quality of being physically attractive. : the qualities in a person or a thing that give pleasure to the senses or the mind. … Beauty | Definition of Beauty by Merriam-Webster https://www.merriam-webster.com › dictionary › beauty
  • 7. Engineering (journey) is beautiful A combination of 99% passion and 1% skills (big data engineering, spatial engineering and software engineering)
  • 9. Summary ▪ Beauty of big data privacy engineering ▪ Geo anonymization and geospatial analytics workloads with Spark ▪ Maximizing the performance by combining multi- dimensional spatial indexing with Spark in- memory computations ▪ Journey of productionizing the geo anonymization workloads ▪ Craftsmanship spirit ▪ Ongoing work ▪ Mobility trajectory anonymization (patents pending) ▪ Mobility pattern anonymization Contact Email: yangcheng.huang_AT_truata.com Linkedin: https://www.linkedin.com/in/yhuang www.truata.com
  • 10. Feedback Your feedback is important to us. Don’t forget to rate and review the sessions.