SlideShare una empresa de Scribd logo
1 de 13
Learn How to
Turbocharge Your
AI/ML Data Workflows
with Data Enrichment
Tim McKenzie | Director, Solution Architecture
1
Location data challenges
• Location is Messy- Addresses, Lat/Long,
Shapes, Lines, Formats
• Complexity of Joining Location Based Data
Sources (3rd Party and Internal)
• Data Sourcing Challenges- Many Providers,
Many Formats, Many Pricing and Licensing
Differences
• Global Extensibility- Data Sources Tend to
Be Regional Yet Use Cases are Often
Global
• Need to Identify and Process Multi-Family
and Condo Properties
• De-centralized repositories of data
• Complex properties can often have multiple
valid addresses, parcels and buildings.
• Legal descriptions in variety of format
leading to discrepancy, inefficiencies, errors
and non-compliance
2
“For every minute spent in
organizing, an hour is earned.”
Benjamin Franklin
Inventor, Statesman, Insurer
Data prep slows data science
3%
19%
9%
4%
5%
What data
scientists spend
the most time
doing
Building data sets
Cleaning and organizing data
Collecting datasets
Mining data for patterns
Refining algorithms
Other
accounts for about 80%
of the work of data
scientists
3
3
Location enabling strategies for data analytics
03.
Analyze
Apply data science at
scale to gain a
competitive advantage
02.
Enrich
Leverage trusted ID to
join massive amounts of
your own and 3rd party
data sources
01.
Organize
Assign a trusted ID that is
unique and persistent to
each address
4
Fast, easy, and consistent data enrichment
5
Precisely’s Geo Addressing with hyper-accurate Master Location Data (MLD) reference data
• Belgium & Luxembourg
• Canada
• Finland
• France
• Germany
• Great Britain
• Ireland
• Netherlands
• Sweden
• Singapore
• United States
• More coming soon!
International
Coverage
Data
Sources
• Postal Authorities
• Government
datasets: local city,
county, and state
• Global Vendors
• Local Players
• Open Sources
• Proprietary
Sources
• Largest & Best available
• Unparalleled &
• Parent-child relationship,
• Unique and Persistent Identifier,
• Multi-sourced,
• Simplify data enrichment process,
MLD Attributes
Data Enrichment – A global product portfolio
Addresses & Property
Verified and validated address and
property data for map display and
analytics
Boundaries
Administrative, community, and
industry-specific boundaries for data
enrichment and territory analysis
Demographics
Demographic and consumer context
data for better understanding people
and behavior
Points of Interest
Detailed business, leisure, and
geographic features for location
and competitive intelligence
Streets
Robust street-level data for mapping,
analysis, routing, and geocoding
Risk
Natural hazard boundaries related to
flood, fire, earthquakes, and weather
Expertly curated datasets containing thousands of attributes for faster, confident decisions
6
Uniquely positioned to address data enrichment needs
Global coverage location enrichment data. Our portfolio includes:
• 400+ datasets
• 250+ countries and territories
• 100s of millions of data points
Datasets that are interoperable and are managed to quality standard, with consistent documentation, and
support e.g.
• Property Graph
• Market and Community Link
Ability to enrich with dynamic data (Dynamic Weather and Dynamic Demographics)
• Data that includes time as a dimension
• Creating insights from data that is updated at regular and short time intervals (e.g. 5 min)
Data experience through deep-domain expertise
• Adding data through, development, partnerships, and acquisitions
Best-in-class addressing and property datasets with a unique and persistent ID
• Link Precisely and customer address, buildings, demographics, risk, and more data using the PreciselyID,
a unique and persistent location identifier
7
Cloud-based location analytics technology
8
Spatial
Functions
30+ Common
Spatial Processes
Global
Geocoding
Forward & Reverse
Global Geocoding
and Trusted ID
Global
Addressing
Validate,
standardize and
parse global
addresses
Global Tax
Jurisdiction
Assign highly
granular tax
jurisdictions
globally.
Map
Visualization
Visualize Location
Data at Scale
Global Street
Routing
Assign isochrones
and isodistance
anywhere in the
world.
Location-enabled analytics
Bank Branch & ATM
Call Center/ Web
Customers by Product
Commercial & Mortgage
Active Mortgages
Historical Defaults
Geocoding and location
intelligence capabilities to
organize and enrich your data
Financial Transactions
All of your sources
Any structure
or frequency
Analytics capabilities for
any use case or persona
Ad Hoc Data Science
Low-cost, rapid experimentation with
new data and models.
Explainable Machine Learning
High volume, fine-grained analysis at scale
served in the tightest of service windows.
BI Reporting & Dashboarding
Power real-time dashboarding directly,
or feed data to a data warehouse for
high-concurrency reporting.
Real-time Applications
Provide real-time data to downstream
applications or power applications via APIs.
PreciselyID
ADMIN
BOUNDARIES
BANK DEPOSITS
MOBILE
MOVEMENT
WEATHER
EVENTS
HAZARD &
RISK DATA
AMENITIES &
COMPETITION
EVERY US/CAN
ADDRESS
BUSINESS
LOCATIONS
PROPERTY
ATTRIBUTES
SCHOOLS &
NEIGHBORHOODS
POPULATION
DEMOGRAPHICS
PARCELS &
BUILDINGS
Analytics Platform
Understanding the
data challenge
10
• Accessing the right raw data
• Keeping up with continuously changing data feeds
• Building features from raw data
• Combining features into training data
• Calculating and serving features in production
• Monitoring features in production
Key data challenges that organizations
face when productionizing ML systems
10
What is a “feature-based”
architecture?
11
A feature store is an ML-specific data system that:
• Runs data pipelines that transform raw data into
feature values
• Stores and manages the feature data itself, and
• Serves feature data consistently for training and
inference purposes
A feature is data used as an input
signal to a predictive model
11
12
Processing
Storage
Inputs
Location specific records Shape files Streaming records
Address Fabric
Analytics
Processing
• Model outputs
• Scores
• Computed columns
• Analysis outcome
Batch Geocoding
with the Operational
Addressing SDKs
• Vaildate input addresses
• Validate other data
• Locate addresses
• Match inputs
• Assign PreciselyID
• Relate data around
PrecisleyID
Batch Spatial
Processing
with the Location
Intelligence SDK
• Flatten shape files
• Compute PIP
• Compute D2P, D2L
• Compute basic scores
• Generate geohash
• Relate data around geohash
(where application)
Realtime Processing
with the Precisely SDKs
• Operational Addressing APIs
• Assign PreciselyID
• Generate geohash
• Relate data
Message Bus
Feature Store
In-stream Analytics Layer
Model outputs, scores, computed columns,
analysis outcomes
PrecisleyID Address
P0000MK1IAAD 287 E 300 S. Provo, UT 84606
P0000MK1DPRD 410 N University Ave. Provo, UT 84601
Vendor
data files
Customer Loyalty Records
Equipment Inventories
Franchise
Zones
Pricing Delivery
Territories
Mobile Trace
Data
POS/IOT
Data
Administration, Governance, Security, Connectivity, Schema, Catalog
Model
Training
EDW
precisely
Data subscriptions
with PreciselyID
PrecisleyID Address Name Type Score Location MICode PointCode DemoRgn
P0000MK1IAAD 287 E 300 S. Provo, UT 84606 Empas LLC REST 91.529 UT108 10020100 101067669 8926
P0000MK1DPRD 410 N University Ave. Provo, UT 84601 THAI HUT REST 65.981 UT108 10020100 100854441 4144
…. ….. ….. ….. ….. …. …. ….. ….
Thank you
Tim McKenzie
Tim.McKenzie@precisely.com
Phone: 678-428-1770

Más contenido relacionado

Similar a Learn How to Turbocharge Your AI/ML Data Workflows with Data Enrichment

Harnessing the Power of Advanced Insurance Analytics Through Property Data
Harnessing the Power of Advanced Insurance Analytics Through Property DataHarnessing the Power of Advanced Insurance Analytics Through Property Data
Harnessing the Power of Advanced Insurance Analytics Through Property Data
Precisely
 
Simplifying Data Interoperability with Geo Addressing and Enrichment
Simplifying Data Interoperability with Geo Addressing and EnrichmentSimplifying Data Interoperability with Geo Addressing and Enrichment
Simplifying Data Interoperability with Geo Addressing and Enrichment
Precisely
 
Big Data Matching - How to Find Two Similar Needles in a Really Big Haystack
Big Data Matching - How to Find Two Similar Needles in a Really Big HaystackBig Data Matching - How to Find Two Similar Needles in a Really Big Haystack
Big Data Matching - How to Find Two Similar Needles in a Really Big Haystack
Precisely
 

Similar a Learn How to Turbocharge Your AI/ML Data Workflows with Data Enrichment (20)

Harnessing the Power of Advanced Insurance Analytics Through Property Data
Harnessing the Power of Advanced Insurance Analytics Through Property DataHarnessing the Power of Advanced Insurance Analytics Through Property Data
Harnessing the Power of Advanced Insurance Analytics Through Property Data
 
ADV Slides: How to Improve Your Analytic Data Architecture Maturity
ADV Slides: How to Improve Your Analytic Data Architecture MaturityADV Slides: How to Improve Your Analytic Data Architecture Maturity
ADV Slides: How to Improve Your Analytic Data Architecture Maturity
 
ADV Slides: Building and Growing Organizational Analytics with Data Lakes
ADV Slides: Building and Growing Organizational Analytics with Data LakesADV Slides: Building and Growing Organizational Analytics with Data Lakes
ADV Slides: Building and Growing Organizational Analytics with Data Lakes
 
Enrich Your Data Your Way
Enrich Your Data Your WayEnrich Your Data Your Way
Enrich Your Data Your Way
 
Sören Eickhoff, Informatica GmbH, "Informatica Intelligent Data Lake – Self S...
Sören Eickhoff, Informatica GmbH, "Informatica Intelligent Data Lake – Self S...Sören Eickhoff, Informatica GmbH, "Informatica Intelligent Data Lake – Self S...
Sören Eickhoff, Informatica GmbH, "Informatica Intelligent Data Lake – Self S...
 
The Shifting Landscape of Data Integration
The Shifting Landscape of Data IntegrationThe Shifting Landscape of Data Integration
The Shifting Landscape of Data Integration
 
Sami patel full_resume
Sami patel full_resumeSami patel full_resume
Sami patel full_resume
 
Building the Artificially Intelligent Enterprise
Building the Artificially Intelligent EnterpriseBuilding the Artificially Intelligent Enterprise
Building the Artificially Intelligent Enterprise
 
Solving Volume, Velocity, and Variety Challenges with Location
Solving Volume, Velocity, and Variety Challenges with LocationSolving Volume, Velocity, and Variety Challenges with Location
Solving Volume, Velocity, and Variety Challenges with Location
 
ADV Slides: Data Pipelines in the Enterprise and Comparison
ADV Slides: Data Pipelines in the Enterprise and ComparisonADV Slides: Data Pipelines in the Enterprise and Comparison
ADV Slides: Data Pipelines in the Enterprise and Comparison
 
Simplifying Data Interoperability with Geo Addressing and Enrichment
Simplifying Data Interoperability with Geo Addressing and EnrichmentSimplifying Data Interoperability with Geo Addressing and Enrichment
Simplifying Data Interoperability with Geo Addressing and Enrichment
 
Oracle Stream Analytics - Developer Introduction
Oracle Stream Analytics - Developer IntroductionOracle Stream Analytics - Developer Introduction
Oracle Stream Analytics - Developer Introduction
 
Data Mesh using Microsoft Fabric
Data Mesh using Microsoft FabricData Mesh using Microsoft Fabric
Data Mesh using Microsoft Fabric
 
Big Data Matching - How to Find Two Similar Needles in a Really Big Haystack
Big Data Matching - How to Find Two Similar Needles in a Really Big HaystackBig Data Matching - How to Find Two Similar Needles in a Really Big Haystack
Big Data Matching - How to Find Two Similar Needles in a Really Big Haystack
 
Empowering Business & IT Teams:  Modern Data Catalog Requirements
Empowering Business & IT Teams:  Modern Data Catalog RequirementsEmpowering Business & IT Teams:  Modern Data Catalog Requirements
Empowering Business & IT Teams:  Modern Data Catalog Requirements
 
(Data) Integrity Matters: Four Ways You Can Build Trust in Your Data
(Data) Integrity Matters: Four Ways You Can Build Trust in Your Data(Data) Integrity Matters: Four Ways You Can Build Trust in Your Data
(Data) Integrity Matters: Four Ways You Can Build Trust in Your Data
 
How a Logical Data Fabric Enhances the Customer 360 View
How a Logical Data Fabric Enhances the Customer 360 ViewHow a Logical Data Fabric Enhances the Customer 360 View
How a Logical Data Fabric Enhances the Customer 360 View
 
Transforming and Scaling Large Scale Data Analytics: Moving to a Cloud-based ...
Transforming and Scaling Large Scale Data Analytics: Moving to a Cloud-based ...Transforming and Scaling Large Scale Data Analytics: Moving to a Cloud-based ...
Transforming and Scaling Large Scale Data Analytics: Moving to a Cloud-based ...
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
 
Digital intelligence satish bhatia
Digital intelligence satish bhatiaDigital intelligence satish bhatia
Digital intelligence satish bhatia
 

Más de Precisely

How to Build Data Governance Programs That Last - A Business-First Approach.pdf
How to Build Data Governance Programs That Last - A Business-First Approach.pdfHow to Build Data Governance Programs That Last - A Business-First Approach.pdf
How to Build Data Governance Programs That Last - A Business-First Approach.pdf
Precisely
 
Zukuntssichere SAP Prozesse dank automatisierter Massendaten
Zukuntssichere SAP Prozesse dank automatisierter MassendatenZukuntssichere SAP Prozesse dank automatisierter Massendaten
Zukuntssichere SAP Prozesse dank automatisierter Massendaten
Precisely
 
Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...
Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...
Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...
Precisely
 
Testjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3f
Testjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3fTestjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3f
Testjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3f
Precisely
 
AI You Can Trust - Ensuring Success with Data Integrity Webinar
AI You Can Trust - Ensuring Success with Data Integrity WebinarAI You Can Trust - Ensuring Success with Data Integrity Webinar
AI You Can Trust - Ensuring Success with Data Integrity Webinar
Precisely
 
Moving IBM i Applications to the Cloud with AWS and Precisely
Moving IBM i Applications to the Cloud with AWS and PreciselyMoving IBM i Applications to the Cloud with AWS and Precisely
Moving IBM i Applications to the Cloud with AWS and Precisely
Precisely
 
Automate Your Master Data Processes for Shared Service Center Excellence
Automate Your Master Data Processes for Shared Service Center ExcellenceAutomate Your Master Data Processes for Shared Service Center Excellence
Automate Your Master Data Processes for Shared Service Center Excellence
Precisely
 

Más de Precisely (20)

How to Build Data Governance Programs That Last - A Business-First Approach.pdf
How to Build Data Governance Programs That Last - A Business-First Approach.pdfHow to Build Data Governance Programs That Last - A Business-First Approach.pdf
How to Build Data Governance Programs That Last - A Business-First Approach.pdf
 
Zukuntssichere SAP Prozesse dank automatisierter Massendaten
Zukuntssichere SAP Prozesse dank automatisierter MassendatenZukuntssichere SAP Prozesse dank automatisierter Massendaten
Zukuntssichere SAP Prozesse dank automatisierter Massendaten
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power Systems
 
Crucial Considerations for AI-ready Data.pdf
Crucial Considerations for AI-ready Data.pdfCrucial Considerations for AI-ready Data.pdf
Crucial Considerations for AI-ready Data.pdf
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Justifying Capacity Managment Webinar 4/10
Justifying Capacity Managment Webinar 4/10Justifying Capacity Managment Webinar 4/10
Justifying Capacity Managment Webinar 4/10
 
Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...
Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...
Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...
 
Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...
Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...
Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...
 
Testjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3f
Testjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3fTestjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3f
Testjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3f
 
Data Innovation Summit: Data Integrity Trends
Data Innovation Summit: Data Integrity TrendsData Innovation Summit: Data Integrity Trends
Data Innovation Summit: Data Integrity Trends
 
AI You Can Trust - Ensuring Success with Data Integrity Webinar
AI You Can Trust - Ensuring Success with Data Integrity WebinarAI You Can Trust - Ensuring Success with Data Integrity Webinar
AI You Can Trust - Ensuring Success with Data Integrity Webinar
 
Optimisez la fonction financière en automatisant vos processus SAP
Optimisez la fonction financière en automatisant vos processus SAPOptimisez la fonction financière en automatisant vos processus SAP
Optimisez la fonction financière en automatisant vos processus SAP
 
SAPS/4HANA Migration - Transformation-Management + nachhaltige Investitionen
SAPS/4HANA Migration - Transformation-Management + nachhaltige InvestitionenSAPS/4HANA Migration - Transformation-Management + nachhaltige Investitionen
SAPS/4HANA Migration - Transformation-Management + nachhaltige Investitionen
 
Automatisierte SAP Prozesse mit Hilfe von APIs
Automatisierte SAP Prozesse mit Hilfe von APIsAutomatisierte SAP Prozesse mit Hilfe von APIs
Automatisierte SAP Prozesse mit Hilfe von APIs
 
Moving IBM i Applications to the Cloud with AWS and Precisely
Moving IBM i Applications to the Cloud with AWS and PreciselyMoving IBM i Applications to the Cloud with AWS and Precisely
Moving IBM i Applications to the Cloud with AWS and Precisely
 
Effective Security Monitoring for IBM i: What You Need to Know
Effective Security Monitoring for IBM i: What You Need to KnowEffective Security Monitoring for IBM i: What You Need to Know
Effective Security Monitoring for IBM i: What You Need to Know
 
Automate Your Master Data Processes for Shared Service Center Excellence
Automate Your Master Data Processes for Shared Service Center ExcellenceAutomate Your Master Data Processes for Shared Service Center Excellence
Automate Your Master Data Processes for Shared Service Center Excellence
 
5 Keys to Improved IT Operation Management
5 Keys to Improved IT Operation Management5 Keys to Improved IT Operation Management
5 Keys to Improved IT Operation Management
 
Unlock Efficiency With Your Address Data Today For a Smarter Tomorrow
Unlock Efficiency With Your Address Data Today For a Smarter TomorrowUnlock Efficiency With Your Address Data Today For a Smarter Tomorrow
Unlock Efficiency With Your Address Data Today For a Smarter Tomorrow
 
Navigating Cloud Trends in 2024 Webinar Deck
Navigating Cloud Trends in 2024 Webinar DeckNavigating Cloud Trends in 2024 Webinar Deck
Navigating Cloud Trends in 2024 Webinar Deck
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Último (20)

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 

Learn How to Turbocharge Your AI/ML Data Workflows with Data Enrichment

  • 1. Learn How to Turbocharge Your AI/ML Data Workflows with Data Enrichment Tim McKenzie | Director, Solution Architecture 1
  • 2. Location data challenges • Location is Messy- Addresses, Lat/Long, Shapes, Lines, Formats • Complexity of Joining Location Based Data Sources (3rd Party and Internal) • Data Sourcing Challenges- Many Providers, Many Formats, Many Pricing and Licensing Differences • Global Extensibility- Data Sources Tend to Be Regional Yet Use Cases are Often Global • Need to Identify and Process Multi-Family and Condo Properties • De-centralized repositories of data • Complex properties can often have multiple valid addresses, parcels and buildings. • Legal descriptions in variety of format leading to discrepancy, inefficiencies, errors and non-compliance 2 “For every minute spent in organizing, an hour is earned.” Benjamin Franklin Inventor, Statesman, Insurer
  • 3. Data prep slows data science 3% 19% 9% 4% 5% What data scientists spend the most time doing Building data sets Cleaning and organizing data Collecting datasets Mining data for patterns Refining algorithms Other accounts for about 80% of the work of data scientists 3 3
  • 4. Location enabling strategies for data analytics 03. Analyze Apply data science at scale to gain a competitive advantage 02. Enrich Leverage trusted ID to join massive amounts of your own and 3rd party data sources 01. Organize Assign a trusted ID that is unique and persistent to each address 4
  • 5. Fast, easy, and consistent data enrichment 5 Precisely’s Geo Addressing with hyper-accurate Master Location Data (MLD) reference data • Belgium & Luxembourg • Canada • Finland • France • Germany • Great Britain • Ireland • Netherlands • Sweden • Singapore • United States • More coming soon! International Coverage Data Sources • Postal Authorities • Government datasets: local city, county, and state • Global Vendors • Local Players • Open Sources • Proprietary Sources • Largest & Best available • Unparalleled & • Parent-child relationship, • Unique and Persistent Identifier, • Multi-sourced, • Simplify data enrichment process, MLD Attributes
  • 6. Data Enrichment – A global product portfolio Addresses & Property Verified and validated address and property data for map display and analytics Boundaries Administrative, community, and industry-specific boundaries for data enrichment and territory analysis Demographics Demographic and consumer context data for better understanding people and behavior Points of Interest Detailed business, leisure, and geographic features for location and competitive intelligence Streets Robust street-level data for mapping, analysis, routing, and geocoding Risk Natural hazard boundaries related to flood, fire, earthquakes, and weather Expertly curated datasets containing thousands of attributes for faster, confident decisions 6
  • 7. Uniquely positioned to address data enrichment needs Global coverage location enrichment data. Our portfolio includes: • 400+ datasets • 250+ countries and territories • 100s of millions of data points Datasets that are interoperable and are managed to quality standard, with consistent documentation, and support e.g. • Property Graph • Market and Community Link Ability to enrich with dynamic data (Dynamic Weather and Dynamic Demographics) • Data that includes time as a dimension • Creating insights from data that is updated at regular and short time intervals (e.g. 5 min) Data experience through deep-domain expertise • Adding data through, development, partnerships, and acquisitions Best-in-class addressing and property datasets with a unique and persistent ID • Link Precisely and customer address, buildings, demographics, risk, and more data using the PreciselyID, a unique and persistent location identifier 7
  • 8. Cloud-based location analytics technology 8 Spatial Functions 30+ Common Spatial Processes Global Geocoding Forward & Reverse Global Geocoding and Trusted ID Global Addressing Validate, standardize and parse global addresses Global Tax Jurisdiction Assign highly granular tax jurisdictions globally. Map Visualization Visualize Location Data at Scale Global Street Routing Assign isochrones and isodistance anywhere in the world.
  • 9. Location-enabled analytics Bank Branch & ATM Call Center/ Web Customers by Product Commercial & Mortgage Active Mortgages Historical Defaults Geocoding and location intelligence capabilities to organize and enrich your data Financial Transactions All of your sources Any structure or frequency Analytics capabilities for any use case or persona Ad Hoc Data Science Low-cost, rapid experimentation with new data and models. Explainable Machine Learning High volume, fine-grained analysis at scale served in the tightest of service windows. BI Reporting & Dashboarding Power real-time dashboarding directly, or feed data to a data warehouse for high-concurrency reporting. Real-time Applications Provide real-time data to downstream applications or power applications via APIs. PreciselyID ADMIN BOUNDARIES BANK DEPOSITS MOBILE MOVEMENT WEATHER EVENTS HAZARD & RISK DATA AMENITIES & COMPETITION EVERY US/CAN ADDRESS BUSINESS LOCATIONS PROPERTY ATTRIBUTES SCHOOLS & NEIGHBORHOODS POPULATION DEMOGRAPHICS PARCELS & BUILDINGS Analytics Platform
  • 10. Understanding the data challenge 10 • Accessing the right raw data • Keeping up with continuously changing data feeds • Building features from raw data • Combining features into training data • Calculating and serving features in production • Monitoring features in production Key data challenges that organizations face when productionizing ML systems 10
  • 11. What is a “feature-based” architecture? 11 A feature store is an ML-specific data system that: • Runs data pipelines that transform raw data into feature values • Stores and manages the feature data itself, and • Serves feature data consistently for training and inference purposes A feature is data used as an input signal to a predictive model 11
  • 12. 12 Processing Storage Inputs Location specific records Shape files Streaming records Address Fabric Analytics Processing • Model outputs • Scores • Computed columns • Analysis outcome Batch Geocoding with the Operational Addressing SDKs • Vaildate input addresses • Validate other data • Locate addresses • Match inputs • Assign PreciselyID • Relate data around PrecisleyID Batch Spatial Processing with the Location Intelligence SDK • Flatten shape files • Compute PIP • Compute D2P, D2L • Compute basic scores • Generate geohash • Relate data around geohash (where application) Realtime Processing with the Precisely SDKs • Operational Addressing APIs • Assign PreciselyID • Generate geohash • Relate data Message Bus Feature Store In-stream Analytics Layer Model outputs, scores, computed columns, analysis outcomes PrecisleyID Address P0000MK1IAAD 287 E 300 S. Provo, UT 84606 P0000MK1DPRD 410 N University Ave. Provo, UT 84601 Vendor data files Customer Loyalty Records Equipment Inventories Franchise Zones Pricing Delivery Territories Mobile Trace Data POS/IOT Data Administration, Governance, Security, Connectivity, Schema, Catalog Model Training EDW precisely Data subscriptions with PreciselyID PrecisleyID Address Name Type Score Location MICode PointCode DemoRgn P0000MK1IAAD 287 E 300 S. Provo, UT 84606 Empas LLC REST 91.529 UT108 10020100 101067669 8926 P0000MK1DPRD 410 N University Ave. Provo, UT 84601 THAI HUT REST 65.981 UT108 10020100 100854441 4144 …. ….. ….. ….. ….. …. …. ….. ….