SlideShare una empresa de Scribd logo
1 de 16
Big Data & Real Estate
Welcomes
Jon Zifcak, MBA, MSIM
CEO, ZULLOO Inc.
@JonZifcak
Anton Polishko, PhD.
CTO, ZULLOO Inc.
@AntonPolishko
NoSQL
Why?
Benefits?
What is NoSQL?
NoSQL encompasses a wide variety of different database technologies that were developed in response to the demands presented in
building modern applications:
● Developers are working with applications that create massive volumes of new, rapidly changing data types — structured, semi-
structured, unstructured and polymorphic data.
● Long gone is the twelve-to-eighteen month waterfall development cycle. Now small teams work in agile sprints, iterating quickly
and pushing code every week or two, some even multiple times every day.
● Applications that once served a finite audience are now delivered as services that must be always-on, accessible from many
different devices and scaled globally to millions of users
● Organizations are now turning to scale-out architectures using open source software, commodity servers and cloud computing
instead of large monolithic servers and storage infrastructure.
Business Application
● Personalization
● Profile Management
● Real-Time Big Data
● Content Management
● Catalog
● Customer 360° View
● Mobile Applications
● Internet of Things
● Digital Communications
● Fraud Detection
Technical Application
Real Estate Data
● Multiple Listing Service (MLS)
○ History of sales and current properties on the market
○ Property descriptions
○ Structured data (literally just a big spreadsheet)
○ However, fields change over time
○ Different locations have different MLS provider, thus, different format
● Public Records
○ A lot of missing data
○ A huge variety of data (demographics, crime rates, etc.)
● 3rd party providers
○ School reviews
○ Proximity to POIs
○ Same as public records but cleaned up
Real Estate Data
● Multiple Listing Service (MLS)
○ History of sales and current properties on the market
○ Property descriptions
○ Structured data (literally just a big spreadsheet)
○ However, fields change over time
○ Different locations have different MLS provider, thus, different format
● Public Records
○ A lot of missing data
○ Total havok in what can you get (demographics, crime rates, etc.)
● 3rd party providers
○ School reviews
○ Proximity to POIs
○ Same as public records but cleaned up
heterogeneous
AVMs
● Automated valuation model (AVM) is a service that can provide real estate
property valuations using mathematical modelling combined with a
database
● Typical AVM uses hedonic regression, means property value is
decomposes
○ number of bedrooms, bathrooms
○ size of lot
○ distance to the city center, schools, etc.
○ etc.
Zulloo Approach
● Real Estate data:
○ Data is combination of structured and
unstructured
○ Missing features
○ Geo-specific
● Our goals
○ Common storage for web-development
and data science teams
○ Horizontal scalability
○ Geo queries
3rd party
provider
s
MLS
Public
records
Things to consider
● As a startup
○ Availability of free support
○ Roadmap
● Speed considerations
○ Comparison PostgreSQL vs MongoDB
Zulloo Approach
● Real Estate data:
○ Data is combination of structured and
unstructured
○ Missing features
○ Geo-specific
● Our goals
○ Common storage for web-development
and data science teams
○ Horizontal scalability
○ Geo queries
3rd party
provider
s
MLS
Public
records
Current Setup
MongoDBMeteor
app
Input data
stream
Machine
Learning
Roadmap
MongoDB
GraphQL
Meteor
app
ApolloStack
Input data
stream
Machine
Learning
Next big thing
● Graphical databases
○ RE data is just a huge graph
anyway
● GPU databases
○ Speed(!!!) but only SQL so far :(
● ApolloStack/GraphQL/etc
○ Clean API between backends and
frontends
○ Less time spent digging API
documentation
Q& A
Info@ZULLOO.com
www.ZULLOO.com

Más contenido relacionado

La actualidad más candente

Big Data for Smart City
Big Data for Smart CityBig Data for Smart City
Big Data for Smart City
Koltiva
 

La actualidad más candente (19)

Analyse en temps réel de BigData par l'exemple
Analyse en temps réel de BigData par l'exempleAnalyse en temps réel de BigData par l'exemple
Analyse en temps réel de BigData par l'exemple
 
Analyse en temps réel de BigData
Analyse en temps réel de BigDataAnalyse en temps réel de BigData
Analyse en temps réel de BigData
 
2015 CMU trading summit session 2 emerging bank technology
2015 CMU trading summit session 2 emerging bank technology2015 CMU trading summit session 2 emerging bank technology
2015 CMU trading summit session 2 emerging bank technology
 
Marching towards building healthy dwellings and industry 4.0 based development
Marching towards building healthy dwellings and industry 4.0 based developmentMarching towards building healthy dwellings and industry 4.0 based development
Marching towards building healthy dwellings and industry 4.0 based development
 
Big data, meager returns
Big data, meager returns Big data, meager returns
Big data, meager returns
 
Ekc 2017 big data
Ekc 2017  big dataEkc 2017  big data
Ekc 2017 big data
 
Big Data for Smart City
Big Data for Smart CityBig Data for Smart City
Big Data for Smart City
 
Big data
Big dataBig data
Big data
 
Using big data and open source for smart city planning
Using big data and open source for smart city planningUsing big data and open source for smart city planning
Using big data and open source for smart city planning
 
Cisco Bas Boorsma
Cisco Bas BoorsmaCisco Bas Boorsma
Cisco Bas Boorsma
 
Dr Ohad Barzilay
Dr Ohad BarzilayDr Ohad Barzilay
Dr Ohad Barzilay
 
Nonprofits + Data: Pathway to Innovation
Nonprofits + Data: Pathway to InnovationNonprofits + Data: Pathway to Innovation
Nonprofits + Data: Pathway to Innovation
 
Big Data: What's it Really About?
Big Data: What's it Really About?Big Data: What's it Really About?
Big Data: What's it Really About?
 
Big data
Big dataBig data
Big data
 
Internet of Things and Smart Cities
Internet of Things and Smart Cities Internet of Things and Smart Cities
Internet of Things and Smart Cities
 
Backend for fintech platform
Backend for fintech platformBackend for fintech platform
Backend for fintech platform
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Emerging trends in IT 2018
Emerging trends in IT 2018Emerging trends in IT 2018
Emerging trends in IT 2018
 
Data Analytics Career Paths
Data Analytics Career PathsData Analytics Career Paths
Data Analytics Career Paths
 

Destacado

101129 tokyopref bochibochi
101129 tokyopref bochibochi101129 tokyopref bochibochi
101129 tokyopref bochibochi
redgang
 

Destacado (20)

Big data in real estate
Big data in real estateBig data in real estate
Big data in real estate
 
Big Data in Real Estate - Digital Real Estate Summit
Big Data in Real Estate - Digital Real Estate SummitBig Data in Real Estate - Digital Real Estate Summit
Big Data in Real Estate - Digital Real Estate Summit
 
Smart Data Tool - Smart City Conference St. Gallen
Smart Data Tool - Smart City Conference St. GallenSmart Data Tool - Smart City Conference St. Gallen
Smart Data Tool - Smart City Conference St. Gallen
 
Big Data Day LA 2016/ Use Case Driven track - Shaping the Role of Data Scienc...
Big Data Day LA 2016/ Use Case Driven track - Shaping the Role of Data Scienc...Big Data Day LA 2016/ Use Case Driven track - Shaping the Role of Data Scienc...
Big Data Day LA 2016/ Use Case Driven track - Shaping the Role of Data Scienc...
 
SPARK16 Presentation: Connecting Facilities Performance Data with Your Real E...
SPARK16 Presentation: Connecting Facilities Performance Data with Your Real E...SPARK16 Presentation: Connecting Facilities Performance Data with Your Real E...
SPARK16 Presentation: Connecting Facilities Performance Data with Your Real E...
 
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Building an Event-oriented...
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Building an Event-oriented...Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Building an Event-oriented...
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Building an Event-oriented...
 
Big Data Day LA 2016/ Use Case Driven track - From Clusters to Clouds, Hardwa...
Big Data Day LA 2016/ Use Case Driven track - From Clusters to Clouds, Hardwa...Big Data Day LA 2016/ Use Case Driven track - From Clusters to Clouds, Hardwa...
Big Data Day LA 2016/ Use Case Driven track - From Clusters to Clouds, Hardwa...
 
Big Data Day LA 2016/ Data Science Track - Intuit's Payments Risk Platform, D...
Big Data Day LA 2016/ Data Science Track - Intuit's Payments Risk Platform, D...Big Data Day LA 2016/ Data Science Track - Intuit's Payments Risk Platform, D...
Big Data Day LA 2016/ Data Science Track - Intuit's Payments Risk Platform, D...
 
Big Data Day LA 2016/ Use Case Driven track - How to Use Design Thinking to J...
Big Data Day LA 2016/ Use Case Driven track - How to Use Design Thinking to J...Big Data Day LA 2016/ Use Case Driven track - How to Use Design Thinking to J...
Big Data Day LA 2016/ Use Case Driven track - How to Use Design Thinking to J...
 
Big Data Day LA 2016/ NoSQL track - Spark And Couchbase: Augmenting The Opera...
Big Data Day LA 2016/ NoSQL track - Spark And Couchbase: Augmenting The Opera...Big Data Day LA 2016/ NoSQL track - Spark And Couchbase: Augmenting The Opera...
Big Data Day LA 2016/ NoSQL track - Spark And Couchbase: Augmenting The Opera...
 
IES Faculty - Intelligent Big Data: Opportunities for Real Estate Asset Manag...
IES Faculty - Intelligent Big Data: Opportunities for Real Estate Asset Manag...IES Faculty - Intelligent Big Data: Opportunities for Real Estate Asset Manag...
IES Faculty - Intelligent Big Data: Opportunities for Real Estate Asset Manag...
 
Big Data Revolution: Increasing Transparency to Risk and Valuation
Big Data Revolution: Increasing Transparency to Risk and ValuationBig Data Revolution: Increasing Transparency to Risk and Valuation
Big Data Revolution: Increasing Transparency to Risk and Valuation
 
Big Data Day LA 2015 - Using data visualization to find patterns in multidime...
Big Data Day LA 2015 - Using data visualization to find patterns in multidime...Big Data Day LA 2015 - Using data visualization to find patterns in multidime...
Big Data Day LA 2015 - Using data visualization to find patterns in multidime...
 
Dot pab forum september 2011
Dot pab forum september 2011Dot pab forum september 2011
Dot pab forum september 2011
 
101129 tokyopref bochibochi
101129 tokyopref bochibochi101129 tokyopref bochibochi
101129 tokyopref bochibochi
 
Big Data Day LA 2015 - What's New Tajo 0.10 and Beyond by Hyunsik Choi of Gruter
Big Data Day LA 2015 - What's New Tajo 0.10 and Beyond by Hyunsik Choi of GruterBig Data Day LA 2015 - What's New Tajo 0.10 and Beyond by Hyunsik Choi of Gruter
Big Data Day LA 2015 - What's New Tajo 0.10 and Beyond by Hyunsik Choi of Gruter
 
Big Data Day LA 2015 - The Big Data Journey: How Big Data Practices Evolve at...
Big Data Day LA 2015 - The Big Data Journey: How Big Data Practices Evolve at...Big Data Day LA 2015 - The Big Data Journey: How Big Data Practices Evolve at...
Big Data Day LA 2015 - The Big Data Journey: How Big Data Practices Evolve at...
 
Big Data Day LA 2015 - Transforming into a data driven enterprise using exist...
Big Data Day LA 2015 - Transforming into a data driven enterprise using exist...Big Data Day LA 2015 - Transforming into a data driven enterprise using exist...
Big Data Day LA 2015 - Transforming into a data driven enterprise using exist...
 
Big Data Day LA 2015 - Big Data Day LA 2015 - Applying GeoSpatial Analytics u...
Big Data Day LA 2015 - Big Data Day LA 2015 - Applying GeoSpatial Analytics u...Big Data Day LA 2015 - Big Data Day LA 2015 - Applying GeoSpatial Analytics u...
Big Data Day LA 2015 - Big Data Day LA 2015 - Applying GeoSpatial Analytics u...
 
An evening with Jay Kreps; author of Apache Kafka, Samza, Voldemort & Azkaban.
An evening with Jay Kreps; author of Apache Kafka, Samza, Voldemort & Azkaban.An evening with Jay Kreps; author of Apache Kafka, Samza, Voldemort & Azkaban.
An evening with Jay Kreps; author of Apache Kafka, Samza, Voldemort & Azkaban.
 

Similar a Big Data Day LA 2016/ NoSQL track - Big Data and Real Estate, Jon Zifcak, CEO & Anton Polishko, CTO, Zulloo

Nov 2019 kafka with mongo db and confluent sydney
Nov 2019 kafka with mongo db and confluent   sydneyNov 2019 kafka with mongo db and confluent   sydney
Nov 2019 kafka with mongo db and confluent sydney
Andrew Blades
 
Big Data & Social Analytics presentation
Big Data & Social Analytics presentationBig Data & Social Analytics presentation
Big Data & Social Analytics presentation
gustavosouto
 
College Van Trends Tot Innovatie
College Van Trends Tot InnovatieCollege Van Trends Tot Innovatie
College Van Trends Tot Innovatie
Wouter Meys
 

Similar a Big Data Day LA 2016/ NoSQL track - Big Data and Real Estate, Jon Zifcak, CEO & Anton Polishko, CTO, Zulloo (20)

#twbconf 2017: Digital transformation in London - Natalie Taylor, Mayor of Lo...
#twbconf 2017: Digital transformation in London - Natalie Taylor, Mayor of Lo...#twbconf 2017: Digital transformation in London - Natalie Taylor, Mayor of Lo...
#twbconf 2017: Digital transformation in London - Natalie Taylor, Mayor of Lo...
 
Open Source Geospatial Business Intelligence (GeoBI): Definition, architectur...
Open Source Geospatial Business Intelligence (GeoBI): Definition, architectur...Open Source Geospatial Business Intelligence (GeoBI): Definition, architectur...
Open Source Geospatial Business Intelligence (GeoBI): Definition, architectur...
 
Webinar: How Penton Uses MongoDB As an Analytics Platform within their Drupal...
Webinar: How Penton Uses MongoDB As an Analytics Platform within their Drupal...Webinar: How Penton Uses MongoDB As an Analytics Platform within their Drupal...
Webinar: How Penton Uses MongoDB As an Analytics Platform within their Drupal...
 
Ledingkart Meetup #1: Monolithic to microservices in action
Ledingkart Meetup #1: Monolithic to microservices in actionLedingkart Meetup #1: Monolithic to microservices in action
Ledingkart Meetup #1: Monolithic to microservices in action
 
Software libre en la banca - Experiencias del grupo Santander con OSS
Software libre en la banca - Experiencias del grupo Santander con OSSSoftware libre en la banca - Experiencias del grupo Santander con OSS
Software libre en la banca - Experiencias del grupo Santander con OSS
 
Amsterdam developing public code for every city and everyone, Boris Van Hoyte...
Amsterdam developing public code for every city and everyone, Boris Van Hoyte...Amsterdam developing public code for every city and everyone, Boris Van Hoyte...
Amsterdam developing public code for every city and everyone, Boris Van Hoyte...
 
Nov 2019 kafka with mongo db and confluent sydney
Nov 2019 kafka with mongo db and confluent   sydneyNov 2019 kafka with mongo db and confluent   sydney
Nov 2019 kafka with mongo db and confluent sydney
 
Open Source Summit Paris '17 Amsterdam Open Source
Open Source Summit Paris '17 Amsterdam Open SourceOpen Source Summit Paris '17 Amsterdam Open Source
Open Source Summit Paris '17 Amsterdam Open Source
 
GlobalLogic Java Community Webinar #16 “Zaloni’s Architecture for Data-Driven...
GlobalLogic Java Community Webinar #16 “Zaloni’s Architecture for Data-Driven...GlobalLogic Java Community Webinar #16 “Zaloni’s Architecture for Data-Driven...
GlobalLogic Java Community Webinar #16 “Zaloni’s Architecture for Data-Driven...
 
Spring Data Neo4j: Graph Power Your Enterprise Apps
Spring Data Neo4j: Graph Power Your Enterprise AppsSpring Data Neo4j: Graph Power Your Enterprise Apps
Spring Data Neo4j: Graph Power Your Enterprise Apps
 
City of Amsterdam: High velocity development
City of Amsterdam: High velocity developmentCity of Amsterdam: High velocity development
City of Amsterdam: High velocity development
 
Growing Importance of Business Intelligence on Property Portal Growth - Prese...
Growing Importance of Business Intelligence on Property Portal Growth - Prese...Growing Importance of Business Intelligence on Property Portal Growth - Prese...
Growing Importance of Business Intelligence on Property Portal Growth - Prese...
 
Big Data & Social Analytics presentation
Big Data & Social Analytics presentationBig Data & Social Analytics presentation
Big Data & Social Analytics presentation
 
MassNow - intelligent church locator
MassNow - intelligent church locatorMassNow - intelligent church locator
MassNow - intelligent church locator
 
Monolith to serverless service based architectures in the enterprise
Monolith to serverless  service based architectures in the enterpriseMonolith to serverless  service based architectures in the enterprise
Monolith to serverless service based architectures in the enterprise
 
Analytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data PlatformAnalytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data Platform
 
Webinar: Realizing the Promise of Machine to Machine (M2M) with MongoDB
Webinar: Realizing the Promise of Machine to Machine (M2M) with MongoDBWebinar: Realizing the Promise of Machine to Machine (M2M) with MongoDB
Webinar: Realizing the Promise of Machine to Machine (M2M) with MongoDB
 
proDataMarket presentation at "Linked Data Europe: Big Geospatial Data"
proDataMarket presentation at "Linked Data Europe: Big Geospatial Data"proDataMarket presentation at "Linked Data Europe: Big Geospatial Data"
proDataMarket presentation at "Linked Data Europe: Big Geospatial Data"
 
College Van Trends Tot Innovatie
College Van Trends Tot InnovatieCollege Van Trends Tot Innovatie
College Van Trends Tot Innovatie
 
An eventful tour from enterprise integration to serverless and functions
An eventful tour from enterprise integration to serverless and functionsAn eventful tour from enterprise integration to serverless and functions
An eventful tour from enterprise integration to serverless and functions
 

Más de Data Con LA

Data Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA 2022 - Collaborative Data Exploration using Conversational AIData Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA
 
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA
 
Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA 2022- Embedding medical journeys with machine learning to improve...Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA
 

Más de Data Con LA (20)

Data Con LA 2022 Keynotes
Data Con LA 2022 KeynotesData Con LA 2022 Keynotes
Data Con LA 2022 Keynotes
 
Data Con LA 2022 Keynotes
Data Con LA 2022 KeynotesData Con LA 2022 Keynotes
Data Con LA 2022 Keynotes
 
Data Con LA 2022 Keynote
Data Con LA 2022 KeynoteData Con LA 2022 Keynote
Data Con LA 2022 Keynote
 
Data Con LA 2022 - Startup Showcase
Data Con LA 2022 - Startup ShowcaseData Con LA 2022 - Startup Showcase
Data Con LA 2022 - Startup Showcase
 
Data Con LA 2022 Keynote
Data Con LA 2022 KeynoteData Con LA 2022 Keynote
Data Con LA 2022 Keynote
 
Data Con LA 2022 - Using Google trends data to build product recommendations
Data Con LA 2022 - Using Google trends data to build product recommendationsData Con LA 2022 - Using Google trends data to build product recommendations
Data Con LA 2022 - Using Google trends data to build product recommendations
 
Data Con LA 2022 - AI Ethics
Data Con LA 2022 - AI EthicsData Con LA 2022 - AI Ethics
Data Con LA 2022 - AI Ethics
 
Data Con LA 2022 - Improving disaster response with machine learning
Data Con LA 2022 - Improving disaster response with machine learningData Con LA 2022 - Improving disaster response with machine learning
Data Con LA 2022 - Improving disaster response with machine learning
 
Data Con LA 2022 - What's new with MongoDB 6.0 and Atlas
Data Con LA 2022 - What's new with MongoDB 6.0 and AtlasData Con LA 2022 - What's new with MongoDB 6.0 and Atlas
Data Con LA 2022 - What's new with MongoDB 6.0 and Atlas
 
Data Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Real world consumer segmentationData Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Real world consumer segmentation
 
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
 
Data Con LA 2022 - Moving Data at Scale to AWS
Data Con LA 2022 - Moving Data at Scale to AWSData Con LA 2022 - Moving Data at Scale to AWS
Data Con LA 2022 - Moving Data at Scale to AWS
 
Data Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA 2022 - Collaborative Data Exploration using Conversational AIData Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA 2022 - Collaborative Data Exploration using Conversational AI
 
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
 
Data Con LA 2022 - Intro to Data Science
Data Con LA 2022 - Intro to Data ScienceData Con LA 2022 - Intro to Data Science
Data Con LA 2022 - Intro to Data Science
 
Data Con LA 2022 - How are NFTs and DeFi Changing Entertainment
Data Con LA 2022 - How are NFTs and DeFi Changing EntertainmentData Con LA 2022 - How are NFTs and DeFi Changing Entertainment
Data Con LA 2022 - How are NFTs and DeFi Changing Entertainment
 
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
 
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
 
Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA 2022- Embedding medical journeys with machine learning to improve...Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA 2022- Embedding medical journeys with machine learning to improve...
 
Data Con LA 2022 - Data Streaming with Kafka
Data Con LA 2022 - Data Streaming with KafkaData Con LA 2022 - Data Streaming with Kafka
Data Con LA 2022 - Data Streaming with Kafka
 

Último

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 

Último (20)

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 

Big Data Day LA 2016/ NoSQL track - Big Data and Real Estate, Jon Zifcak, CEO & Anton Polishko, CTO, Zulloo

  • 1. Big Data & Real Estate
  • 2. Welcomes Jon Zifcak, MBA, MSIM CEO, ZULLOO Inc. @JonZifcak Anton Polishko, PhD. CTO, ZULLOO Inc. @AntonPolishko
  • 4. What is NoSQL? NoSQL encompasses a wide variety of different database technologies that were developed in response to the demands presented in building modern applications: ● Developers are working with applications that create massive volumes of new, rapidly changing data types — structured, semi- structured, unstructured and polymorphic data. ● Long gone is the twelve-to-eighteen month waterfall development cycle. Now small teams work in agile sprints, iterating quickly and pushing code every week or two, some even multiple times every day. ● Applications that once served a finite audience are now delivered as services that must be always-on, accessible from many different devices and scaled globally to millions of users ● Organizations are now turning to scale-out architectures using open source software, commodity servers and cloud computing instead of large monolithic servers and storage infrastructure.
  • 5. Business Application ● Personalization ● Profile Management ● Real-Time Big Data ● Content Management ● Catalog ● Customer 360° View ● Mobile Applications ● Internet of Things ● Digital Communications ● Fraud Detection
  • 7. Real Estate Data ● Multiple Listing Service (MLS) ○ History of sales and current properties on the market ○ Property descriptions ○ Structured data (literally just a big spreadsheet) ○ However, fields change over time ○ Different locations have different MLS provider, thus, different format ● Public Records ○ A lot of missing data ○ A huge variety of data (demographics, crime rates, etc.) ● 3rd party providers ○ School reviews ○ Proximity to POIs ○ Same as public records but cleaned up
  • 8. Real Estate Data ● Multiple Listing Service (MLS) ○ History of sales and current properties on the market ○ Property descriptions ○ Structured data (literally just a big spreadsheet) ○ However, fields change over time ○ Different locations have different MLS provider, thus, different format ● Public Records ○ A lot of missing data ○ Total havok in what can you get (demographics, crime rates, etc.) ● 3rd party providers ○ School reviews ○ Proximity to POIs ○ Same as public records but cleaned up heterogeneous
  • 9. AVMs ● Automated valuation model (AVM) is a service that can provide real estate property valuations using mathematical modelling combined with a database ● Typical AVM uses hedonic regression, means property value is decomposes ○ number of bedrooms, bathrooms ○ size of lot ○ distance to the city center, schools, etc. ○ etc.
  • 10. Zulloo Approach ● Real Estate data: ○ Data is combination of structured and unstructured ○ Missing features ○ Geo-specific ● Our goals ○ Common storage for web-development and data science teams ○ Horizontal scalability ○ Geo queries 3rd party provider s MLS Public records
  • 11. Things to consider ● As a startup ○ Availability of free support ○ Roadmap ● Speed considerations ○ Comparison PostgreSQL vs MongoDB
  • 12. Zulloo Approach ● Real Estate data: ○ Data is combination of structured and unstructured ○ Missing features ○ Geo-specific ● Our goals ○ Common storage for web-development and data science teams ○ Horizontal scalability ○ Geo queries 3rd party provider s MLS Public records
  • 15. Next big thing ● Graphical databases ○ RE data is just a huge graph anyway ● GPU databases ○ Speed(!!!) but only SQL so far :( ● ApolloStack/GraphQL/etc ○ Clean API between backends and frontends ○ Less time spent digging API documentation