SlideShare una empresa de Scribd logo
1 de 22
Descargar para leer sin conexión
Common MongoDB Use-Cases

             Kevin Hanson
      Solutions Architect, 10gen
   @hungarianhc ~ kevin@10gen.com
Intro to NoSQL and
                  MongoDB
Folllow-up:            (completed)
@hungarianhc
kevin@10gen.com   How to Get Started
                  with your MongoDB
                  Pilot Project
                       (August 7th)
Emerging NoSQL Space

                   RDBMS                RDBMS



 RDBMS


                    Data          Data
                                                    NoSQL
                  Warehouse     Warehouse




The beginning   Last 10 years               Today
Qualities of NoSQL
                    Workloads

Flexible data models      High Throughput          Large Data Sizes
• Lists, Nested Objects   • Lots of reads          • Aggregate data size
• Sparse schemas          • Lots of writes         • Number of objects
• Semi-structured data
• Agile Development



Low Latency               Cloud Computing          Commodity
• Both reads and writes   • Run anywhere           Hardware
• Millisecond latency     • No assumptions about   • Ethernet
                            hardware               • Local disks
                          • No / Few Knobs
MongoDB was designed for
            this

Flexible data models      High Throughput             Large Data Sizes
• Lists, Nested Objects   • Lots of reads             • Aggregate data size
      • schemas
• SparseJSON based             • writes
                          • Lots of Replica Sets to   • Number of objects shards
                                                           • 1000’s of
• Semi-structuredmodel
          object data           scale reads                 in a single DB
      • Dynamic
• Agile Development           • Sharding to               • Partitioning of
        schemas                 scale writes                data

Low Latency               Cloud Computing             Commodity
• Both reads and writes   • Run anywhere              Hardware
      • In-memory
• Millisecond latency     • No • Scale-out to
                               assumptions about      • Ethernet
                                                           • Designed for
      cache                        overcome
                            hardware                  • Local disks
                          • No / Few Knobs                   “typical” OS and
    • Scale-out                    hardware
                                                             local file system
      working set                limitations
Example customers
Content Management       Operational Intelligence     Product Data Management




            User Data Management         High Volume Data Feeds
USE CASES THAT
LEVERAGE NOSQL
High Volume Data Feeds
  Machine      • More machines, more sensors, more
 Generated       data
   Data        • Variably structured


Stock Market   • High frequency trading
    Data

Social Media   • Multiple sources of data
 Firehose      • Each changes their format constantly
High Volume Data Feed
                              Flexible document
                              model can adapt to
                              changes in sensor
                                    format
   Asynchronous writes




 Data
  Data
Sources
    Data
 Sources
     Data                     Write to memory with
  Sources                      periodic disk flush
    Sources




          Scale writes over
           multiple shards
Operational Intelligence

               • Large volume of state about users
Ad Targeting   • Very strict latency requirements



 Customer      • Expose report data to millions of customers
  Facing       • Report on large volumes of data
               • Reports that update in real time
Dashboards

Social Media   • Need to join the conversation _now_
 Monitoring
Operational Intelligence
                                    Parallelize queries
               Low latency reads
                                   across replicas and
                                          shards




    API
                                      In database
                                      aggregation




Dashboards
                                    Flexible schema
                                   adapts to changing
                                       input data
Can use same cluster
to collect, store, and
   report on data
Behavioral Profiles
                                                               Rich profiles
                                                            collecting multiple
                                                             complex actions
1   See Ad

                Scale out to support   { cookie_id: “1234512413243”,
                 high throughput of      advertiser:{
                                            apple: {
                  activities tracked           actions: [
2   See Ad                                        { impression: ‘ad1’, time: 123 },
                                                  { impression: ‘ad2’, time: 232 },
                                                  { click: ‘ad2’, time: 235 },
                                                  { add_to_cart: ‘laptop’,
                                                     sku: ‘asdf23f’,
                                                     time: 254 },
    Click                                         { purchase: ‘laptop’, time: 354 }
3                                              ]
                                            }
                                         }
                                       }
                         Dynamic schemas
                        make it easy to track
                                                       Indexing and
4   Convert               vendor specific
                                                    querying to support
                            attributes
                                                    matching, frequency
                                                         capping
Product Data
E-Commerce
              • Diverse product portfolio
  Product     • Complex querying and filtering
  Catalog

              • Scale for short bursts of high volume traffic
Flash Sales   • Scalable, but consistent view of inventory
Product Data
                                                                   Indexing and rich
                                                                   query API for easy
                                                                 searching and sorting
    db.products.
       find({ “details.author”: “David Eggers” }).
       sort({ “title” : -1 });
                                                                      Flexible data model
                                                                         for similar, but
                                                                       different objects


{ sku: “00a9f3a”,                              { sku: “00e8da9b”,
  type: “Book”,                                  type: “MP3”,
  details: {                                     details: {
       author: “David Eggers”,                        artist: “John Coltrane”,
       title: “You shall know our velocity”,          title: “A love supreme”,
       isbn: “0-9703355-5-5”                          length: 123
  }                                              }
}                                              }
Content Management
               • Comments and user generated
 News Site       content
               • Personalization of content, layout

Multi-Device   • Generate layout on the fly for each
 rendering       device that connects
               • No need to cache static pages


               • Store large objects
  Sharing      • Simple modeling of metadata
Content Management
                                                                             Geo spatial indexing
                              Flexible data model                             for location based
GridFS for large
                                 for similar, but                                  searches
 object storage
                               different objects

                                                { camera: “Nikon d4”,
                                                  location: [ -122.418333, 37.775 ]
                                                }



                                                { camera: “Canon 5d mkII”,
                                                  people: [ “Jim”, “Carol” ],
                                                  taken_on: ISODate("2012-03-07T18:32:35.002Z")
                                                }


                                                { origin: “facebook.com/photos/xwdf23fsdf”,
                                                  license: “Creative Commons CC0”,
                                                  size: {
                                                     dimensions: [ 124, 52 ],
                                                     units: “pixels”
     Horizontal scalability                       }
      for large data sets                       }
User Data Management

            • User state and session
Video Games   management


              • Scale out to large graphs
Social Graphs
              • Easy to search and process


  Identity • Authentication, Authorization
Management   and Accounting
User Game State
                                                    Flexible documents
                   Easy to store entire             supports new game
                     player state in a                features without
                    single document.                 schema migration




                               Sharding enables
                             whole data set to be
JSON data model
                             in memory, ensuring
  maps well to
                                 low latency
HTML5/JS & Flash
  based clients
Social Graphs
 Native support for
Arrays makes it easy
to store connections
 inside user profile




                           Sharding partitions
                           user profiles across    Documents enable
            Social Graph    available servers       disk locality of all
                                                  profile data for a user
IS MY USE CASE A GOOD
FIT FOR MONGODB?
Good fits for MongoDB
Application Characteristic      Why MongoDB might be a good fit
Large number of objects to      Sharding lets you split objects across multiple
store                           servers
High write or read throughput   Sharding + Replication lets you scale read and
                                write traffic across multiple servers
Low Latency Access              Memory Mapped storage engine caches
                                documents in RAM, enabling in-memory
                                performance. Data locality of documents can
                                significantly improve latency over join based
                                approaches
Variable data in objects        Dynamic schema and JSON data model enable
                                flexible data storage without sparse tables or
                                complex joins
Cloud based deployment          Sharding and replication let you work around
                                hardware limitations in clouds.
Thanks!

Más contenido relacionado

La actualidad más candente

Common MongoDB Use Cases
Common MongoDB Use Cases Common MongoDB Use Cases
Common MongoDB Use Cases
MongoDB
 
An Introduction to Big Data, NoSQL and MongoDB
An Introduction to Big Data, NoSQL and MongoDBAn Introduction to Big Data, NoSQL and MongoDB
An Introduction to Big Data, NoSQL and MongoDB
William LaForest
 

La actualidad más candente (20)

Common MongoDB Use Cases
Common MongoDB Use Cases Common MongoDB Use Cases
Common MongoDB Use Cases
 
Introduction to MongoDB
Introduction to MongoDBIntroduction to MongoDB
Introduction to MongoDB
 
MongoDB company and case studies - john hong
MongoDB company and case studies - john hong MongoDB company and case studies - john hong
MongoDB company and case studies - john hong
 
Introduction to memcached
Introduction to memcachedIntroduction to memcached
Introduction to memcached
 
MongoDB Fundamentals
MongoDB FundamentalsMongoDB Fundamentals
MongoDB Fundamentals
 
Amazon Redshift
Amazon Redshift Amazon Redshift
Amazon Redshift
 
Distributed SQL Databases Deconstructed
Distributed SQL Databases DeconstructedDistributed SQL Databases Deconstructed
Distributed SQL Databases Deconstructed
 
How Insurance Companies Use MongoDB
How Insurance Companies Use MongoDB How Insurance Companies Use MongoDB
How Insurance Companies Use MongoDB
 
Introduction to NoSQL
Introduction to NoSQLIntroduction to NoSQL
Introduction to NoSQL
 
Event Driven-Architecture from a Scalability perspective
Event Driven-Architecture from a Scalability perspectiveEvent Driven-Architecture from a Scalability perspective
Event Driven-Architecture from a Scalability perspective
 
[245] presto 내부구조 파헤치기
[245] presto 내부구조 파헤치기[245] presto 내부구조 파헤치기
[245] presto 내부구조 파헤치기
 
Mongo db intro.pptx
Mongo db intro.pptxMongo db intro.pptx
Mongo db intro.pptx
 
Sizing MongoDB Clusters
Sizing MongoDB Clusters Sizing MongoDB Clusters
Sizing MongoDB Clusters
 
Capacity Planning For Your Growing MongoDB Cluster
Capacity Planning For Your Growing MongoDB ClusterCapacity Planning For Your Growing MongoDB Cluster
Capacity Planning For Your Growing MongoDB Cluster
 
Log System As Backbone – How We Built the World’s Most Advanced Vector Databa...
Log System As Backbone – How We Built the World’s Most Advanced Vector Databa...Log System As Backbone – How We Built the World’s Most Advanced Vector Databa...
Log System As Backbone – How We Built the World’s Most Advanced Vector Databa...
 
Rainbird: Realtime Analytics at Twitter (Strata 2011)
Rainbird: Realtime Analytics at Twitter (Strata 2011)Rainbird: Realtime Analytics at Twitter (Strata 2011)
Rainbird: Realtime Analytics at Twitter (Strata 2011)
 
[pgday.Seoul 2022] PostgreSQL with Google Cloud
[pgday.Seoul 2022] PostgreSQL with Google Cloud[pgday.Seoul 2022] PostgreSQL with Google Cloud
[pgday.Seoul 2022] PostgreSQL with Google Cloud
 
An Introduction to Big Data, NoSQL and MongoDB
An Introduction to Big Data, NoSQL and MongoDBAn Introduction to Big Data, NoSQL and MongoDB
An Introduction to Big Data, NoSQL and MongoDB
 
MongoDB Administration 101
MongoDB Administration 101MongoDB Administration 101
MongoDB Administration 101
 
Migrating from RDBMS to MongoDB
Migrating from RDBMS to MongoDBMigrating from RDBMS to MongoDB
Migrating from RDBMS to MongoDB
 

Destacado

Presentation on project report
Presentation on project reportPresentation on project report
Presentation on project report
ramesh_x
 

Destacado (12)

MongoDB Schema Design: Four Real-World Examples
MongoDB Schema Design: Four Real-World ExamplesMongoDB Schema Design: Four Real-World Examples
MongoDB Schema Design: Four Real-World Examples
 
The Right (and Wrong) Use Cases for MongoDB
The Right (and Wrong) Use Cases for MongoDBThe Right (and Wrong) Use Cases for MongoDB
The Right (and Wrong) Use Cases for MongoDB
 
Business Jumpstart: The Right (and Wrong) Use Cases for MongoDB
Business Jumpstart: The Right (and Wrong) Use Cases for MongoDBBusiness Jumpstart: The Right (and Wrong) Use Cases for MongoDB
Business Jumpstart: The Right (and Wrong) Use Cases for MongoDB
 
IPDB: A Public Database for the Planet
IPDB: A Public Database for the PlanetIPDB: A Public Database for the Planet
IPDB: A Public Database for the Planet
 
Add a bit of ACID to Cassandra. Cassandra Summit EU 2014
Add a bit of ACID to Cassandra. Cassandra Summit EU 2014Add a bit of ACID to Cassandra. Cassandra Summit EU 2014
Add a bit of ACID to Cassandra. Cassandra Summit EU 2014
 
No dues management system prepared by HRITIKA RAJ (Shivalik College of engg.,...
No dues management system prepared by HRITIKA RAJ (Shivalik College of engg.,...No dues management system prepared by HRITIKA RAJ (Shivalik College of engg.,...
No dues management system prepared by HRITIKA RAJ (Shivalik College of engg.,...
 
MongoDB Case Study in Healthcare
MongoDB Case Study in HealthcareMongoDB Case Study in Healthcare
MongoDB Case Study in Healthcare
 
BigchainDB: A Scalable Blockchain Database, In Python
BigchainDB: A Scalable Blockchain Database, In PythonBigchainDB: A Scalable Blockchain Database, In Python
BigchainDB: A Scalable Blockchain Database, In Python
 
How Retail Banks Use MongoDB
How Retail Banks Use MongoDBHow Retail Banks Use MongoDB
How Retail Banks Use MongoDB
 
Final Year Project Presentation
Final Year Project PresentationFinal Year Project Presentation
Final Year Project Presentation
 
Presentation on project report
Presentation on project reportPresentation on project report
Presentation on project report
 
MongoDB Schema Design
MongoDB Schema DesignMongoDB Schema Design
MongoDB Schema Design
 

Similar a Common MongoDB Use Cases

Common MongoDB Use Cases Webinar
Common MongoDB Use Cases WebinarCommon MongoDB Use Cases Webinar
Common MongoDB Use Cases Webinar
MongoDB
 
Common MongoDB Use Cases
Common MongoDB Use CasesCommon MongoDB Use Cases
Common MongoDB Use Cases
DATAVERSITY
 
Millions quotes per second in pure java
Millions quotes per second in pure javaMillions quotes per second in pure java
Millions quotes per second in pure java
Roman Elizarov
 
Microsoft Openness Mongo DB
Microsoft Openness Mongo DBMicrosoft Openness Mongo DB
Microsoft Openness Mongo DB
Heriyadi Janwar
 
Kognitio overview jan 2013
Kognitio overview jan 2013Kognitio overview jan 2013
Kognitio overview jan 2013
Kognitio
 

Similar a Common MongoDB Use Cases (20)

Common MongoDB Use Cases Webinar
Common MongoDB Use Cases WebinarCommon MongoDB Use Cases Webinar
Common MongoDB Use Cases Webinar
 
Common MongoDB Use Cases
Common MongoDB Use CasesCommon MongoDB Use Cases
Common MongoDB Use Cases
 
Nosql Now 2012: MongoDB Use Cases
Nosql Now 2012: MongoDB Use CasesNosql Now 2012: MongoDB Use Cases
Nosql Now 2012: MongoDB Use Cases
 
Webinar: Utilisations courantes de MongoDB
Webinar: Utilisations courantes de MongoDBWebinar: Utilisations courantes de MongoDB
Webinar: Utilisations courantes de MongoDB
 
Processing Big Data
Processing Big DataProcessing Big Data
Processing Big Data
 
Using Data Lakes: Data Analytics Week SF
Using Data Lakes: Data Analytics Week SFUsing Data Lakes: Data Analytics Week SF
Using Data Lakes: Data Analytics Week SF
 
C* Summit 2013: Cassandra at eBay Scale by Feng Qu and Anurag Jambhekar
C* Summit 2013: Cassandra at eBay Scale by Feng Qu and Anurag JambhekarC* Summit 2013: Cassandra at eBay Scale by Feng Qu and Anurag Jambhekar
C* Summit 2013: Cassandra at eBay Scale by Feng Qu and Anurag Jambhekar
 
Using Data Lakes
Using Data Lakes Using Data Lakes
Using Data Lakes
 
Millions quotes per second in pure java
Millions quotes per second in pure javaMillions quotes per second in pure java
Millions quotes per second in pure java
 
Top 6 Reasons to Use a Distributed Data Grid
Top 6 Reasons to Use a Distributed Data GridTop 6 Reasons to Use a Distributed Data Grid
Top 6 Reasons to Use a Distributed Data Grid
 
Using Data Lakes
Using Data LakesUsing Data Lakes
Using Data Lakes
 
Kafka & Hadoop in Rakuten
Kafka & Hadoop in RakutenKafka & Hadoop in Rakuten
Kafka & Hadoop in Rakuten
 
Azure DocumentDB Overview
Azure DocumentDB OverviewAzure DocumentDB Overview
Azure DocumentDB Overview
 
5 Comparing Microsoft Big Data Technologies for Analytics
5 Comparing Microsoft Big Data Technologies for Analytics5 Comparing Microsoft Big Data Technologies for Analytics
5 Comparing Microsoft Big Data Technologies for Analytics
 
Microsoft Openness Mongo DB
Microsoft Openness Mongo DBMicrosoft Openness Mongo DB
Microsoft Openness Mongo DB
 
Agility and Scalability with MongoDB
Agility and Scalability with MongoDBAgility and Scalability with MongoDB
Agility and Scalability with MongoDB
 
Kognitio overview jan 2013
Kognitio overview jan 2013Kognitio overview jan 2013
Kognitio overview jan 2013
 
Kognitio overview jan 2013
Kognitio overview jan 2013Kognitio overview jan 2013
Kognitio overview jan 2013
 
Using Distributed In-Memory Computing for Fast Data Analysis
Using Distributed In-Memory Computing for Fast Data AnalysisUsing Distributed In-Memory Computing for Fast Data Analysis
Using Distributed In-Memory Computing for Fast Data Analysis
 
Highlights of AWS ReInvent 2023 (Announcements and Best Practices)
Highlights of AWS ReInvent 2023 (Announcements and Best Practices)Highlights of AWS ReInvent 2023 (Announcements and Best Practices)
Highlights of AWS ReInvent 2023 (Announcements and Best Practices)
 

Más de DATAVERSITY

The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
DATAVERSITY
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
DATAVERSITY
 

Más de DATAVERSITY (20)

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and Governance
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data Literacy
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for You
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling Fundamentals
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic Project
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at Scale
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and Forwards
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement Today
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best Practices
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive Advantage
 

Último

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 

Último (20)

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 

Common MongoDB Use Cases

  • 1. Common MongoDB Use-Cases Kevin Hanson Solutions Architect, 10gen @hungarianhc ~ kevin@10gen.com
  • 2. Intro to NoSQL and MongoDB Folllow-up: (completed) @hungarianhc kevin@10gen.com How to Get Started with your MongoDB Pilot Project (August 7th)
  • 3. Emerging NoSQL Space RDBMS RDBMS RDBMS Data Data NoSQL Warehouse Warehouse The beginning Last 10 years Today
  • 4. Qualities of NoSQL Workloads Flexible data models High Throughput Large Data Sizes • Lists, Nested Objects • Lots of reads • Aggregate data size • Sparse schemas • Lots of writes • Number of objects • Semi-structured data • Agile Development Low Latency Cloud Computing Commodity • Both reads and writes • Run anywhere Hardware • Millisecond latency • No assumptions about • Ethernet hardware • Local disks • No / Few Knobs
  • 5. MongoDB was designed for this Flexible data models High Throughput Large Data Sizes • Lists, Nested Objects • Lots of reads • Aggregate data size • schemas • SparseJSON based • writes • Lots of Replica Sets to • Number of objects shards • 1000’s of • Semi-structuredmodel object data scale reads in a single DB • Dynamic • Agile Development • Sharding to • Partitioning of schemas scale writes data Low Latency Cloud Computing Commodity • Both reads and writes • Run anywhere Hardware • In-memory • Millisecond latency • No • Scale-out to assumptions about • Ethernet • Designed for cache overcome hardware • Local disks • No / Few Knobs “typical” OS and • Scale-out hardware local file system working set limitations
  • 6. Example customers Content Management Operational Intelligence Product Data Management User Data Management High Volume Data Feeds
  • 8. High Volume Data Feeds Machine • More machines, more sensors, more Generated data Data • Variably structured Stock Market • High frequency trading Data Social Media • Multiple sources of data Firehose • Each changes their format constantly
  • 9. High Volume Data Feed Flexible document model can adapt to changes in sensor format Asynchronous writes Data Data Sources Data Sources Data Write to memory with Sources periodic disk flush Sources Scale writes over multiple shards
  • 10. Operational Intelligence • Large volume of state about users Ad Targeting • Very strict latency requirements Customer • Expose report data to millions of customers Facing • Report on large volumes of data • Reports that update in real time Dashboards Social Media • Need to join the conversation _now_ Monitoring
  • 11. Operational Intelligence Parallelize queries Low latency reads across replicas and shards API In database aggregation Dashboards Flexible schema adapts to changing input data Can use same cluster to collect, store, and report on data
  • 12. Behavioral Profiles Rich profiles collecting multiple complex actions 1 See Ad Scale out to support { cookie_id: “1234512413243”, high throughput of advertiser:{ apple: { activities tracked actions: [ 2 See Ad { impression: ‘ad1’, time: 123 }, { impression: ‘ad2’, time: 232 }, { click: ‘ad2’, time: 235 }, { add_to_cart: ‘laptop’, sku: ‘asdf23f’, time: 254 }, Click { purchase: ‘laptop’, time: 354 } 3 ] } } } Dynamic schemas make it easy to track Indexing and 4 Convert vendor specific querying to support attributes matching, frequency capping
  • 13. Product Data E-Commerce • Diverse product portfolio Product • Complex querying and filtering Catalog • Scale for short bursts of high volume traffic Flash Sales • Scalable, but consistent view of inventory
  • 14. Product Data Indexing and rich query API for easy searching and sorting db.products. find({ “details.author”: “David Eggers” }). sort({ “title” : -1 }); Flexible data model for similar, but different objects { sku: “00a9f3a”, { sku: “00e8da9b”, type: “Book”, type: “MP3”, details: { details: { author: “David Eggers”, artist: “John Coltrane”, title: “You shall know our velocity”, title: “A love supreme”, isbn: “0-9703355-5-5” length: 123 } } } }
  • 15. Content Management • Comments and user generated News Site content • Personalization of content, layout Multi-Device • Generate layout on the fly for each rendering device that connects • No need to cache static pages • Store large objects Sharing • Simple modeling of metadata
  • 16. Content Management Geo spatial indexing Flexible data model for location based GridFS for large for similar, but searches object storage different objects { camera: “Nikon d4”, location: [ -122.418333, 37.775 ] } { camera: “Canon 5d mkII”, people: [ “Jim”, “Carol” ], taken_on: ISODate("2012-03-07T18:32:35.002Z") } { origin: “facebook.com/photos/xwdf23fsdf”, license: “Creative Commons CC0”, size: { dimensions: [ 124, 52 ], units: “pixels” Horizontal scalability } for large data sets }
  • 17. User Data Management • User state and session Video Games management • Scale out to large graphs Social Graphs • Easy to search and process Identity • Authentication, Authorization Management and Accounting
  • 18. User Game State Flexible documents Easy to store entire supports new game player state in a features without single document. schema migration Sharding enables whole data set to be JSON data model in memory, ensuring maps well to low latency HTML5/JS & Flash based clients
  • 19. Social Graphs Native support for Arrays makes it easy to store connections inside user profile Sharding partitions user profiles across Documents enable Social Graph available servers disk locality of all profile data for a user
  • 20. IS MY USE CASE A GOOD FIT FOR MONGODB?
  • 21. Good fits for MongoDB Application Characteristic Why MongoDB might be a good fit Large number of objects to Sharding lets you split objects across multiple store servers High write or read throughput Sharding + Replication lets you scale read and write traffic across multiple servers Low Latency Access Memory Mapped storage engine caches documents in RAM, enabling in-memory performance. Data locality of documents can significantly improve latency over join based approaches Variable data in objects Dynamic schema and JSON data model enable flexible data storage without sparse tables or complex joins Cloud based deployment Sharding and replication let you work around hardware limitations in clouds.