SlideShare una empresa de Scribd logo
1 de 45
Databases
    Alan Medlar

amedlar@cs.ucl.ac.uk
Schedule

• Today: Introduction
• Monday 2 Feb: Networking
           nd


• Monday 2 Feb: Principles of Transactions
           nd


• Tuesday 3 Feb: Concurrent Transactions
           rd


• To be decided: Distributed Transactions
Introduction
Introduction
•   Why do we care about databases?
Introduction
•   Why do we care about databases?

    •   Abstraction
Introduction
•   Why do we care about databases?

    •   Abstraction

        •   Details of storage and access are irrelevant
Introduction
•   Why do we care about databases?

    •   Abstraction

        •   Details of storage and access are irrelevant

        •   Concurrency
Introduction
•   Why do we care about databases?

    •   Abstraction

        •   Details of storage and access are irrelevant

        •   Concurrency

        •   Crash recovery
Introduction
•   Why do we care about databases?

    •   Abstraction

        •   Details of storage and access are irrelevant

        •   Concurrency

        •   Crash recovery

    •   Integrity and Security (privacy)
Introduction
•   Why do we care about databases?

    •   Abstraction

        •   Details of storage and access are irrelevant

        •   Concurrency

        •   Crash recovery

    •   Integrity and Security (privacy)

    •   Multi-user
Centralised Databases
Centralised Databases


                       Bottleneck!

Communication
  overhead!

                 Single point
                  of failure!
Centralised Databases
•   Bad news...
    •   Performance
        •   Processing
        •   I/O
    •   Distributed nature of data (departments,
        companies, “mashups”)
    •   Availability (single point of failure)
Distributed Databases
Distributed Databases
    Distributed
    Processing
                      Distributed




       {
                       Storage

Localised
 Traffic
Distributed Databases


• Definition: A single DBMS running across
  multiple CPUs, disks and/or networks,
  designed to permit safe parallel access.
Concepts
•   Distributed Processing

    •   Central database, distributed processing
Distributed Processing

• Multiple processors or cores
 • Same memory (multi-core)
 • Same disk (networked storage)
• Concurrency Control provided by
  transactions
Concepts
•   Distributed Processing

    •   Central database, distributed processing

•   Ad-hoc Distributed Database

    •   Database physically distributed over
        network
Distributed Databases
•   Fragmentation

    •   Large dataset broken up into smaller components
Distributed Databases
•   Fragmentation

    •   Large dataset broken up into smaller components

•   Allocation

    •   Fragments should be stored according to usage
Distributed Databases
•   Fragmentation

    •   Large dataset broken up into smaller components

•   Allocation

    •   Fragments should be stored according to usage

•   Replication

    •   Copy maintained at multiple sites to take
        advantage of additional processing power or
        decreased latency
Distributed Databases
•                                      Might be implicit!
    Fragmentation

    •   Large dataset broken up into smaller components

•   Allocation

    •   Fragments should be stored according to usage

•   Replication

    •   Copy maintained at multiple sites to take
        advantage of additional processing power or
        decreased latency
Distributed Databases (2)
•   How do we decide how to fragment a
    database?

    •   Data, application and usage dependant!
Distributed Databases (2)
•   How do we decide how to fragment a
    database?

    •   Data, application and usage dependant!

•   Goals:

    •   Locality

    •   Minimal Communication

    •   Balance storage, processing, monetary costs
Concepts
•   Distributed Processing

    •   Central database, distributed processing

•   Ad-hoc Distributed Database

    •   Database physically distributed over
        network

•   Distributed DBMS (DDBMS)

    •   Software that makes distribution transparent
Distributed DBMS
• Key Concept: Transparency
Distributed DBMS
• Key Concept: Transparency
 • Transparent Distribution
Distributed DBMS
• Key Concept: Transparency
 • Transparent Distribution
   • One centralised database from
      perspective of user (programmer)
Distributed DBMS
• Key Concept: Transparency
 • Transparent Distribution
   • One centralised database from
      perspective of user (programmer)
 • Transparent Transactions
Distributed DBMS
• Key Concept: Transparency
 • Transparent Distribution
   • One centralised database from
      perspective of user (programmer)
 • Transparent Transactions
  • Integrity of data maintained across
      multiple databases
Distributed DBMS (2)

• Requires advanced:
 • Recovery Services
 • Concurrency control
• Focus of this course
Summary
•   Advantages
Summary
•   Advantages

    •   Performance
Summary
•   Advantages

    •   Performance

    •   Reflect organisational structure
Summary
•   Advantages

    •   Performance

    •   Reflect organisational structure

    •   Economics
Summary
•   Advantages

    •   Performance

    •   Reflect organisational structure

    •   Economics

    •   Modular Growth
Summary
•   Advantages

    •   Performance

    •   Reflect organisational structure

    •   Economics

    •   Modular Growth

    •   Availability
Summary
•   Advantages

    •   Performance

    •   Reflect organisational structure

    •   Economics

    •   Modular Growth

    •   Availability

    •   Reliability
Summary (2)
•   Disadvantages
Summary (2)
•   Disadvantages
    •   Complexity (design, transparency, integrity)
Summary (2)
•   Disadvantages
    •   Complexity (design, transparency, integrity)
    •   Security more of an issue
Summary (2)
•   Disadvantages
    •   Complexity (design, transparency, integrity)
    •   Security more of an issue
    •   Maintenance Costs
Summary (2)
•   Disadvantages
    •   Complexity (design, transparency, integrity)
    •   Security more of an issue
    •   Maintenance Costs
    •   Lack of standards (so much dependant on
        data, application, usage, etc)
Next: Transactions...

Más contenido relacionado

Similar a 2011 Db Intro

Brian Oliver Pimp My Data Grid
Brian Oliver  Pimp My Data GridBrian Oliver  Pimp My Data Grid
Brian Oliver Pimp My Data Grid
deimos
 
Greatdebate Postgres vs Mysql
Greatdebate Postgres vs MysqlGreatdebate Postgres vs Mysql
Greatdebate Postgres vs Mysql
Krishna Infosoft
 
Next Generation Datacenter Oracle - Alan Hartwell
Next Generation Datacenter Oracle - Alan HartwellNext Generation Datacenter Oracle - Alan Hartwell
Next Generation Datacenter Oracle - Alan Hartwell
HPDutchWorld
 
Oracle - Next Generation Datacenter - Alan Hartwell
Oracle - Next Generation Datacenter - Alan HartwellOracle - Next Generation Datacenter - Alan Hartwell
Oracle - Next Generation Datacenter - Alan Hartwell
HPDutchWorld
 
Randy Shoup eBays Architectural Principles
Randy Shoup eBays Architectural PrinciplesRandy Shoup eBays Architectural Principles
Randy Shoup eBays Architectural Principles
deimos
 
Challenges and Opportunities of Big Data Genomics
Challenges and Opportunities of Big Data GenomicsChallenges and Opportunities of Big Data Genomics
Challenges and Opportunities of Big Data Genomics
Yasin Memari
 
Evolution Of Dedupe
Evolution Of DedupeEvolution Of Dedupe
Evolution Of Dedupe
rammotive
 

Similar a 2011 Db Intro (20)

Advanced Deployment
Advanced DeploymentAdvanced Deployment
Advanced Deployment
 
Brian Oliver Pimp My Data Grid
Brian Oliver  Pimp My Data GridBrian Oliver  Pimp My Data Grid
Brian Oliver Pimp My Data Grid
 
Greatdebate Postgres vs Mysql
Greatdebate Postgres vs MysqlGreatdebate Postgres vs Mysql
Greatdebate Postgres vs Mysql
 
The Great Debate: PostgreSQL vs MySQL
The Great Debate: PostgreSQL vs MySQLThe Great Debate: PostgreSQL vs MySQL
The Great Debate: PostgreSQL vs MySQL
 
Branch Office Infrastructure
Branch Office InfrastructureBranch Office Infrastructure
Branch Office Infrastructure
 
Next Generation Datacenter Oracle - Alan Hartwell
Next Generation Datacenter Oracle - Alan HartwellNext Generation Datacenter Oracle - Alan Hartwell
Next Generation Datacenter Oracle - Alan Hartwell
 
Oracle - Next Generation Datacenter - Alan Hartwell
Oracle - Next Generation Datacenter - Alan HartwellOracle - Next Generation Datacenter - Alan Hartwell
Oracle - Next Generation Datacenter - Alan Hartwell
 
Randy Shoup eBays Architectural Principles
Randy Shoup eBays Architectural PrinciplesRandy Shoup eBays Architectural Principles
Randy Shoup eBays Architectural Principles
 
Challenges and Opportunities of Big Data Genomics
Challenges and Opportunities of Big Data GenomicsChallenges and Opportunities of Big Data Genomics
Challenges and Opportunities of Big Data Genomics
 
Four Assumptions Killing Backup Storage Webinar
Four Assumptions Killing Backup Storage WebinarFour Assumptions Killing Backup Storage Webinar
Four Assumptions Killing Backup Storage Webinar
 
Storage Systems for High Scalable Systems Presentation
Storage Systems for High Scalable Systems PresentationStorage Systems for High Scalable Systems Presentation
Storage Systems for High Scalable Systems Presentation
 
How to build a state-of-the-art rails cluster
How to build a state-of-the-art rails clusterHow to build a state-of-the-art rails cluster
How to build a state-of-the-art rails cluster
 
Distributed systems - A Primer
Distributed systems - A PrimerDistributed systems - A Primer
Distributed systems - A Primer
 
Bit Level Preservation
Bit Level PreservationBit Level Preservation
Bit Level Preservation
 
Big data pipelines
Big data pipelinesBig data pipelines
Big data pipelines
 
Scalabe MySQL Infrastructure
Scalabe MySQL InfrastructureScalabe MySQL Infrastructure
Scalabe MySQL Infrastructure
 
Netcetera Proactive Management Service
Netcetera Proactive Management ServiceNetcetera Proactive Management Service
Netcetera Proactive Management Service
 
Evolution Of Dedupe
Evolution Of DedupeEvolution Of Dedupe
Evolution Of Dedupe
 
Solving the Database Problem
Solving the Database ProblemSolving the Database Problem
Solving the Database Problem
 
Stopping Storage Hardware Sprawl
Stopping Storage Hardware SprawlStopping Storage Hardware Sprawl
Stopping Storage Hardware Sprawl
 

Último

Breaking Down the Flutterwave Scandal What You Need to Know.pdf
Breaking Down the Flutterwave Scandal What You Need to Know.pdfBreaking Down the Flutterwave Scandal What You Need to Know.pdf
Breaking Down the Flutterwave Scandal What You Need to Know.pdf
UK Journal
 
Structuring Teams and Portfolios for Success
Structuring Teams and Portfolios for SuccessStructuring Teams and Portfolios for Success
Structuring Teams and Portfolios for Success
UXDXConf
 
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
panagenda
 

Último (20)

FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
 
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
 
ADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptxADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptx
 
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
 
State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!
 
Breaking Down the Flutterwave Scandal What You Need to Know.pdf
Breaking Down the Flutterwave Scandal What You Need to Know.pdfBreaking Down the Flutterwave Scandal What You Need to Know.pdf
Breaking Down the Flutterwave Scandal What You Need to Know.pdf
 
(Explainable) Data-Centric AI: what are you explaininhg, and to whom?
(Explainable) Data-Centric AI: what are you explaininhg, and to whom?(Explainable) Data-Centric AI: what are you explaininhg, and to whom?
(Explainable) Data-Centric AI: what are you explaininhg, and to whom?
 
WebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM PerformanceWebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM Performance
 
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdfSimplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
 
Intro in Product Management - Коротко про професію продакт менеджера
Intro in Product Management - Коротко про професію продакт менеджераIntro in Product Management - Коротко про професію продакт менеджера
Intro in Product Management - Коротко про професію продакт менеджера
 
Event-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream ProcessingEvent-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream Processing
 
The Metaverse: Are We There Yet?
The  Metaverse:    Are   We  There  Yet?The  Metaverse:    Are   We  There  Yet?
The Metaverse: Are We There Yet?
 
Structuring Teams and Portfolios for Success
Structuring Teams and Portfolios for SuccessStructuring Teams and Portfolios for Success
Structuring Teams and Portfolios for Success
 
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfWhere to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
 
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on ThanabotsContinuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
 
Oauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoftOauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoft
 
How we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdfHow we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdf
 
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
 
WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024
 
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
 

2011 Db Intro

  • 1. Databases Alan Medlar amedlar@cs.ucl.ac.uk
  • 2. Schedule • Today: Introduction • Monday 2 Feb: Networking nd • Monday 2 Feb: Principles of Transactions nd • Tuesday 3 Feb: Concurrent Transactions rd • To be decided: Distributed Transactions
  • 4. Introduction • Why do we care about databases?
  • 5. Introduction • Why do we care about databases? • Abstraction
  • 6. Introduction • Why do we care about databases? • Abstraction • Details of storage and access are irrelevant
  • 7. Introduction • Why do we care about databases? • Abstraction • Details of storage and access are irrelevant • Concurrency
  • 8. Introduction • Why do we care about databases? • Abstraction • Details of storage and access are irrelevant • Concurrency • Crash recovery
  • 9. Introduction • Why do we care about databases? • Abstraction • Details of storage and access are irrelevant • Concurrency • Crash recovery • Integrity and Security (privacy)
  • 10. Introduction • Why do we care about databases? • Abstraction • Details of storage and access are irrelevant • Concurrency • Crash recovery • Integrity and Security (privacy) • Multi-user
  • 12. Centralised Databases Bottleneck! Communication overhead! Single point of failure!
  • 13. Centralised Databases • Bad news... • Performance • Processing • I/O • Distributed nature of data (departments, companies, “mashups”) • Availability (single point of failure)
  • 15. Distributed Databases Distributed Processing Distributed { Storage Localised Traffic
  • 16. Distributed Databases • Definition: A single DBMS running across multiple CPUs, disks and/or networks, designed to permit safe parallel access.
  • 17. Concepts • Distributed Processing • Central database, distributed processing
  • 18. Distributed Processing • Multiple processors or cores • Same memory (multi-core) • Same disk (networked storage) • Concurrency Control provided by transactions
  • 19. Concepts • Distributed Processing • Central database, distributed processing • Ad-hoc Distributed Database • Database physically distributed over network
  • 20. Distributed Databases • Fragmentation • Large dataset broken up into smaller components
  • 21. Distributed Databases • Fragmentation • Large dataset broken up into smaller components • Allocation • Fragments should be stored according to usage
  • 22. Distributed Databases • Fragmentation • Large dataset broken up into smaller components • Allocation • Fragments should be stored according to usage • Replication • Copy maintained at multiple sites to take advantage of additional processing power or decreased latency
  • 23. Distributed Databases • Might be implicit! Fragmentation • Large dataset broken up into smaller components • Allocation • Fragments should be stored according to usage • Replication • Copy maintained at multiple sites to take advantage of additional processing power or decreased latency
  • 24. Distributed Databases (2) • How do we decide how to fragment a database? • Data, application and usage dependant!
  • 25. Distributed Databases (2) • How do we decide how to fragment a database? • Data, application and usage dependant! • Goals: • Locality • Minimal Communication • Balance storage, processing, monetary costs
  • 26. Concepts • Distributed Processing • Central database, distributed processing • Ad-hoc Distributed Database • Database physically distributed over network • Distributed DBMS (DDBMS) • Software that makes distribution transparent
  • 27. Distributed DBMS • Key Concept: Transparency
  • 28. Distributed DBMS • Key Concept: Transparency • Transparent Distribution
  • 29. Distributed DBMS • Key Concept: Transparency • Transparent Distribution • One centralised database from perspective of user (programmer)
  • 30. Distributed DBMS • Key Concept: Transparency • Transparent Distribution • One centralised database from perspective of user (programmer) • Transparent Transactions
  • 31. Distributed DBMS • Key Concept: Transparency • Transparent Distribution • One centralised database from perspective of user (programmer) • Transparent Transactions • Integrity of data maintained across multiple databases
  • 32. Distributed DBMS (2) • Requires advanced: • Recovery Services • Concurrency control • Focus of this course
  • 33. Summary • Advantages
  • 34. Summary • Advantages • Performance
  • 35. Summary • Advantages • Performance • Reflect organisational structure
  • 36. Summary • Advantages • Performance • Reflect organisational structure • Economics
  • 37. Summary • Advantages • Performance • Reflect organisational structure • Economics • Modular Growth
  • 38. Summary • Advantages • Performance • Reflect organisational structure • Economics • Modular Growth • Availability
  • 39. Summary • Advantages • Performance • Reflect organisational structure • Economics • Modular Growth • Availability • Reliability
  • 40. Summary (2) • Disadvantages
  • 41. Summary (2) • Disadvantages • Complexity (design, transparency, integrity)
  • 42. Summary (2) • Disadvantages • Complexity (design, transparency, integrity) • Security more of an issue
  • 43. Summary (2) • Disadvantages • Complexity (design, transparency, integrity) • Security more of an issue • Maintenance Costs
  • 44. Summary (2) • Disadvantages • Complexity (design, transparency, integrity) • Security more of an issue • Maintenance Costs • Lack of standards (so much dependant on data, application, usage, etc)

Notas del editor