SlideShare una empresa de Scribd logo
1 de 32
Descargar para leer sin conexión
XtreemFS
Extreme cloud file system?!
         Udo Seidel




           OSDC 2012
Agenda
●   Background/motivation
●   High level overview
●   High Availability
●   Security
●   Summary




                          OSDC 2012
Distributed file systems
●   Part of shared file systems family
●   Around for a while
●   “back” in scope
    ●   Storage challenges
        –   More
        –   Faster
        –   Cheaper
    ●   XaaS


                             OSDC 2012
Shared file systems family
●   Multiple server access the same data
●   Different approaches
    ●   Network based, e.g. NFS, CIFS
    ●   Clustered
        –   Shared disk, e.g. CXFS, CFS, GFS(2), OCFS2
        –   Distributed, e.g. Lustre, CephFS, GlusterFS .... and
            XtreemFS




                                 OSDC 2012
Distributed file systems – why?
●   More efficient utilization of distributed hardware
    ●   Storage
    ●   CPU/Network
●   Scalability ... capacity demands
    ●   Amount
    ●   I/O requirements




                           OSDC 2012
Distributed file systems – which?
●   HDFS (Hadoop)
●   CephFS ..
●   GlusterFS .. RedHat
●   ...
●   XtreemFS




                          OSDC 2012
History
●   European Research project (2006-2010)
●   Part of XtreemOS
    ●   Linux based grid O/S
    ●   Member of OpenGridForum
    ●   Need of distributed file system




                             OSDC 2012
Implementation I
●   Java
    ●   Supported O/S
        –   Linux
        –   MacOS X with manual work
        –   Free/Net/OpenBSD?
        –   No Windows anymore
    ●   Server and Client (fuse) ... both in user space
●   Non-privileged user


                             OSDC 2012
Implementation II
●   IP based
    ●   Different ports for different XtreemFS services
    ●   Clear text vs. encrypted
●   Object based storage
    ●   Software implementation
    ●   OSD features in XtreemFS code
        –   Copy on write
        –   Snapshotting


                             OSDC 2012
XtreemFS – the architecture I
●   4 components
    ●   Object based Storage Devices
    ●   Meta Data and Replica Catalogue Servers
    ●   Directory Service
    ●   Clients ;-)




                            OSDC 2012
XtreemFS – the architecture II




             OSDC 2012
XtreemFS services
●   Several
    ●   OSD
    ●   MRC
    ●   Volumes
●   UUID's
    ●   Abstraction from network
    ●   Change requires outage
    ●   Plans for topology

                             OSDC 2012
XtreemFS – DIR/MRC data
●   Data stored locally
    ●   BabuDB
    ●   Independent of OSD
●   Write buffers
            Modus                          Description
ASYNC                     Asynchronous log entry write
FSYNC                     Fsync() called after log entry write and before
                          ack'ing of operation
SYNC_WRITE                Synchronous log entry write, ack'ing of operation
                          before meta data update
SYNC_WRITE_METADATA       Synchronous log entry write and meta data
                          update before ack'ing of operation
                          OSDC 2012
XtreemFS – OSD data
●   File cut in 128 Kbyte pieces
●   Default: entire file on one OSD
●   Distribution across multiple OSD's possible
    ●   RAID 0 implemented
    ●   RAID 5 planned
    ●   Parallel reads/writes




                                OSDC 2012
XtreemFS interfaces
●   HTTP
    ●   Read-only
    ●   Read-write planned
●   Command line
    ●   All purposes




                             OSDC 2012
XtreemFS interfaces




       OSDC 2012
XtreemFS – high level summary
●   Multi-platform
●   Abstraction via UUID
●   Communication separation
●   Freedom of choice of OSD backend file system
●   HPC out of scope




                       OSDC 2012
XtreemFS – HA in general
●   One part: OSD
    ●   Replication via policies
●   Other part: MRC and DIR
    ●   Local data stored in BabuDB's
    ●   Synchronization via BabuDB methods




                             OSDC 2012
XtreemFS – HA for MRC/DIR
●   Master/slave
    ●   Master changes -> log file without buffering
    ●   Log file entries propagation to slaves
    ●   Quorum needed => at least 3 instances
    ●   No automation for DIR
●   Synchronization
    ●   in clear text
    ●   Encryption via SSL possible

                            OSDC 2012
XtreemFS OSD replication
●   File replication
    ●   Read-only
         –   Since 1.0
         –   Easy to handle
    ●   Read-write
         –   Only since 1.3
         –   Later more
●   Copies
    ●   Full
    ●   Partial aka on-demand
                              OSDC 2012
XtreemFS r/o replication
●   Arbitrary amount of replicas
●   Equally treated replicas
●   Only OSD local access
●   No sync needed
●   Use case
    ●   Static files :-)
    ●   Low bandwidth (partial replica)
    ●   Big static files (partial replica)
                               OSDC 2012
XtreemFS r/w replication
●   Primary/secondary
●   Election on demand with leases
●   Read/write access
    ●   First primary
    ●   Propagated to secondaries




                          OSDC 2012
XtreemFS r/w replication - failure
●   Secondary
    ●   Behaviour configurable
    ●   Write failure vs. Write on remaining
        –   Quorum needed
●   Primary
    ●   Behaviour configurable
    ●   Write failure vs. Write on remaining
        –   Quorum needed


                            OSDC 2012
XtreemFS OSD/replica policies
●   OSD selection for new files
●   Replica selection for new/additional copies
●   Categories: filter, group, sort
●   Combination of rules
                   Policy                        Category
      Standard OSD                filter
      FQDN based                  filter, group, sort
      UUID based                  filter
      Data center topology        group, sort
      random                      sort

                             OSDC 2012
XtreemFS HA summary
●   Homework needed for DIR and MRC
●   OSD
    ●   Lateness of OSD read-write replication
    ●   OSD Read-only replication
         –   Mature and WAN ready
    ●   Access time improvement via striping
    ●   Flexibility of policies



                                  OSDC 2012
XtreemFS encryption
●   Not on file system level
●   For communication
    ●   Interaction of DIR, MRC and OSD
    ●   Data replication for HA for DIR and/or MRC




                           OSDC 2012
XtreemFS channel encryption
●   Via SSL
    ●   PCKS#12 or Java Key Store (JKS)
    ●   Locally stored
        –   service/client certificates
        –   root CA certificates
●   Two modes
    ●   All-Or-Nothing approach
    ●   Grid-SSL
        –   just authentication

                                  OSDC 2012
XtreemFS secure channel
                encryption
●   Password protection of certificates
    ●   MRC/DIR/OSD: stored service configuration
    ●   Client: via CLI!!




                            OSDC 2012
XtreemFS encryption summary
●   Data encryption on POSIX layer?
●   SSL obvious choice for TCP/IP channels
    ●   Missing PKI contradicts scalability
    ●   Password protection needs re-design




                            OSDC 2012
Summary
●   High self-defined goals
    ●   Some dropped?
    ●   Some partially implemented
●   Ok for R&D Labs
    ●   HA and housekeeping improvement needed
    ●   Encryption w/o PKI




                             OSDC 2012
References
●   http://www.xtreemfs.org
●   http://babudb.googlecode.com




                       OSDC 2012
Thank you!




   OSDC 2012

Más contenido relacionado

La actualidad más candente

Sdc 2012-challenges
Sdc 2012-challengesSdc 2012-challenges
Sdc 2012-challengesGluster.org
 
20160401 Gluster-roadmap
20160401 Gluster-roadmap20160401 Gluster-roadmap
20160401 Gluster-roadmapGluster.org
 
20160401 guster-roadmap
20160401 guster-roadmap20160401 guster-roadmap
20160401 guster-roadmapGluster.org
 
Gluster intro-tdose
Gluster intro-tdoseGluster intro-tdose
Gluster intro-tdoseGluster.org
 
Gluster d thread_synchronization_using_urcu_lca2016
Gluster d thread_synchronization_using_urcu_lca2016Gluster d thread_synchronization_using_urcu_lca2016
Gluster d thread_synchronization_using_urcu_lca2016Gluster.org
 
Introduction to highly_availablenfs_server_on_scale-out_storage_systems_based...
Introduction to highly_availablenfs_server_on_scale-out_storage_systems_based...Introduction to highly_availablenfs_server_on_scale-out_storage_systems_based...
Introduction to highly_availablenfs_server_on_scale-out_storage_systems_based...Gluster.org
 
Gluster technical overview
Gluster technical overviewGluster technical overview
Gluster technical overviewGluster.org
 
20160130 Gluster-roadmap
20160130 Gluster-roadmap20160130 Gluster-roadmap
20160130 Gluster-roadmapGluster.org
 
GlusterD - Daemon refactoring
GlusterD - Daemon refactoringGlusterD - Daemon refactoring
GlusterD - Daemon refactoringAtin Mukherjee
 
Gluster intro-tdose
Gluster intro-tdoseGluster intro-tdose
Gluster intro-tdoseGluster.org
 
Persistent Storage in Openshift using GlusterFS
Persistent Storage in Openshift using GlusterFSPersistent Storage in Openshift using GlusterFS
Persistent Storage in Openshift using GlusterFSHumble Chirammal
 
Scale out backups-with_bareos_and_gluster
Scale out backups-with_bareos_and_glusterScale out backups-with_bareos_and_gluster
Scale out backups-with_bareos_and_glusterGluster.org
 
Software defined storage
Software defined storageSoftware defined storage
Software defined storageGluster.org
 
Lt2013 glusterfs.talk
Lt2013 glusterfs.talkLt2013 glusterfs.talk
Lt2013 glusterfs.talkUdo Seidel
 
OSBConf 2015 | Scale out backups with bareos and gluster by niels de vos
OSBConf 2015 | Scale out backups with bareos and gluster by niels de vosOSBConf 2015 | Scale out backups with bareos and gluster by niels de vos
OSBConf 2015 | Scale out backups with bareos and gluster by niels de vosNETWAYS
 
Developing apps and_integrating_with_gluster_fs_-_libgfapi
Developing apps and_integrating_with_gluster_fs_-_libgfapiDeveloping apps and_integrating_with_gluster_fs_-_libgfapi
Developing apps and_integrating_with_gluster_fs_-_libgfapiGluster.org
 
Performant and Resilient Storage: The Open Source & Linux Way
Performant and Resilient Storage: The Open Source & Linux WayPerformant and Resilient Storage: The Open Source & Linux Way
Performant and Resilient Storage: The Open Source & Linux WayOpenNebula Project
 

La actualidad más candente (20)

Cncf meetup-rook
Cncf meetup-rookCncf meetup-rook
Cncf meetup-rook
 
Sdc 2012-challenges
Sdc 2012-challengesSdc 2012-challenges
Sdc 2012-challenges
 
20160401 Gluster-roadmap
20160401 Gluster-roadmap20160401 Gluster-roadmap
20160401 Gluster-roadmap
 
20160401 guster-roadmap
20160401 guster-roadmap20160401 guster-roadmap
20160401 guster-roadmap
 
Gluster intro-tdose
Gluster intro-tdoseGluster intro-tdose
Gluster intro-tdose
 
Gluster d thread_synchronization_using_urcu_lca2016
Gluster d thread_synchronization_using_urcu_lca2016Gluster d thread_synchronization_using_urcu_lca2016
Gluster d thread_synchronization_using_urcu_lca2016
 
Introduction to highly_availablenfs_server_on_scale-out_storage_systems_based...
Introduction to highly_availablenfs_server_on_scale-out_storage_systems_based...Introduction to highly_availablenfs_server_on_scale-out_storage_systems_based...
Introduction to highly_availablenfs_server_on_scale-out_storage_systems_based...
 
Gdeploy 2.0
Gdeploy 2.0Gdeploy 2.0
Gdeploy 2.0
 
Gluster technical overview
Gluster technical overviewGluster technical overview
Gluster technical overview
 
20160130 Gluster-roadmap
20160130 Gluster-roadmap20160130 Gluster-roadmap
20160130 Gluster-roadmap
 
GlusterD - Daemon refactoring
GlusterD - Daemon refactoringGlusterD - Daemon refactoring
GlusterD - Daemon refactoring
 
Gluster intro-tdose
Gluster intro-tdoseGluster intro-tdose
Gluster intro-tdose
 
Persistent Storage in Openshift using GlusterFS
Persistent Storage in Openshift using GlusterFSPersistent Storage in Openshift using GlusterFS
Persistent Storage in Openshift using GlusterFS
 
Scale out backups-with_bareos_and_gluster
Scale out backups-with_bareos_and_glusterScale out backups-with_bareos_and_gluster
Scale out backups-with_bareos_and_gluster
 
Software defined storage
Software defined storageSoftware defined storage
Software defined storage
 
Lt2013 glusterfs.talk
Lt2013 glusterfs.talkLt2013 glusterfs.talk
Lt2013 glusterfs.talk
 
OSBConf 2015 | Scale out backups with bareos and gluster by niels de vos
OSBConf 2015 | Scale out backups with bareos and gluster by niels de vosOSBConf 2015 | Scale out backups with bareos and gluster by niels de vos
OSBConf 2015 | Scale out backups with bareos and gluster by niels de vos
 
Developing apps and_integrating_with_gluster_fs_-_libgfapi
Developing apps and_integrating_with_gluster_fs_-_libgfapiDeveloping apps and_integrating_with_gluster_fs_-_libgfapi
Developing apps and_integrating_with_gluster_fs_-_libgfapi
 
Performant and Resilient Storage: The Open Source & Linux Way
Performant and Resilient Storage: The Open Source & Linux WayPerformant and Resilient Storage: The Open Source & Linux Way
Performant and Resilient Storage: The Open Source & Linux Way
 
Scaling Docker Registry
Scaling Docker RegistryScaling Docker Registry
Scaling Docker Registry
 

Destacado

One Size Doesn't Fit All: The New Database Revolution
One Size Doesn't Fit All: The New Database RevolutionOne Size Doesn't Fit All: The New Database Revolution
One Size Doesn't Fit All: The New Database Revolutionmark madsen
 
Linuxkongress2010.gfs2ocfs2.talk
Linuxkongress2010.gfs2ocfs2.talkLinuxkongress2010.gfs2ocfs2.talk
Linuxkongress2010.gfs2ocfs2.talkUdo Seidel
 
Cephfs - Red Hat Openstack and Ceph meetup, Pune 28th november 2015
Cephfs - Red Hat Openstack and Ceph meetup, Pune 28th november 2015Cephfs - Red Hat Openstack and Ceph meetup, Pune 28th november 2015
Cephfs - Red Hat Openstack and Ceph meetup, Pune 28th november 2015bipin kunal
 
History of storage devices
History of storage devicesHistory of storage devices
History of storage devicesdan13ljg
 
Storage devices powerpoint
Storage devices powerpointStorage devices powerpoint
Storage devices powerpointfairy_elephant
 
Determine the Right Analytic Database: A Survey of New Data Technologies
Determine the Right Analytic Database: A Survey of New Data TechnologiesDetermine the Right Analytic Database: A Survey of New Data Technologies
Determine the Right Analytic Database: A Survey of New Data Technologiesmark madsen
 
Future Information Growth And Storage Device Reliability 2007
Future Information Growth And Storage Device Reliability 2007Future Information Growth And Storage Device Reliability 2007
Future Information Growth And Storage Device Reliability 2007Andrei Khurshudov
 

Destacado (7)

One Size Doesn't Fit All: The New Database Revolution
One Size Doesn't Fit All: The New Database RevolutionOne Size Doesn't Fit All: The New Database Revolution
One Size Doesn't Fit All: The New Database Revolution
 
Linuxkongress2010.gfs2ocfs2.talk
Linuxkongress2010.gfs2ocfs2.talkLinuxkongress2010.gfs2ocfs2.talk
Linuxkongress2010.gfs2ocfs2.talk
 
Cephfs - Red Hat Openstack and Ceph meetup, Pune 28th november 2015
Cephfs - Red Hat Openstack and Ceph meetup, Pune 28th november 2015Cephfs - Red Hat Openstack and Ceph meetup, Pune 28th november 2015
Cephfs - Red Hat Openstack and Ceph meetup, Pune 28th november 2015
 
History of storage devices
History of storage devicesHistory of storage devices
History of storage devices
 
Storage devices powerpoint
Storage devices powerpointStorage devices powerpoint
Storage devices powerpoint
 
Determine the Right Analytic Database: A Survey of New Data Technologies
Determine the Right Analytic Database: A Survey of New Data TechnologiesDetermine the Right Analytic Database: A Survey of New Data Technologies
Determine the Right Analytic Database: A Survey of New Data Technologies
 
Future Information Growth And Storage Device Reliability 2007
Future Information Growth And Storage Device Reliability 2007Future Information Growth And Storage Device Reliability 2007
Future Information Growth And Storage Device Reliability 2007
 

Similar a Osdc2012 xtfs.talk

OSDC 2012 | Extremes Wolken Dateisystem!? by Dr. Udo Seidel
OSDC 2012 | Extremes Wolken Dateisystem!? by Dr. Udo SeidelOSDC 2012 | Extremes Wolken Dateisystem!? by Dr. Udo Seidel
OSDC 2012 | Extremes Wolken Dateisystem!? by Dr. Udo SeidelNETWAYS
 
CS 626 - March : Capsicum: Practical Capabilities for UNIX
CS 626 - March : Capsicum: Practical Capabilities for UNIXCS 626 - March : Capsicum: Practical Capabilities for UNIX
CS 626 - March : Capsicum: Practical Capabilities for UNIXruchith
 
Community Update at OpenStack Summit Boston
Community Update at OpenStack Summit BostonCommunity Update at OpenStack Summit Boston
Community Update at OpenStack Summit BostonSage Weil
 
Wheeler w 0450_linux_file_systems1
Wheeler w 0450_linux_file_systems1Wheeler w 0450_linux_file_systems1
Wheeler w 0450_linux_file_systems1sprdd
 
Wheeler w 0450_linux_file_systems1
Wheeler w 0450_linux_file_systems1Wheeler w 0450_linux_file_systems1
Wheeler w 0450_linux_file_systems1sprdd
 
2021.02 new in Ceph Pacific Dashboard
2021.02 new in Ceph Pacific Dashboard2021.02 new in Ceph Pacific Dashboard
2021.02 new in Ceph Pacific DashboardCeph Community
 
MongoDB: Advantages of an Open Source NoSQL Database
MongoDB: Advantages of an Open Source NoSQL DatabaseMongoDB: Advantages of an Open Source NoSQL Database
MongoDB: Advantages of an Open Source NoSQL DatabaseFITC
 
The Forefront of the Development for NVDIMM on Linux Kernel
The Forefront of the Development for NVDIMM on Linux KernelThe Forefront of the Development for NVDIMM on Linux Kernel
The Forefront of the Development for NVDIMM on Linux KernelYasunori Goto
 
What CloudStackers Need To Know About LINSTOR/DRBD
What CloudStackers Need To Know About LINSTOR/DRBDWhat CloudStackers Need To Know About LINSTOR/DRBD
What CloudStackers Need To Know About LINSTOR/DRBDShapeBlue
 
SJTU Summary report
SJTU Summary reportSJTU Summary report
SJTU Summary reportYves Chan
 
An Introduction to Redis for .NET Developers.pdf
An Introduction to Redis for .NET Developers.pdfAn Introduction to Redis for .NET Developers.pdf
An Introduction to Redis for .NET Developers.pdfStephen Lorello
 
7. Cloud Native Computing - Kubernetes - Bratislava - Rook.io
7. Cloud Native Computing - Kubernetes - Bratislava - Rook.io7. Cloud Native Computing - Kubernetes - Bratislava - Rook.io
7. Cloud Native Computing - Kubernetes - Bratislava - Rook.ioDávid Kőszeghy
 
OSDC 2015: John Spray | The Ceph Storage System
OSDC 2015: John Spray | The Ceph Storage SystemOSDC 2015: John Spray | The Ceph Storage System
OSDC 2015: John Spray | The Ceph Storage SystemNETWAYS
 
Post Mortem Debugging in Embedded Linux Systems
Post Mortem Debugging in Embedded Linux Systems Post Mortem Debugging in Embedded Linux Systems
Post Mortem Debugging in Embedded Linux Systems GlobalLogic Ukraine
 
Porting the drm/kms graphic drivers to DragonFlyBSD by Francois Tigeot
Porting the drm/kms graphic drivers to DragonFlyBSD by Francois TigeotPorting the drm/kms graphic drivers to DragonFlyBSD by Francois Tigeot
Porting the drm/kms graphic drivers to DragonFlyBSD by Francois Tigeoteurobsdcon
 

Similar a Osdc2012 xtfs.talk (20)

OSDC 2012 | Extremes Wolken Dateisystem!? by Dr. Udo Seidel
OSDC 2012 | Extremes Wolken Dateisystem!? by Dr. Udo SeidelOSDC 2012 | Extremes Wolken Dateisystem!? by Dr. Udo Seidel
OSDC 2012 | Extremes Wolken Dateisystem!? by Dr. Udo Seidel
 
CS 626 - March : Capsicum: Practical Capabilities for UNIX
CS 626 - March : Capsicum: Practical Capabilities for UNIXCS 626 - March : Capsicum: Practical Capabilities for UNIX
CS 626 - March : Capsicum: Practical Capabilities for UNIX
 
Community Update at OpenStack Summit Boston
Community Update at OpenStack Summit BostonCommunity Update at OpenStack Summit Boston
Community Update at OpenStack Summit Boston
 
Wheeler w 0450_linux_file_systems1
Wheeler w 0450_linux_file_systems1Wheeler w 0450_linux_file_systems1
Wheeler w 0450_linux_file_systems1
 
Wheeler w 0450_linux_file_systems1
Wheeler w 0450_linux_file_systems1Wheeler w 0450_linux_file_systems1
Wheeler w 0450_linux_file_systems1
 
2021.02 new in Ceph Pacific Dashboard
2021.02 new in Ceph Pacific Dashboard2021.02 new in Ceph Pacific Dashboard
2021.02 new in Ceph Pacific Dashboard
 
XenSummit - 08/28/2012
XenSummit - 08/28/2012XenSummit - 08/28/2012
XenSummit - 08/28/2012
 
Block Storage For VMs With Ceph
Block Storage For VMs With CephBlock Storage For VMs With Ceph
Block Storage For VMs With Ceph
 
OpenZFS - AsiaBSDcon
OpenZFS - AsiaBSDconOpenZFS - AsiaBSDcon
OpenZFS - AsiaBSDcon
 
MongoDB: Advantages of an Open Source NoSQL Database
MongoDB: Advantages of an Open Source NoSQL DatabaseMongoDB: Advantages of an Open Source NoSQL Database
MongoDB: Advantages of an Open Source NoSQL Database
 
The Forefront of the Development for NVDIMM on Linux Kernel
The Forefront of the Development for NVDIMM on Linux KernelThe Forefront of the Development for NVDIMM on Linux Kernel
The Forefront of the Development for NVDIMM on Linux Kernel
 
What CloudStackers Need To Know About LINSTOR/DRBD
What CloudStackers Need To Know About LINSTOR/DRBDWhat CloudStackers Need To Know About LINSTOR/DRBD
What CloudStackers Need To Know About LINSTOR/DRBD
 
SJTU Summary report
SJTU Summary reportSJTU Summary report
SJTU Summary report
 
An Introduction to Redis for .NET Developers.pdf
An Introduction to Redis for .NET Developers.pdfAn Introduction to Redis for .NET Developers.pdf
An Introduction to Redis for .NET Developers.pdf
 
7. Cloud Native Computing - Kubernetes - Bratislava - Rook.io
7. Cloud Native Computing - Kubernetes - Bratislava - Rook.io7. Cloud Native Computing - Kubernetes - Bratislava - Rook.io
7. Cloud Native Computing - Kubernetes - Bratislava - Rook.io
 
OSDC 2015: John Spray | The Ceph Storage System
OSDC 2015: John Spray | The Ceph Storage SystemOSDC 2015: John Spray | The Ceph Storage System
OSDC 2015: John Spray | The Ceph Storage System
 
Vpm
VpmVpm
Vpm
 
Post Mortem Debugging in Embedded Linux Systems
Post Mortem Debugging in Embedded Linux Systems Post Mortem Debugging in Embedded Linux Systems
Post Mortem Debugging in Embedded Linux Systems
 
Linux on System z – disk I/O performance
Linux on System z – disk I/O performanceLinux on System z – disk I/O performance
Linux on System z – disk I/O performance
 
Porting the drm/kms graphic drivers to DragonFlyBSD by Francois Tigeot
Porting the drm/kms graphic drivers to DragonFlyBSD by Francois TigeotPorting the drm/kms graphic drivers to DragonFlyBSD by Francois Tigeot
Porting the drm/kms graphic drivers to DragonFlyBSD by Francois Tigeot
 

Último

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 

Último (20)

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 

Osdc2012 xtfs.talk

  • 1. XtreemFS Extreme cloud file system?! Udo Seidel OSDC 2012
  • 2. Agenda ● Background/motivation ● High level overview ● High Availability ● Security ● Summary OSDC 2012
  • 3. Distributed file systems ● Part of shared file systems family ● Around for a while ● “back” in scope ● Storage challenges – More – Faster – Cheaper ● XaaS OSDC 2012
  • 4. Shared file systems family ● Multiple server access the same data ● Different approaches ● Network based, e.g. NFS, CIFS ● Clustered – Shared disk, e.g. CXFS, CFS, GFS(2), OCFS2 – Distributed, e.g. Lustre, CephFS, GlusterFS .... and XtreemFS OSDC 2012
  • 5. Distributed file systems – why? ● More efficient utilization of distributed hardware ● Storage ● CPU/Network ● Scalability ... capacity demands ● Amount ● I/O requirements OSDC 2012
  • 6. Distributed file systems – which? ● HDFS (Hadoop) ● CephFS .. ● GlusterFS .. RedHat ● ... ● XtreemFS OSDC 2012
  • 7. History ● European Research project (2006-2010) ● Part of XtreemOS ● Linux based grid O/S ● Member of OpenGridForum ● Need of distributed file system OSDC 2012
  • 8. Implementation I ● Java ● Supported O/S – Linux – MacOS X with manual work – Free/Net/OpenBSD? – No Windows anymore ● Server and Client (fuse) ... both in user space ● Non-privileged user OSDC 2012
  • 9. Implementation II ● IP based ● Different ports for different XtreemFS services ● Clear text vs. encrypted ● Object based storage ● Software implementation ● OSD features in XtreemFS code – Copy on write – Snapshotting OSDC 2012
  • 10. XtreemFS – the architecture I ● 4 components ● Object based Storage Devices ● Meta Data and Replica Catalogue Servers ● Directory Service ● Clients ;-) OSDC 2012
  • 11. XtreemFS – the architecture II OSDC 2012
  • 12. XtreemFS services ● Several ● OSD ● MRC ● Volumes ● UUID's ● Abstraction from network ● Change requires outage ● Plans for topology OSDC 2012
  • 13. XtreemFS – DIR/MRC data ● Data stored locally ● BabuDB ● Independent of OSD ● Write buffers Modus Description ASYNC Asynchronous log entry write FSYNC Fsync() called after log entry write and before ack'ing of operation SYNC_WRITE Synchronous log entry write, ack'ing of operation before meta data update SYNC_WRITE_METADATA Synchronous log entry write and meta data update before ack'ing of operation OSDC 2012
  • 14. XtreemFS – OSD data ● File cut in 128 Kbyte pieces ● Default: entire file on one OSD ● Distribution across multiple OSD's possible ● RAID 0 implemented ● RAID 5 planned ● Parallel reads/writes OSDC 2012
  • 15. XtreemFS interfaces ● HTTP ● Read-only ● Read-write planned ● Command line ● All purposes OSDC 2012
  • 16. XtreemFS interfaces OSDC 2012
  • 17. XtreemFS – high level summary ● Multi-platform ● Abstraction via UUID ● Communication separation ● Freedom of choice of OSD backend file system ● HPC out of scope OSDC 2012
  • 18. XtreemFS – HA in general ● One part: OSD ● Replication via policies ● Other part: MRC and DIR ● Local data stored in BabuDB's ● Synchronization via BabuDB methods OSDC 2012
  • 19. XtreemFS – HA for MRC/DIR ● Master/slave ● Master changes -> log file without buffering ● Log file entries propagation to slaves ● Quorum needed => at least 3 instances ● No automation for DIR ● Synchronization ● in clear text ● Encryption via SSL possible OSDC 2012
  • 20. XtreemFS OSD replication ● File replication ● Read-only – Since 1.0 – Easy to handle ● Read-write – Only since 1.3 – Later more ● Copies ● Full ● Partial aka on-demand OSDC 2012
  • 21. XtreemFS r/o replication ● Arbitrary amount of replicas ● Equally treated replicas ● Only OSD local access ● No sync needed ● Use case ● Static files :-) ● Low bandwidth (partial replica) ● Big static files (partial replica) OSDC 2012
  • 22. XtreemFS r/w replication ● Primary/secondary ● Election on demand with leases ● Read/write access ● First primary ● Propagated to secondaries OSDC 2012
  • 23. XtreemFS r/w replication - failure ● Secondary ● Behaviour configurable ● Write failure vs. Write on remaining – Quorum needed ● Primary ● Behaviour configurable ● Write failure vs. Write on remaining – Quorum needed OSDC 2012
  • 24. XtreemFS OSD/replica policies ● OSD selection for new files ● Replica selection for new/additional copies ● Categories: filter, group, sort ● Combination of rules Policy Category Standard OSD filter FQDN based filter, group, sort UUID based filter Data center topology group, sort random sort OSDC 2012
  • 25. XtreemFS HA summary ● Homework needed for DIR and MRC ● OSD ● Lateness of OSD read-write replication ● OSD Read-only replication – Mature and WAN ready ● Access time improvement via striping ● Flexibility of policies OSDC 2012
  • 26. XtreemFS encryption ● Not on file system level ● For communication ● Interaction of DIR, MRC and OSD ● Data replication for HA for DIR and/or MRC OSDC 2012
  • 27. XtreemFS channel encryption ● Via SSL ● PCKS#12 or Java Key Store (JKS) ● Locally stored – service/client certificates – root CA certificates ● Two modes ● All-Or-Nothing approach ● Grid-SSL – just authentication OSDC 2012
  • 28. XtreemFS secure channel encryption ● Password protection of certificates ● MRC/DIR/OSD: stored service configuration ● Client: via CLI!! OSDC 2012
  • 29. XtreemFS encryption summary ● Data encryption on POSIX layer? ● SSL obvious choice for TCP/IP channels ● Missing PKI contradicts scalability ● Password protection needs re-design OSDC 2012
  • 30. Summary ● High self-defined goals ● Some dropped? ● Some partially implemented ● Ok for R&D Labs ● HA and housekeeping improvement needed ● Encryption w/o PKI OSDC 2012
  • 31. References ● http://www.xtreemfs.org ● http://babudb.googlecode.com OSDC 2012
  • 32. Thank you! OSDC 2012