SlideShare una empresa de Scribd logo
1 de 21
MapR NFS
  7/6/2012

© 2012 MapR Technologies   Storage Service Architecture 1
MapR NFS
   Agenda
   • Before NFS
   • Exercise: Using hadoop fs
   • MapR NFS Overview
   • NFS Mounting Models
   • Exercise: Mount the Cluster to Itself and Copy a File




© 2012 MapR Technologies   Storage Service Architecture 2
MapR NFS
   Objectives
   At the end of this module you will be able to:
   • Describe the tools that are used with HDFS and their limitations
   • List the features and benefits of MapR Direct Access NFS
   • Identify several use cases for NFS on Hadoop
   • Describe the different models for mounting NFS volumes with MapR




© 2012 MapR Technologies   Storage Service Architecture 3
Before NFS




© 2012 MapR Technologies   Storage Service Architecture 4
Before NFS
     Getting work done without MapR NFS
      –   Example: mining web logs for meaningful data

          •   First: get the logs into the cluster
              – Do you stop writing to the logs?
              – How long does it take?
              – What do you do in the meantime?




                            hadoop fs -put, copyfromlocal

                            distcp

                            Flume




© 2012 MapR Technologies             Storage Service Architecture 5
Before NFS
     Getting work done without MapR NFS
      –   Example: mining web logs for meaningful data

          •   Next: getting answers from the data
              – What tools do you use to access data in your cluster?
              – Who is using these tools?




                         hadoop fs

                         copytolocal




© 2012 MapR Technologies             Storage Service Architecture 6
Exercise:
                              hadoop fs



© 2012 MapR Technologies   Storage Service Architecture 7
MapR NFS Overview




© 2012 MapR Technologies   Storage Service Architecture 8
Introducing Direct Access NFS
                                           Drag and drop
                                           Open files and apps
                                           Watch live data come in
                                           Fully read/write
                                           Multiple concurrent reads/writes
                                           Easy to get data in/out
                                           Same namespace paths as Hadoop
                                         Mount with
                                         -o tcp,ver=3,nolock




© 2012 MapR Technologies   Storage Service Architecture 9
Getting Data into the Cluster with NFS
     Simply mount your cluster filesystem locally
      –   /mapr/my.cluster.com/myvolume/…
     Redirect (or copy) logs to cluster filesystem




                             log data




© 2012 MapR Technologies   Storage Service Architecture 10
Using NFS with Clients
     Mount your cluster filesystem locally
      –   /mapr/my.cluster.com/myvolume/…
     Access all of the data in your cluster with standard applications
      –   Double-click a file (spreadsheet, report, etc.)
          •   edit, save, etc.
      –   Run a perl script
      –   grep, sed, awk, shell scripts, etc.
     Use Hadoop apps (Hive, Pig, Hbase, etc.)




© 2012 MapR Technologies         Storage Service Architecture 11
Using NFS with Clients
     Data is constantly changing
     How do I view the data at a point in time?
      –   Easy: take a snapshot
      –   cd to the snapshot volume
      –   View the data of interest
      –   Note: overhead for taking a snapshot: none to minimal
          •   The only write overhead is the data that changed
              –   Recall snapshot architecture




© 2012 MapR Technologies              Storage Service Architecture 12
Using NFS with Legacy Applications
     What if I have legacy applications that work with my data?
      –   Log analysis, reporting tools, etc.
     Mount the cluster where your tools run
      –   Take a snapshot if necessary
      –   Then use your app just as before




© 2012 MapR Technologies        Storage Service Architecture 13
NFS Mounting Models




© 2012 MapR Technologies   Storage Service Architecture 14
NFS Mounting Models

     Export to the world
      –   NFS gateway runs on selected gateway hosts
     Local server
      –   NFS server runs on local host
      –   Enables local compression and check summing
     Export to self
      –   NFS gateway runs on all data nodes, mounted from localhost




© 2012 MapR Technologies      Storage Service Architecture 15
Export to the World



                                        NFS
                                          NFS
                                       Server
                                           NFS
                                        Server
                                             NFS
                                          Server
                 NFS                       Server
                Client




© 2012 MapR Technologies    Storage Service Architecture 16
Local Server


                             Application

                                     NFS
                                    Server
                           Client




                                                               Cluster
                                                               Nodes



© 2012 MapR Technologies     Storage Service Architecture 17
Universal Export to Self


                                                               Cluster Nodes




                             Application

                                    NFS
                           Cluster Server
                           Node




© 2012 MapR Technologies         Storage Service Architecture 18
Nodes are Identical

                  Application
                                                                     Application
                       NFS
                                                                         NFS
              Cluster Server
              Node                                              Cluster Server
                                                                Node



                             Application

                                    NFS
                           Cluster Server
                           Node



© 2012 MapR Technologies           Storage Service Architecture 19
Exercise:
                       Mount the Cluster to
                       Itself and Copy a File


© 2012 MapR Technologies   Storage Service Architecture 20
Questions




© 2012 MapR Technologies   Storage Service Architecture 21

Más contenido relacionado

La actualidad más candente

NYC Hadoop Meetup - MapR, Architecture, Philosophy and Applications
NYC Hadoop Meetup - MapR, Architecture, Philosophy and ApplicationsNYC Hadoop Meetup - MapR, Architecture, Philosophy and Applications
NYC Hadoop Meetup - MapR, Architecture, Philosophy and Applications
Jason Shao
 
MapReduce Container ReUse
MapReduce Container ReUseMapReduce Container ReUse
MapReduce Container ReUse
Hortonworks
 
Ambari Meetup: NameNode HA
Ambari Meetup: NameNode HAAmbari Meetup: NameNode HA
Ambari Meetup: NameNode HA
Hortonworks
 

La actualidad más candente (20)

20a installation
20a installation20a installation
20a installation
 
HDF-EOS Tools
HDF-EOS ToolsHDF-EOS Tools
HDF-EOS Tools
 
MapR Tutorial Series
MapR Tutorial SeriesMapR Tutorial Series
MapR Tutorial Series
 
Hadoop Internals
Hadoop InternalsHadoop Internals
Hadoop Internals
 
Introduction to Yarn
Introduction to YarnIntroduction to Yarn
Introduction to Yarn
 
NYC Hadoop Meetup - MapR, Architecture, Philosophy and Applications
NYC Hadoop Meetup - MapR, Architecture, Philosophy and ApplicationsNYC Hadoop Meetup - MapR, Architecture, Philosophy and Applications
NYC Hadoop Meetup - MapR, Architecture, Philosophy and Applications
 
Autonomous control in Big Data platforms: and experience with Cassandra
Autonomous control in Big Data platforms: and experience with CassandraAutonomous control in Big Data platforms: and experience with Cassandra
Autonomous control in Big Data platforms: and experience with Cassandra
 
What's New and Upcoming in HDFS - the Hadoop Distributed File System
What's New and Upcoming in HDFS - the Hadoop Distributed File SystemWhat's New and Upcoming in HDFS - the Hadoop Distributed File System
What's New and Upcoming in HDFS - the Hadoop Distributed File System
 
How to Increase Performance of Your Hadoop Cluster
How to Increase Performance of Your Hadoop ClusterHow to Increase Performance of Your Hadoop Cluster
How to Increase Performance of Your Hadoop Cluster
 
Yarn
YarnYarn
Yarn
 
Anatomy of Hadoop YARN
Anatomy of Hadoop YARNAnatomy of Hadoop YARN
Anatomy of Hadoop YARN
 
MapReduce Container ReUse
MapReduce Container ReUseMapReduce Container ReUse
MapReduce Container ReUse
 
Advanced Hadoop Tuning and Optimization
Advanced Hadoop Tuning and Optimization Advanced Hadoop Tuning and Optimization
Advanced Hadoop Tuning and Optimization
 
Hadoop Cluster With High Availability
Hadoop Cluster With High AvailabilityHadoop Cluster With High Availability
Hadoop Cluster With High Availability
 
Ambari Meetup: NameNode HA
Ambari Meetup: NameNode HAAmbari Meetup: NameNode HA
Ambari Meetup: NameNode HA
 
Nn ha hadoop world.final
Nn ha hadoop world.finalNn ha hadoop world.final
Nn ha hadoop world.final
 
MapReduce and Hadoop
MapReduce and HadoopMapReduce and Hadoop
MapReduce and Hadoop
 
Hadoop fault-tolerance
Hadoop fault-toleranceHadoop fault-tolerance
Hadoop fault-tolerance
 
HBaseCon 2015: DeathStar - Easy, Dynamic, Multi-tenant HBase via YARN
HBaseCon 2015: DeathStar - Easy, Dynamic,  Multi-tenant HBase via YARNHBaseCon 2015: DeathStar - Easy, Dynamic,  Multi-tenant HBase via YARN
HBaseCon 2015: DeathStar - Easy, Dynamic, Multi-tenant HBase via YARN
 
Spark tunning in Apache Kylin
Spark tunning in Apache KylinSpark tunning in Apache Kylin
Spark tunning in Apache Kylin
 

Similar a 52 nfs

pnfs status
pnfs statuspnfs status
pnfs status
bergwolf
 
Filesystem as a service in OpenStack
Filesystem as a service in OpenStackFilesystem as a service in OpenStack
Filesystem as a service in OpenStack
openstackindia
 
Sfo17 109 containerized vn fs with data plane acceleration on arm platform
Sfo17 109 containerized vn fs with data plane acceleration on arm platformSfo17 109 containerized vn fs with data plane acceleration on arm platform
Sfo17 109 containerized vn fs with data plane acceleration on arm platform
Linaro
 
Storage as a service OpenStack
Storage as a service OpenStackStorage as a service OpenStack
Storage as a service OpenStack
openstackindia
 
Ceph Day London 2014 - The current state of CephFS development
Ceph Day London 2014 - The current state of CephFS development Ceph Day London 2014 - The current state of CephFS development
Ceph Day London 2014 - The current state of CephFS development
Ceph Community
 

Similar a 52 nfs (20)

NFS and ODBC
NFS and ODBCNFS and ODBC
NFS and ODBC
 
July 2012 HUG: Using Standard File-Based Applications and SQL-Based Tools wit...
July 2012 HUG: Using Standard File-Based Applications and SQL-Based Tools wit...July 2012 HUG: Using Standard File-Based Applications and SQL-Based Tools wit...
July 2012 HUG: Using Standard File-Based Applications and SQL-Based Tools wit...
 
Emerging Persistent Memory Hardware and ZUFS - PM-based File Systems in User ...
Emerging Persistent Memory Hardware and ZUFS - PM-based File Systems in User ...Emerging Persistent Memory Hardware and ZUFS - PM-based File Systems in User ...
Emerging Persistent Memory Hardware and ZUFS - PM-based File Systems in User ...
 
pnfs status
pnfs statuspnfs status
pnfs status
 
Filesystem as a service in OpenStack
Filesystem as a service in OpenStackFilesystem as a service in OpenStack
Filesystem as a service in OpenStack
 
12a architecture
12a architecture12a architecture
12a architecture
 
Sfo17 109 containerized vn fs with data plane acceleration on arm platform
Sfo17 109 containerized vn fs with data plane acceleration on arm platformSfo17 109 containerized vn fs with data plane acceleration on arm platform
Sfo17 109 containerized vn fs with data plane acceleration on arm platform
 
Storage as a service OpenStack
Storage as a service OpenStackStorage as a service OpenStack
Storage as a service OpenStack
 
Red Hat Enterprise Linux and NFS by syedmshaaf
Red Hat Enterprise Linux and NFS by syedmshaafRed Hat Enterprise Linux and NFS by syedmshaaf
Red Hat Enterprise Linux and NFS by syedmshaaf
 
HUG slides on NFS and ODBC
HUG slides on NFS and ODBCHUG slides on NFS and ODBC
HUG slides on NFS and ODBC
 
hadoop architecture -Big data hadoop
   hadoop architecture -Big data hadoop   hadoop architecture -Big data hadoop
hadoop architecture -Big data hadoop
 
Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...
Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...
Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...
 
Filesystems, RPC and HDFS
Filesystems, RPC and HDFSFilesystems, RPC and HDFS
Filesystems, RPC and HDFS
 
How YARN Enables Multiple Data Processing Engines in Hadoop
How YARN Enables Multiple Data Processing Engines in HadoopHow YARN Enables Multiple Data Processing Engines in Hadoop
How YARN Enables Multiple Data Processing Engines in Hadoop
 
OpenNebulaConf 2016 - The DRBD SDS for OpenNebula by Philipp Reisner, LINBIT
OpenNebulaConf 2016 - The DRBD SDS for OpenNebula by Philipp Reisner, LINBITOpenNebulaConf 2016 - The DRBD SDS for OpenNebula by Philipp Reisner, LINBIT
OpenNebulaConf 2016 - The DRBD SDS for OpenNebula by Philipp Reisner, LINBIT
 
Micro services vs hadoop
Micro services vs hadoopMicro services vs hadoop
Micro services vs hadoop
 
Ceph Day London 2014 - The current state of CephFS development
Ceph Day London 2014 - The current state of CephFS development Ceph Day London 2014 - The current state of CephFS development
Ceph Day London 2014 - The current state of CephFS development
 
Monitoring thecloud openstackdaysturkey
Monitoring thecloud openstackdaysturkeyMonitoring thecloud openstackdaysturkey
Monitoring thecloud openstackdaysturkey
 
Virtual Hadoop Introduction In Chinese
Virtual Hadoop Introduction In ChineseVirtual Hadoop Introduction In Chinese
Virtual Hadoop Introduction In Chinese
 
RHCE (RED HAT CERTIFIED ENGINEERING)
RHCE (RED HAT CERTIFIED ENGINEERING)RHCE (RED HAT CERTIFIED ENGINEERING)
RHCE (RED HAT CERTIFIED ENGINEERING)
 

Más de mapr-academy

Más de mapr-academy (9)

53 lab-nfs
53 lab-nfs53 lab-nfs
53 lab-nfs
 
51 lab-volumes
51 lab-volumes51 lab-volumes
51 lab-volumes
 
48a tuning
48a tuning48a tuning
48a tuning
 
42 lab-managing services
42 lab-managing services42 lab-managing services
42 lab-managing services
 
41a managing services
41a managing services41a managing services
41a managing services
 
22 configuration
22 configuration22 configuration
22 configuration
 
14 lab-planing
14 lab-planing14 lab-planing
14 lab-planing
 
10c introduction
10c introduction10c introduction
10c introduction
 
3 map r installation & setup administration course description
3 map r installation & setup administration course description3 map r installation & setup administration course description
3 map r installation & setup administration course description
 

Último

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Último (20)

Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 

52 nfs

  • 1. MapR NFS 7/6/2012 © 2012 MapR Technologies Storage Service Architecture 1
  • 2. MapR NFS Agenda • Before NFS • Exercise: Using hadoop fs • MapR NFS Overview • NFS Mounting Models • Exercise: Mount the Cluster to Itself and Copy a File © 2012 MapR Technologies Storage Service Architecture 2
  • 3. MapR NFS Objectives At the end of this module you will be able to: • Describe the tools that are used with HDFS and their limitations • List the features and benefits of MapR Direct Access NFS • Identify several use cases for NFS on Hadoop • Describe the different models for mounting NFS volumes with MapR © 2012 MapR Technologies Storage Service Architecture 3
  • 4. Before NFS © 2012 MapR Technologies Storage Service Architecture 4
  • 5. Before NFS  Getting work done without MapR NFS – Example: mining web logs for meaningful data • First: get the logs into the cluster – Do you stop writing to the logs? – How long does it take? – What do you do in the meantime? hadoop fs -put, copyfromlocal distcp Flume © 2012 MapR Technologies Storage Service Architecture 5
  • 6. Before NFS  Getting work done without MapR NFS – Example: mining web logs for meaningful data • Next: getting answers from the data – What tools do you use to access data in your cluster? – Who is using these tools? hadoop fs copytolocal © 2012 MapR Technologies Storage Service Architecture 6
  • 7. Exercise: hadoop fs © 2012 MapR Technologies Storage Service Architecture 7
  • 8. MapR NFS Overview © 2012 MapR Technologies Storage Service Architecture 8
  • 9. Introducing Direct Access NFS  Drag and drop  Open files and apps  Watch live data come in  Fully read/write  Multiple concurrent reads/writes  Easy to get data in/out  Same namespace paths as Hadoop  Mount with -o tcp,ver=3,nolock © 2012 MapR Technologies Storage Service Architecture 9
  • 10. Getting Data into the Cluster with NFS  Simply mount your cluster filesystem locally – /mapr/my.cluster.com/myvolume/…  Redirect (or copy) logs to cluster filesystem log data © 2012 MapR Technologies Storage Service Architecture 10
  • 11. Using NFS with Clients  Mount your cluster filesystem locally – /mapr/my.cluster.com/myvolume/…  Access all of the data in your cluster with standard applications – Double-click a file (spreadsheet, report, etc.) • edit, save, etc. – Run a perl script – grep, sed, awk, shell scripts, etc.  Use Hadoop apps (Hive, Pig, Hbase, etc.) © 2012 MapR Technologies Storage Service Architecture 11
  • 12. Using NFS with Clients  Data is constantly changing  How do I view the data at a point in time? – Easy: take a snapshot – cd to the snapshot volume – View the data of interest – Note: overhead for taking a snapshot: none to minimal • The only write overhead is the data that changed – Recall snapshot architecture © 2012 MapR Technologies Storage Service Architecture 12
  • 13. Using NFS with Legacy Applications  What if I have legacy applications that work with my data? – Log analysis, reporting tools, etc.  Mount the cluster where your tools run – Take a snapshot if necessary – Then use your app just as before © 2012 MapR Technologies Storage Service Architecture 13
  • 14. NFS Mounting Models © 2012 MapR Technologies Storage Service Architecture 14
  • 15. NFS Mounting Models  Export to the world – NFS gateway runs on selected gateway hosts  Local server – NFS server runs on local host – Enables local compression and check summing  Export to self – NFS gateway runs on all data nodes, mounted from localhost © 2012 MapR Technologies Storage Service Architecture 15
  • 16. Export to the World NFS NFS Server NFS Server NFS Server NFS Server Client © 2012 MapR Technologies Storage Service Architecture 16
  • 17. Local Server Application NFS Server Client Cluster Nodes © 2012 MapR Technologies Storage Service Architecture 17
  • 18. Universal Export to Self Cluster Nodes Application NFS Cluster Server Node © 2012 MapR Technologies Storage Service Architecture 18
  • 19. Nodes are Identical Application Application NFS NFS Cluster Server Node Cluster Server Node Application NFS Cluster Server Node © 2012 MapR Technologies Storage Service Architecture 19
  • 20. Exercise: Mount the Cluster to Itself and Copy a File © 2012 MapR Technologies Storage Service Architecture 20
  • 21. Questions © 2012 MapR Technologies Storage Service Architecture 21