SlideShare una empresa de Scribd logo
1 de 28
Descargar para leer sin conexión
OSC 2012 Tokyo


             openstack
         Open source software to build public and private clouds.


     Hadoop on OpenStack Swift
  - Experiment of using swift as storage
           for Apache Hadoop
                  2012.09.08
               OpenStack Japan
                  Zheng Xu
                                                                    1
Self introduction

 ●
     Software designer(engineer) for
     embedded system and web
     system(60%hobbit, 40%job).
 ●
     Major: openstack, linux, web browser,
     html, epub, OSS
 ●
     Contact
     ●
         @xz911
     ●
         https://www.facebook.com/xuzheng2001




                                                2
Abstract

●
    This slide is to introduce how to use OpenStack
    Swift as storage service for Apache Hadoop
    instead of HDFS(which is storage service of
    Hadoop project).
●
    This slide is based on
    http://bigdatacraft.com/archives/349, and really
    appreciate Constantine Peresypkin and David
    Gruzman for providing their idea and
    implementation.
                                                       3
Agenda
●
    OpenStack Swift


●
    Apache Hadoop and HDFS


●
    Experiment of replacing HDFS by OpenStack
    Swift



                                                4
What is OpenStack and Swift




  From http://www.openstack.org/
                                   5
What is OpenStack and Swift
                       User Application
                                               http


               Proxy Server            Proxy Server
                                                            http


   Account Server     Account Server       Account Server
                                                                   http




Container Server       Container Server           Container Server



Object Server Object Server        Object Server      Object Server

                                                                          6
What is OpenStack and Swift

●
    OpenSource written in Python
●
    diversity
    ●
        Swift can be a part of OpenStack or an individual
        service it self.
●
    zones, devices, partitions, and replicas
●
    No SPOF




                                                            7
Agenda
●
    OpenStack Swift


●
    Apache Hadoop and HDFS


●
    Experiment of replacing HDFS by OpenStack
    Swift



                                                8
Apache Hadoop and HDFS




         From http://hadoop.apache.org/
                                          9
Apache Hadoop and HDFS

             User Application


                                    Map-Reduce

               Name Node
                                             Hive



 Data Node   Data Node          Data Node




                                                    10
Agenda
●
    OpenStack Swift


●
    Apache Hadoop and HDFS


●
    Experiment of replacing HDFS by OpenStack
    Swift



                                                11
Experiment (Concept)

             User Application
                                      Map-Reduce




               Name Node                      Hive




 Data Node   Data Node          Data Node




                                                     12
Experiment (Concept)

                    User Application
                                                 Map-Reduce


                     java-cloudfiles   java-cloudfiles
                                                                    Hive
             http



                                                  java-cloudfiles
 Data Node          Data Node
             Swift


                                                                           13
Experiment (Software)
 ●
     Swift v1.6
     ●
         https://github.com/openstack/swift.git
     ●
         r21616cf, Jul 25
 ●
     Java Client java-cloudfiles
     ●
         https://github.com/rackspace/java-cloudfiles
     ●
         r0807fa6, Jun 4
 ●
     Apache Hadoop
     ●
         1.0.3
 ●
     Swift fs for Apache Hadoop(just part of following source code)
     ●
         https://github.com/Dazo-org/hadoop-common.git (branch-0.20-security-
         205.swift )

                                                                           14
Experiment (infra)




                     192.168.0.4
   192.168.0.9




                                   15
Experiment(install swift)

●
    Install swift based on
    http://docs.openstack.org/developer/swift/development_saio.html

●
    Do not forget to set bind_ip of proxy-server.conf
    ●
        192.168.0.9 in my case


    ●
        Suppose we have username as "test:tester" with
        password as "testing", the account name is
        AUTH_test and have some container based on
        steps in above Url.

                                                                      16
Experiment (cloudfiles)

●
    Run "ant compile"
●
    Change cloudfiles.properties to following
      # Auth info

      auth_url=http://192.168.0.9:8080/auth/v1.0

      auth_token_name=X-Auth-Token

      #auth_user_header=X-Storage-User

      #auth_pass_header=X-Storage-Pass



      # user properties

      username=test:tester

      password=testing



      # cloudfs properties

      version=v1

      connection_timeout=15000                     17
Experiment(cloudfiles)

●
    Connect cloudfiles to swift(this is option)
    ●
        Change cloudfiles.sh as following and run it to try
        connection with swift
        #!/bin/sh


        export CLASSPATH=lib/httpcore-4.1.4.jar:lib/commons-cli-
        1.1.jar:lib/httpclient-4.1.3.jar:lib/commons-lang-
        2.4.jar:lib/junit.jar:lib/commons-codec-1.3.jar:lib/commons-io-
        1.4.jar:lib/commons-logging-1.1.1.jar:lib/log4j-1.2.15.jar:dist/java-
        cloudfiles.jar:.
        java com.rackspacecloud.client.cloudfiles.sample.FilesCli $@

                                                                                18
Experiment (cloudfiles)

●
    Packaging java-cloudfiles to jar file for Apache
    Hadoop (clone java-cloudfiles to ~/java-
    cloudfiles)
    ●
        We need to put *.properties into java-cloudfiles.jar
        $ ant package
        $ cd cloudfiles/dist
        $ cp ../*.properties .
        $ rm java-cloudfiles.jar
        $ jar cvf java-cloudfiles.jar ./*
                                                               19
Experiment (hadoop)
●
    Prepare
    ●
        download hadoop to ~/hadoop-1.0.3 (newest stable
        version of original hadoop) and git clone
        https://github.com/Dazo-org/hadoop-common.git to
        ~/hadoop-common (old hadoop source code with
        swift fs plugin)
    ●
        At ~/hadoop-1.0.3 (copy java-cloudfiles and related
        library to hadoop lib folder)
        –   cd lib;cp ~/java-cloudfiles/cloudfiles/dist/java-cloudfiles.jar .
        –   cp ~/java-cloudfiles/lib/httpc* .

                                                                         20
Experiment (setting hadoop)

●
    ./hadoop-1.0.3/src/core/core-default.xml
    ●
        Add following to make hadoop can recognize
        handle "swift://" schema to SwiftFileSystem class
    <property>

     <name>fs.swift.impl</name>
    <value>org.apache.hadoop.fs.swift.SwiftFileSystem</value>

    <description>The FileSystem for swift: uris.</description>

    </property>




                                                                 21
Experiment (hadoop)

●
    Copy implementation for swift fs to hadoop
    1.0.3 and build
    ●
        cp -R ../hadoop-
        common/src/core/org/apache/hadoop/fs/swift
        ./src/core/org/apache/hadoop/fs
    ●
        ant




                                                     22
Experiment(hadoop setting)

●
    ./conf/core-site.xml (part1)
    ●
         Add following property for example
    <property>

        <name>fs.swift.userName</name>

        <value>test:tester</value>

    </property>




                                              23
Experiment (hadoop setting)
●
    ./conf/core-site.xml (part2)
    ●
         Add following property for example
    <property>

        <name>fs.swift.userPassword</name>

        <value>testing</value>

    </property>

    <property>

        <name>fs.swift.acccountname</name>

        <value>AUTH_test</value>

    </property>



                                              24
Experiment (hadoop setting)
●
    ./conf/core-site.xml (part3)
    ●
         Add following property for example
     <property>

        <name>fs.swift.authUrl</name>

        <value>http://192.168.0.9:8080/auth/v1.0</value>

     </property>



     <property>

    <name>fs.default.name</name>

    <value>swift://192.168.0.9:8080/v1/AUTH_test</value>

     </property>

                                                           25
Experiment (check swift fs)

●
    At this time, we should can list account
    information via following command
    ●
        ./bin/hadoop -fs -ls /
    ●
        or ./bin/hadoop fs -put ./conf/core-site.xml
        /test_container/core-site.xml (test_container is a test
        container created after swift installed)




                                                             26
Finally

●
    We installed swift for storage service of hadoop
●
    We built origin java-cloudfiles and created
    packages for hadoop
●
    We copied fs.swift plugin from
    https://github.com/Dazo-org/hadoop-common.git
    to new hadoop source tree and build hadoop
●
    We set up core-site.xml of hadoop to connect to
    swift via java-cloudfiles

                                                   27
Thank you for listening.




                           28

Más contenido relacionado

La actualidad más candente

Cloud Foundry 百日行 振り返り
Cloud Foundry 百日行 振り返りCloud Foundry 百日行 振り返り
Cloud Foundry 百日行 振り返りnota-ja
 
Docker to the Rescue of an Ops Team
Docker to the Rescue of an Ops TeamDocker to the Rescue of an Ops Team
Docker to the Rescue of an Ops TeamRachid Zarouali
 
Docker for Developers - Sunshine PHP
Docker for Developers - Sunshine PHPDocker for Developers - Sunshine PHP
Docker for Developers - Sunshine PHPChris Tankersley
 
Introduction and Deep Dive Into Containerd
Introduction and Deep Dive Into ContainerdIntroduction and Deep Dive Into Containerd
Introduction and Deep Dive Into ContainerdKohei Tokunaga
 
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-Ranchers
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-Ranchers제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-Ranchers
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-RanchersTommy Lee
 
Использование Docker в CI / Александр Акбашев (HERE Technologies)
Использование Docker в CI / Александр Акбашев (HERE Technologies)Использование Docker в CI / Александр Акбашев (HERE Technologies)
Использование Docker в CI / Александр Акбашев (HERE Technologies)Ontico
 
OpenStack for Centos
OpenStack for CentosOpenStack for Centos
OpenStack for CentosChandan Kumar
 
Dockerを利用したローカル環境から本番環境までの構築設計
Dockerを利用したローカル環境から本番環境までの構築設計Dockerを利用したローカル環境から本番環境までの構築設計
Dockerを利用したローカル環境から本番環境までの構築設計Koichi Nagaoka
 
Kubernetes Story - Day 2: Quay.io Container Registry for Publishing, Building...
Kubernetes Story - Day 2: Quay.io Container Registry for Publishing, Building...Kubernetes Story - Day 2: Quay.io Container Registry for Publishing, Building...
Kubernetes Story - Day 2: Quay.io Container Registry for Publishing, Building...Mihai Criveti
 
Build Your Own CaaS (Container as a Service)
Build Your Own CaaS (Container as a Service)Build Your Own CaaS (Container as a Service)
Build Your Own CaaS (Container as a Service)HungWei Chiu
 
Red Hat Enterprise Linux OpenStack Platform 7 - VM Instance HA Architecture
Red Hat Enterprise Linux OpenStack Platform 7 - VM Instance HA ArchitectureRed Hat Enterprise Linux OpenStack Platform 7 - VM Instance HA Architecture
Red Hat Enterprise Linux OpenStack Platform 7 - VM Instance HA ArchitectureEtsuji Nakai
 
Gdg cloud taipei ddt meetup #53 buildpack
Gdg cloud taipei ddt meetup #53 buildpackGdg cloud taipei ddt meetup #53 buildpack
Gdg cloud taipei ddt meetup #53 buildpackKAI CHU CHUNG
 
CoreOS + Kubernetes @ All Things Open 2015
CoreOS + Kubernetes @ All Things Open 2015CoreOS + Kubernetes @ All Things Open 2015
CoreOS + Kubernetes @ All Things Open 2015Brandon Philips
 
Exploring the Future of Helm
Exploring the Future of HelmExploring the Future of Helm
Exploring the Future of HelmMatthew Farina
 
Practical Docker for OpenStack (Juno Summit - May 15th, 2014)
Practical Docker for OpenStack (Juno Summit - May 15th, 2014)Practical Docker for OpenStack (Juno Summit - May 15th, 2014)
Practical Docker for OpenStack (Juno Summit - May 15th, 2014)Erica Windisch
 
State of Big Data on ARM64 / AArch64 - Apache Bigtop
State of Big Data on ARM64 / AArch64 - Apache BigtopState of Big Data on ARM64 / AArch64 - Apache Bigtop
State of Big Data on ARM64 / AArch64 - Apache BigtopGanesh Raju
 
Cluster Networking with Docker
Cluster Networking with DockerCluster Networking with Docker
Cluster Networking with DockerStefan Schimanski
 
How to operate containerized OpenStack
How to operate containerized OpenStackHow to operate containerized OpenStack
How to operate containerized OpenStackNalee Jang
 

La actualidad más candente (20)

Cloud Foundry 百日行 振り返り
Cloud Foundry 百日行 振り返りCloud Foundry 百日行 振り返り
Cloud Foundry 百日行 振り返り
 
Docker to the Rescue of an Ops Team
Docker to the Rescue of an Ops TeamDocker to the Rescue of an Ops Team
Docker to the Rescue of an Ops Team
 
Docker for Developers - Sunshine PHP
Docker for Developers - Sunshine PHPDocker for Developers - Sunshine PHP
Docker for Developers - Sunshine PHP
 
Introduction and Deep Dive Into Containerd
Introduction and Deep Dive Into ContainerdIntroduction and Deep Dive Into Containerd
Introduction and Deep Dive Into Containerd
 
App container rkt
App container rktApp container rkt
App container rkt
 
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-Ranchers
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-Ranchers제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-Ranchers
제4회 한국IBM과 함께하는 난공불락 오픈소스 인프라 세미나-Ranchers
 
Использование Docker в CI / Александр Акбашев (HERE Technologies)
Использование Docker в CI / Александр Акбашев (HERE Technologies)Использование Docker в CI / Александр Акбашев (HERE Technologies)
Использование Docker в CI / Александр Акбашев (HERE Technologies)
 
OpenStack for Centos
OpenStack for CentosOpenStack for Centos
OpenStack for Centos
 
Dockerを利用したローカル環境から本番環境までの構築設計
Dockerを利用したローカル環境から本番環境までの構築設計Dockerを利用したローカル環境から本番環境までの構築設計
Dockerを利用したローカル環境から本番環境までの構築設計
 
Kubernetes Story - Day 2: Quay.io Container Registry for Publishing, Building...
Kubernetes Story - Day 2: Quay.io Container Registry for Publishing, Building...Kubernetes Story - Day 2: Quay.io Container Registry for Publishing, Building...
Kubernetes Story - Day 2: Quay.io Container Registry for Publishing, Building...
 
Tp install anything
Tp install anythingTp install anything
Tp install anything
 
Build Your Own CaaS (Container as a Service)
Build Your Own CaaS (Container as a Service)Build Your Own CaaS (Container as a Service)
Build Your Own CaaS (Container as a Service)
 
Red Hat Enterprise Linux OpenStack Platform 7 - VM Instance HA Architecture
Red Hat Enterprise Linux OpenStack Platform 7 - VM Instance HA ArchitectureRed Hat Enterprise Linux OpenStack Platform 7 - VM Instance HA Architecture
Red Hat Enterprise Linux OpenStack Platform 7 - VM Instance HA Architecture
 
Gdg cloud taipei ddt meetup #53 buildpack
Gdg cloud taipei ddt meetup #53 buildpackGdg cloud taipei ddt meetup #53 buildpack
Gdg cloud taipei ddt meetup #53 buildpack
 
CoreOS + Kubernetes @ All Things Open 2015
CoreOS + Kubernetes @ All Things Open 2015CoreOS + Kubernetes @ All Things Open 2015
CoreOS + Kubernetes @ All Things Open 2015
 
Exploring the Future of Helm
Exploring the Future of HelmExploring the Future of Helm
Exploring the Future of Helm
 
Practical Docker for OpenStack (Juno Summit - May 15th, 2014)
Practical Docker for OpenStack (Juno Summit - May 15th, 2014)Practical Docker for OpenStack (Juno Summit - May 15th, 2014)
Practical Docker for OpenStack (Juno Summit - May 15th, 2014)
 
State of Big Data on ARM64 / AArch64 - Apache Bigtop
State of Big Data on ARM64 / AArch64 - Apache BigtopState of Big Data on ARM64 / AArch64 - Apache Bigtop
State of Big Data on ARM64 / AArch64 - Apache Bigtop
 
Cluster Networking with Docker
Cluster Networking with DockerCluster Networking with Docker
Cluster Networking with Docker
 
How to operate containerized OpenStack
How to operate containerized OpenStackHow to operate containerized OpenStack
How to operate containerized OpenStack
 

Destacado

The state of the art for OpenStack Data Processing (Hadoop on OpenStack) - At...
The state of the art for OpenStack Data Processing (Hadoop on OpenStack) - At...The state of the art for OpenStack Data Processing (Hadoop on OpenStack) - At...
The state of the art for OpenStack Data Processing (Hadoop on OpenStack) - At...spinningmatt
 
Hadoop on OpenStack
Hadoop on OpenStackHadoop on OpenStack
Hadoop on OpenStackSandeep Raju
 
Hadoop on OpenStack - Trove Day 2014
Hadoop on OpenStack - Trove Day 2014Hadoop on OpenStack - Trove Day 2014
Hadoop on OpenStack - Trove Day 2014Tesora
 
Apache Ambari BOF - OpenStack - Hadoop Summit 2013
Apache Ambari BOF - OpenStack - Hadoop Summit 2013Apache Ambari BOF - OpenStack - Hadoop Summit 2013
Apache Ambari BOF - OpenStack - Hadoop Summit 2013Hortonworks
 
Hadoop and OpenStack - Hadoop Summit San Jose 2014
Hadoop and OpenStack - Hadoop Summit San Jose 2014Hadoop and OpenStack - Hadoop Summit San Jose 2014
Hadoop and OpenStack - Hadoop Summit San Jose 2014spinningmatt
 
Savanna: Hadoop on OpenStack
Savanna: Hadoop on OpenStackSavanna: Hadoop on OpenStack
Savanna: Hadoop on OpenStackMirantis
 
Hadoop on OpenStack - Sahara @DevNation 2014
Hadoop on OpenStack - Sahara @DevNation 2014Hadoop on OpenStack - Sahara @DevNation 2014
Hadoop on OpenStack - Sahara @DevNation 2014spinningmatt
 

Destacado (8)

The state of the art for OpenStack Data Processing (Hadoop on OpenStack) - At...
The state of the art for OpenStack Data Processing (Hadoop on OpenStack) - At...The state of the art for OpenStack Data Processing (Hadoop on OpenStack) - At...
The state of the art for OpenStack Data Processing (Hadoop on OpenStack) - At...
 
Hadoop on OpenStack
Hadoop on OpenStackHadoop on OpenStack
Hadoop on OpenStack
 
Hadoop on OpenStack - Trove Day 2014
Hadoop on OpenStack - Trove Day 2014Hadoop on OpenStack - Trove Day 2014
Hadoop on OpenStack - Trove Day 2014
 
Hadoop For OpenStack Log Analysis
Hadoop For OpenStack Log AnalysisHadoop For OpenStack Log Analysis
Hadoop For OpenStack Log Analysis
 
Apache Ambari BOF - OpenStack - Hadoop Summit 2013
Apache Ambari BOF - OpenStack - Hadoop Summit 2013Apache Ambari BOF - OpenStack - Hadoop Summit 2013
Apache Ambari BOF - OpenStack - Hadoop Summit 2013
 
Hadoop and OpenStack - Hadoop Summit San Jose 2014
Hadoop and OpenStack - Hadoop Summit San Jose 2014Hadoop and OpenStack - Hadoop Summit San Jose 2014
Hadoop and OpenStack - Hadoop Summit San Jose 2014
 
Savanna: Hadoop on OpenStack
Savanna: Hadoop on OpenStackSavanna: Hadoop on OpenStack
Savanna: Hadoop on OpenStack
 
Hadoop on OpenStack - Sahara @DevNation 2014
Hadoop on OpenStack - Sahara @DevNation 2014Hadoop on OpenStack - Sahara @DevNation 2014
Hadoop on OpenStack - Sahara @DevNation 2014
 

Similar a 2012 09-08-josug-jeff

Kubernetes for the PHP developer
Kubernetes for the PHP developerKubernetes for the PHP developer
Kubernetes for the PHP developerPaul Czarkowski
 
[HKOSCON][20180616][Containerized High Availability Virtual Hosting Deploymen...
[HKOSCON][20180616][Containerized High Availability Virtual Hosting Deploymen...[HKOSCON][20180616][Containerized High Availability Virtual Hosting Deploymen...
[HKOSCON][20180616][Containerized High Availability Virtual Hosting Deploymen...Wong Hoi Sing Edison
 
Extending OpenShift Origin: Build Your Own Cartridge with Bill DeCoste of Red...
Extending OpenShift Origin: Build Your Own Cartridge with Bill DeCoste of Red...Extending OpenShift Origin: Build Your Own Cartridge with Bill DeCoste of Red...
Extending OpenShift Origin: Build Your Own Cartridge with Bill DeCoste of Red...OpenShift Origin
 
Part 4: Custom Buildpacks and Data Services (Pivotal Cloud Platform Roadshow)
Part 4: Custom Buildpacks and Data Services (Pivotal Cloud Platform Roadshow)Part 4: Custom Buildpacks and Data Services (Pivotal Cloud Platform Roadshow)
Part 4: Custom Buildpacks and Data Services (Pivotal Cloud Platform Roadshow)VMware Tanzu
 
Django deployment with PaaS
Django deployment with PaaSDjango deployment with PaaS
Django deployment with PaaSAppsembler
 
Custom Buildpacks and Data Services
Custom Buildpacks and Data ServicesCustom Buildpacks and Data Services
Custom Buildpacks and Data ServicesTom Kranz
 
DC HUG Hadoop for Windows
DC HUG Hadoop for WindowsDC HUG Hadoop for Windows
DC HUG Hadoop for WindowsTerry Padgett
 
PHP on Heroku: Deploying and Scaling Apps in the Cloud
PHP on Heroku: Deploying and Scaling Apps in the CloudPHP on Heroku: Deploying and Scaling Apps in the Cloud
PHP on Heroku: Deploying and Scaling Apps in the CloudSalesforce Developers
 
Ceph Day Shanghai - Hyper Converged PLCloud with Ceph
Ceph Day Shanghai - Hyper Converged PLCloud with Ceph Ceph Day Shanghai - Hyper Converged PLCloud with Ceph
Ceph Day Shanghai - Hyper Converged PLCloud with Ceph Ceph Community
 
Introduction to JIB and Google Cloud Run
Introduction to JIB and Google Cloud RunIntroduction to JIB and Google Cloud Run
Introduction to JIB and Google Cloud RunSaiyam Pathak
 
A DevOps guide to Kubernetes
A DevOps guide to KubernetesA DevOps guide to Kubernetes
A DevOps guide to KubernetesPaul Czarkowski
 
Baylisa - Dive Into OpenStack
Baylisa - Dive Into OpenStackBaylisa - Dive Into OpenStack
Baylisa - Dive Into OpenStackJesse Andrews
 
Scaleable PHP Applications in Kubernetes
Scaleable PHP Applications in KubernetesScaleable PHP Applications in Kubernetes
Scaleable PHP Applications in KubernetesRobert Lemke
 
Scale Apache with Nginx
Scale Apache with NginxScale Apache with Nginx
Scale Apache with NginxBud Siddhisena
 
Hadoop Everywhere & Cloudbreak
Hadoop Everywhere & CloudbreakHadoop Everywhere & Cloudbreak
Hadoop Everywhere & CloudbreakSean Roberts
 
Hortonworks Technical Workshop: HDP everywhere - cloud considerations using...
Hortonworks Technical Workshop:   HDP everywhere - cloud considerations using...Hortonworks Technical Workshop:   HDP everywhere - cloud considerations using...
Hortonworks Technical Workshop: HDP everywhere - cloud considerations using...Hortonworks
 
Apache Bigtop and ARM64 / AArch64 - Empowering Big Data Everywhere
Apache Bigtop and ARM64 / AArch64 - Empowering Big Data EverywhereApache Bigtop and ARM64 / AArch64 - Empowering Big Data Everywhere
Apache Bigtop and ARM64 / AArch64 - Empowering Big Data EverywhereGanesh Raju
 
[BarCamp2018][20180915][Tips for Virtual Hosting on Kubernetes]
[BarCamp2018][20180915][Tips for Virtual Hosting on Kubernetes][BarCamp2018][20180915][Tips for Virtual Hosting on Kubernetes]
[BarCamp2018][20180915][Tips for Virtual Hosting on Kubernetes]Wong Hoi Sing Edison
 
Deploying Hadoop-based Bigdata Environments
Deploying Hadoop-based Bigdata Environments Deploying Hadoop-based Bigdata Environments
Deploying Hadoop-based Bigdata Environments buildacloud
 
Deploying Hadoop-Based Bigdata Environments
Deploying Hadoop-Based Bigdata EnvironmentsDeploying Hadoop-Based Bigdata Environments
Deploying Hadoop-Based Bigdata EnvironmentsPuppet
 

Similar a 2012 09-08-josug-jeff (20)

Kubernetes for the PHP developer
Kubernetes for the PHP developerKubernetes for the PHP developer
Kubernetes for the PHP developer
 
[HKOSCON][20180616][Containerized High Availability Virtual Hosting Deploymen...
[HKOSCON][20180616][Containerized High Availability Virtual Hosting Deploymen...[HKOSCON][20180616][Containerized High Availability Virtual Hosting Deploymen...
[HKOSCON][20180616][Containerized High Availability Virtual Hosting Deploymen...
 
Extending OpenShift Origin: Build Your Own Cartridge with Bill DeCoste of Red...
Extending OpenShift Origin: Build Your Own Cartridge with Bill DeCoste of Red...Extending OpenShift Origin: Build Your Own Cartridge with Bill DeCoste of Red...
Extending OpenShift Origin: Build Your Own Cartridge with Bill DeCoste of Red...
 
Part 4: Custom Buildpacks and Data Services (Pivotal Cloud Platform Roadshow)
Part 4: Custom Buildpacks and Data Services (Pivotal Cloud Platform Roadshow)Part 4: Custom Buildpacks and Data Services (Pivotal Cloud Platform Roadshow)
Part 4: Custom Buildpacks and Data Services (Pivotal Cloud Platform Roadshow)
 
Django deployment with PaaS
Django deployment with PaaSDjango deployment with PaaS
Django deployment with PaaS
 
Custom Buildpacks and Data Services
Custom Buildpacks and Data ServicesCustom Buildpacks and Data Services
Custom Buildpacks and Data Services
 
DC HUG Hadoop for Windows
DC HUG Hadoop for WindowsDC HUG Hadoop for Windows
DC HUG Hadoop for Windows
 
PHP on Heroku: Deploying and Scaling Apps in the Cloud
PHP on Heroku: Deploying and Scaling Apps in the CloudPHP on Heroku: Deploying and Scaling Apps in the Cloud
PHP on Heroku: Deploying and Scaling Apps in the Cloud
 
Ceph Day Shanghai - Hyper Converged PLCloud with Ceph
Ceph Day Shanghai - Hyper Converged PLCloud with Ceph Ceph Day Shanghai - Hyper Converged PLCloud with Ceph
Ceph Day Shanghai - Hyper Converged PLCloud with Ceph
 
Introduction to JIB and Google Cloud Run
Introduction to JIB and Google Cloud RunIntroduction to JIB and Google Cloud Run
Introduction to JIB and Google Cloud Run
 
A DevOps guide to Kubernetes
A DevOps guide to KubernetesA DevOps guide to Kubernetes
A DevOps guide to Kubernetes
 
Baylisa - Dive Into OpenStack
Baylisa - Dive Into OpenStackBaylisa - Dive Into OpenStack
Baylisa - Dive Into OpenStack
 
Scaleable PHP Applications in Kubernetes
Scaleable PHP Applications in KubernetesScaleable PHP Applications in Kubernetes
Scaleable PHP Applications in Kubernetes
 
Scale Apache with Nginx
Scale Apache with NginxScale Apache with Nginx
Scale Apache with Nginx
 
Hadoop Everywhere & Cloudbreak
Hadoop Everywhere & CloudbreakHadoop Everywhere & Cloudbreak
Hadoop Everywhere & Cloudbreak
 
Hortonworks Technical Workshop: HDP everywhere - cloud considerations using...
Hortonworks Technical Workshop:   HDP everywhere - cloud considerations using...Hortonworks Technical Workshop:   HDP everywhere - cloud considerations using...
Hortonworks Technical Workshop: HDP everywhere - cloud considerations using...
 
Apache Bigtop and ARM64 / AArch64 - Empowering Big Data Everywhere
Apache Bigtop and ARM64 / AArch64 - Empowering Big Data EverywhereApache Bigtop and ARM64 / AArch64 - Empowering Big Data Everywhere
Apache Bigtop and ARM64 / AArch64 - Empowering Big Data Everywhere
 
[BarCamp2018][20180915][Tips for Virtual Hosting on Kubernetes]
[BarCamp2018][20180915][Tips for Virtual Hosting on Kubernetes][BarCamp2018][20180915][Tips for Virtual Hosting on Kubernetes]
[BarCamp2018][20180915][Tips for Virtual Hosting on Kubernetes]
 
Deploying Hadoop-based Bigdata Environments
Deploying Hadoop-based Bigdata Environments Deploying Hadoop-based Bigdata Environments
Deploying Hadoop-based Bigdata Environments
 
Deploying Hadoop-Based Bigdata Environments
Deploying Hadoop-Based Bigdata EnvironmentsDeploying Hadoop-Based Bigdata Environments
Deploying Hadoop-Based Bigdata Environments
 

Último

Sales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for SuccessSales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for SuccessAggregage
 
Keppel Ltd. 1Q 2024 Business Update Presentation Slides
Keppel Ltd. 1Q 2024 Business Update  Presentation SlidesKeppel Ltd. 1Q 2024 Business Update  Presentation Slides
Keppel Ltd. 1Q 2024 Business Update Presentation SlidesKeppelCorporation
 
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdfRenandantas16
 
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Dave Litwiller
 
Pharma Works Profile of Karan Communications
Pharma Works Profile of Karan CommunicationsPharma Works Profile of Karan Communications
Pharma Works Profile of Karan Communicationskarancommunications
 
Ensure the security of your HCL environment by applying the Zero Trust princi...
Ensure the security of your HCL environment by applying the Zero Trust princi...Ensure the security of your HCL environment by applying the Zero Trust princi...
Ensure the security of your HCL environment by applying the Zero Trust princi...Roland Driesen
 
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...lizamodels9
 
Boost the utilization of your HCL environment by reevaluating use cases and f...
Boost the utilization of your HCL environment by reevaluating use cases and f...Boost the utilization of your HCL environment by reevaluating use cases and f...
Boost the utilization of your HCL environment by reevaluating use cases and f...Roland Driesen
 
Progress Report - Oracle Database Analyst Summit
Progress  Report - Oracle Database Analyst SummitProgress  Report - Oracle Database Analyst Summit
Progress Report - Oracle Database Analyst SummitHolger Mueller
 
Value Proposition canvas- Customer needs and pains
Value Proposition canvas- Customer needs and painsValue Proposition canvas- Customer needs and pains
Value Proposition canvas- Customer needs and painsP&CO
 
Call Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine ServiceCall Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine Serviceritikaroy0888
 
Tech Startup Growth Hacking 101 - Basics on Growth Marketing
Tech Startup Growth Hacking 101  - Basics on Growth MarketingTech Startup Growth Hacking 101  - Basics on Growth Marketing
Tech Startup Growth Hacking 101 - Basics on Growth MarketingShawn Pang
 
VIP Kolkata Call Girl Howrah 👉 8250192130 Available With Room
VIP Kolkata Call Girl Howrah 👉 8250192130  Available With RoomVIP Kolkata Call Girl Howrah 👉 8250192130  Available With Room
VIP Kolkata Call Girl Howrah 👉 8250192130 Available With Roomdivyansh0kumar0
 
Monthly Social Media Update April 2024 pptx.pptx
Monthly Social Media Update April 2024 pptx.pptxMonthly Social Media Update April 2024 pptx.pptx
Monthly Social Media Update April 2024 pptx.pptxAndy Lambert
 
VIP Call Girls In Saharaganj ( Lucknow ) 🔝 8923113531 🔝 Cash Payment (COD) 👒
VIP Call Girls In Saharaganj ( Lucknow  ) 🔝 8923113531 🔝  Cash Payment (COD) 👒VIP Call Girls In Saharaganj ( Lucknow  ) 🔝 8923113531 🔝  Cash Payment (COD) 👒
VIP Call Girls In Saharaganj ( Lucknow ) 🔝 8923113531 🔝 Cash Payment (COD) 👒anilsa9823
 
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Dipal Arora
 
Best Basmati Rice Manufacturers in India
Best Basmati Rice Manufacturers in IndiaBest Basmati Rice Manufacturers in India
Best Basmati Rice Manufacturers in IndiaShree Krishna Exports
 
The Coffee Bean & Tea Leaf(CBTL), Business strategy case study
The Coffee Bean & Tea Leaf(CBTL), Business strategy case studyThe Coffee Bean & Tea Leaf(CBTL), Business strategy case study
The Coffee Bean & Tea Leaf(CBTL), Business strategy case studyEthan lee
 

Último (20)

VVVIP Call Girls In Greater Kailash ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...
VVVIP Call Girls In Greater Kailash ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...VVVIP Call Girls In Greater Kailash ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...
VVVIP Call Girls In Greater Kailash ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...
 
Sales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for SuccessSales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for Success
 
Keppel Ltd. 1Q 2024 Business Update Presentation Slides
Keppel Ltd. 1Q 2024 Business Update  Presentation SlidesKeppel Ltd. 1Q 2024 Business Update  Presentation Slides
Keppel Ltd. 1Q 2024 Business Update Presentation Slides
 
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
 
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
 
Pharma Works Profile of Karan Communications
Pharma Works Profile of Karan CommunicationsPharma Works Profile of Karan Communications
Pharma Works Profile of Karan Communications
 
Ensure the security of your HCL environment by applying the Zero Trust princi...
Ensure the security of your HCL environment by applying the Zero Trust princi...Ensure the security of your HCL environment by applying the Zero Trust princi...
Ensure the security of your HCL environment by applying the Zero Trust princi...
 
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...
 
Boost the utilization of your HCL environment by reevaluating use cases and f...
Boost the utilization of your HCL environment by reevaluating use cases and f...Boost the utilization of your HCL environment by reevaluating use cases and f...
Boost the utilization of your HCL environment by reevaluating use cases and f...
 
Progress Report - Oracle Database Analyst Summit
Progress  Report - Oracle Database Analyst SummitProgress  Report - Oracle Database Analyst Summit
Progress Report - Oracle Database Analyst Summit
 
Value Proposition canvas- Customer needs and pains
Value Proposition canvas- Customer needs and painsValue Proposition canvas- Customer needs and pains
Value Proposition canvas- Customer needs and pains
 
Call Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine ServiceCall Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine Service
 
Tech Startup Growth Hacking 101 - Basics on Growth Marketing
Tech Startup Growth Hacking 101  - Basics on Growth MarketingTech Startup Growth Hacking 101  - Basics on Growth Marketing
Tech Startup Growth Hacking 101 - Basics on Growth Marketing
 
VIP Kolkata Call Girl Howrah 👉 8250192130 Available With Room
VIP Kolkata Call Girl Howrah 👉 8250192130  Available With RoomVIP Kolkata Call Girl Howrah 👉 8250192130  Available With Room
VIP Kolkata Call Girl Howrah 👉 8250192130 Available With Room
 
Monthly Social Media Update April 2024 pptx.pptx
Monthly Social Media Update April 2024 pptx.pptxMonthly Social Media Update April 2024 pptx.pptx
Monthly Social Media Update April 2024 pptx.pptx
 
VIP Call Girls In Saharaganj ( Lucknow ) 🔝 8923113531 🔝 Cash Payment (COD) 👒
VIP Call Girls In Saharaganj ( Lucknow  ) 🔝 8923113531 🔝  Cash Payment (COD) 👒VIP Call Girls In Saharaganj ( Lucknow  ) 🔝 8923113531 🔝  Cash Payment (COD) 👒
VIP Call Girls In Saharaganj ( Lucknow ) 🔝 8923113531 🔝 Cash Payment (COD) 👒
 
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
 
Best Basmati Rice Manufacturers in India
Best Basmati Rice Manufacturers in IndiaBest Basmati Rice Manufacturers in India
Best Basmati Rice Manufacturers in India
 
The Coffee Bean & Tea Leaf(CBTL), Business strategy case study
The Coffee Bean & Tea Leaf(CBTL), Business strategy case studyThe Coffee Bean & Tea Leaf(CBTL), Business strategy case study
The Coffee Bean & Tea Leaf(CBTL), Business strategy case study
 
Nepali Escort Girl Kakori \ 9548273370 Indian Call Girls Service Lucknow ₹,9517
Nepali Escort Girl Kakori \ 9548273370 Indian Call Girls Service Lucknow ₹,9517Nepali Escort Girl Kakori \ 9548273370 Indian Call Girls Service Lucknow ₹,9517
Nepali Escort Girl Kakori \ 9548273370 Indian Call Girls Service Lucknow ₹,9517
 

2012 09-08-josug-jeff

  • 1. OSC 2012 Tokyo openstack Open source software to build public and private clouds. Hadoop on OpenStack Swift - Experiment of using swift as storage for Apache Hadoop 2012.09.08 OpenStack Japan Zheng Xu 1
  • 2. Self introduction ● Software designer(engineer) for embedded system and web system(60%hobbit, 40%job). ● Major: openstack, linux, web browser, html, epub, OSS ● Contact ● @xz911 ● https://www.facebook.com/xuzheng2001 2
  • 3. Abstract ● This slide is to introduce how to use OpenStack Swift as storage service for Apache Hadoop instead of HDFS(which is storage service of Hadoop project). ● This slide is based on http://bigdatacraft.com/archives/349, and really appreciate Constantine Peresypkin and David Gruzman for providing their idea and implementation. 3
  • 4. Agenda ● OpenStack Swift ● Apache Hadoop and HDFS ● Experiment of replacing HDFS by OpenStack Swift 4
  • 5. What is OpenStack and Swift From http://www.openstack.org/ 5
  • 6. What is OpenStack and Swift User Application http Proxy Server Proxy Server http Account Server Account Server Account Server http Container Server Container Server Container Server Object Server Object Server Object Server Object Server 6
  • 7. What is OpenStack and Swift ● OpenSource written in Python ● diversity ● Swift can be a part of OpenStack or an individual service it self. ● zones, devices, partitions, and replicas ● No SPOF 7
  • 8. Agenda ● OpenStack Swift ● Apache Hadoop and HDFS ● Experiment of replacing HDFS by OpenStack Swift 8
  • 9. Apache Hadoop and HDFS From http://hadoop.apache.org/ 9
  • 10. Apache Hadoop and HDFS User Application Map-Reduce Name Node Hive Data Node Data Node Data Node 10
  • 11. Agenda ● OpenStack Swift ● Apache Hadoop and HDFS ● Experiment of replacing HDFS by OpenStack Swift 11
  • 12. Experiment (Concept) User Application Map-Reduce Name Node Hive Data Node Data Node Data Node 12
  • 13. Experiment (Concept) User Application Map-Reduce java-cloudfiles java-cloudfiles Hive http java-cloudfiles Data Node Data Node Swift 13
  • 14. Experiment (Software) ● Swift v1.6 ● https://github.com/openstack/swift.git ● r21616cf, Jul 25 ● Java Client java-cloudfiles ● https://github.com/rackspace/java-cloudfiles ● r0807fa6, Jun 4 ● Apache Hadoop ● 1.0.3 ● Swift fs for Apache Hadoop(just part of following source code) ● https://github.com/Dazo-org/hadoop-common.git (branch-0.20-security- 205.swift ) 14
  • 15. Experiment (infra) 192.168.0.4 192.168.0.9 15
  • 16. Experiment(install swift) ● Install swift based on http://docs.openstack.org/developer/swift/development_saio.html ● Do not forget to set bind_ip of proxy-server.conf ● 192.168.0.9 in my case ● Suppose we have username as "test:tester" with password as "testing", the account name is AUTH_test and have some container based on steps in above Url. 16
  • 17. Experiment (cloudfiles) ● Run "ant compile" ● Change cloudfiles.properties to following # Auth info auth_url=http://192.168.0.9:8080/auth/v1.0 auth_token_name=X-Auth-Token #auth_user_header=X-Storage-User #auth_pass_header=X-Storage-Pass # user properties username=test:tester password=testing # cloudfs properties version=v1 connection_timeout=15000 17
  • 18. Experiment(cloudfiles) ● Connect cloudfiles to swift(this is option) ● Change cloudfiles.sh as following and run it to try connection with swift #!/bin/sh export CLASSPATH=lib/httpcore-4.1.4.jar:lib/commons-cli- 1.1.jar:lib/httpclient-4.1.3.jar:lib/commons-lang- 2.4.jar:lib/junit.jar:lib/commons-codec-1.3.jar:lib/commons-io- 1.4.jar:lib/commons-logging-1.1.1.jar:lib/log4j-1.2.15.jar:dist/java- cloudfiles.jar:. java com.rackspacecloud.client.cloudfiles.sample.FilesCli $@ 18
  • 19. Experiment (cloudfiles) ● Packaging java-cloudfiles to jar file for Apache Hadoop (clone java-cloudfiles to ~/java- cloudfiles) ● We need to put *.properties into java-cloudfiles.jar $ ant package $ cd cloudfiles/dist $ cp ../*.properties . $ rm java-cloudfiles.jar $ jar cvf java-cloudfiles.jar ./* 19
  • 20. Experiment (hadoop) ● Prepare ● download hadoop to ~/hadoop-1.0.3 (newest stable version of original hadoop) and git clone https://github.com/Dazo-org/hadoop-common.git to ~/hadoop-common (old hadoop source code with swift fs plugin) ● At ~/hadoop-1.0.3 (copy java-cloudfiles and related library to hadoop lib folder) – cd lib;cp ~/java-cloudfiles/cloudfiles/dist/java-cloudfiles.jar . – cp ~/java-cloudfiles/lib/httpc* . 20
  • 21. Experiment (setting hadoop) ● ./hadoop-1.0.3/src/core/core-default.xml ● Add following to make hadoop can recognize handle "swift://" schema to SwiftFileSystem class <property> <name>fs.swift.impl</name> <value>org.apache.hadoop.fs.swift.SwiftFileSystem</value> <description>The FileSystem for swift: uris.</description> </property> 21
  • 22. Experiment (hadoop) ● Copy implementation for swift fs to hadoop 1.0.3 and build ● cp -R ../hadoop- common/src/core/org/apache/hadoop/fs/swift ./src/core/org/apache/hadoop/fs ● ant 22
  • 23. Experiment(hadoop setting) ● ./conf/core-site.xml (part1) ● Add following property for example <property> <name>fs.swift.userName</name> <value>test:tester</value> </property> 23
  • 24. Experiment (hadoop setting) ● ./conf/core-site.xml (part2) ● Add following property for example <property> <name>fs.swift.userPassword</name> <value>testing</value> </property> <property> <name>fs.swift.acccountname</name> <value>AUTH_test</value> </property> 24
  • 25. Experiment (hadoop setting) ● ./conf/core-site.xml (part3) ● Add following property for example <property> <name>fs.swift.authUrl</name> <value>http://192.168.0.9:8080/auth/v1.0</value> </property> <property> <name>fs.default.name</name> <value>swift://192.168.0.9:8080/v1/AUTH_test</value> </property> 25
  • 26. Experiment (check swift fs) ● At this time, we should can list account information via following command ● ./bin/hadoop -fs -ls / ● or ./bin/hadoop fs -put ./conf/core-site.xml /test_container/core-site.xml (test_container is a test container created after swift installed) 26
  • 27. Finally ● We installed swift for storage service of hadoop ● We built origin java-cloudfiles and created packages for hadoop ● We copied fs.swift plugin from https://github.com/Dazo-org/hadoop-common.git to new hadoop source tree and build hadoop ● We set up core-site.xml of hadoop to connect to swift via java-cloudfiles 27
  • 28. Thank you for listening. 28