Project Savanna automates the deployment of Apache Hadoop on OpenStack. It provisions Hadoop clusters using templates, allows for elastic scaling of nodes, and provides multi-tenancy. The Hortonworks OpenStack plugin uses Ambari to install and manage Hadoop clusters on OpenStack. It demonstrates provisioning a Hadoop cluster with Ambari, installing services, and monitoring the cluster through the Ambari UI. OpenStack provides operational agility while Hadoop is well-suited for its scale-out architecture.
Note:Not Recommended: One cluster having multiple data nodes on the same hypervisor nodeAllowed: Multiple clusters having a data node on the same hypervisor nodeAllowed: One data node and multiple compute nodes from per hypervisor
Object Store (codenamed "Swift") provides object storage. It allows you to store or retrieve files (but not mount directories like a fileserver). Several companies provide commercial storage services based on Swift. These include KT, Rackspace (from which Swift originated) and Internap. Swift is also used internally at many large companies to store their data.Image (codenamed "Glance") provides a catalog and repository for virtual disk images. These disk images are mostly commonly used in OpenStack Compute. While this service is technically optional, any cloud of size will require it.Compute (codenamed "Nova") provides virtual servers upon demand. Rackspace and HP provide commercial compute services built on Nova and it is used internally at companies like Mercado Libre and NASA (where it originated).Dashboard (codenamed "Horizon") provides a modular web-based user interface for all the OpenStack services. With this web GUI, you can perform most operations on your cloud like launching an instance, assigning IP addresses and setting access controls.Identity (codenamed "Keystone") provides authentication and authorization for all the OpenStack services. It also provides a service catalog of services within a particular OpenStack cloud.Network (codenamed "Quantum") provides "network connectivity as a service" between interface devices managed by other OpenStack services (most likely Nova). The service works by allowing users to create their own networks and then attach interfaces to them. Quantum has a pluggable architecture to support many popular networking vendors and technologies.Block Storage (codenamed "Cinder") provides persistent block storage to guest VMs. This project was born from code originally in Nova (the nova-volume service described below). In the Folsom release, both the nova-volume service and the separate volume service are available.File STORAGE(NAS)– No Support. Currently, OpenStack Compute does not have any native support for this type of file storage inside of an instance. However, there is a Gluster storage connector for OpenStack that enables the use of the GlusterFS file system as a back-end for the Image service.
1. What is RDO?* Distribution of OpenStack - The OpenStack project produces code. Packaging, integration, installation and support is left to distributors and partners - In its current form, OpenStack is a toolbox for creating an IaaS cloud, RDO allows you to get started quickly* For RHEL, CentOS, Scientific Linux and other RHEL clones, and for Fedora - There is a demand for being able to try out OpenStack on the industry's most successful enterprise Linux platform - We welcome users and experiences from the Red Hat Enterprise Linux ecosystem, which includes CentOS and Scientific Linux - We also want to make it easy for users of Fedora to try the version of OpenStack they are interested in without necessarily upgrading their entire operating system* Community-driven - The RDO community site is a wiki, and a forum. We welcome the participation of community members sharing knowledge, helping each other - Support offered with RDO is of a standard which can be expected from a community supported project - we encourage anyone who is looking for enterprise level support to upgrade to Red Hat OpenStack
Every conversation with customers around Hadoop deployment model end with one word ‘flexibility’ . Customers want to be able to deploy Hadoop On prem – physical or over a virtual infrastructure and in the cloud. In the cloud OpenStack is emerging/ rather has emerged as the hands down dominant open source cloud management platform