SlideShare una empresa de Scribd logo
1 de 4
Descargar para leer sin conexión
Anita Ahuja, Ajay Kumar, Ramveer Singh / International Journal of Engineering Research and
                   Applications (IJERA) ISSN: 2248-9622 www.ijera.com
                    Vol. 2, Issue 5, September- October 2012, pp.352-355

An Approach for Virtualization and Integration of Heterogeneous
                       Cloud Databases

                   Anita Ahuja* Ajay Kumar ** Ramveer Singh***
          *(Department of Computer Science, Asst. Professor, Mewar University, Chittorgarh (India)
        ** (Department of Computer Science, Asst. Professor, Mewar University, Chittorgarh (India))
        *** ( Department of Computer Science,Professor, R.K.G.I.T.,Mahamaya Technical University,
                                            Ghaziabad(India)


ABSTRACT:
         Virtualization is the key technology            distribution transparency, Global schema- Common
behind cloud computing that allows the creation          data descriptions & Data placement information,
of an abstraction layer of the underlying cloud          Centralized admin through global catalog,
Infrastructure. Using virtualization, resources          Distributed      functions,   Query    processing,
(hardware and software) can be shared and                Transaction management, Access control etc[1].
utilized while hiding the complexity from the
cloud users. A lot of cloud database are available       II WHY NOT RDBMS?
that managed by different organization such as-                    RDBMS all have a distributed and parallel
Amazon Storage for the Cloud, Google Storage             version with SQL support for all kinds of data
for the Cloud, Hadoop Storage for the Cloud,             (structured, XML, multimedia, streams, etc.) [1]
Yahoo!’s PNUTS, Cassandra, CouchDB etc.                  Standard SQL a major argument for adoption by
   This paper is presented to propose a virtual          tool vendors (e.g. analytics, business intelligence),
Database framework that enables the centralized          but the “one size fits all” approach has reached the
global object oriented database. A virtually             limits result loss of performance.
integrated huge database that will hide the              Now simplicity and flexibility required for
heterogeneity of various cloud databases. Once           applications with specific, tight requirements. New
they are integrated a consistent access is provide       specialized DBMS engines more efficient: column-
to the end user.                                         oriented DBMS for OLAP, DSMS for stream
                                                         processing, SciDB[11] for scientific analytics, etc.
Keywords – OOMDS, Virtualization, Cloud,                 RDBMS provides ACID transactions, complex
Databases, cloud computing, Mediator                     query language, lots of tuning knobs but it is less
Framework, Peers.                                        suitable for specific optimizations for OLAP,
                                                         flexible programming model, flexible schema and
I. INTRODUCTION                                          scalability.
          Cloud computing is a model for enabling
convenient, on- demand network access to a shared        III INTEGRATED DATA                MANAGEMENT
pool of configurable computing resources (e.g.,          PROBLEM IN CLOUD
networks, servers, storage, applications, and                      Cloud data are very large (lots of data
services) that can be rapidly provisioned and            spaces, very large collections, multimedia etc).
released with minimal management effort or               They are Complex, unstructured or semi-structured
service provider interaction[3]                          often schema less but metadata (tags,). Different
The different cloud providers adopt different            file formats, access protocols and query languages
architecture and data models such as Amazon‟s            are used. Table decompositions may vary, column
storage building block Dynamo[6], S3, SimpleDB,          names (data labels) may be different (but have the
and RDS, S3, Google storage building blocks              same semantics), and data encoding schemes may
Bigtable, Hadoop‟s building block HDFS, Hive,            vary it also referred as schematic heterogeneity[8].
HadoopDB, and HBase, Yahoo‟s PNUTS,                      Cloud users and application developers are in very
Cassandra data model, CouchDB data model.                high numbers with very diverse expertise but very
It is realized that traditional DBMS does not fit        little DBMS expertise.
well for the cloud computing environment so new
data model row oriented, document oriented,              IV PROPOSED FRAMEWORK
widecolumn are widely used in cloud. Different           Object Oriented Mediator Database System
cloud providers use different architecture and data      (OOMDS):
models that best suit their application.                         The proposed system is object oriented
Now A Virtual integrated database management             mediator data base system of various
system should be developed that Provides                 heterogeneous cloud data bases that having object

                                                                                              352 | P a g e
Anita Ahuja, Ajay Kumar, Ramveer Singh / International Journal of Engineering Research and
                   Applications (IJERA) ISSN: 2248-9622 www.ijera.com
                    Vol. 2, Issue 5, September- October 2012, pp.352-355
oriented query language in which object oriented       for different application areas in mediator peers.
views of data can be specified .In OOMDS has           The object oriented data model provides very
primitive to translate data from different clouds      powerful query and data integration primitives
database into object oriented data base. These         which require advanced query optimization.
translated cloud data can be used to build views
.This OOMDS supports multiple data base exists         The mediator/wrapper approach has been used for
on cloud.                                              integrating heterogeneous data in several projects.
                                                       Most mediator systems integrate data through a
                                                       central mediator server accessing one or several
                                                       data sources through a number of “wrapper”
                                                       interfaces that translate data to a global data model.
                                                       However, one of the original goals for mediator
                                                       architectures      was that mediators should be
                                                       relatively simple distributed software modules
                                                       that    transparently      encode     domain-specific
                                                       knowledge about data and share abstractions of
                                                       that data with higher layers of mediators or
                                                       applications. Larger networks of mediators would
                                                       then be defined through these primitive mediators
                                                       by composing new mediators in terms of other
                                                       mediators and data sources. The core of OOMDS
                                                       is an open, light-weight, and extensible object
                                                       oriented database management system             with a
                                                       object oriented data model. Each OOMDS server
                                                       must contains all the traditional database facilities,
                                                       such as a storage manager, a recovery manager,
                                                       a transaction manager, and a functional query
                                                       language named OOMDSQL. The system can be
                                                       used as a single-user database or as a multi-user
                                                       server to applications and to other OOMDS peers.

FIGURE   : OBJECT         ORIENTED      MEDIATOR       DISTRIBUTION:
DATABASE SYSTEM                                                  OOMDS is a distributed mediator system
                                                       where several mediator peers communicate over
DATA INTEGRATION IN OODMS SYSTEM                       the Internet. Each mediator peer appears as a
         OOMDS is a distributed mediator system        virtual functional database layer having data
that uses a object oriented data model and has a       abstractions and       a object oriented      query
relationally complete object oriented query            language.       Object oriented views provide
language, OOMDSQL. Through its distributed             transparent access to data sources from clients and
object oriented multi-database facilities many         other mediator peers. Conflicts and overlaps
autonomous and distributed OOMDS peers can             between similar real- world entities          being
interoperate. Object oriented multi-database queries   modeled differently in different data sources are
and views can be defined where external data           reconciled through the mediation primitives of the
sources of different kinds are translated through      multi-mediator query language OOMBSQL. The
OOMDS and reconciled through its functional            mediation services allow transparent access to
object oriented mediation primitives. Each             similar data structures represented differently in
mediator peer provides a number of transparent         different data sources[13]. Applications access data
functional views of data reconciled from other         from distributed data sources through queries to
mediator peers, wrapped data sources, and data         views in some mediator peer[9].
stored in OOMDS itself. The composition of             Logical composition of mediators is achieved when
mediator peers in terms of other peers provides a      multi-database views in mediators are defined in
way to scale the data integration process by           terms of views, tables, and functions in other
composing mediation modules. The OOMDS                 mediators or data sources. The multi-database
data manager and query processor must be               views make the mediator peers appear to the user
extensible so that new application oriented data       as a single virtual database. OOMDS mediators are
types and operators can be added to OODMSQL,           compostable since a mediator peer can regard other
implemented in some external programming               mediator peers as data sources[16].
language (Java, C, C++ or Lisp). The extensibility
allows wrapping data representations specialized


                                                                                             353 | P a g e
Anita Ahuja, Ajay Kumar, Ramveer Singh / International Journal of Engineering Research and
                   Applications (IJERA) ISSN: 2248-9622 www.ijera.com
                    Vol. 2, Issue 5, September- October 2012, pp.352-355
WRAPPING DATA                                             The Object oriented Data Model and query
          In order to access data from external          language forming the basis for data integration in
data sources OOMDS mediators may contain one             OOMDS. The distributed multi-mediator query
or several wrappers which process data from              decomposition strategies used were summarized.
different kinds of external data sources[15], e.g.       The mediator peers are autonomous without any
ODBC-based access to relational databases, access        central schema. A special mediator, the central
to XML files, CAD systems, or Internet search            name server, keeps track of what mediator peers
engines to extract data from heterogeneous cloud         are members of a group. The central name servers
data bases. A wrapper is a procedure in OOMDS            can be queried for the location of mediator peers in
having specialized facilities for query processing       a group. Meta-queries to each mediator peer can be
and translation of data from a particular class of       posed to investigate the structure of its schema.
external data sources. It contains both interfaces to    Some unique features of OOMDS are: A
external data sources and knowledge of how to            distributed mediator Framework where query plans
efficiently translate and process queries involving      are distributed over several communicating
accesses to different cloud databases. In particular,    mediator peers. Using declarative object oriented
external OOMDS peers known to a mediator are             queries to model reconciled object oriented views
also regarded as external data sources and there is a    spanning multiple mediator peers. Query
special wrapper for accessing other OOMDS                processing and optimization techniques for queries
peers[18]. However, among the OOMDS peers                to reconcile views involving function overloading,
special query optimization methods are used              late binding, and type-aware query rewrites.
that take into account the distribution, capabilities,
costs, etc., of the different peers[20].
                                                         REFERENCES
THE CENTRAL NAME SERVER                                    [1] S. Aulbach, T. Grust, D. Jacobs, A. Kemper,
          Every mediator peer must belong to a                    and J. Rittinger. Multi-tenant databases for
group of mediator peers. The mediator peers in a                  software as a service: Schema-mapping
group are described through a meta-schema stored                  techniques. In SIGMOD, 2008.
in a mediator server called central name server.           [2] M. Brantner, D. Florescu, D. Graf, D.
The mediator peers are autonomous and there is no                 Kossmann, and T. Kraska. Building a
central schema in the name server [13]. The central               database on S3. In SIGMOD, 2008.
name server contains          only general meta-           [3] F. Chang, J. Dean, S. Ghemawat, W. Hsieh,
information such as the locations and names of the                D. Wallach, M. Burrows, T. Chandra, A.
peers in the group while each mediator peer has its               Fikes, and R. Gruber. Bigtable: A
own schema describing its local data and data                     distributed storage system for structured
sources. The information in the central name                      data. In OSDI, 2006.
server is managed without explicit operator                [4]     B. F. Cooper, R. Ramakrishnan, U.
intervention; its content is managed through                      Srivastava, A. Silberstein, P. Bohannon,
messages from the mediator peers. To avoid a                      H.-A. Jacobsen, N. Puz, D. Weaver, and
bottleneck, mediator peers usually communicate                    R. Yerneni. PNUTS: Yahoo!‟s hosted data
directly without involving the name server; it is                 serving platform. PVLDB, 1(2), 2008.
normally involved only when a connection to some           [5] C. Curino, E. Jones, Y. Zhang, and S.
new mediator peer is established [21].                            Madden. Schism: A Workload-Driven
                                                                  Approach to Database Replication and
                                                                  Partitioning. In VLDB, 2010.
CONCLUSION:                                                [6] E. Damiani, S. D. C. di Vimercati, S. Jajodia,
          We have given an overview of the                        S. Paraboschi, and P. Samarati. Balancing
OOMDS mediator system where groups of                             Confidentiality     and     Efficiency    in
distributed mediator peers are used to integrate data             Untrusted Relational DBMS. CCS, 2003.
from different sources. Each mediator in a group           [7] S. Das, D. Agrawal, and A. E. Abbadi.
has DBMS facilities for query compilation and                     ElasTraS: An elastic transactional data
exchange of data and meta-data with other                         store in the cloud. HotCloud, 2009.
mediator peers. Derived functions can be defined           [8] R. Freeman. Oracle Database 11g New
where data from several mediator peers is                         Features. McGraw-Hill, Inc., New York,
abstracted, transformed, and reconciled. Wrappers                 NY, USA, 2008.
are defined by interfacing OOMDS systems with              [9] R. Gennaro, C. Gentry, and B. Parno. Non-
external systems through its multi-directional                    Interactive      Verifiable     Computing:
foreign function interface.          OOMDS can                    Outsourcing Computation to Untrusted
furthermore be embedded in applications and used                  Workers. STOC,2010.
as stand-alone databases.


                                                                                              354 | P a g e
Anita Ahuja, Ajay Kumar, Ramveer Singh / International Journal of Engineering Research and
                   Applications (IJERA) ISSN: 2248-9622 www.ijera.com
                    Vol. 2, Issue 5, September- October 2012, pp.352-355
 [11]    H. Hacigumus, B. Iyer, C. Li, and S.                  system for data integration. PhD Thesis,
        Mehrotra. Executing SQL over Encrypted                 Linko¨        ping       U.,       Sweden.
        Data in the Database-Service-Provider                  http://www.dis.uu.se/˜udbl/publ/vanjaphd.
        Model. ACM SIGMOD, 2002.                               pdf [1999].
 [12]   “Kernel based virtual machine (KVM).”           [17]   Bukhres O, Elmagarmid A (eds.). Object-
        [Online].       Available:http://www.linux-            oriented Multidatabase Systems. Pretince
        kvm.org                                                Hall, 1996.
 [13]   G. Giunta, R. Montella, G. Agrillo,             [18]   Dayal U, Hwang H-Y. View definition
        and G. Coviello, “A                  GPGPU             and generalization for database integration
        transparent      virtualization component              in     a     multidatabase     system.IEEE
        for high        performance       computing            Transactions on Software Engineering
        clouds,” in Proceedings of the 16th                    1984; 10(6):628–645.
        international Euro-Par conference on            [19]   A. N. Laboratory. (2010, Jul.) Heckle.
        Parallel processing: Part           I, ser.            [Online].                         Available:
        EuroPar‟10.         Berlin,      Heidelberg:           http://trac.mcs.anl.gov/projects/Heckle/
        Springer-Verlag, 2010, pp. 379–391.             [20]    xCat Open         Source Project.
        [Online].                         Available:           (2011,May)xCat          extreme       cloud
        http://portal.acm.org/citation.cfm?id=1887             administration toolkit. [Online]. Available:
        695.1887738                                            http://xcat.sourceforge.net/
 [14]   L. Shi, H. Chen, and J. Sun, “vCUDA:            [21]   P. O. S. Project. (2010, Apr.) Perceus
        GPU        accelerated high performance                provision enterprise resources and clusters
        computing in virtual machines,” in                     enabling uniform systems. [Online].
        Proceedings      of     the    2009    IEEE            Available: http://www.perceus.org/
        International             Symposium       on
        Parallel&Distributed             Processing.   AUTHOR:
        Washington, DC, USA: IEEE Computer
        Society, 2009, pp. 1–11. [Online].                             ANITA AHUJA is an Asst.
        Available:http://portal.acm.org/citation.cf                    Professor in Department of
        m?id=1586640.1587737                                           Computer      Science     and
 [15]   F. Bellard, “QEMU, a fast and portable                         Information   Technology    at
        dynamic translator,” in Proceedings of the                     Mewar University, Chittorgarh
        annual conference on USENIX Annual             (Rajasthan).    She has completed „A‟ level
        Technical Conference, ser. ATEC ‟05.           DOEACC Society, M.Sc (IT) from M.C.R.P.V,
        Berkeley,       CA,       USA:      USENIX     Bhopal,M.Phil, Rajasthan Vidyapeeth, and
        Association, 2005, pp. 441. [Online].          Udaipur. And M.Tech.(P) at Mewar University,
        Available:http://portal.acm.org/citation.cf    Chittorgarh . Her research interest is in the
        m?id=1247360.1247401                           fields of Network Security, Cloud Computing ,
 [16]   Josifovski V. Design, implementation and       Advance Data Structure and Algorithms.
        evaluation of a distributed mediator




                                                                                            355 | P a g e

Más contenido relacionado

La actualidad más candente

Neuro-Fuzzy System Based Dynamic Resource Allocation in Collaborative Cloud C...
Neuro-Fuzzy System Based Dynamic Resource Allocation in Collaborative Cloud C...Neuro-Fuzzy System Based Dynamic Resource Allocation in Collaborative Cloud C...
Neuro-Fuzzy System Based Dynamic Resource Allocation in Collaborative Cloud C...neirew J
 
A Comparative Study of RDBMs and OODBMs in Relation to Security of Data
A Comparative Study of RDBMs and OODBMs in Relation to Security of DataA Comparative Study of RDBMs and OODBMs in Relation to Security of Data
A Comparative Study of RDBMs and OODBMs in Relation to Security of Datainscit2006
 
The Data Distribution Service: The Communication Middleware Fabric for Scala...
The Data Distribution Service: The Communication  Middleware Fabric for Scala...The Data Distribution Service: The Communication  Middleware Fabric for Scala...
The Data Distribution Service: The Communication Middleware Fabric for Scala...Angelo Corsaro
 
Cloud Storage Research
Cloud Storage ResearchCloud Storage Research
Cloud Storage ResearchMajed AlNemari
 
Guaranteed Availability of Cloud Data with Efficient Cost
Guaranteed Availability of Cloud Data with Efficient CostGuaranteed Availability of Cloud Data with Efficient Cost
Guaranteed Availability of Cloud Data with Efficient CostIRJET Journal
 
Distributed Algorithms with DDS
Distributed Algorithms with DDSDistributed Algorithms with DDS
Distributed Algorithms with DDSAngelo Corsaro
 
ANALYSIS OF ATTACK TECHNIQUES ON CLOUD BASED DATA DEDUPLICATION TECHNIQUES
ANALYSIS OF ATTACK TECHNIQUES ON CLOUD BASED DATA DEDUPLICATION TECHNIQUESANALYSIS OF ATTACK TECHNIQUES ON CLOUD BASED DATA DEDUPLICATION TECHNIQUES
ANALYSIS OF ATTACK TECHNIQUES ON CLOUD BASED DATA DEDUPLICATION TECHNIQUESneirew J
 
SQLBits X SQL Server 2012 Rich Unstructured Data
SQLBits X SQL Server 2012 Rich Unstructured DataSQLBits X SQL Server 2012 Rich Unstructured Data
SQLBits X SQL Server 2012 Rich Unstructured DataMichael Rys
 
Bigchaindb whitepaper
Bigchaindb whitepaperBigchaindb whitepaper
Bigchaindb whitepaperArek Talun
 
Robust Module based data management system
Robust Module based data management systemRobust Module based data management system
Robust Module based data management systemRahul Roi
 
International Journal of Computational Engineering Research(IJCER)
 International Journal of Computational Engineering Research(IJCER)  International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER) ijceronline
 
Authenticated Key Exchange Protocols for Parallel Network File Systems
Authenticated Key Exchange Protocols for Parallel Network File SystemsAuthenticated Key Exchange Protocols for Parallel Network File Systems
Authenticated Key Exchange Protocols for Parallel Network File Systems1crore projects
 
Using BIG DATA implementations onto Software Defined Networking
Using BIG DATA implementations onto Software Defined NetworkingUsing BIG DATA implementations onto Software Defined Networking
Using BIG DATA implementations onto Software Defined NetworkingIJCSIS Research Publications
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)IJERD Editor
 
Big Data Glossary of terms
Big Data Glossary of termsBig Data Glossary of terms
Big Data Glossary of termsKognitio
 

La actualidad más candente (19)

Neuro-Fuzzy System Based Dynamic Resource Allocation in Collaborative Cloud C...
Neuro-Fuzzy System Based Dynamic Resource Allocation in Collaborative Cloud C...Neuro-Fuzzy System Based Dynamic Resource Allocation in Collaborative Cloud C...
Neuro-Fuzzy System Based Dynamic Resource Allocation in Collaborative Cloud C...
 
P24120125
P24120125P24120125
P24120125
 
A Comparative Study of RDBMs and OODBMs in Relation to Security of Data
A Comparative Study of RDBMs and OODBMs in Relation to Security of DataA Comparative Study of RDBMs and OODBMs in Relation to Security of Data
A Comparative Study of RDBMs and OODBMs in Relation to Security of Data
 
The Data Distribution Service: The Communication Middleware Fabric for Scala...
The Data Distribution Service: The Communication  Middleware Fabric for Scala...The Data Distribution Service: The Communication  Middleware Fabric for Scala...
The Data Distribution Service: The Communication Middleware Fabric for Scala...
 
Cloud Storage Research
Cloud Storage ResearchCloud Storage Research
Cloud Storage Research
 
NuoDB Product Brochure
NuoDB Product BrochureNuoDB Product Brochure
NuoDB Product Brochure
 
Guaranteed Availability of Cloud Data with Efficient Cost
Guaranteed Availability of Cloud Data with Efficient CostGuaranteed Availability of Cloud Data with Efficient Cost
Guaranteed Availability of Cloud Data with Efficient Cost
 
[IJET-V1I6P11] Authors: A.Stenila, M. Kavitha, S.Alonshia
[IJET-V1I6P11] Authors: A.Stenila, M. Kavitha, S.Alonshia[IJET-V1I6P11] Authors: A.Stenila, M. Kavitha, S.Alonshia
[IJET-V1I6P11] Authors: A.Stenila, M. Kavitha, S.Alonshia
 
Distributed Algorithms with DDS
Distributed Algorithms with DDSDistributed Algorithms with DDS
Distributed Algorithms with DDS
 
ANALYSIS OF ATTACK TECHNIQUES ON CLOUD BASED DATA DEDUPLICATION TECHNIQUES
ANALYSIS OF ATTACK TECHNIQUES ON CLOUD BASED DATA DEDUPLICATION TECHNIQUESANALYSIS OF ATTACK TECHNIQUES ON CLOUD BASED DATA DEDUPLICATION TECHNIQUES
ANALYSIS OF ATTACK TECHNIQUES ON CLOUD BASED DATA DEDUPLICATION TECHNIQUES
 
Robust module based data management
Robust module based data managementRobust module based data management
Robust module based data management
 
SQLBits X SQL Server 2012 Rich Unstructured Data
SQLBits X SQL Server 2012 Rich Unstructured DataSQLBits X SQL Server 2012 Rich Unstructured Data
SQLBits X SQL Server 2012 Rich Unstructured Data
 
Bigchaindb whitepaper
Bigchaindb whitepaperBigchaindb whitepaper
Bigchaindb whitepaper
 
Robust Module based data management system
Robust Module based data management systemRobust Module based data management system
Robust Module based data management system
 
International Journal of Computational Engineering Research(IJCER)
 International Journal of Computational Engineering Research(IJCER)  International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
 
Authenticated Key Exchange Protocols for Parallel Network File Systems
Authenticated Key Exchange Protocols for Parallel Network File SystemsAuthenticated Key Exchange Protocols for Parallel Network File Systems
Authenticated Key Exchange Protocols for Parallel Network File Systems
 
Using BIG DATA implementations onto Software Defined Networking
Using BIG DATA implementations onto Software Defined NetworkingUsing BIG DATA implementations onto Software Defined Networking
Using BIG DATA implementations onto Software Defined Networking
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
Big Data Glossary of terms
Big Data Glossary of termsBig Data Glossary of terms
Big Data Glossary of terms
 

Destacado

IBGE - Emprego Março 2014
IBGE - Emprego Março 2014IBGE - Emprego Março 2014
IBGE - Emprego Março 2014Paulo Veras
 
10 Insightful Quotes On Designing A Better Customer Experience
10 Insightful Quotes On Designing A Better Customer Experience10 Insightful Quotes On Designing A Better Customer Experience
10 Insightful Quotes On Designing A Better Customer ExperienceYuan Wang
 
How to Build a Dynamic Social Media Plan
How to Build a Dynamic Social Media PlanHow to Build a Dynamic Social Media Plan
How to Build a Dynamic Social Media PlanPost Planner
 
Learn BEM: CSS Naming Convention
Learn BEM: CSS Naming ConventionLearn BEM: CSS Naming Convention
Learn BEM: CSS Naming ConventionIn a Rocket
 
SEO: Getting Personal
SEO: Getting PersonalSEO: Getting Personal
SEO: Getting PersonalKirsty Hulse
 

Destacado (7)

Textures Van Gogh
Textures Van GoghTextures Van Gogh
Textures Van Gogh
 
Andalucia
AndaluciaAndalucia
Andalucia
 
IBGE - Emprego Março 2014
IBGE - Emprego Março 2014IBGE - Emprego Março 2014
IBGE - Emprego Março 2014
 
10 Insightful Quotes On Designing A Better Customer Experience
10 Insightful Quotes On Designing A Better Customer Experience10 Insightful Quotes On Designing A Better Customer Experience
10 Insightful Quotes On Designing A Better Customer Experience
 
How to Build a Dynamic Social Media Plan
How to Build a Dynamic Social Media PlanHow to Build a Dynamic Social Media Plan
How to Build a Dynamic Social Media Plan
 
Learn BEM: CSS Naming Convention
Learn BEM: CSS Naming ConventionLearn BEM: CSS Naming Convention
Learn BEM: CSS Naming Convention
 
SEO: Getting Personal
SEO: Getting PersonalSEO: Getting Personal
SEO: Getting Personal
 

Similar a Approach for Virtualization and Integration of Heterogeneous Cloud Databases

Presentation on Databases in the Cloud
Presentation on Databases in the CloudPresentation on Databases in the Cloud
Presentation on Databases in the Cloudmoshfiq
 
Database Management Systems
Database Management SystemsDatabase Management Systems
Database Management SystemsGeorge Grayson
 
WHAT IS A DBMS? EXPLAIN DIFFERENT MYSQL COMMANDS AND CONSTRAINTS OF THE SAME.
WHAT IS A DBMS? EXPLAIN DIFFERENT MYSQL COMMANDS AND  CONSTRAINTS OF THE SAME.WHAT IS A DBMS? EXPLAIN DIFFERENT MYSQL COMMANDS AND  CONSTRAINTS OF THE SAME.
WHAT IS A DBMS? EXPLAIN DIFFERENT MYSQL COMMANDS AND CONSTRAINTS OF THE SAME.`Shweta Bhavsar
 
Quantitative Performance Evaluation of Cloud-Based MySQL (Relational) Vs. Mon...
Quantitative Performance Evaluation of Cloud-Based MySQL (Relational) Vs. Mon...Quantitative Performance Evaluation of Cloud-Based MySQL (Relational) Vs. Mon...
Quantitative Performance Evaluation of Cloud-Based MySQL (Relational) Vs. Mon...Darshan Gorasiya
 
Challenges Management and Opportunities of Cloud DBA
Challenges Management and Opportunities of Cloud DBAChallenges Management and Opportunities of Cloud DBA
Challenges Management and Opportunities of Cloud DBAinventy
 
NOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQLNOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQLRamakant Soni
 
NOSQL in big data is the not only structure langua.pdf
NOSQL in big data is the not only structure langua.pdfNOSQL in big data is the not only structure langua.pdf
NOSQL in big data is the not only structure langua.pdfajajkhan16
 
New Data Technologies, Graph Computing and Relationship Discovery in the Ente...
New Data Technologies, Graph Computing and Relationship Discovery in the Ente...New Data Technologies, Graph Computing and Relationship Discovery in the Ente...
New Data Technologies, Graph Computing and Relationship Discovery in the Ente...InfiniteGraph
 
Cloud Storage: Focusing On Back End Storage Architecture
Cloud Storage: Focusing On Back End Storage ArchitectureCloud Storage: Focusing On Back End Storage Architecture
Cloud Storage: Focusing On Back End Storage ArchitectureIOSR Journals
 
Cidr11 paper32
Cidr11 paper32Cidr11 paper32
Cidr11 paper32jujukoko
 
Megastore providing scalable, highly available storage for interactive services
Megastore providing scalable, highly available storage for interactive servicesMegastore providing scalable, highly available storage for interactive services
Megastore providing scalable, highly available storage for interactive servicesJoão Gabriel Lima
 
Database management-system
Database management-systemDatabase management-system
Database management-systemkalasalingam
 

Similar a Approach for Virtualization and Integration of Heterogeneous Cloud Databases (20)

Database
DatabaseDatabase
Database
 
Presentation on Databases in the Cloud
Presentation on Databases in the CloudPresentation on Databases in the Cloud
Presentation on Databases in the Cloud
 
Database Management Systems
Database Management SystemsDatabase Management Systems
Database Management Systems
 
WHAT IS A DBMS? EXPLAIN DIFFERENT MYSQL COMMANDS AND CONSTRAINTS OF THE SAME.
WHAT IS A DBMS? EXPLAIN DIFFERENT MYSQL COMMANDS AND  CONSTRAINTS OF THE SAME.WHAT IS A DBMS? EXPLAIN DIFFERENT MYSQL COMMANDS AND  CONSTRAINTS OF THE SAME.
WHAT IS A DBMS? EXPLAIN DIFFERENT MYSQL COMMANDS AND CONSTRAINTS OF THE SAME.
 
jose rizal
jose rizaljose rizal
jose rizal
 
Database
DatabaseDatabase
Database
 
Quantitative Performance Evaluation of Cloud-Based MySQL (Relational) Vs. Mon...
Quantitative Performance Evaluation of Cloud-Based MySQL (Relational) Vs. Mon...Quantitative Performance Evaluation of Cloud-Based MySQL (Relational) Vs. Mon...
Quantitative Performance Evaluation of Cloud-Based MySQL (Relational) Vs. Mon...
 
Challenges Management and Opportunities of Cloud DBA
Challenges Management and Opportunities of Cloud DBAChallenges Management and Opportunities of Cloud DBA
Challenges Management and Opportunities of Cloud DBA
 
Report 1.0.docx
Report 1.0.docxReport 1.0.docx
Report 1.0.docx
 
Aw4103303306
Aw4103303306Aw4103303306
Aw4103303306
 
NOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQLNOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQL
 
NOSQL in big data is the not only structure langua.pdf
NOSQL in big data is the not only structure langua.pdfNOSQL in big data is the not only structure langua.pdf
NOSQL in big data is the not only structure langua.pdf
 
New Data Technologies, Graph Computing and Relationship Discovery in the Ente...
New Data Technologies, Graph Computing and Relationship Discovery in the Ente...New Data Technologies, Graph Computing and Relationship Discovery in the Ente...
New Data Technologies, Graph Computing and Relationship Discovery in the Ente...
 
Cloud Storage: Focusing On Back End Storage Architecture
Cloud Storage: Focusing On Back End Storage ArchitectureCloud Storage: Focusing On Back End Storage Architecture
Cloud Storage: Focusing On Back End Storage Architecture
 
K017146064
K017146064K017146064
K017146064
 
Ordbms
OrdbmsOrdbms
Ordbms
 
Cidr11 paper32
Cidr11 paper32Cidr11 paper32
Cidr11 paper32
 
Megastore providing scalable, highly available storage for interactive services
Megastore providing scalable, highly available storage for interactive servicesMegastore providing scalable, highly available storage for interactive services
Megastore providing scalable, highly available storage for interactive services
 
Database management-system
Database management-systemDatabase management-system
Database management-system
 
Report 2.0.docx
Report 2.0.docxReport 2.0.docx
Report 2.0.docx
 

Approach for Virtualization and Integration of Heterogeneous Cloud Databases

  • 1. Anita Ahuja, Ajay Kumar, Ramveer Singh / International Journal of Engineering Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com Vol. 2, Issue 5, September- October 2012, pp.352-355 An Approach for Virtualization and Integration of Heterogeneous Cloud Databases Anita Ahuja* Ajay Kumar ** Ramveer Singh*** *(Department of Computer Science, Asst. Professor, Mewar University, Chittorgarh (India) ** (Department of Computer Science, Asst. Professor, Mewar University, Chittorgarh (India)) *** ( Department of Computer Science,Professor, R.K.G.I.T.,Mahamaya Technical University, Ghaziabad(India) ABSTRACT: Virtualization is the key technology distribution transparency, Global schema- Common behind cloud computing that allows the creation data descriptions & Data placement information, of an abstraction layer of the underlying cloud Centralized admin through global catalog, Infrastructure. Using virtualization, resources Distributed functions, Query processing, (hardware and software) can be shared and Transaction management, Access control etc[1]. utilized while hiding the complexity from the cloud users. A lot of cloud database are available II WHY NOT RDBMS? that managed by different organization such as- RDBMS all have a distributed and parallel Amazon Storage for the Cloud, Google Storage version with SQL support for all kinds of data for the Cloud, Hadoop Storage for the Cloud, (structured, XML, multimedia, streams, etc.) [1] Yahoo!’s PNUTS, Cassandra, CouchDB etc. Standard SQL a major argument for adoption by This paper is presented to propose a virtual tool vendors (e.g. analytics, business intelligence), Database framework that enables the centralized but the “one size fits all” approach has reached the global object oriented database. A virtually limits result loss of performance. integrated huge database that will hide the Now simplicity and flexibility required for heterogeneity of various cloud databases. Once applications with specific, tight requirements. New they are integrated a consistent access is provide specialized DBMS engines more efficient: column- to the end user. oriented DBMS for OLAP, DSMS for stream processing, SciDB[11] for scientific analytics, etc. Keywords – OOMDS, Virtualization, Cloud, RDBMS provides ACID transactions, complex Databases, cloud computing, Mediator query language, lots of tuning knobs but it is less Framework, Peers. suitable for specific optimizations for OLAP, flexible programming model, flexible schema and I. INTRODUCTION scalability. Cloud computing is a model for enabling convenient, on- demand network access to a shared III INTEGRATED DATA MANAGEMENT pool of configurable computing resources (e.g., PROBLEM IN CLOUD networks, servers, storage, applications, and Cloud data are very large (lots of data services) that can be rapidly provisioned and spaces, very large collections, multimedia etc). released with minimal management effort or They are Complex, unstructured or semi-structured service provider interaction[3] often schema less but metadata (tags,). Different The different cloud providers adopt different file formats, access protocols and query languages architecture and data models such as Amazon‟s are used. Table decompositions may vary, column storage building block Dynamo[6], S3, SimpleDB, names (data labels) may be different (but have the and RDS, S3, Google storage building blocks same semantics), and data encoding schemes may Bigtable, Hadoop‟s building block HDFS, Hive, vary it also referred as schematic heterogeneity[8]. HadoopDB, and HBase, Yahoo‟s PNUTS, Cloud users and application developers are in very Cassandra data model, CouchDB data model. high numbers with very diverse expertise but very It is realized that traditional DBMS does not fit little DBMS expertise. well for the cloud computing environment so new data model row oriented, document oriented, IV PROPOSED FRAMEWORK widecolumn are widely used in cloud. Different Object Oriented Mediator Database System cloud providers use different architecture and data (OOMDS): models that best suit their application. The proposed system is object oriented Now A Virtual integrated database management mediator data base system of various system should be developed that Provides heterogeneous cloud data bases that having object 352 | P a g e
  • 2. Anita Ahuja, Ajay Kumar, Ramveer Singh / International Journal of Engineering Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com Vol. 2, Issue 5, September- October 2012, pp.352-355 oriented query language in which object oriented for different application areas in mediator peers. views of data can be specified .In OOMDS has The object oriented data model provides very primitive to translate data from different clouds powerful query and data integration primitives database into object oriented data base. These which require advanced query optimization. translated cloud data can be used to build views .This OOMDS supports multiple data base exists The mediator/wrapper approach has been used for on cloud. integrating heterogeneous data in several projects. Most mediator systems integrate data through a central mediator server accessing one or several data sources through a number of “wrapper” interfaces that translate data to a global data model. However, one of the original goals for mediator architectures was that mediators should be relatively simple distributed software modules that transparently encode domain-specific knowledge about data and share abstractions of that data with higher layers of mediators or applications. Larger networks of mediators would then be defined through these primitive mediators by composing new mediators in terms of other mediators and data sources. The core of OOMDS is an open, light-weight, and extensible object oriented database management system with a object oriented data model. Each OOMDS server must contains all the traditional database facilities, such as a storage manager, a recovery manager, a transaction manager, and a functional query language named OOMDSQL. The system can be used as a single-user database or as a multi-user server to applications and to other OOMDS peers. FIGURE : OBJECT ORIENTED MEDIATOR DISTRIBUTION: DATABASE SYSTEM OOMDS is a distributed mediator system where several mediator peers communicate over DATA INTEGRATION IN OODMS SYSTEM the Internet. Each mediator peer appears as a OOMDS is a distributed mediator system virtual functional database layer having data that uses a object oriented data model and has a abstractions and a object oriented query relationally complete object oriented query language. Object oriented views provide language, OOMDSQL. Through its distributed transparent access to data sources from clients and object oriented multi-database facilities many other mediator peers. Conflicts and overlaps autonomous and distributed OOMDS peers can between similar real- world entities being interoperate. Object oriented multi-database queries modeled differently in different data sources are and views can be defined where external data reconciled through the mediation primitives of the sources of different kinds are translated through multi-mediator query language OOMBSQL. The OOMDS and reconciled through its functional mediation services allow transparent access to object oriented mediation primitives. Each similar data structures represented differently in mediator peer provides a number of transparent different data sources[13]. Applications access data functional views of data reconciled from other from distributed data sources through queries to mediator peers, wrapped data sources, and data views in some mediator peer[9]. stored in OOMDS itself. The composition of Logical composition of mediators is achieved when mediator peers in terms of other peers provides a multi-database views in mediators are defined in way to scale the data integration process by terms of views, tables, and functions in other composing mediation modules. The OOMDS mediators or data sources. The multi-database data manager and query processor must be views make the mediator peers appear to the user extensible so that new application oriented data as a single virtual database. OOMDS mediators are types and operators can be added to OODMSQL, compostable since a mediator peer can regard other implemented in some external programming mediator peers as data sources[16]. language (Java, C, C++ or Lisp). The extensibility allows wrapping data representations specialized 353 | P a g e
  • 3. Anita Ahuja, Ajay Kumar, Ramveer Singh / International Journal of Engineering Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com Vol. 2, Issue 5, September- October 2012, pp.352-355 WRAPPING DATA The Object oriented Data Model and query In order to access data from external language forming the basis for data integration in data sources OOMDS mediators may contain one OOMDS. The distributed multi-mediator query or several wrappers which process data from decomposition strategies used were summarized. different kinds of external data sources[15], e.g. The mediator peers are autonomous without any ODBC-based access to relational databases, access central schema. A special mediator, the central to XML files, CAD systems, or Internet search name server, keeps track of what mediator peers engines to extract data from heterogeneous cloud are members of a group. The central name servers data bases. A wrapper is a procedure in OOMDS can be queried for the location of mediator peers in having specialized facilities for query processing a group. Meta-queries to each mediator peer can be and translation of data from a particular class of posed to investigate the structure of its schema. external data sources. It contains both interfaces to Some unique features of OOMDS are: A external data sources and knowledge of how to distributed mediator Framework where query plans efficiently translate and process queries involving are distributed over several communicating accesses to different cloud databases. In particular, mediator peers. Using declarative object oriented external OOMDS peers known to a mediator are queries to model reconciled object oriented views also regarded as external data sources and there is a spanning multiple mediator peers. Query special wrapper for accessing other OOMDS processing and optimization techniques for queries peers[18]. However, among the OOMDS peers to reconcile views involving function overloading, special query optimization methods are used late binding, and type-aware query rewrites. that take into account the distribution, capabilities, costs, etc., of the different peers[20]. REFERENCES THE CENTRAL NAME SERVER [1] S. Aulbach, T. Grust, D. Jacobs, A. Kemper, Every mediator peer must belong to a and J. Rittinger. Multi-tenant databases for group of mediator peers. The mediator peers in a software as a service: Schema-mapping group are described through a meta-schema stored techniques. In SIGMOD, 2008. in a mediator server called central name server. [2] M. Brantner, D. Florescu, D. Graf, D. The mediator peers are autonomous and there is no Kossmann, and T. Kraska. Building a central schema in the name server [13]. The central database on S3. In SIGMOD, 2008. name server contains only general meta- [3] F. Chang, J. Dean, S. Ghemawat, W. Hsieh, information such as the locations and names of the D. Wallach, M. Burrows, T. Chandra, A. peers in the group while each mediator peer has its Fikes, and R. Gruber. Bigtable: A own schema describing its local data and data distributed storage system for structured sources. The information in the central name data. In OSDI, 2006. server is managed without explicit operator [4] B. F. Cooper, R. Ramakrishnan, U. intervention; its content is managed through Srivastava, A. Silberstein, P. Bohannon, messages from the mediator peers. To avoid a H.-A. Jacobsen, N. Puz, D. Weaver, and bottleneck, mediator peers usually communicate R. Yerneni. PNUTS: Yahoo!‟s hosted data directly without involving the name server; it is serving platform. PVLDB, 1(2), 2008. normally involved only when a connection to some [5] C. Curino, E. Jones, Y. Zhang, and S. new mediator peer is established [21]. Madden. Schism: A Workload-Driven Approach to Database Replication and Partitioning. In VLDB, 2010. CONCLUSION: [6] E. Damiani, S. D. C. di Vimercati, S. Jajodia, We have given an overview of the S. Paraboschi, and P. Samarati. Balancing OOMDS mediator system where groups of Confidentiality and Efficiency in distributed mediator peers are used to integrate data Untrusted Relational DBMS. CCS, 2003. from different sources. Each mediator in a group [7] S. Das, D. Agrawal, and A. E. Abbadi. has DBMS facilities for query compilation and ElasTraS: An elastic transactional data exchange of data and meta-data with other store in the cloud. HotCloud, 2009. mediator peers. Derived functions can be defined [8] R. Freeman. Oracle Database 11g New where data from several mediator peers is Features. McGraw-Hill, Inc., New York, abstracted, transformed, and reconciled. Wrappers NY, USA, 2008. are defined by interfacing OOMDS systems with [9] R. Gennaro, C. Gentry, and B. Parno. Non- external systems through its multi-directional Interactive Verifiable Computing: foreign function interface. OOMDS can Outsourcing Computation to Untrusted furthermore be embedded in applications and used Workers. STOC,2010. as stand-alone databases. 354 | P a g e
  • 4. Anita Ahuja, Ajay Kumar, Ramveer Singh / International Journal of Engineering Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com Vol. 2, Issue 5, September- October 2012, pp.352-355 [11] H. Hacigumus, B. Iyer, C. Li, and S. system for data integration. PhD Thesis, Mehrotra. Executing SQL over Encrypted Linko¨ ping U., Sweden. Data in the Database-Service-Provider http://www.dis.uu.se/˜udbl/publ/vanjaphd. Model. ACM SIGMOD, 2002. pdf [1999]. [12] “Kernel based virtual machine (KVM).” [17] Bukhres O, Elmagarmid A (eds.). Object- [Online]. Available:http://www.linux- oriented Multidatabase Systems. Pretince kvm.org Hall, 1996. [13] G. Giunta, R. Montella, G. Agrillo, [18] Dayal U, Hwang H-Y. View definition and G. Coviello, “A GPGPU and generalization for database integration transparent virtualization component in a multidatabase system.IEEE for high performance computing Transactions on Software Engineering clouds,” in Proceedings of the 16th 1984; 10(6):628–645. international Euro-Par conference on [19] A. N. Laboratory. (2010, Jul.) Heckle. Parallel processing: Part I, ser. [Online]. Available: EuroPar‟10. Berlin, Heidelberg: http://trac.mcs.anl.gov/projects/Heckle/ Springer-Verlag, 2010, pp. 379–391. [20] xCat Open Source Project. [Online]. Available: (2011,May)xCat extreme cloud http://portal.acm.org/citation.cfm?id=1887 administration toolkit. [Online]. Available: 695.1887738 http://xcat.sourceforge.net/ [14] L. Shi, H. Chen, and J. Sun, “vCUDA: [21] P. O. S. Project. (2010, Apr.) Perceus GPU accelerated high performance provision enterprise resources and clusters computing in virtual machines,” in enabling uniform systems. [Online]. Proceedings of the 2009 IEEE Available: http://www.perceus.org/ International Symposium on Parallel&Distributed Processing. AUTHOR: Washington, DC, USA: IEEE Computer Society, 2009, pp. 1–11. [Online]. ANITA AHUJA is an Asst. Available:http://portal.acm.org/citation.cf Professor in Department of m?id=1586640.1587737 Computer Science and [15] F. Bellard, “QEMU, a fast and portable Information Technology at dynamic translator,” in Proceedings of the Mewar University, Chittorgarh annual conference on USENIX Annual (Rajasthan). She has completed „A‟ level Technical Conference, ser. ATEC ‟05. DOEACC Society, M.Sc (IT) from M.C.R.P.V, Berkeley, CA, USA: USENIX Bhopal,M.Phil, Rajasthan Vidyapeeth, and Association, 2005, pp. 441. [Online]. Udaipur. And M.Tech.(P) at Mewar University, Available:http://portal.acm.org/citation.cf Chittorgarh . Her research interest is in the m?id=1247360.1247401 fields of Network Security, Cloud Computing , [16] Josifovski V. Design, implementation and Advance Data Structure and Algorithms. evaluation of a distributed mediator 355 | P a g e