SlideShare una empresa de Scribd logo
1 de 5
Descargar para leer sin conexión
2011 10th IEEE/ACIS International Conference on Computer and Information Science



    A Novel Solution of Distributed Memory NoSQL Database for Cloud Computing

                                                     Jing Han, Meina Song, Junde Song
                               School of Computer, Beijing University of Posts and Telecommunication
                                                       Beijing 100876, China
                              babyblue110128@hotmail.com; mnsong@bupt.edu.cn; jdsong@bupt.edu.cn

                               Abstract
     The traditional relational database to online storage are                     (MapReduce[3][4]), Which constitutes Google's "cloud
 becoming increasingly problematic: the low performance do not                     computing" environment, other companies have also
 gracefully meet the needs of mass data, the storage approaches of                 proposed and implemented similar cloud system, including
 massive data are still not perfect presents. NoSQL and distributed                Amazon's EC2[5], S3[6], and IBM's "Blue Cloud" and so on.
 memory database technologies have the potential to simplify or                          If faced mass data, the existing database system would
 eliminate many of these challenges. NoSQL database technologies                   be lacked performance and cannot afford the high prices of
 can provide Key-value style of data storage and largely ensure
                                                                                   database software, in future mass data must be migration to
 high performance. Distributed memory database technologies
                                                                                   the cloud, but the ACID properties which database system
 provide a means for easily store mass data in cloud in a dynamic
 and scalable manner.
                                                                                   required will lead to serious performance reducing of part of
     This paper argues for a new architecture called CDSA, which                   operation when stored the data distributed.
 is a distributed memory NoSQL database architecture for Cloud                           To ensure high availability, high reliability and
 computing to improve the performance of querying data and                         economy, the cloud data storage must be stored redundant to
 ensure mass data storage in cloud by using rational strategy.                     ensure reliability. In addition, the cloud computing system
 Furthermore, add or deleted any node from the distributed                         needs to meet the needs of large number of users
 database cluster, the others node can work without stop service.                  simultaneously, provide service to users paralleled.
 We believe that CDSA can provide durable storage with high                        Therefore, data storage technology of cloud computing
 throughput and lower access latency                                               requires high throughput and high transfer rate.
                                                                                         This paper presents high performance data storage
     Keywords: cloud computing; NoSQL; cache; memory                               architecture for Cloud Computing to improve the
 database; distributed                                                             performance of storing distributed data by using more
                                                                                   rational strategy.
                         I.      INTRODUCTION                                            This paper is structured in the following way: Section 2
      In order to store and manage data in an efficient way,                       describes related work. Section 3 introduces the internal
 database management system (DBMS) was conceptualized                              architecture of Cloud Data Storage Architecture (CDSA).
 by industry in the 1960s and developed the early DBMS                             Section 4 concludes the paper.
 from file system. With the expansion of applications for
 handling more complex data model, more complex query
 operation and more extensive database, gradually, distributed                                          II.   RELATED WORK
 database was developed, parallel database, database cluster,
 data warehouse and so on. All of them were met the
 application requirements in some ways, but the support of                         A. Cloud Computing
 massive data are still not perfect. At the same time, Internet                          Cloud Computing[7] was mainly proposed by Google,
 becomes a huge repository of information and vast amounts                         Amazon and IBM and so on. There are several definitions
 of data generated every day. If want to retrieve the data, you                    currently, and each described cloud computing from a
 need massive storage space and large scale computing,                             different point of view.
 which existing database system cannot afford, and large                                 In which a comprehensive definition[8] is: “cloud
 server has issues such as limited performance and high price.                     computing is a platform (system) or a type of application. A
 Therefore, Google established a large scale cluster using a lot                   cloud computing platform provision, configuration,
 of cheap PC, designed and implemented a distributed file                          reconfigure and provision on-demand and so on. In a cloud
 system named GFS[1], a storage system named BigTable[2]                           computing platform, the server can be physical server or
 and a parallel programming environment                                            virtual server. Advanced computing clouds usually contain a
                                                                                   number of other computing resources, such as Storage Area
 -------------------------------------------------------------------------         Networks, network equipment, firewall and other safety
 This work is supported by the National Key project of                             equipment. Cloud computing describes a scalable
 Scientific and Technical Supporting Programs of China                             applications which can access through the Internet. ‘Cloud
 (Grant Nos.2008BAH24B04, 2008BAH21B03); the Program                               applications’ use large-scale data centers and a strong server,
 of the Co-Construction with Beijing Municipal Commission                          web application and network services can run in it. Any user
 of Education of China.                                                            can access a cloud computing applications through
                                                                                   appropriate equipment and a standard Internet browser. ”

978-0-7695-4401-4/11 $26.00 © 2011 IEEE                                      351
DOI 10.1109/ICIS.2011.61
Despite there exist a variety of understanding of cloud                  Challenges traditional relational database facing in
computing, it’s mainly the following three basic                         cloud computing[11]:
characteristics: infrastructure on a large scale low-cost server            1) High Performance - demand on the high concurrent
clusters; applications developed in collaboration with the               literacy of database
underlying services to maximize the use of resources; to                       In cloud computing, users’ complicated needs require
ensure high availability by the redundancy among multiple                real-time formation of dynamic pages and provide dynamic
low-cost servers.                                                        information, it cannot use technology to staticized dynamic
      The feature of data storage in cloud computing is that,            pages, so database concurrent is too high and tend to
data is automatically distributed in different storage nodes in          thousands of times reading request per second. A relational
the cluster, each data storage node only preserved a fragment            database is hard to cope with thousands of times SQL data
of whole data, at the same time, user can set to store data in           written request, also the hard disk cannot bear.
different storage nodes to ensure that a single point failure               2) Huge Storage – demand on the efficient storage and
will not cause data loss.                                                visit of mass data
                                                                               The mass data generated dynamically in cloud
B. Cloud Database                                                        computing, for relational database, recorded hundreds of
                                                                         millions of records in the storage in a table and the efficient
      As the development of large-scale distributed service
                                                                         to run SQL query is extremely low.
and distributed storage which clouds computing needs,
traditional relational database are facing many new                         3) High Scalability & High Availability
challenges, especially in scenarios of the large scale and high                In the web-based framework, database is too hard to
concurrent SNS applications, by the use of a relational                  extend horizontally. When the volume and traffic of an
database to store and inquires users’ dynamic data has been              application system is growing, for Web Server and App
ragged and exposes many problems to overcome, such as                    Server, adding more hardware and service node to expand
there need high real-time insert performance; Need mass data             the performance and load capacity, but for database there
storage capacity, still need high retrieval speed; Need to store         have no simple way like that. For a lot to application system
data seamless spreading throughout the distributed cluster               deployed in cloud should be provided 24-hour uninterrupted
environment, and have ability of online expansion, etc.                  service, upgrading and extension of database system are very
According this, NoSQL[9] is made.                                        difficult, often require downtime and data migration, but
                                                                         cannot through adding database services node continuously
                                                                         to implement expansion.
                                                                               Meanwhile, many of the main characters of relational
                                                                         database in the cloud often have no use.
                                                                            1) Database transaction consistency
                                                                               Many system deployed in the clouds does not require
                                                                         strict database transaction, and have low requirements of
                                                                         read consistency; some occasions write consistency
                                                                         requirement is not high also. Therefore, database transaction
                                                                         management became high load of an under-heavy-burden
                                                                         database.
                                                                            2) Complex SQL query, especially demand on
                                                                         multi-table association query
                                                                               Any system with mass data has difficulty on multiple
                                                                         associated queries among big tables, and complex data
                                                                         analysis and complex data report. In cloud computing,
                                                                         simple conditions paging query of single table with primary
                                                                         key is more often, association query of relational database
                                                                         query have been are greatly weakened.
               Figure 1. Simple Data R&W Process                               As cloud computing is an Internet-based application
                                                                         (environment), the database used in cloud computing are key
                                                                         / value model which is for Internet.
      NoSQL database broke the paradigm constraint of
traditional relational database. From a storage perspective,                               III.   CDS ARCHITECTRUE
NoSQL is not a relational database, but a hash database with                  This paper presents a solution which oriented to high
format of Key-value[10]. Due to the abandoned powerful                   performance cloud computing for distributed-memory
SQL query language, transaction-consistent and constraints               database NoSQL. The solution was used to solve the
in paradigm of relational database, NoSQL database largely               problems like data storage and access issues in the massive
solved challenges which the traditional relational database is           highly concurrent systems of cloud computing environment
facing.



                                                                   352
According to the functions of various components, the              meet the need of unknown data pattern of cloud database, the
system logic can be abstracted as three layers: Data Cache               core ideas of OP method is to update distributed cache before
layer, Memory Database layer and Disk Database Layer.                    the first query of user, the specific method is: record all the
                                                                         queries of users, predict queries may be ran, queries at the
                                                                         appropriate time to active cache to work, stored the relevant
                                                                         information in each distributed cache. This method will solve
                                                                         the problem of the first hit to cache and reduce the user delay
                                                                         when the first query, so that cache can play a better role.
                                                                               OP cache is a method to capture change of data; it is
                                                                         based on a prediction algorithm, in cache layer OP is used to
                                                                         improve query response speeds. Record the user's query
                                                                         history, extract the appropriate query script, abstracted the
                                                                         query with different parameter values as a template; statistic
                                                                         from the historical probably query parameters of each query
                                                                         and sort, OP objects is the high probability template and its
                                                                         parameters. However, typical applications of this method are:
                                                                         the use of access the database structure to create dynamic
                                                                         Web page, because the mode of database is predefined, it is
                                                                         appropriate for simple statistical analysis and access
                                                                         prediction.

                                                                         B. Memory Database Layer MDL
                   Figure 2. CDS Archetectrue.
                                                                               Before the persistence of data, the data in the cloud
                                                                         were stored in memory firstly, for the efficiency of memory
                                                                         storage is much higher than the performance of disk storage.
A. Data Cache Layer DCL                                                        Combined with the requirements of data storage which
                                                                         previous analyzed for high performance data and mass
      Currently, the way of centralized deployment was used
                                                                         storage in the cloud computing environment, NoSQL came
to handle the problem of cache. The distributed deployment
                                                                         to the only solution.
of database will lead to the change of cache when the data in
                                                                               Therefore, these two considerations can be deduced
any node was updated.
                                                                         from an idea: memory NoSQL database, use "memory" as a
      Particularly the larger of distributed database will takes
                                                                         "disk", read and write operations no longer interact directly
the problems like the bigger load of current cache in the
                                                                         with the disk database but with memory databases, which
middle tier, too expensive to maintain the system
                                                                         avoid the time delay and the same issues in simple NoSQL
consistently, and weak practically of system. In addition, the
                                                                         read-write- separation structure.
frequent consistency Maintenance will reduce the utilization
of cache. Therefore, we propose a new cache system which
was applied in distributed environment. And the primary
feature of this cache system is maximizes the role of the
cache. The definition of distributed cache is all the service
nodes just store and maintain the cache data in current
database and its corresponding query information. During
the data updating of all databases will not affect the cache
information of other databases. From the task of Agent in the
middle layer, the cache data of each service node can be used
to summary the query results.
      The structure of cache stored should same as table
structure in disk, and the data cannot have redundancy.
Although the rational use of this method can improve the
efficiency of database queries, but some common requests
are also not timely respond. For example, everyday users
statistic the sales the day before, at the first query in cache,               Figure 3. Read and write operations in Memory Database
they must not hit in the cache, so the first response time of
query is often very long, which is a shortcoming which cache
technology cannot overcome.
      We use "Optimal Prefetch(OP)" method to resolve this                     In the transactional database system, the memory
problem. Work presented in [12] discusses the Prefetch                   database required for a consistency with disk database in the
method, but the data pattern should be input in advance. To              startup. Therefore, the memory database need to have the
                                                                         same table definition with library database; and in the first


                                                                   353
time startup, it required to load all data in table of disk                        By separated the operation of reading and writing,
database into memory database.                                               vertical and horizontally segmentation, mass data can be
                                                                             stored Cloud memory NoSQL database cluster. As shown
                                                                             below, the database cluster distributed proxy is in charge of
C. Disk Database Layer DDL                                                   controls such as data routing.
     Key-Value database is mainly characteristic of high
concurrently read and write performance. Key-Value
memory database load the entire database in memory, which
flush data from memory database to disk through the
asynchronous operation regularly. In addition, by setting
expire time of Key-Value can also capture the changed data
in memory database. Because operations is ran purely in
memory, Key-Value memory database’s performance is very
good, more than million times read and write operations can
be handled per second.
     Changes of data in memory database need to be copied
to the disk database. Then copy the data from memory to
disk can be seen as the process of an original asynchronous
writing operation, obviously, the asynchronous write
operation make faster.

                                                                                         Figure 5. Memory NoSQL Database cluster

                                                                                  When copy data from memory database to disk
                                                                             database, it will use hybrid partition method. The partition
                                                                             which was committed by a NoSQL node in Memory
                                                                             Database Layer will be committed by a memory node and a
                                                                             common NoSQL node. The database schema will be formed
                                                                             by the two databases.




Figure 4. Data synchronization from Memory Database to Disk Database



D. Distributed storage in CDSA
      Distributed database in cloud computing is a database
service system which constitute from database distributed in
different nodes, and according this distributed architecture to
provide online flexible scalability. For example, more data                                      Figure 6. H-Shard cluster
nodes can be added or deleted without stop service and so on.
Cloud database is not just a database, but a distributed
database network service which was constituted with a large
number of database nodes. A write operation of cloud                                       IV.    PERFORMANCE OF CDSA
database will be copied to other nodes, read request will be                      MDL of CDSA is created by aggregating the main
routed to a node in cloud. For a cloud database cluster, the                 memories of servers. The CDSA can provide durable and
scalability is become easy; just add a node in the cluster                   available storage with 100x the throughput of only use disk
inside cloud. Cloud distributed database is necessarily a                    database and lower access latency. The combination of low
multiple-node system; its performance of concurrent depends                  latency and large scale will enable a new breed of data-
on the number of nodes in the system and routing efficiency.                 intensive applications.



                                                                       354
This section focused on the test of CDSA access                             3) Mass data storage
patterns. Through the basic operations of insert, read, update,                      By created a distributed cloud database cluster in the
and delete, to test both the throughput and average response                   cloud to achieve mass data storage;
time, at the same time to verify the correctness and                              4) High Availability
effectiveness of CDSA. Using the test environment                                    Based on the distributed database proxy, by using MDB
specifically as follows:                                                       master node and slave node to implement high availability
      To test throughput and response time of CDSA, firstly                    further; the secondary disk database can achieve fast data
nmoll is used as tools to capture the performance of the test                  recovery
program and write to a file, and then use the nmoll as an                            The statistical data shows that this architecture
analyser to analyze the data file, finally we select the test                  performs well and brings a lot of benefits. However, there is
results of CPU and I/O of disk to create a graphical report.                   limit in the management of user-defined data resources, the
      Test program insert, update, read or delete 500 million                  flexibility of cloud service controls, and search in different
records respectively, the time consuming was recorded. First                   data resources. We plan to address these problems in the
the test program opened the database, the records were read                    future releases.
from a disk database to memory corresponds, and then run
the Insert. Update, Read or Delete operations, finally
synchronized data from memory to disk file.                                                                  REFERENCES
           TABLE I.              TEST RESULT-DATA OPERATION
                                                                               [1]  S. Ghemawat, H. Gobioff, and S. Leung The Google File System In:
  Operations          open database          run        close database
                                                                                    Proc. of the Nineteenth ACM Sym-posium on Operating Systems
                           (s)                (s)             (s)
                                                                                    Principles, October, 2003
 insert         2                          17       4                          [2] Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh,
 update         0.7                        8        2                               Deborah A. Wallach, Mike Burrows, Tushar Chandra, Andrew Fikes,
 read           1                          5        3                               and Robert E. Gruber Bigtable: A Distributed Storage System for
 delete         1                          5        2                               Structured Data. In Proceedings of OSDI'06: Seventh Symposium on
                                                                                    Operating System Design and Implementation, November, 2006
                                                                               [3] Amazon. Amazon elastic compute cloud (Amazon EC2). 2009.
                                                                                    http://aws.amazon.com/ec2/
    TABLE II.     TEST RESULT-AVG RESPONSETIME&THROUGHPUT
                                                                               [4] Amazon          Simple       Storage      Service       (S3).      2009.
   Operations           average response time            throughput                 http://www.amazon.com/s3/.
                                (μ s)                                          [5] Dean J, Ghemawat S. MapReduce: Simplified Data Processing on
                                                                                    Large Clusters. In OSDI '04: Sixth Symposium on Operating System
 insert               3                             316000
                                                                                    Design and Implementation, 2004
 update               1.5                           562000
 read                 1.2                           753000                     [6] Yang HC, Dasdan A, Hsiao RL, Parker DS. Map-Reduce-Merge:
                                                                                    Simplified relational data processing on large clusters In: Proc. of the
 delete               1                             671000
                                                                                    2007 ACM SIGMOD Int’ Conf on Management ofData. New York:
                                                                                    ACM Press,2007. 1029-1040.
      Table I and table      show the test result: the average                 [7] P. Mell and T. Grance, "Draft nist working definition of cloud
                                                                                    computing - v15," 21. Aug 2009, 2009
response time is in the level of microseconds. Because of
logic is relatively simple, the test results may be relative                   [8] Chen Kang, Zheng Weimin Cloud Computing: System Instances and
                                                                                    Current Research[J] Journal of Software, Vol.20, No.5, May 2009,
better than the practical, but it is certain that the CDSA is a                     pp.1337−1348.
good choice for cloud.                                                         [9] "NoSQL Relational Database Management System: Home Page".
                                                                                    Strozzi.it. 2007-10-02. Retrieved 2010-03-29
                                                                               [10] DeCandia G, Hastorun D, Jampani M, Kakulapati G, Lakshman A
                            V.     CONCLUSION                                       Pilchin A, Sivasubramanian S, Vosshnll P, Vogels W. Dynamo:
      According to the characteristics of distributed storage in                    Amazon’ s highly available key value store In: Proc. of the 21st
                                                                                    ACM Symp on Operating Systems Principles New York: ACM
cloud, by introducing memory database and NoSQL, we                                 Press, 2007. 205-220.
propose an architecture of distributed memory NO-SQL                           [11] Eelco Plugge, Tim Hawkins, Peter Membrey The Definitive Guide to
database for cloud, to address massive data storage and high                        MongoDB: The NoSQL Database for Cloud and Desktop Computing
concurrent access issues cloud in computing environments.                           Publisher Apress Berkely, CA, USA, 2010
The core idea is:                                                              [12] Wang D, Xie J. An approach toward web caching and pre-fetching
   1) High performance:                                                             for database management system. Paris, France: SIGMOD, 2001:
      By using a cache layer and a memory NoSQL database,                           204-216.
to provide high-performance database access service, which
is the main goal of this architecture;
   2) Persistence
      By using of memory database and disk database to
implement asynchronously copy from memory database to
disk database;



                                                                         355

Más contenido relacionado

La actualidad más candente

Efficient and reliable hybrid cloud architecture for big database
Efficient and reliable hybrid cloud architecture for big databaseEfficient and reliable hybrid cloud architecture for big database
Efficient and reliable hybrid cloud architecture for big databaseijccsa
 
Cloud Computing: A Perspective on Next Basic Utility in IT World
Cloud Computing: A Perspective on Next Basic Utility in IT World Cloud Computing: A Perspective on Next Basic Utility in IT World
Cloud Computing: A Perspective on Next Basic Utility in IT World IRJET Journal
 
A Virtualization Model for Cloud Computing
A Virtualization Model for Cloud ComputingA Virtualization Model for Cloud Computing
A Virtualization Model for Cloud ComputingSouvik Pal
 
05958007cloud
05958007cloud05958007cloud
05958007cloudyassram
 
IJSRED-V1I1P1
IJSRED-V1I1P1IJSRED-V1I1P1
IJSRED-V1I1P1IJSRED
 
A Threshold Secure Data Sharing Scheme for Federated Clouds
A Threshold Secure Data Sharing Scheme for Federated CloudsA Threshold Secure Data Sharing Scheme for Federated Clouds
A Threshold Secure Data Sharing Scheme for Federated CloudsIJORCS
 
35 content distribution with dynamic migration of services for minimum cost u...
35 content distribution with dynamic migration of services for minimum cost u...35 content distribution with dynamic migration of services for minimum cost u...
35 content distribution with dynamic migration of services for minimum cost u...INFOGAIN PUBLICATION
 
Analysis of SOFTWARE DEFINED STORAGE (SDS)
Analysis of SOFTWARE DEFINED STORAGE (SDS)Analysis of SOFTWARE DEFINED STORAGE (SDS)
Analysis of SOFTWARE DEFINED STORAGE (SDS)Kaushik Rajan
 
BFC: High-Performance Distributed Big-File Cloud Storage Based On Key-Value S...
BFC: High-Performance Distributed Big-File Cloud Storage Based On Key-Value S...BFC: High-Performance Distributed Big-File Cloud Storage Based On Key-Value S...
BFC: High-Performance Distributed Big-File Cloud Storage Based On Key-Value S...dbpublications
 
Access security on cloud computing implemented in hadoop system
Access security on cloud computing implemented in hadoop systemAccess security on cloud computing implemented in hadoop system
Access security on cloud computing implemented in hadoop systemJoão Gabriel Lima
 

La actualidad más candente (17)

Efficient and reliable hybrid cloud architecture for big database
Efficient and reliable hybrid cloud architecture for big databaseEfficient and reliable hybrid cloud architecture for big database
Efficient and reliable hybrid cloud architecture for big database
 
13
1313
13
 
WJCAT2-13707877
WJCAT2-13707877WJCAT2-13707877
WJCAT2-13707877
 
Cloud computing
Cloud computingCloud computing
Cloud computing
 
A REVIEW ON RESOURCE ALLOCATION MECHANISM IN CLOUD ENVIORNMENT
A REVIEW ON RESOURCE ALLOCATION MECHANISM IN CLOUD ENVIORNMENTA REVIEW ON RESOURCE ALLOCATION MECHANISM IN CLOUD ENVIORNMENT
A REVIEW ON RESOURCE ALLOCATION MECHANISM IN CLOUD ENVIORNMENT
 
Cloud Computing: A Perspective on Next Basic Utility in IT World
Cloud Computing: A Perspective on Next Basic Utility in IT World Cloud Computing: A Perspective on Next Basic Utility in IT World
Cloud Computing: A Perspective on Next Basic Utility in IT World
 
A Virtualization Model for Cloud Computing
A Virtualization Model for Cloud ComputingA Virtualization Model for Cloud Computing
A Virtualization Model for Cloud Computing
 
05958007cloud
05958007cloud05958007cloud
05958007cloud
 
IJSRED-V1I1P1
IJSRED-V1I1P1IJSRED-V1I1P1
IJSRED-V1I1P1
 
Chapter1
Chapter1Chapter1
Chapter1
 
Cloud versus cloud
Cloud versus cloudCloud versus cloud
Cloud versus cloud
 
A Threshold Secure Data Sharing Scheme for Federated Clouds
A Threshold Secure Data Sharing Scheme for Federated CloudsA Threshold Secure Data Sharing Scheme for Federated Clouds
A Threshold Secure Data Sharing Scheme for Federated Clouds
 
E0332427
E0332427E0332427
E0332427
 
35 content distribution with dynamic migration of services for minimum cost u...
35 content distribution with dynamic migration of services for minimum cost u...35 content distribution with dynamic migration of services for minimum cost u...
35 content distribution with dynamic migration of services for minimum cost u...
 
Analysis of SOFTWARE DEFINED STORAGE (SDS)
Analysis of SOFTWARE DEFINED STORAGE (SDS)Analysis of SOFTWARE DEFINED STORAGE (SDS)
Analysis of SOFTWARE DEFINED STORAGE (SDS)
 
BFC: High-Performance Distributed Big-File Cloud Storage Based On Key-Value S...
BFC: High-Performance Distributed Big-File Cloud Storage Based On Key-Value S...BFC: High-Performance Distributed Big-File Cloud Storage Based On Key-Value S...
BFC: High-Performance Distributed Big-File Cloud Storage Based On Key-Value S...
 
Access security on cloud computing implemented in hadoop system
Access security on cloud computing implemented in hadoop systemAccess security on cloud computing implemented in hadoop system
Access security on cloud computing implemented in hadoop system
 

Similar a A Novel Solution of Distributed Memory NoSQL Database for Cloud Computing

Cidr11 paper32
Cidr11 paper32Cidr11 paper32
Cidr11 paper32jujukoko
 
Megastore providing scalable, highly available storage for interactive services
Megastore providing scalable, highly available storage for interactive servicesMegastore providing scalable, highly available storage for interactive services
Megastore providing scalable, highly available storage for interactive servicesJoão Gabriel Lima
 
Cache and consistency in nosql
Cache and consistency in nosqlCache and consistency in nosql
Cache and consistency in nosqlJoão Gabriel Lima
 
Challenges Management and Opportunities of Cloud DBA
Challenges Management and Opportunities of Cloud DBAChallenges Management and Opportunities of Cloud DBA
Challenges Management and Opportunities of Cloud DBAinventy
 
Privacy preserving public auditing for secured cloud storage
Privacy preserving public auditing for secured cloud storagePrivacy preserving public auditing for secured cloud storage
Privacy preserving public auditing for secured cloud storagedbpublications
 
ANALYSIS OF ATTACK TECHNIQUES ON CLOUD BASED DATA DEDUPLICATION TECHNIQUES
ANALYSIS OF ATTACK TECHNIQUES ON CLOUD BASED DATA DEDUPLICATION TECHNIQUESANALYSIS OF ATTACK TECHNIQUES ON CLOUD BASED DATA DEDUPLICATION TECHNIQUES
ANALYSIS OF ATTACK TECHNIQUES ON CLOUD BASED DATA DEDUPLICATION TECHNIQUESijccsa
 
ANALYSIS OF ATTACK TECHNIQUES ON CLOUD BASED DATA DEDUPLICATION TECHNIQUES
ANALYSIS OF ATTACK TECHNIQUES ON CLOUD BASED DATA DEDUPLICATION TECHNIQUESANALYSIS OF ATTACK TECHNIQUES ON CLOUD BASED DATA DEDUPLICATION TECHNIQUES
ANALYSIS OF ATTACK TECHNIQUES ON CLOUD BASED DATA DEDUPLICATION TECHNIQUESijccsa
 
ANALYSIS OF ATTACK TECHNIQUES ON CLOUD BASED DATA DEDUPLICATION TECHNIQUES
ANALYSIS OF ATTACK TECHNIQUES ON CLOUD BASED DATA DEDUPLICATION TECHNIQUESANALYSIS OF ATTACK TECHNIQUES ON CLOUD BASED DATA DEDUPLICATION TECHNIQUES
ANALYSIS OF ATTACK TECHNIQUES ON CLOUD BASED DATA DEDUPLICATION TECHNIQUESijccsa
 
Cloud_on_Linux_Operating_System.pdf
Cloud_on_Linux_Operating_System.pdfCloud_on_Linux_Operating_System.pdf
Cloud_on_Linux_Operating_System.pdfPalanikumar72221
 
Improving the Latency Value by Virtualizing Distributed Data Center and Auto...
Improving the Latency Value by Virtualizing Distributed Data  Center and Auto...Improving the Latency Value by Virtualizing Distributed Data  Center and Auto...
Improving the Latency Value by Virtualizing Distributed Data Center and Auto...IOSR Journals
 
iFCloud Secure File Sharing
iFCloud Secure File SharingiFCloud Secure File Sharing
iFCloud Secure File Sharingifcloudus
 
Introduction to cloud Cambridge University.ppt
Introduction to cloud Cambridge University.pptIntroduction to cloud Cambridge University.ppt
Introduction to cloud Cambridge University.pptestabraqhm
 
iFCloud Secure File Sharing
iFCloud Secure File SharingiFCloud Secure File Sharing
iFCloud Secure File Sharingifcloudus
 
E newsletter promise_&_challenges_of_cloud storage-2
E newsletter promise_&_challenges_of_cloud storage-2E newsletter promise_&_challenges_of_cloud storage-2
E newsletter promise_&_challenges_of_cloud storage-2Anil Vasudeva
 

Similar a A Novel Solution of Distributed Memory NoSQL Database for Cloud Computing (20)

Cidr11 paper32
Cidr11 paper32Cidr11 paper32
Cidr11 paper32
 
Megastore providing scalable, highly available storage for interactive services
Megastore providing scalable, highly available storage for interactive servicesMegastore providing scalable, highly available storage for interactive services
Megastore providing scalable, highly available storage for interactive services
 
K017146064
K017146064K017146064
K017146064
 
Cache and consistency in nosql
Cache and consistency in nosqlCache and consistency in nosql
Cache and consistency in nosql
 
Challenges Management and Opportunities of Cloud DBA
Challenges Management and Opportunities of Cloud DBAChallenges Management and Opportunities of Cloud DBA
Challenges Management and Opportunities of Cloud DBA
 
Unit -3-Cloud.pptx
Unit -3-Cloud.pptxUnit -3-Cloud.pptx
Unit -3-Cloud.pptx
 
Distributed system.pptx
Distributed system.pptxDistributed system.pptx
Distributed system.pptx
 
Privacy preserving public auditing for secured cloud storage
Privacy preserving public auditing for secured cloud storagePrivacy preserving public auditing for secured cloud storage
Privacy preserving public auditing for secured cloud storage
 
ANALYSIS OF ATTACK TECHNIQUES ON CLOUD BASED DATA DEDUPLICATION TECHNIQUES
ANALYSIS OF ATTACK TECHNIQUES ON CLOUD BASED DATA DEDUPLICATION TECHNIQUESANALYSIS OF ATTACK TECHNIQUES ON CLOUD BASED DATA DEDUPLICATION TECHNIQUES
ANALYSIS OF ATTACK TECHNIQUES ON CLOUD BASED DATA DEDUPLICATION TECHNIQUES
 
ANALYSIS OF ATTACK TECHNIQUES ON CLOUD BASED DATA DEDUPLICATION TECHNIQUES
ANALYSIS OF ATTACK TECHNIQUES ON CLOUD BASED DATA DEDUPLICATION TECHNIQUESANALYSIS OF ATTACK TECHNIQUES ON CLOUD BASED DATA DEDUPLICATION TECHNIQUES
ANALYSIS OF ATTACK TECHNIQUES ON CLOUD BASED DATA DEDUPLICATION TECHNIQUES
 
ANALYSIS OF ATTACK TECHNIQUES ON CLOUD BASED DATA DEDUPLICATION TECHNIQUES
ANALYSIS OF ATTACK TECHNIQUES ON CLOUD BASED DATA DEDUPLICATION TECHNIQUESANALYSIS OF ATTACK TECHNIQUES ON CLOUD BASED DATA DEDUPLICATION TECHNIQUES
ANALYSIS OF ATTACK TECHNIQUES ON CLOUD BASED DATA DEDUPLICATION TECHNIQUES
 
Cloud_on_Linux_Operating_System.pdf
Cloud_on_Linux_Operating_System.pdfCloud_on_Linux_Operating_System.pdf
Cloud_on_Linux_Operating_System.pdf
 
Improving the Latency Value by Virtualizing Distributed Data Center and Auto...
Improving the Latency Value by Virtualizing Distributed Data  Center and Auto...Improving the Latency Value by Virtualizing Distributed Data  Center and Auto...
Improving the Latency Value by Virtualizing Distributed Data Center and Auto...
 
iFCloud Secure File Sharing
iFCloud Secure File SharingiFCloud Secure File Sharing
iFCloud Secure File Sharing
 
Report 2.0.docx
Report 2.0.docxReport 2.0.docx
Report 2.0.docx
 
Introduction to cloud Cambridge University.ppt
Introduction to cloud Cambridge University.pptIntroduction to cloud Cambridge University.ppt
Introduction to cloud Cambridge University.ppt
 
Introduction.ppt
Introduction.pptIntroduction.ppt
Introduction.ppt
 
iFCloud Secure File Sharing
iFCloud Secure File SharingiFCloud Secure File Sharing
iFCloud Secure File Sharing
 
E newsletter promise_&_challenges_of_cloud storage-2
E newsletter promise_&_challenges_of_cloud storage-2E newsletter promise_&_challenges_of_cloud storage-2
E newsletter promise_&_challenges_of_cloud storage-2
 
P18 2 8-5
P18 2 8-5P18 2 8-5
P18 2 8-5
 

Más de João Gabriel Lima

Deep marketing - Indoor Customer Segmentation
Deep marketing - Indoor Customer SegmentationDeep marketing - Indoor Customer Segmentation
Deep marketing - Indoor Customer SegmentationJoão Gabriel Lima
 
Aplicações de Alto Desempenho com JHipster Full Stack
Aplicações de Alto Desempenho com JHipster Full StackAplicações de Alto Desempenho com JHipster Full Stack
Aplicações de Alto Desempenho com JHipster Full StackJoão Gabriel Lima
 
Realidade aumentada com react native e ARKit
Realidade aumentada com react native e ARKitRealidade aumentada com react native e ARKit
Realidade aumentada com react native e ARKitJoão Gabriel Lima
 
Big data e Inteligência Artificial
Big data e Inteligência ArtificialBig data e Inteligência Artificial
Big data e Inteligência ArtificialJoão Gabriel Lima
 
Mineração de Dados no Weka - Regressão Linear
Mineração de Dados no Weka -  Regressão LinearMineração de Dados no Weka -  Regressão Linear
Mineração de Dados no Weka - Regressão LinearJoão Gabriel Lima
 
Segurança na Internet - Estudos de caso
Segurança na Internet - Estudos de casoSegurança na Internet - Estudos de caso
Segurança na Internet - Estudos de casoJoão Gabriel Lima
 
Segurança na Internet - Google Hacking
Segurança na Internet - Google  HackingSegurança na Internet - Google  Hacking
Segurança na Internet - Google HackingJoão Gabriel Lima
 
Segurança na Internet - Conceitos fundamentais
Segurança na Internet - Conceitos fundamentaisSegurança na Internet - Conceitos fundamentais
Segurança na Internet - Conceitos fundamentaisJoão Gabriel Lima
 
Mineração de Dados com RapidMiner - Um Estudo de caso sobre o Churn Rate em...
Mineração de Dados com RapidMiner - Um Estudo de caso sobre o Churn Rate em...Mineração de Dados com RapidMiner - Um Estudo de caso sobre o Churn Rate em...
Mineração de Dados com RapidMiner - Um Estudo de caso sobre o Churn Rate em...João Gabriel Lima
 
Mineração de dados com RapidMiner + WEKA - Clusterização
Mineração de dados com RapidMiner + WEKA - ClusterizaçãoMineração de dados com RapidMiner + WEKA - Clusterização
Mineração de dados com RapidMiner + WEKA - ClusterizaçãoJoão Gabriel Lima
 
Mineração de dados na prática com RapidMiner e Weka
Mineração de dados na prática com RapidMiner e WekaMineração de dados na prática com RapidMiner e Weka
Mineração de dados na prática com RapidMiner e WekaJoão Gabriel Lima
 
Visualizacao de dados - Come to the dark side
Visualizacao de dados - Come to the dark sideVisualizacao de dados - Come to the dark side
Visualizacao de dados - Come to the dark sideJoão Gabriel Lima
 
REST x SOAP : Qual abordagem escolher?
REST x SOAP : Qual abordagem escolher?REST x SOAP : Qual abordagem escolher?
REST x SOAP : Qual abordagem escolher?João Gabriel Lima
 
Game of data - Predição e Análise da série Game Of Thrones a partir do uso de...
Game of data - Predição e Análise da série Game Of Thrones a partir do uso de...Game of data - Predição e Análise da série Game Of Thrones a partir do uso de...
Game of data - Predição e Análise da série Game Of Thrones a partir do uso de...João Gabriel Lima
 
E-trânsito cidadão - IPVA em suas mãos
E-trânsito cidadão - IPVA em suas mãosE-trânsito cidadão - IPVA em suas mãos
E-trânsito cidadão - IPVA em suas mãosJoão Gabriel Lima
 
[Estácio - IESAM] Automatizando Tarefas com Gulp.js
[Estácio - IESAM] Automatizando Tarefas com Gulp.js[Estácio - IESAM] Automatizando Tarefas com Gulp.js
[Estácio - IESAM] Automatizando Tarefas com Gulp.jsJoão Gabriel Lima
 
Hackeando a Internet das Coisas com Javascript
Hackeando a Internet das Coisas com JavascriptHackeando a Internet das Coisas com Javascript
Hackeando a Internet das Coisas com JavascriptJoão Gabriel Lima
 

Más de João Gabriel Lima (20)

Cooking with data
Cooking with dataCooking with data
Cooking with data
 
Deep marketing - Indoor Customer Segmentation
Deep marketing - Indoor Customer SegmentationDeep marketing - Indoor Customer Segmentation
Deep marketing - Indoor Customer Segmentation
 
Aplicações de Alto Desempenho com JHipster Full Stack
Aplicações de Alto Desempenho com JHipster Full StackAplicações de Alto Desempenho com JHipster Full Stack
Aplicações de Alto Desempenho com JHipster Full Stack
 
Realidade aumentada com react native e ARKit
Realidade aumentada com react native e ARKitRealidade aumentada com react native e ARKit
Realidade aumentada com react native e ARKit
 
JS - IA
JS - IAJS - IA
JS - IA
 
Big data e Inteligência Artificial
Big data e Inteligência ArtificialBig data e Inteligência Artificial
Big data e Inteligência Artificial
 
Mineração de Dados no Weka - Regressão Linear
Mineração de Dados no Weka -  Regressão LinearMineração de Dados no Weka -  Regressão Linear
Mineração de Dados no Weka - Regressão Linear
 
Segurança na Internet - Estudos de caso
Segurança na Internet - Estudos de casoSegurança na Internet - Estudos de caso
Segurança na Internet - Estudos de caso
 
Segurança na Internet - Google Hacking
Segurança na Internet - Google  HackingSegurança na Internet - Google  Hacking
Segurança na Internet - Google Hacking
 
Segurança na Internet - Conceitos fundamentais
Segurança na Internet - Conceitos fundamentaisSegurança na Internet - Conceitos fundamentais
Segurança na Internet - Conceitos fundamentais
 
Web Machine Learning
Web Machine LearningWeb Machine Learning
Web Machine Learning
 
Mineração de Dados com RapidMiner - Um Estudo de caso sobre o Churn Rate em...
Mineração de Dados com RapidMiner - Um Estudo de caso sobre o Churn Rate em...Mineração de Dados com RapidMiner - Um Estudo de caso sobre o Churn Rate em...
Mineração de Dados com RapidMiner - Um Estudo de caso sobre o Churn Rate em...
 
Mineração de dados com RapidMiner + WEKA - Clusterização
Mineração de dados com RapidMiner + WEKA - ClusterizaçãoMineração de dados com RapidMiner + WEKA - Clusterização
Mineração de dados com RapidMiner + WEKA - Clusterização
 
Mineração de dados na prática com RapidMiner e Weka
Mineração de dados na prática com RapidMiner e WekaMineração de dados na prática com RapidMiner e Weka
Mineração de dados na prática com RapidMiner e Weka
 
Visualizacao de dados - Come to the dark side
Visualizacao de dados - Come to the dark sideVisualizacao de dados - Come to the dark side
Visualizacao de dados - Come to the dark side
 
REST x SOAP : Qual abordagem escolher?
REST x SOAP : Qual abordagem escolher?REST x SOAP : Qual abordagem escolher?
REST x SOAP : Qual abordagem escolher?
 
Game of data - Predição e Análise da série Game Of Thrones a partir do uso de...
Game of data - Predição e Análise da série Game Of Thrones a partir do uso de...Game of data - Predição e Análise da série Game Of Thrones a partir do uso de...
Game of data - Predição e Análise da série Game Of Thrones a partir do uso de...
 
E-trânsito cidadão - IPVA em suas mãos
E-trânsito cidadão - IPVA em suas mãosE-trânsito cidadão - IPVA em suas mãos
E-trânsito cidadão - IPVA em suas mãos
 
[Estácio - IESAM] Automatizando Tarefas com Gulp.js
[Estácio - IESAM] Automatizando Tarefas com Gulp.js[Estácio - IESAM] Automatizando Tarefas com Gulp.js
[Estácio - IESAM] Automatizando Tarefas com Gulp.js
 
Hackeando a Internet das Coisas com Javascript
Hackeando a Internet das Coisas com JavascriptHackeando a Internet das Coisas com Javascript
Hackeando a Internet das Coisas com Javascript
 

Último

Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 

Último (20)

Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 

A Novel Solution of Distributed Memory NoSQL Database for Cloud Computing

  • 1. 2011 10th IEEE/ACIS International Conference on Computer and Information Science A Novel Solution of Distributed Memory NoSQL Database for Cloud Computing Jing Han, Meina Song, Junde Song School of Computer, Beijing University of Posts and Telecommunication Beijing 100876, China babyblue110128@hotmail.com; mnsong@bupt.edu.cn; jdsong@bupt.edu.cn Abstract The traditional relational database to online storage are (MapReduce[3][4]), Which constitutes Google's "cloud becoming increasingly problematic: the low performance do not computing" environment, other companies have also gracefully meet the needs of mass data, the storage approaches of proposed and implemented similar cloud system, including massive data are still not perfect presents. NoSQL and distributed Amazon's EC2[5], S3[6], and IBM's "Blue Cloud" and so on. memory database technologies have the potential to simplify or If faced mass data, the existing database system would eliminate many of these challenges. NoSQL database technologies be lacked performance and cannot afford the high prices of can provide Key-value style of data storage and largely ensure database software, in future mass data must be migration to high performance. Distributed memory database technologies the cloud, but the ACID properties which database system provide a means for easily store mass data in cloud in a dynamic and scalable manner. required will lead to serious performance reducing of part of This paper argues for a new architecture called CDSA, which operation when stored the data distributed. is a distributed memory NoSQL database architecture for Cloud To ensure high availability, high reliability and computing to improve the performance of querying data and economy, the cloud data storage must be stored redundant to ensure mass data storage in cloud by using rational strategy. ensure reliability. In addition, the cloud computing system Furthermore, add or deleted any node from the distributed needs to meet the needs of large number of users database cluster, the others node can work without stop service. simultaneously, provide service to users paralleled. We believe that CDSA can provide durable storage with high Therefore, data storage technology of cloud computing throughput and lower access latency requires high throughput and high transfer rate. This paper presents high performance data storage Keywords: cloud computing; NoSQL; cache; memory architecture for Cloud Computing to improve the database; distributed performance of storing distributed data by using more rational strategy. I. INTRODUCTION This paper is structured in the following way: Section 2 In order to store and manage data in an efficient way, describes related work. Section 3 introduces the internal database management system (DBMS) was conceptualized architecture of Cloud Data Storage Architecture (CDSA). by industry in the 1960s and developed the early DBMS Section 4 concludes the paper. from file system. With the expansion of applications for handling more complex data model, more complex query operation and more extensive database, gradually, distributed II. RELATED WORK database was developed, parallel database, database cluster, data warehouse and so on. All of them were met the application requirements in some ways, but the support of A. Cloud Computing massive data are still not perfect. At the same time, Internet Cloud Computing[7] was mainly proposed by Google, becomes a huge repository of information and vast amounts Amazon and IBM and so on. There are several definitions of data generated every day. If want to retrieve the data, you currently, and each described cloud computing from a need massive storage space and large scale computing, different point of view. which existing database system cannot afford, and large In which a comprehensive definition[8] is: “cloud server has issues such as limited performance and high price. computing is a platform (system) or a type of application. A Therefore, Google established a large scale cluster using a lot cloud computing platform provision, configuration, of cheap PC, designed and implemented a distributed file reconfigure and provision on-demand and so on. In a cloud system named GFS[1], a storage system named BigTable[2] computing platform, the server can be physical server or and a parallel programming environment virtual server. Advanced computing clouds usually contain a number of other computing resources, such as Storage Area ------------------------------------------------------------------------- Networks, network equipment, firewall and other safety This work is supported by the National Key project of equipment. Cloud computing describes a scalable Scientific and Technical Supporting Programs of China applications which can access through the Internet. ‘Cloud (Grant Nos.2008BAH24B04, 2008BAH21B03); the Program applications’ use large-scale data centers and a strong server, of the Co-Construction with Beijing Municipal Commission web application and network services can run in it. Any user of Education of China. can access a cloud computing applications through appropriate equipment and a standard Internet browser. ” 978-0-7695-4401-4/11 $26.00 © 2011 IEEE 351 DOI 10.1109/ICIS.2011.61
  • 2. Despite there exist a variety of understanding of cloud Challenges traditional relational database facing in computing, it’s mainly the following three basic cloud computing[11]: characteristics: infrastructure on a large scale low-cost server 1) High Performance - demand on the high concurrent clusters; applications developed in collaboration with the literacy of database underlying services to maximize the use of resources; to In cloud computing, users’ complicated needs require ensure high availability by the redundancy among multiple real-time formation of dynamic pages and provide dynamic low-cost servers. information, it cannot use technology to staticized dynamic The feature of data storage in cloud computing is that, pages, so database concurrent is too high and tend to data is automatically distributed in different storage nodes in thousands of times reading request per second. A relational the cluster, each data storage node only preserved a fragment database is hard to cope with thousands of times SQL data of whole data, at the same time, user can set to store data in written request, also the hard disk cannot bear. different storage nodes to ensure that a single point failure 2) Huge Storage – demand on the efficient storage and will not cause data loss. visit of mass data The mass data generated dynamically in cloud B. Cloud Database computing, for relational database, recorded hundreds of millions of records in the storage in a table and the efficient As the development of large-scale distributed service to run SQL query is extremely low. and distributed storage which clouds computing needs, traditional relational database are facing many new 3) High Scalability & High Availability challenges, especially in scenarios of the large scale and high In the web-based framework, database is too hard to concurrent SNS applications, by the use of a relational extend horizontally. When the volume and traffic of an database to store and inquires users’ dynamic data has been application system is growing, for Web Server and App ragged and exposes many problems to overcome, such as Server, adding more hardware and service node to expand there need high real-time insert performance; Need mass data the performance and load capacity, but for database there storage capacity, still need high retrieval speed; Need to store have no simple way like that. For a lot to application system data seamless spreading throughout the distributed cluster deployed in cloud should be provided 24-hour uninterrupted environment, and have ability of online expansion, etc. service, upgrading and extension of database system are very According this, NoSQL[9] is made. difficult, often require downtime and data migration, but cannot through adding database services node continuously to implement expansion. Meanwhile, many of the main characters of relational database in the cloud often have no use. 1) Database transaction consistency Many system deployed in the clouds does not require strict database transaction, and have low requirements of read consistency; some occasions write consistency requirement is not high also. Therefore, database transaction management became high load of an under-heavy-burden database. 2) Complex SQL query, especially demand on multi-table association query Any system with mass data has difficulty on multiple associated queries among big tables, and complex data analysis and complex data report. In cloud computing, simple conditions paging query of single table with primary key is more often, association query of relational database query have been are greatly weakened. Figure 1. Simple Data R&W Process As cloud computing is an Internet-based application (environment), the database used in cloud computing are key / value model which is for Internet. NoSQL database broke the paradigm constraint of traditional relational database. From a storage perspective, III. CDS ARCHITECTRUE NoSQL is not a relational database, but a hash database with This paper presents a solution which oriented to high format of Key-value[10]. Due to the abandoned powerful performance cloud computing for distributed-memory SQL query language, transaction-consistent and constraints database NoSQL. The solution was used to solve the in paradigm of relational database, NoSQL database largely problems like data storage and access issues in the massive solved challenges which the traditional relational database is highly concurrent systems of cloud computing environment facing. 352
  • 3. According to the functions of various components, the meet the need of unknown data pattern of cloud database, the system logic can be abstracted as three layers: Data Cache core ideas of OP method is to update distributed cache before layer, Memory Database layer and Disk Database Layer. the first query of user, the specific method is: record all the queries of users, predict queries may be ran, queries at the appropriate time to active cache to work, stored the relevant information in each distributed cache. This method will solve the problem of the first hit to cache and reduce the user delay when the first query, so that cache can play a better role. OP cache is a method to capture change of data; it is based on a prediction algorithm, in cache layer OP is used to improve query response speeds. Record the user's query history, extract the appropriate query script, abstracted the query with different parameter values as a template; statistic from the historical probably query parameters of each query and sort, OP objects is the high probability template and its parameters. However, typical applications of this method are: the use of access the database structure to create dynamic Web page, because the mode of database is predefined, it is appropriate for simple statistical analysis and access prediction. B. Memory Database Layer MDL Figure 2. CDS Archetectrue. Before the persistence of data, the data in the cloud were stored in memory firstly, for the efficiency of memory storage is much higher than the performance of disk storage. A. Data Cache Layer DCL Combined with the requirements of data storage which previous analyzed for high performance data and mass Currently, the way of centralized deployment was used storage in the cloud computing environment, NoSQL came to handle the problem of cache. The distributed deployment to the only solution. of database will lead to the change of cache when the data in Therefore, these two considerations can be deduced any node was updated. from an idea: memory NoSQL database, use "memory" as a Particularly the larger of distributed database will takes "disk", read and write operations no longer interact directly the problems like the bigger load of current cache in the with the disk database but with memory databases, which middle tier, too expensive to maintain the system avoid the time delay and the same issues in simple NoSQL consistently, and weak practically of system. In addition, the read-write- separation structure. frequent consistency Maintenance will reduce the utilization of cache. Therefore, we propose a new cache system which was applied in distributed environment. And the primary feature of this cache system is maximizes the role of the cache. The definition of distributed cache is all the service nodes just store and maintain the cache data in current database and its corresponding query information. During the data updating of all databases will not affect the cache information of other databases. From the task of Agent in the middle layer, the cache data of each service node can be used to summary the query results. The structure of cache stored should same as table structure in disk, and the data cannot have redundancy. Although the rational use of this method can improve the efficiency of database queries, but some common requests are also not timely respond. For example, everyday users statistic the sales the day before, at the first query in cache, Figure 3. Read and write operations in Memory Database they must not hit in the cache, so the first response time of query is often very long, which is a shortcoming which cache technology cannot overcome. We use "Optimal Prefetch(OP)" method to resolve this In the transactional database system, the memory problem. Work presented in [12] discusses the Prefetch database required for a consistency with disk database in the method, but the data pattern should be input in advance. To startup. Therefore, the memory database need to have the same table definition with library database; and in the first 353
  • 4. time startup, it required to load all data in table of disk By separated the operation of reading and writing, database into memory database. vertical and horizontally segmentation, mass data can be stored Cloud memory NoSQL database cluster. As shown below, the database cluster distributed proxy is in charge of C. Disk Database Layer DDL controls such as data routing. Key-Value database is mainly characteristic of high concurrently read and write performance. Key-Value memory database load the entire database in memory, which flush data from memory database to disk through the asynchronous operation regularly. In addition, by setting expire time of Key-Value can also capture the changed data in memory database. Because operations is ran purely in memory, Key-Value memory database’s performance is very good, more than million times read and write operations can be handled per second. Changes of data in memory database need to be copied to the disk database. Then copy the data from memory to disk can be seen as the process of an original asynchronous writing operation, obviously, the asynchronous write operation make faster. Figure 5. Memory NoSQL Database cluster When copy data from memory database to disk database, it will use hybrid partition method. The partition which was committed by a NoSQL node in Memory Database Layer will be committed by a memory node and a common NoSQL node. The database schema will be formed by the two databases. Figure 4. Data synchronization from Memory Database to Disk Database D. Distributed storage in CDSA Distributed database in cloud computing is a database service system which constitute from database distributed in different nodes, and according this distributed architecture to provide online flexible scalability. For example, more data Figure 6. H-Shard cluster nodes can be added or deleted without stop service and so on. Cloud database is not just a database, but a distributed database network service which was constituted with a large number of database nodes. A write operation of cloud IV. PERFORMANCE OF CDSA database will be copied to other nodes, read request will be MDL of CDSA is created by aggregating the main routed to a node in cloud. For a cloud database cluster, the memories of servers. The CDSA can provide durable and scalability is become easy; just add a node in the cluster available storage with 100x the throughput of only use disk inside cloud. Cloud distributed database is necessarily a database and lower access latency. The combination of low multiple-node system; its performance of concurrent depends latency and large scale will enable a new breed of data- on the number of nodes in the system and routing efficiency. intensive applications. 354
  • 5. This section focused on the test of CDSA access 3) Mass data storage patterns. Through the basic operations of insert, read, update, By created a distributed cloud database cluster in the and delete, to test both the throughput and average response cloud to achieve mass data storage; time, at the same time to verify the correctness and 4) High Availability effectiveness of CDSA. Using the test environment Based on the distributed database proxy, by using MDB specifically as follows: master node and slave node to implement high availability To test throughput and response time of CDSA, firstly further; the secondary disk database can achieve fast data nmoll is used as tools to capture the performance of the test recovery program and write to a file, and then use the nmoll as an The statistical data shows that this architecture analyser to analyze the data file, finally we select the test performs well and brings a lot of benefits. However, there is results of CPU and I/O of disk to create a graphical report. limit in the management of user-defined data resources, the Test program insert, update, read or delete 500 million flexibility of cloud service controls, and search in different records respectively, the time consuming was recorded. First data resources. We plan to address these problems in the the test program opened the database, the records were read future releases. from a disk database to memory corresponds, and then run the Insert. Update, Read or Delete operations, finally synchronized data from memory to disk file. REFERENCES TABLE I. TEST RESULT-DATA OPERATION [1] S. Ghemawat, H. Gobioff, and S. Leung The Google File System In: Operations open database run close database Proc. of the Nineteenth ACM Sym-posium on Operating Systems (s) (s) (s) Principles, October, 2003 insert 2 17 4 [2] Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, update 0.7 8 2 Deborah A. Wallach, Mike Burrows, Tushar Chandra, Andrew Fikes, read 1 5 3 and Robert E. Gruber Bigtable: A Distributed Storage System for delete 1 5 2 Structured Data. In Proceedings of OSDI'06: Seventh Symposium on Operating System Design and Implementation, November, 2006 [3] Amazon. Amazon elastic compute cloud (Amazon EC2). 2009. http://aws.amazon.com/ec2/ TABLE II. TEST RESULT-AVG RESPONSETIME&THROUGHPUT [4] Amazon Simple Storage Service (S3). 2009. Operations average response time throughput http://www.amazon.com/s3/. (μ s) [5] Dean J, Ghemawat S. MapReduce: Simplified Data Processing on Large Clusters. In OSDI '04: Sixth Symposium on Operating System insert 3 316000 Design and Implementation, 2004 update 1.5 562000 read 1.2 753000 [6] Yang HC, Dasdan A, Hsiao RL, Parker DS. Map-Reduce-Merge: Simplified relational data processing on large clusters In: Proc. of the delete 1 671000 2007 ACM SIGMOD Int’ Conf on Management ofData. New York: ACM Press,2007. 1029-1040. Table I and table show the test result: the average [7] P. Mell and T. Grance, "Draft nist working definition of cloud computing - v15," 21. Aug 2009, 2009 response time is in the level of microseconds. Because of logic is relatively simple, the test results may be relative [8] Chen Kang, Zheng Weimin Cloud Computing: System Instances and Current Research[J] Journal of Software, Vol.20, No.5, May 2009, better than the practical, but it is certain that the CDSA is a pp.1337−1348. good choice for cloud. [9] "NoSQL Relational Database Management System: Home Page". Strozzi.it. 2007-10-02. Retrieved 2010-03-29 [10] DeCandia G, Hastorun D, Jampani M, Kakulapati G, Lakshman A V. CONCLUSION Pilchin A, Sivasubramanian S, Vosshnll P, Vogels W. Dynamo: According to the characteristics of distributed storage in Amazon’ s highly available key value store In: Proc. of the 21st ACM Symp on Operating Systems Principles New York: ACM cloud, by introducing memory database and NoSQL, we Press, 2007. 205-220. propose an architecture of distributed memory NO-SQL [11] Eelco Plugge, Tim Hawkins, Peter Membrey The Definitive Guide to database for cloud, to address massive data storage and high MongoDB: The NoSQL Database for Cloud and Desktop Computing concurrent access issues cloud in computing environments. Publisher Apress Berkely, CA, USA, 2010 The core idea is: [12] Wang D, Xie J. An approach toward web caching and pre-fetching 1) High performance: for database management system. Paris, France: SIGMOD, 2001: By using a cache layer and a memory NoSQL database, 204-216. to provide high-performance database access service, which is the main goal of this architecture; 2) Persistence By using of memory database and disk database to implement asynchronously copy from memory database to disk database; 355