SlideShare a Scribd company logo
1 of 22
www.day.com



CRX 1.4

TarPersistenceManager and Clustering
with TarPersistenceManager




Speaker: Honwai Wong, SSE
Duration: 45 min

Feedback: techsummit@day.com

Day Technical Summit 2008              1
www.day.com



Agenda
 TarPM (TarPersistenceManager)
     Functionality
     Configuration
     Optimization
     Hot Backup
     Migration

 TarPM (TarPersistenceManager) Clustering
     Architecture
     Global Data Store
     Setup
     Configuration

Day Technical Summit 2008
www.day.com



TarPM
Functionality
 Disk-based PersistenceManager
 Uses standard Tar file format (POSIX standard)
 Append-only write operations, thus extremely efficient
 Particularly suitable for high data creation and modification
  use-cases
 Takes advantage of key-value pair data structure of CRX
 Maintains index files for fast access
 Hot backup capability



Day Technical Summit 2008                                         3
www.day.com



TarPM
Configuration
 TarPM configuration is done on a per workspace-level
    e.g. <crx_home>/workspaces/crx.default/workspace.xml


 no mandatory parameters, preset with default values
    <PersistenceManager class=
    quot;com.day.crx.persistence.tar.TarPersistenceManagerquot; />




Day Technical Summit 2008                                    4
www.day.com



TarPM
Configuration - Parameters
       Parameter            Description                                                        default

                            The directory where local files are stored. This can be an        base directory
       localPath            absolute or relative path.                                        of workspace


                            If the current data file grows larger than this number (in MB),
       maxFileSize                                                                                 64
                            a new data file is created.


                            After an abnormal termination, at most this much data (in
       maxIndexBuffer                                                                              32
                            MB) needs to be scanned to re-create the tar entry index.


       optimizeSleep        Number in milliseconds to wait after each optimization step.            1




Day Technical Summit 2008                                                                                      5
www.day.com



TarPM
Optimization
 Append-only operation leads to increased disk usage
 Data in the tar files is never overwritten
 Delete will append 0 length entries
 Optimization task copies active data from old tar files into new
  ones and subsequently deletes old tar files
 Different modes of operation supported
 Recommended to run during times of low system usage




Day Technical Summit 2008                                            6
www.day.com



TarPM
Optimization - Modes
 Manually trigger optimization from CRX Explorer
 Place a file called optimize.tar in data-directory
     TarPM detects this file and starts optimization
     optimize.tar is renamed to optimizeNow.tar
     after optimization finished, optimizeNow.tar is deleted automatically
     stop task by deleting this file
 Automate using cron-job
 Offline optimization using command-line tool
     java -cp <jars> com.day.crx.persistence.tar.TarUtils -optimize <directory>



Day Technical Summit 2008                                                          7
www.day.com



TarPM
Hot Backup
 Reminder: tar files are append-only
 Backup at any time including runtime
 Place file stopdelete.tar to prevent the TarPM from deleting old
  files while backing up
 Consistent backup by copying data_*.tar files, sorted by
  modification date, newest last
     e.g.          ls -tr data_*.tar | xargs -n1 -J % cp -v % /backup

 When restoring, incomplete transactions are rolled back



Day Technical Summit 2008                                                8
www.day.com



TarPM
Migration
 Migration of workspaces using CRX Console
 Low-level copy of existing workspace to new TarPM workspace
 Tool is provided with CRX
 Comprehensive documentation on docs.day.com
     see section CQ 4.2 / Setup / Migration
 Part of migration presentation from Tech Summit 2007
     http://daycare.day.com/home/day_public/tech_summit_2007.html
     Author: Dominique Jaeggi, SSE, Day



Day Technical Summit 2008                                            9
www.day.com



TarPM Clustering
Architecture
 Master/Slave relation between CRX cluster nodes participating
  in a cluster
 Consists of 2 or more CRX cluster nodes with TarPM
 Synchronization via file-based Cluster Journal
 Direct communication between cluster nodes via TCP/IP using
  HTTP
 Only Master CRX node writes data
 Master is elected automatically
 Automatic fail-over


Day Technical Summit 2008                                     10
www.day.com



TarPM Clustering
Architecture - Overview
                                       Cluster Node A                                Cluster Node B
                                          (Master)
                                  s
                              old
                            H      s                CRX                                  CRX
                               ock                          Journal posts
                             L                                to master


                                          TAR PM                                        TAR PM




                                                               master write

                                                                              read
                                          master write




                                                                                               read
                                                          Master Data TAR



                                                              Cluster
                                                            Journal (FS)




                                                                    FS
Day Technical Summit 2008                                                                             11
www.day.com



TarPM Clustering
Global Data Store
 Central storage for binary data, even beyond repository
  boundaries
 Only one copy per unique object is kept
 Storing and reading does not block other users, done outside
  Persistence Manager
 Objects in the Data Store are immutable
 Only unique data identifier of existing objects in the Data Store
  are stored in the Persistence Manager
 Transactional semantics guaranteed
 Hot Backup by simply copying all files :)

Day Technical Summit 2008                                         12
www.day.com



TarPM Clustering
Global Data Store - Configuration
 Configured in repository.xml of CRX
     e.g. <crx_home>/server/runtime/0/_crx/WEB-INF/repository.xml
 File-based or database-backed
     org.apache.jackrabbit.core.data.FileDataStore
     org.apache.jackrabbit.core.data.db.DbDataStore




Day Technical Summit 2008                                            13
www.day.com



TarPM Clustering
FileDataStore - Config Parameters

       Parameter            Description                                            default

                                                                                   repository.home/
       path                 The directory where to store binary objects.           repository/datastore


                            Binary objects bigger than this value (in bytes) are
       minRecordLength                                                             100
                            stored in the Data Store.




Day Technical Summit 2008                                                                                 14
www.day.com



TarPM Clustering
DbDataStore - Config Parameters
       Parameter            Description                                            default
       url                  The database URL used to access the database.             -

       user                 Name of the database-user.                                -

       password             Password of the user.                                     -

                            Binary objects bigger than this value (in bytes) are
       minRecordLength                                                               100
                            stored in the Data Store.

       maxConnections       The maximum number of open connections.                   3




Day Technical Summit 2008                                                                    15
www.day.com



TarPM Clustering
Architecture
                                       Cluster Node A                                Cluster Node B
                                          (Master)
                                  s
                              old
                            H      s
                               ock
                                                    CRX                                  CRX
                             L                              Journal posts
                                                              to master


                                          TAR PM                                        TAR PM




                                                               master write

                                                                              read
                                          master write




                                                                                               read
                                                          Master Data TAR



                                                              Cluster
                                                            Journal (FS)



                                                          Global Data Store




                                                                   FS
Day Technical Summit 2008                                                                             16
www.day.com



TarPM Clustering
Setup
 Install CRX
 Configure clustering in repository.xml
 Configure TarPM to run in cluster mode
 Setup additional CRX cluster node by copying complete
  instance
 Delete repository-local revision, if present
 On startup, CRX cluster node will sync up with master data
  based on journal



Day Technical Summit 2008                                      17
www.day.com



TarPM Clustering
Repository Configuration
 Enable clustering on a repository-wide level
     e.g. <crx_home>/runtime/0/_crx/WEB-INF/repository.xml
 Unique cluster id
 Cluster Journal
    <Cluster id=quot;cluster-node-1quot; syncDelay=quot;1quot;>
        <Journal class=quot;org.apache.jackrabbit.core.journal.FileJournalquot;>
            <param name=quot;revisionquot; value=quot;${rep.home}/revision.logquot; />
            <param name=quot;directoryquot; value=quot;/data/shared/journalquot; />
        </Journal>
    </Cluster>




Day Technical Summit 2008                                                  18
www.day.com



TarPM Clustering
Repository Configuration - Parameters
       luster
     C
              Parameter     Description
              id            This is required to be a unique literal id of the cluster node.

                            Delay in milliseconds before changes to the journal are
              syncDelay     automatically detected. Default: 5000


        urnal
     Jo
              Parameter     Description
                            FQN of org.apache.jackrabbit.core.journal.Journal interface
              class
                            implementation.

              revision      Location and filename of repository-local revision counter.


              directory     Shared directory of journal entries and global revision counter.



Day Technical Summit 2008                                                                      19
www.day.com



TarPM Clustering
Workspace Configuration
 enable clustering on the TarPM
     e.g. <crx_home>/crx/workspaces/crx.default/workspace.xml
 set cluster flag
 configure local and shared paths
    <PersistenceManager class=quot;com.day.crx.persistence.tar.TarPersistenceManagerquot;>
        <param name=quot;clusterquot; value=quot;truequot; />
        <param name=quot;localPathquot; value=quot;${wsp.home}quot; />
        <param name=quot;sharedPathquot; value=quot;/data/sharedquot; />
    </PersistenceManager>




Day Technical Summit 2008                                                            20
www.day.com



TarPM Clustering
TarPM Configuration - Parameters
              Parameter     Description                                        default

              cluster       Enables clustering.                                FALSE


              localPath     Path where to store local tar-files and index-files. workspace.home


              sharedPath    Path where to store shared data, i.e. tar-files.   workspace.home




Day Technical Summit 2008                                                                         21
www.day.com



TarPM


                            Questions?




Day Technical Summit 2008                22

More Related Content

What's hot

Compression Options in Hadoop - A Tale of Tradeoffs
Compression Options in Hadoop - A Tale of TradeoffsCompression Options in Hadoop - A Tale of Tradeoffs
Compression Options in Hadoop - A Tale of TradeoffsDataWorks Summit
 
Big Data Meets HPC - Exploiting HPC Technologies for Accelerating Big Data Pr...
Big Data Meets HPC - Exploiting HPC Technologies for Accelerating Big Data Pr...Big Data Meets HPC - Exploiting HPC Technologies for Accelerating Big Data Pr...
Big Data Meets HPC - Exploiting HPC Technologies for Accelerating Big Data Pr...inside-BigData.com
 
Ceph on Intel: Intel Storage Components, Benchmarks, and Contributions
Ceph on Intel: Intel Storage Components, Benchmarks, and ContributionsCeph on Intel: Intel Storage Components, Benchmarks, and Contributions
Ceph on Intel: Intel Storage Components, Benchmarks, and ContributionsColleen Corrice
 
Hug Hbase Presentation.
Hug Hbase Presentation.Hug Hbase Presentation.
Hug Hbase Presentation.Jack Levin
 
Hadoop HDFS Architeture and Design
Hadoop HDFS Architeture and DesignHadoop HDFS Architeture and Design
Hadoop HDFS Architeture and Designsudhakara st
 
Red Hat Storage Server Administration Deep Dive
Red Hat Storage Server Administration Deep DiveRed Hat Storage Server Administration Deep Dive
Red Hat Storage Server Administration Deep DiveRed_Hat_Storage
 
Cephfs jewel mds performance benchmark
Cephfs jewel mds performance benchmarkCephfs jewel mds performance benchmark
Cephfs jewel mds performance benchmarkXiaoxi Chen
 
In-memory Caching in HDFS: Lower Latency, Same Great Taste
In-memory Caching in HDFS: Lower Latency, Same Great TasteIn-memory Caching in HDFS: Lower Latency, Same Great Taste
In-memory Caching in HDFS: Lower Latency, Same Great TasteDataWorks Summit
 
HDFS Futures: NameNode Federation for Improved Efficiency and Scalability
HDFS Futures: NameNode Federation for Improved Efficiency and ScalabilityHDFS Futures: NameNode Federation for Improved Efficiency and Scalability
HDFS Futures: NameNode Federation for Improved Efficiency and ScalabilityHortonworks
 
Ceph - High Performance Without High Costs
Ceph - High Performance Without High CostsCeph - High Performance Without High Costs
Ceph - High Performance Without High CostsJonathan Long
 
Apache HBase Performance Tuning
Apache HBase Performance TuningApache HBase Performance Tuning
Apache HBase Performance TuningLars Hofhansl
 
Hadoop Cluster With High Availability
Hadoop Cluster With High AvailabilityHadoop Cluster With High Availability
Hadoop Cluster With High AvailabilityEdureka!
 
QCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitectureQCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitecturePatrick McGarry
 
Hadoop Architecture | HDFS Architecture | Hadoop Architecture Tutorial | HDFS...
Hadoop Architecture | HDFS Architecture | Hadoop Architecture Tutorial | HDFS...Hadoop Architecture | HDFS Architecture | Hadoop Architecture Tutorial | HDFS...
Hadoop Architecture | HDFS Architecture | Hadoop Architecture Tutorial | HDFS...Simplilearn
 
Storage Systems for big data - HDFS, HBase, and intro to KV Store - Redis
Storage Systems for big data - HDFS, HBase, and intro to KV Store - RedisStorage Systems for big data - HDFS, HBase, and intro to KV Store - Redis
Storage Systems for big data - HDFS, HBase, and intro to KV Store - RedisSameer Tiwari
 
Real-Time Data Loading from MySQL to Hadoop with New Tungsten Replicator 3.0
Real-Time Data Loading from MySQL to Hadoop with New Tungsten Replicator 3.0Real-Time Data Loading from MySQL to Hadoop with New Tungsten Replicator 3.0
Real-Time Data Loading from MySQL to Hadoop with New Tungsten Replicator 3.0Continuent
 
Storage Geeks 101 - 2019
Storage Geeks 101 - 2019Storage Geeks 101 - 2019
Storage Geeks 101 - 2019Andrew McGee
 

What's hot (20)

Compression Options in Hadoop - A Tale of Tradeoffs
Compression Options in Hadoop - A Tale of TradeoffsCompression Options in Hadoop - A Tale of Tradeoffs
Compression Options in Hadoop - A Tale of Tradeoffs
 
Hadoop architecture by ajay
Hadoop architecture by ajayHadoop architecture by ajay
Hadoop architecture by ajay
 
Big Data Meets HPC - Exploiting HPC Technologies for Accelerating Big Data Pr...
Big Data Meets HPC - Exploiting HPC Technologies for Accelerating Big Data Pr...Big Data Meets HPC - Exploiting HPC Technologies for Accelerating Big Data Pr...
Big Data Meets HPC - Exploiting HPC Technologies for Accelerating Big Data Pr...
 
Ceph on Intel: Intel Storage Components, Benchmarks, and Contributions
Ceph on Intel: Intel Storage Components, Benchmarks, and ContributionsCeph on Intel: Intel Storage Components, Benchmarks, and Contributions
Ceph on Intel: Intel Storage Components, Benchmarks, and Contributions
 
Hug Hbase Presentation.
Hug Hbase Presentation.Hug Hbase Presentation.
Hug Hbase Presentation.
 
Hadoop HDFS Architeture and Design
Hadoop HDFS Architeture and DesignHadoop HDFS Architeture and Design
Hadoop HDFS Architeture and Design
 
Red Hat Storage Server Administration Deep Dive
Red Hat Storage Server Administration Deep DiveRed Hat Storage Server Administration Deep Dive
Red Hat Storage Server Administration Deep Dive
 
Hadoop HDFS Concepts
Hadoop HDFS ConceptsHadoop HDFS Concepts
Hadoop HDFS Concepts
 
Cephfs jewel mds performance benchmark
Cephfs jewel mds performance benchmarkCephfs jewel mds performance benchmark
Cephfs jewel mds performance benchmark
 
In-memory Caching in HDFS: Lower Latency, Same Great Taste
In-memory Caching in HDFS: Lower Latency, Same Great TasteIn-memory Caching in HDFS: Lower Latency, Same Great Taste
In-memory Caching in HDFS: Lower Latency, Same Great Taste
 
Hadoop HDFS Concepts
Hadoop HDFS ConceptsHadoop HDFS Concepts
Hadoop HDFS Concepts
 
HDFS Futures: NameNode Federation for Improved Efficiency and Scalability
HDFS Futures: NameNode Federation for Improved Efficiency and ScalabilityHDFS Futures: NameNode Federation for Improved Efficiency and Scalability
HDFS Futures: NameNode Federation for Improved Efficiency and Scalability
 
Ceph - High Performance Without High Costs
Ceph - High Performance Without High CostsCeph - High Performance Without High Costs
Ceph - High Performance Without High Costs
 
Apache HBase Performance Tuning
Apache HBase Performance TuningApache HBase Performance Tuning
Apache HBase Performance Tuning
 
Hadoop Cluster With High Availability
Hadoop Cluster With High AvailabilityHadoop Cluster With High Availability
Hadoop Cluster With High Availability
 
QCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitectureQCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference Architecture
 
Hadoop Architecture | HDFS Architecture | Hadoop Architecture Tutorial | HDFS...
Hadoop Architecture | HDFS Architecture | Hadoop Architecture Tutorial | HDFS...Hadoop Architecture | HDFS Architecture | Hadoop Architecture Tutorial | HDFS...
Hadoop Architecture | HDFS Architecture | Hadoop Architecture Tutorial | HDFS...
 
Storage Systems for big data - HDFS, HBase, and intro to KV Store - Redis
Storage Systems for big data - HDFS, HBase, and intro to KV Store - RedisStorage Systems for big data - HDFS, HBase, and intro to KV Store - Redis
Storage Systems for big data - HDFS, HBase, and intro to KV Store - Redis
 
Real-Time Data Loading from MySQL to Hadoop with New Tungsten Replicator 3.0
Real-Time Data Loading from MySQL to Hadoop with New Tungsten Replicator 3.0Real-Time Data Loading from MySQL to Hadoop with New Tungsten Replicator 3.0
Real-Time Data Loading from MySQL to Hadoop with New Tungsten Replicator 3.0
 
Storage Geeks 101 - 2019
Storage Geeks 101 - 2019Storage Geeks 101 - 2019
Storage Geeks 101 - 2019
 

Viewers also liked

Embrace OSGi Apache Con Europe2009
Embrace OSGi Apache Con Europe2009Embrace OSGi Apache Con Europe2009
Embrace OSGi Apache Con Europe2009day
 
Scripting Yor Java Application with BSF3
Scripting Yor Java Application with BSF3Scripting Yor Java Application with BSF3
Scripting Yor Java Application with BSF3day
 
Java Persistence Frameworks
Java Persistence FrameworksJava Persistence Frameworks
Java Persistence Frameworksday
 
Scala for scripting
Scala for scriptingScala for scripting
Scala for scriptingday
 
Scala4sling
Scala4slingScala4sling
Scala4slingday
 
Testing Zen
Testing ZenTesting Zen
Testing Zenday
 
Performance Pack
Performance PackPerformance Pack
Performance Packday
 

Viewers also liked (8)

Embrace OSGi Apache Con Europe2009
Embrace OSGi Apache Con Europe2009Embrace OSGi Apache Con Europe2009
Embrace OSGi Apache Con Europe2009
 
Scripting Yor Java Application with BSF3
Scripting Yor Java Application with BSF3Scripting Yor Java Application with BSF3
Scripting Yor Java Application with BSF3
 
Java Persistence Frameworks
Java Persistence FrameworksJava Persistence Frameworks
Java Persistence Frameworks
 
Apache jMeter
Apache jMeterApache jMeter
Apache jMeter
 
Scala for scripting
Scala for scriptingScala for scripting
Scala for scripting
 
Scala4sling
Scala4slingScala4sling
Scala4sling
 
Testing Zen
Testing ZenTesting Zen
Testing Zen
 
Performance Pack
Performance PackPerformance Pack
Performance Pack
 

Similar to Tarpm Clustering

Operating and Supporting Delta Lake in Production
Operating and Supporting Delta Lake in ProductionOperating and Supporting Delta Lake in Production
Operating and Supporting Delta Lake in ProductionDatabricks
 
Accelerate and Scale Big Data Analytics with Disaggregated Compute and Storage
Accelerate and Scale Big Data Analytics with Disaggregated Compute and StorageAccelerate and Scale Big Data Analytics with Disaggregated Compute and Storage
Accelerate and Scale Big Data Analytics with Disaggregated Compute and StorageAlluxio, Inc.
 
Scale your Alfresco Solutions
Scale your Alfresco Solutions Scale your Alfresco Solutions
Scale your Alfresco Solutions Alfresco Software
 
Flink Forward Berlin 2018: Stefan Richter - "Tuning Flink for Robustness and ...
Flink Forward Berlin 2018: Stefan Richter - "Tuning Flink for Robustness and ...Flink Forward Berlin 2018: Stefan Richter - "Tuning Flink for Robustness and ...
Flink Forward Berlin 2018: Stefan Richter - "Tuning Flink for Robustness and ...Flink Forward
 
Tuning Flink For Robustness And Performance
Tuning Flink For Robustness And PerformanceTuning Flink For Robustness And Performance
Tuning Flink For Robustness And PerformanceStefan Richter
 
Can we run the Whole Web on Apache Sling?
Can we run the Whole Web on Apache Sling?Can we run the Whole Web on Apache Sling?
Can we run the Whole Web on Apache Sling?Bertrand Delacretaz
 
Inside PostgreSQL Shared Memory
Inside PostgreSQL Shared MemoryInside PostgreSQL Shared Memory
Inside PostgreSQL Shared MemoryEDB
 
10 things i wish i'd known before using spark in production
10 things i wish i'd known before using spark in production10 things i wish i'd known before using spark in production
10 things i wish i'd known before using spark in productionParis Data Engineers !
 
Geek Sync | Guide to Understanding and Monitoring Tempdb
Geek Sync | Guide to Understanding and Monitoring TempdbGeek Sync | Guide to Understanding and Monitoring Tempdb
Geek Sync | Guide to Understanding and Monitoring TempdbIDERA Software
 
Persistent Memory Programming with Java*
Persistent Memory Programming with Java*Persistent Memory Programming with Java*
Persistent Memory Programming with Java*Intel® Software
 
Running Apache Spark on a High-Performance Cluster Using RDMA and NVMe Flash ...
Running Apache Spark on a High-Performance Cluster Using RDMA and NVMe Flash ...Running Apache Spark on a High-Performance Cluster Using RDMA and NVMe Flash ...
Running Apache Spark on a High-Performance Cluster Using RDMA and NVMe Flash ...Databricks
 
Optimizing Presto Connector on Cloud Storage
Optimizing Presto Connector on Cloud StorageOptimizing Presto Connector on Cloud Storage
Optimizing Presto Connector on Cloud StorageKai Sasaki
 
Real World Storage in Treasure Data
Real World Storage in Treasure DataReal World Storage in Treasure Data
Real World Storage in Treasure DataKai Sasaki
 
Massively Parallel RISC-V Processing with Transactional Memory
Massively Parallel RISC-V Processing with Transactional MemoryMassively Parallel RISC-V Processing with Transactional Memory
Massively Parallel RISC-V Processing with Transactional MemoryNetronome
 
[REPEAT 1] Deep Dive on Amazon Aurora with MySQL Compatibility (DAT304-R1) - ...
[REPEAT 1] Deep Dive on Amazon Aurora with MySQL Compatibility (DAT304-R1) - ...[REPEAT 1] Deep Dive on Amazon Aurora with MySQL Compatibility (DAT304-R1) - ...
[REPEAT 1] Deep Dive on Amazon Aurora with MySQL Compatibility (DAT304-R1) - ...Amazon Web Services
 
MongoDB World 2018: MongoDB for High Volume Time Series Data Streams
MongoDB World 2018: MongoDB for High Volume Time Series Data StreamsMongoDB World 2018: MongoDB for High Volume Time Series Data Streams
MongoDB World 2018: MongoDB for High Volume Time Series Data StreamsMongoDB
 
Elastify Cloud-Native Spark Application with Persistent Memory
Elastify Cloud-Native Spark Application with Persistent MemoryElastify Cloud-Native Spark Application with Persistent Memory
Elastify Cloud-Native Spark Application with Persistent MemoryDatabricks
 
Building Your First Serverless Data Lake (ANT356-R1) - AWS re:Invent 2018
Building Your First Serverless Data Lake (ANT356-R1) - AWS re:Invent 2018Building Your First Serverless Data Lake (ANT356-R1) - AWS re:Invent 2018
Building Your First Serverless Data Lake (ANT356-R1) - AWS re:Invent 2018Amazon Web Services
 

Similar to Tarpm Clustering (20)

Operating and Supporting Delta Lake in Production
Operating and Supporting Delta Lake in ProductionOperating and Supporting Delta Lake in Production
Operating and Supporting Delta Lake in Production
 
Accelerate and Scale Big Data Analytics with Disaggregated Compute and Storage
Accelerate and Scale Big Data Analytics with Disaggregated Compute and StorageAccelerate and Scale Big Data Analytics with Disaggregated Compute and Storage
Accelerate and Scale Big Data Analytics with Disaggregated Compute and Storage
 
Masterclass Live: Amazon EMR
Masterclass Live: Amazon EMRMasterclass Live: Amazon EMR
Masterclass Live: Amazon EMR
 
Scale your Alfresco Solutions
Scale your Alfresco Solutions Scale your Alfresco Solutions
Scale your Alfresco Solutions
 
Flink Forward Berlin 2018: Stefan Richter - "Tuning Flink for Robustness and ...
Flink Forward Berlin 2018: Stefan Richter - "Tuning Flink for Robustness and ...Flink Forward Berlin 2018: Stefan Richter - "Tuning Flink for Robustness and ...
Flink Forward Berlin 2018: Stefan Richter - "Tuning Flink for Robustness and ...
 
Tuning Flink For Robustness And Performance
Tuning Flink For Robustness And PerformanceTuning Flink For Robustness And Performance
Tuning Flink For Robustness And Performance
 
Can we run the Whole Web on Apache Sling?
Can we run the Whole Web on Apache Sling?Can we run the Whole Web on Apache Sling?
Can we run the Whole Web on Apache Sling?
 
Inside PostgreSQL Shared Memory
Inside PostgreSQL Shared MemoryInside PostgreSQL Shared Memory
Inside PostgreSQL Shared Memory
 
10 things i wish i'd known before using spark in production
10 things i wish i'd known before using spark in production10 things i wish i'd known before using spark in production
10 things i wish i'd known before using spark in production
 
Geek Sync | Guide to Understanding and Monitoring Tempdb
Geek Sync | Guide to Understanding and Monitoring TempdbGeek Sync | Guide to Understanding and Monitoring Tempdb
Geek Sync | Guide to Understanding and Monitoring Tempdb
 
Persistent Memory Programming with Java*
Persistent Memory Programming with Java*Persistent Memory Programming with Java*
Persistent Memory Programming with Java*
 
Running Apache Spark on a High-Performance Cluster Using RDMA and NVMe Flash ...
Running Apache Spark on a High-Performance Cluster Using RDMA and NVMe Flash ...Running Apache Spark on a High-Performance Cluster Using RDMA and NVMe Flash ...
Running Apache Spark on a High-Performance Cluster Using RDMA and NVMe Flash ...
 
Optimizing Presto Connector on Cloud Storage
Optimizing Presto Connector on Cloud StorageOptimizing Presto Connector on Cloud Storage
Optimizing Presto Connector on Cloud Storage
 
Real World Storage in Treasure Data
Real World Storage in Treasure DataReal World Storage in Treasure Data
Real World Storage in Treasure Data
 
Massively Parallel RISC-V Processing with Transactional Memory
Massively Parallel RISC-V Processing with Transactional MemoryMassively Parallel RISC-V Processing with Transactional Memory
Massively Parallel RISC-V Processing with Transactional Memory
 
[REPEAT 1] Deep Dive on Amazon Aurora with MySQL Compatibility (DAT304-R1) - ...
[REPEAT 1] Deep Dive on Amazon Aurora with MySQL Compatibility (DAT304-R1) - ...[REPEAT 1] Deep Dive on Amazon Aurora with MySQL Compatibility (DAT304-R1) - ...
[REPEAT 1] Deep Dive on Amazon Aurora with MySQL Compatibility (DAT304-R1) - ...
 
MongoDB World 2018: MongoDB for High Volume Time Series Data Streams
MongoDB World 2018: MongoDB for High Volume Time Series Data StreamsMongoDB World 2018: MongoDB for High Volume Time Series Data Streams
MongoDB World 2018: MongoDB for High Volume Time Series Data Streams
 
Elastify Cloud-Native Spark Application with Persistent Memory
Elastify Cloud-Native Spark Application with Persistent MemoryElastify Cloud-Native Spark Application with Persistent Memory
Elastify Cloud-Native Spark Application with Persistent Memory
 
Amazon Aurora
Amazon AuroraAmazon Aurora
Amazon Aurora
 
Building Your First Serverless Data Lake (ANT356-R1) - AWS re:Invent 2018
Building Your First Serverless Data Lake (ANT356-R1) - AWS re:Invent 2018Building Your First Serverless Data Lake (ANT356-R1) - AWS re:Invent 2018
Building Your First Serverless Data Lake (ANT356-R1) - AWS re:Invent 2018
 

More from day

Tech Summit 08 Support Initiative
Tech Summit 08 Support InitiativeTech Summit 08 Support Initiative
Tech Summit 08 Support Initiativeday
 
Non Cms For Web Apps
Non Cms For Web AppsNon Cms For Web Apps
Non Cms For Web Appsday
 
Getting Into The Flow With Cq Dam
Getting Into The Flow With Cq DamGetting Into The Flow With Cq Dam
Getting Into The Flow With Cq Damday
 
Dispatcher Oom
Dispatcher OomDispatcher Oom
Dispatcher Oomday
 
Advanced Collaboration And Beyond
Advanced Collaboration And BeyondAdvanced Collaboration And Beyond
Advanced Collaboration And Beyondday
 
Wc Mand Connectors2
Wc Mand Connectors2Wc Mand Connectors2
Wc Mand Connectors2day
 
Jackrabbit Roadmap
Jackrabbit RoadmapJackrabbit Roadmap
Jackrabbit Roadmapday
 
Doc Book Vs Dita
Doc Book Vs DitaDoc Book Vs Dita
Doc Book Vs Ditaday
 
Doc Book Vs Dita Teresa
Doc Book Vs Dita TeresaDoc Book Vs Dita Teresa
Doc Book Vs Dita Teresaday
 
862
862862
862day
 
Apache Con Us2007 Sanselan
Apache Con Us2007 SanselanApache Con Us2007 Sanselan
Apache Con Us2007 Sanselanday
 
Apache Con Us2007 Jcr In Action
Apache Con Us2007 Jcr In ActionApache Con Us2007 Jcr In Action
Apache Con Us2007 Jcr In Actionday
 
Apache Con Us2007 Apachei Batis
Apache Con Us2007 Apachei BatisApache Con Us2007 Apachei Batis
Apache Con Us2007 Apachei Batisday
 
Apache Con U S07 F F T Sling
Apache Con U S07  F F T  SlingApache Con U S07  F F T  Sling
Apache Con U S07 F F T Slingday
 
200711 R E S T Apache Con
200711  R E S T  Apache Con200711  R E S T  Apache Con
200711 R E S T Apache Conday
 

More from day (15)

Tech Summit 08 Support Initiative
Tech Summit 08 Support InitiativeTech Summit 08 Support Initiative
Tech Summit 08 Support Initiative
 
Non Cms For Web Apps
Non Cms For Web AppsNon Cms For Web Apps
Non Cms For Web Apps
 
Getting Into The Flow With Cq Dam
Getting Into The Flow With Cq DamGetting Into The Flow With Cq Dam
Getting Into The Flow With Cq Dam
 
Dispatcher Oom
Dispatcher OomDispatcher Oom
Dispatcher Oom
 
Advanced Collaboration And Beyond
Advanced Collaboration And BeyondAdvanced Collaboration And Beyond
Advanced Collaboration And Beyond
 
Wc Mand Connectors2
Wc Mand Connectors2Wc Mand Connectors2
Wc Mand Connectors2
 
Jackrabbit Roadmap
Jackrabbit RoadmapJackrabbit Roadmap
Jackrabbit Roadmap
 
Doc Book Vs Dita
Doc Book Vs DitaDoc Book Vs Dita
Doc Book Vs Dita
 
Doc Book Vs Dita Teresa
Doc Book Vs Dita TeresaDoc Book Vs Dita Teresa
Doc Book Vs Dita Teresa
 
862
862862
862
 
Apache Con Us2007 Sanselan
Apache Con Us2007 SanselanApache Con Us2007 Sanselan
Apache Con Us2007 Sanselan
 
Apache Con Us2007 Jcr In Action
Apache Con Us2007 Jcr In ActionApache Con Us2007 Jcr In Action
Apache Con Us2007 Jcr In Action
 
Apache Con Us2007 Apachei Batis
Apache Con Us2007 Apachei BatisApache Con Us2007 Apachei Batis
Apache Con Us2007 Apachei Batis
 
Apache Con U S07 F F T Sling
Apache Con U S07  F F T  SlingApache Con U S07  F F T  Sling
Apache Con U S07 F F T Sling
 
200711 R E S T Apache Con
200711  R E S T  Apache Con200711  R E S T  Apache Con
200711 R E S T Apache Con
 

Recently uploaded

Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 

Recently uploaded (20)

Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 

Tarpm Clustering

  • 1. www.day.com CRX 1.4 TarPersistenceManager and Clustering with TarPersistenceManager Speaker: Honwai Wong, SSE Duration: 45 min Feedback: techsummit@day.com Day Technical Summit 2008 1
  • 2. www.day.com Agenda  TarPM (TarPersistenceManager)  Functionality  Configuration  Optimization  Hot Backup  Migration  TarPM (TarPersistenceManager) Clustering  Architecture  Global Data Store  Setup  Configuration Day Technical Summit 2008
  • 3. www.day.com TarPM Functionality  Disk-based PersistenceManager  Uses standard Tar file format (POSIX standard)  Append-only write operations, thus extremely efficient  Particularly suitable for high data creation and modification use-cases  Takes advantage of key-value pair data structure of CRX  Maintains index files for fast access  Hot backup capability Day Technical Summit 2008 3
  • 4. www.day.com TarPM Configuration  TarPM configuration is done on a per workspace-level e.g. <crx_home>/workspaces/crx.default/workspace.xml  no mandatory parameters, preset with default values <PersistenceManager class= quot;com.day.crx.persistence.tar.TarPersistenceManagerquot; /> Day Technical Summit 2008 4
  • 5. www.day.com TarPM Configuration - Parameters Parameter Description default The directory where local files are stored. This can be an base directory localPath absolute or relative path. of workspace If the current data file grows larger than this number (in MB), maxFileSize 64 a new data file is created. After an abnormal termination, at most this much data (in maxIndexBuffer 32 MB) needs to be scanned to re-create the tar entry index. optimizeSleep Number in milliseconds to wait after each optimization step. 1 Day Technical Summit 2008 5
  • 6. www.day.com TarPM Optimization  Append-only operation leads to increased disk usage  Data in the tar files is never overwritten  Delete will append 0 length entries  Optimization task copies active data from old tar files into new ones and subsequently deletes old tar files  Different modes of operation supported  Recommended to run during times of low system usage Day Technical Summit 2008 6
  • 7. www.day.com TarPM Optimization - Modes  Manually trigger optimization from CRX Explorer  Place a file called optimize.tar in data-directory  TarPM detects this file and starts optimization  optimize.tar is renamed to optimizeNow.tar  after optimization finished, optimizeNow.tar is deleted automatically  stop task by deleting this file  Automate using cron-job  Offline optimization using command-line tool  java -cp <jars> com.day.crx.persistence.tar.TarUtils -optimize <directory> Day Technical Summit 2008 7
  • 8. www.day.com TarPM Hot Backup  Reminder: tar files are append-only  Backup at any time including runtime  Place file stopdelete.tar to prevent the TarPM from deleting old files while backing up  Consistent backup by copying data_*.tar files, sorted by modification date, newest last  e.g. ls -tr data_*.tar | xargs -n1 -J % cp -v % /backup  When restoring, incomplete transactions are rolled back Day Technical Summit 2008 8
  • 9. www.day.com TarPM Migration  Migration of workspaces using CRX Console  Low-level copy of existing workspace to new TarPM workspace  Tool is provided with CRX  Comprehensive documentation on docs.day.com  see section CQ 4.2 / Setup / Migration  Part of migration presentation from Tech Summit 2007  http://daycare.day.com/home/day_public/tech_summit_2007.html  Author: Dominique Jaeggi, SSE, Day Day Technical Summit 2008 9
  • 10. www.day.com TarPM Clustering Architecture  Master/Slave relation between CRX cluster nodes participating in a cluster  Consists of 2 or more CRX cluster nodes with TarPM  Synchronization via file-based Cluster Journal  Direct communication between cluster nodes via TCP/IP using HTTP  Only Master CRX node writes data  Master is elected automatically  Automatic fail-over Day Technical Summit 2008 10
  • 11. www.day.com TarPM Clustering Architecture - Overview Cluster Node A Cluster Node B (Master) s old H s CRX CRX ock Journal posts L to master TAR PM TAR PM master write read master write read Master Data TAR Cluster Journal (FS) FS Day Technical Summit 2008 11
  • 12. www.day.com TarPM Clustering Global Data Store  Central storage for binary data, even beyond repository boundaries  Only one copy per unique object is kept  Storing and reading does not block other users, done outside Persistence Manager  Objects in the Data Store are immutable  Only unique data identifier of existing objects in the Data Store are stored in the Persistence Manager  Transactional semantics guaranteed  Hot Backup by simply copying all files :) Day Technical Summit 2008 12
  • 13. www.day.com TarPM Clustering Global Data Store - Configuration  Configured in repository.xml of CRX  e.g. <crx_home>/server/runtime/0/_crx/WEB-INF/repository.xml  File-based or database-backed  org.apache.jackrabbit.core.data.FileDataStore  org.apache.jackrabbit.core.data.db.DbDataStore Day Technical Summit 2008 13
  • 14. www.day.com TarPM Clustering FileDataStore - Config Parameters Parameter Description default repository.home/ path The directory where to store binary objects. repository/datastore Binary objects bigger than this value (in bytes) are minRecordLength 100 stored in the Data Store. Day Technical Summit 2008 14
  • 15. www.day.com TarPM Clustering DbDataStore - Config Parameters Parameter Description default url The database URL used to access the database. - user Name of the database-user. - password Password of the user. - Binary objects bigger than this value (in bytes) are minRecordLength 100 stored in the Data Store. maxConnections The maximum number of open connections. 3 Day Technical Summit 2008 15
  • 16. www.day.com TarPM Clustering Architecture Cluster Node A Cluster Node B (Master) s old H s ock CRX CRX L Journal posts to master TAR PM TAR PM master write read master write read Master Data TAR Cluster Journal (FS) Global Data Store FS Day Technical Summit 2008 16
  • 17. www.day.com TarPM Clustering Setup  Install CRX  Configure clustering in repository.xml  Configure TarPM to run in cluster mode  Setup additional CRX cluster node by copying complete instance  Delete repository-local revision, if present  On startup, CRX cluster node will sync up with master data based on journal Day Technical Summit 2008 17
  • 18. www.day.com TarPM Clustering Repository Configuration  Enable clustering on a repository-wide level  e.g. <crx_home>/runtime/0/_crx/WEB-INF/repository.xml  Unique cluster id  Cluster Journal <Cluster id=quot;cluster-node-1quot; syncDelay=quot;1quot;> <Journal class=quot;org.apache.jackrabbit.core.journal.FileJournalquot;> <param name=quot;revisionquot; value=quot;${rep.home}/revision.logquot; /> <param name=quot;directoryquot; value=quot;/data/shared/journalquot; /> </Journal> </Cluster> Day Technical Summit 2008 18
  • 19. www.day.com TarPM Clustering Repository Configuration - Parameters luster C Parameter Description id This is required to be a unique literal id of the cluster node. Delay in milliseconds before changes to the journal are syncDelay automatically detected. Default: 5000 urnal Jo Parameter Description FQN of org.apache.jackrabbit.core.journal.Journal interface class implementation. revision Location and filename of repository-local revision counter. directory Shared directory of journal entries and global revision counter. Day Technical Summit 2008 19
  • 20. www.day.com TarPM Clustering Workspace Configuration  enable clustering on the TarPM  e.g. <crx_home>/crx/workspaces/crx.default/workspace.xml  set cluster flag  configure local and shared paths <PersistenceManager class=quot;com.day.crx.persistence.tar.TarPersistenceManagerquot;> <param name=quot;clusterquot; value=quot;truequot; /> <param name=quot;localPathquot; value=quot;${wsp.home}quot; /> <param name=quot;sharedPathquot; value=quot;/data/sharedquot; /> </PersistenceManager> Day Technical Summit 2008 20
  • 21. www.day.com TarPM Clustering TarPM Configuration - Parameters Parameter Description default cluster Enables clustering. FALSE localPath Path where to store local tar-files and index-files. workspace.home sharedPath Path where to store shared data, i.e. tar-files. workspace.home Day Technical Summit 2008 21
  • 22. www.day.com TarPM Questions? Day Technical Summit 2008 22