Alfresco tuning part1

Tech Talk Live
Alfresco Performance Tuning – Part 1

Speaker Bio
Luis Cabaceira – Principal Consultant at Alfresco

Agenda
1 - General best practices on tuning
2 - Common mistakes
3 - Sizing, what to expect from a single server
4 – Solr tuning
5 – Jvm tuning ( Part 2 )
6 – Caches (Part 2 )
7 - Alfresco is running slow.. where to start ?
(Part 2)

1 - General Best Practices on Tuning
Disable Un-used services and features
• Disable virtual file-systems
• cifs.enabled=false, ftp.enabled=false
• webdav.enabled=false, nfs.enabled=false, imap.enabled=false
• Disable thumbnails and documents previews
• system.thumbnail.generate=false
• Disable share web-preview (on share-config-custom)
<config evaluator="string-compare" condition="DocumentDetails" replace="true">
<document-details>

<display-web-preview>false</display-web-preview>
</document-details>
</config>
• Disable replication
• replication.enabled=false
• transferservice.receiver.enabled=false

Disable Un-used services and features (cont.)
• Disable cloud-sync features
• syncService.mode=OFF
• sync.mode=OFF
• sync.pullJob.enabled=false
• sync.pushJob.enabled=false
• Disable user quotas
• system.usages.enabled=false
• system.usages.clearBatchSize=0
• Disable eager creation of home folders
• system.usages.enabled=false
• Disable activities feed
• activities.feed.notifier.enabled=false
• activities.feed.cleaner.enabled=false
• activities.post.cleaner.enabled=false

Golden Rules for the Repository
• Limit Groups hierarchy to 5 (nested groups)
• User inheritance based permission model
• Limit the maximum number of nodes in a folder
• Have a certain degree of control on the number of sites, do you really need
10000 sites ?
• Keep a low ratio on user/groups membership
• Limit the depth of the folder hierarchy

2 – Common Mistakes
• Not keeping extended configurations and customizations separate in the shared
directory. Do not put them in the configuration root. If you do, you will lose them
during upgrades.
• Not testing the backup strategy.
• Insufficient Monitoring, Insufficient troubleshooting tools
• Making changes to the system without testing them thoroughly on a test and pre-
production machine first.
• Forgetting to adjust the system sizing for increased users and sessions
• Increase the database connection pool
• Tune the maxThreads on tomcat
• Tune Jvm and Gc
• Benchmarks and Stress tests
• Using a shared database with other applications
• Network/Infrastructure constraints
• Not following the SPM

• Customizations / Custom Code mistakes
• Not closing search resultsets on try..catch…finally blocks (memory leaks)
• Incorrect usage of policies/behaviors (collisions, poor code quality)
• Using lower case versions of Alfresco beans
• Direct access to the database (should use Spring and the existing DAOs)
• Usage of private API’s

• Customizations / Custom Code mistakes
• Using Transaction Service instead of
RetryingTransactionHelper
• Not using CMIS query language when using SearchService
• Improper exception handling

3 – Sizing - what to expect from a single server
Lets assume we’re running alfresco on a single server with the following hardware.
Red Hat Linux 64 bits, 16 GB RAM, 2 quad-core cpus 3.2Ghz, local SSD disk
We will have 3 web-applications running on the same JVM and container (i.e tomcat)
• Alfresco Repository
• Alfresco Share UI
• Solr
According to our internal benchmarks, and highly dependent on the specifics of each
use case this server should be able to serve handle 200 concurrent users or up to
2000 casual users

3 – Sizing - what to expect from a single server
The following facts will affect the sizing and architecture.
• Use Case
• Concurrent users
• Document types, sizes and distribution ratios
• Architecture (virtualization ?, fail safe ?, replication ? Integrations ? Component stack
• Authority structure
• Operations
• Components, Protocols and Apis
• Batch operations
• Response times requirements

3 – Sizing , 2 common use cases

3 – Sizing Divide and Conquer
• Know when,where and what are the processes that are running on your
server and what resources are those processes influencing.
• Do it with appropriate monitoring
• Javamelody as a simple approach (DEMO)
• https://github.com/miguel-rodriguez/alfresco-monitoring
• Use support tools for troubleshooting (DEMO)
• https://github.com/Alfresco/alfresco-support-tools
• Have specific servers dedicated to specific tasks
• Offload the user facing nodes

3 – Sizing – Capacity Plan

3 – Sizing – Monitor your resources

4 – Solr Tuning
Golden Rules for Solr
• Do you search on deleted content ? If not, disable the archive core.
• Go to solrHome and edit the solr.xml file commenting out the archive core
• Also disable the archive core backup scheduled task
• Do you search on content or only meta-data ? You can disable full-text-indexing
• alfresco.index.transformContent=false
• Alfresco can make use of Transactional Metadata queries (db fetch)
• SSL really needed? If inside the intranet, it should be disabled to reduce complexity.
• Optimize your ACL policy, re-use your permissions, use inherit and use groups

4 – Solr Tuning
Golden Rules for Solr Indexing
• Have local indexes (don’t use shared folders, NFS, use Fast hardware
(RAID, SSD,..)
• Tune the mergeFactor, 25 is ideal for indexing, while 2 is ideal for search.
• Tune your Ram buffer size (ramBufferSizeMB) on solrconfig.xml, 32 MB by
default
• Analyze your indexing processes (check alfresco repository health)
• Tune the transformations that occur on the repository side, set a
transformation timeout.

4 – Solr Tuning
Golden Rules for Solr Indexing
• Closely monitor Solr JVM (especially GC and Heap usage)
• Enable GC logs, analyze Gc performance, tune the GC algorithm
• Do you need tracking to happen every 15 seconds ?
• Use a dedicated tracking alfresco instance, several architecture options
• Increase your index batch counts to get more results on your indexing
webscript
• In each core solrcore.properties, raise the batch count to 2000
• Impacting factors in Indexing
• Jvm Memory and Cpu usage on Repository Layer (text extraction /transformations)
• Jvm Memory ,Cpu , Disk I/O, Disk Cache size on Solr Layer
• Number of threads for indexing, Solr caches

4 – Solr Tuning
Golden Rules for Solr Search
• Have local indexes (don’t use shared folders, NFS, use Fast hardware (RAID,
SSD,..)
• Tune the mergeFactor, 2 is ideal for search.
• Increase your query caches and the RAMBuffer
• Avoid path search queries, those are be slow,.Avoid * search, avoid ALL search
• Avoid using sort, you can sort your results on the client side using js or any client
side framework of your choice.
• Search is CPU intensive rather then RAM intensive, increase cpu power.
• Upgrade your Alfresco release with the latest service packs and hotfixes. Those
contain the latest Solr improvements and bug fixes that can have great impact on the
overall search performance

4 – Solr Tuning
Solr Caches
Tracking the usage of the solr caches can help to tune them for your use case.
• http://<solr_server>:<solr_port>/solr/alfresco/admin/stats.jsp#cache
The url above show you statistics on cache usages, If you have many evictions you
should look into increasing that cache module so all elements can fit (but don't
overdo it, adjust it and see what fits for your setup). It is likewise also a idea to decrease
the size of some of them if they have a lot of unused slots.
Goal should be to get the hit rate as close to 1.00 as possible (1.00 beeing 100% hit ratio)

4 – Solr Tuning
Solr usage on Alfresco Share
• Solr indexing and search performance will affect positively the overall share
performance. Share relies on Solr in the following situations:
• Full Text Search (search field in top right corner)
• Advanced Search
• Filters
• Tags
• Categories (implemented as facets)
• Dashlets such as the Recently Modified Documents
• Wildcard searches for People, Groups, Sites (uses database search if not wildcard)

Alfresco tuning part1

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Similar a Alfresco tuning part1

Similar a Alfresco tuning part1 (20)

Último

Último (20)

Alfresco tuning part1

Notas del editor