Columbro - Alfresco 4 Scalability and Benchmarking

Alfresco Scalability Benchmarking

Before telling how cool Alfresco is, you better prove it!

Few items we’ll talk about…

• Scalability and benchmarking in the ECM context

• Alfresco 4 is rocket scalable. And we got proofs

• If your software scales, your BM framework must scale more!

Scalability and benchmarking

What do you know about it?

One size fits all…

How scalable is your ECM?

“ECM systems can be scalable or they can fail to scale well. They can have modular architectures that allow you to simply add more

elements as required, rather than multiply the entire system as things expand. They can be scalable in that they have built in high availability, automatic

failover support, run on enterprise grade application servers and databases. They can be scalable because they have been tested and proven to handle very

high volumes (hundreds of millions of documents) in the repository and/or tested and proven to handle very high throughput rates (tens of thousands per

hour or minute). There are many ways in which an ECM system can scale or not. But the biggest element determining whether the system can scale

is your usage of it”

http://www.realstorygroup.com/Blog/1403-Scalable-ECM

Alfresco ECM Solutions

Factors at Alfresco Scalability

Just so we talk the same language:

What you need to know about Alfresco 4?

Key scalability changes:

The game changer(s) – Node Creation

Alfresco 3.x

Game changer(s) – Node Creation

Alfresco 4.x

NOTE: Solr will index with a default period of 15s

Game changer(s) – Node Search

Alfresco 3.x

The game changer(s) – Node Search

Alfresco 4.x

NOTE: Solr needs to track ACLs!

Alfresco 4 Scalability Benchmarks

The journey to the ultimate knowledge of Alfresco

Why benchmarking?

Verba volant, scripta manent

• Alfresco Product• Validation of Alfresco 4 scalability• Product bottlenecks

• Alfresco Field• Sizing• Achieve even more stellar use-cases

• Customers & partners• Provide reference milestones• Allow contextualized benchmarks

Benchmark projects overview

BM-0001 – Alfresco 4 Scalability – 4.0.0BM-0002 - 3.4 vs. 4.0 (Share Scenario - 360 u – 1 node)– 3.4.8 4.0.0BM-0003 - SOLR vs. Lucene (Share Scenario) – 4.0.0BM-0004 - Uncover repository limit using simple Site-based data – 4.0.2BM-0005 - Measure Alfresco Cloud signup rates up to 125K users - CloudBM-0006 - Measure Activiti workflow service performance – 4.0.2BM-0007 - Measure Alfresco workflow API performance – 4.0.2BM-0008 - Simulate 125K users on Alfresco Cloud – Cloud

BM-0009 - Define optimal tuning and extrapolate sizing information for large scale Share Enterprise deployment - 4.1.1.x

BM-0001 and BM-0009

• Common factors• Benchmark Lab• Enterprise Collaboration Scenario• Technologies (Alfresco Enterprise + Jmeter)

• Differences• Load testing scripts• Database tier• Repository content

• Both provided useful insight!

The benchmark lab (HW)

BM-0001 – Alfresco 4 Scalability benchmark

• You can never forget your first time!

• Objective• Statistic analysis of Alfresco Scalability • Best effort pre-tuning• Scenario• Search intensive Collaboration scenario• 10s think time • Implemented with Jmeter

• Async requests• Memory intensive

BM-0001 – Scenario

http://svn.alfresco.com/repos/alfresco-open-mirror/benchmark/scripts/SHARE/share-0001/V4.0.0/

BM-0001 – Scalability data points

BM-0001 – Architecture

BM-0001 - Software

BM-0001 – Scalability results

In other words…

BM-0001 take-aways

• 1100 concurrent users on 10M docs! • With high search %, load is mostly on Solr• Share is lightweight, repo not loaded• Solr can be memory intensive• Make sure you give enough memory!• Scale out when needed!

• Dedicated Alfresco for Solr tracking beneficial

The “Alfresco 4 Scalability blueprint”

• Broad intro to Alfresco 4 Scalability • Scalability analysis of BM-0001 results• Architectural options• Tuning and configuration details• Load analysis• Sizing and performance reference• Available for Enterprise Customers &

Partners at http://support.alfresco.com

BM-0009 – Real life collaboration

• Errare humanum est, perseverare diabolicum!• Objective• Create a real repository and scenario

• 2*Alfresco + 2*Solr• Find optimal tuning / sizing for large concurrency

• Scenario• Much less search intensive than BM-0001• 15s think time • Implemented with Jmeter

• More Async requests• Less Memory intensive

BM-0009 – Scenario

https://svn.alfresco.com/repos/alfresco-enterprise/benchmark/scripts/SHARE/share-0002/4.0.2/

BM-0009 – Repository details

• Share Sites: 10 000 • Avg members per site: 100 + 3 random

groups• #files per site: 10 2MB docs + 1k docs of

1kB • 3 level deep folder structure, each folder of the

hierarchy contains 5 documents and 5 folders

• Users in repo: 50 000• Groups: 30 000 hierarchical (depth=7)• Storage used: 1TB • Number of nodes: >10M• Number of assocs: >10M child assocs

BM-0009 – Architecture

BM-0009 - Software

BM-0009 – Scalability results

Once again…

BM-0009 take-aways

• Did I already say Alfresco ROCKS? • Even on a high end realistic scenario, avg

time 1.2s with 500 concurrent users (with 2*Alfresco and 2*Solr)

• Dedicated Alfresco tracking beneficial mostly for operational purposes (similar performance)

• Adding Solr nodes allows further degrees of scalability

Lessons learned (tuning tips)

As we are not in marketing…numbers are not enough!

Make sure house is clean!

•Allow enough Open files

•Allow enough connections to your DB

•Allow enough Tomcat threads (but don’t exaggerate!)

•Disable your anti-virus!

•Check your balancer, and first test without it!

Areas you might want to tune

•Alfresco Repository•db.pool.* (and DB connections accordingly)•(only if your know what you’re doing) L2-caches and transactional caches •solr.maxConnections

•Solr•Solr caches in solrcore.properties•#tracking threads (default=3). I/O bound so don’t exaggerate!•alfresco.maxConnections•mergeFactor in solrconfig.xml

Dedicated vs. shared tracking

Benchmark gotchas

• Jmeter is very memory intensive• 30G to scale to 1100 users!• Not fit for cloud-scale• Jmeter does not parse Javascript• Complex hacked implementation to mimic Share• Full emulation is impossible!

• We need a scalable, distributed, flexible framework to run cloud-scale benchmarks!

Alfresco Benchmark framework

I.e. cool people doing even cooler stuff to prove they are cool!

Derek Hulley, Repository Lead Engineer

Gabriele ColumbroPrincipal Architect Consulting Services

@mindthegabz

Derek HulleyFounding Engineer and Repository Team Lead

Columbro - Alfresco 4 Scalability and Benchmarking

Documents

Transcript of Columbro - Alfresco 4 Scalability and Benchmarking

Alfresco 5.0 Alfresco One QuickStart · Alfresco One QuickStart 4 Alfresco 5.0 Personalizing Alfresco Having installed and signed into Alfresco, the first thing you can do is to personalize

Alfresco Day Roma 2015: Alfresco One

Zaizi Alfresco Solutions - Alfresco Implementation Best Practice

Alfresco - Présentation Alfresco 3 Enterprise

Alfresco Content Services 5.2 Reference Architecture · Alfresco Content Services overview 4 Alfresco Content Services 5.2 Reference Architecture Alfresco Content Services overview

Alfresco WCM For High Scalability

Ricoh and Alfresco. Alfresco Summit 2013

Alfresco Summit 2013: eSignatures and Alfresco - Manuel Reyes

[Informatique][Alfresco] Alfresco Simple Records Management August 2006

Alfresco One (Enterprise) vs Alfresco Community 2014

Alfresco Workflow - Student Guide - v1.1 - Alfresco University

Alfresco One Product Overviewpages.alfresco.com/rs/alfresco/images/02 Alfresco...• Product Overview • Alfresco One Enterprise Demo • Alfresco Enterprise v.s. Community ... Application

Alfresco weddings - Metro Hotel · Alfresco weddings Metro Mirage Hotel Newport. w elcome ... Alfresco wedding reception Reception Inclusions: • 4 hour reception in the alfresco

Alfresco JLAN Server Programmers Guide - Alfresco SVN Repository

Alfresco Presentation Templatepages.alfresco.com/rs/alfresco/images/Paris Alfresco Day product update.pdf · + Alfresco One 4.2 + Records Management 2.x •Les nouveautés Alfresco

Alfresco Day Roma 2015: Alfresco Activiti

201511 - Alfresco Day - Platform Update and Roadmap - Gabriele Columbro - Boston

Zia Consulting & Alfresco ALFRESCO SOLUTIONS OFFERED ...€¦ · Ephesoft to Alfresco • UI Services: Customized Alfresco Share, Liferay Portal, OpenWorkdesk • Enterprise Integration

Alfresco Day Milano 2016 - Alfresco in Italia

Unlocking the Secrets of Alfresco Authentication - Alfresco Devcon