How to tune MMBase for production

Author: Nico Klasens
Date: 2005-12-09

This software is OSI Certified Open Source Software. OSI Certified is a certification mark of the Open Source Initiative.

The license (Mozilla version 1.0) can be read at the MMBase site. See http://www.mmbase.org/license

Abstract

How to for fine tuning MMBase

Table of Contents

1. Introduction

2. Scope

3. Identify bottlenecks

4. Browser segment

5. Database segment

6. Application server segment

6.1. Operating system and applications
6.2. Java Virtual Machine
6.3. Memory
6.4. Application server

7. MMBase optimization

7.1. Caches

8. Performance tuning sequence

1. Introduction

Like every application, MMBase is not optimized for every production environment. Out of the box it is suitable for small sites with low traffic. This document will describe how to change the configuration of MMBase and what other things are involved to boost performance of a site.

2. Scope

The experience of an user that a site is slow is usually not only the fault of a web application. The request from a browser passes many systems before the response is returned. Every passed system will add a delay to the response. The web application is just a part of the cycle. This document describes how to tune MMBase, but it is only worth the effort when MMBase is the problem.

Several systems in a production environment which can affect performance are:

Internet connection to Internet Service Provider
FIrewalls which block and filter traffic
Network with physical data limits and routing
Webserver or proxy (e.g. Apache http server or IIS)
Operating system (I/O and network settings)
Other running applications on the system
Database server
Database settings
Java Virtual Machine settings

3. Identify bottlenecks

The first step, in identifying which systems perform poorly, is to divide the production environment in segments. Every segment represent a part of the response time.

Browser - webserver
Webserver and application server
Database server

Every system on a segment boundary should log the response time of the request to find the segment with the highest delay

Browser

Logging of the response time on the client can be done with several load test tools. A freely available one is Jmeter http://jakarta.apache.org/jmeter

Webserver or application server

All frequently used servers can log in the access.log response time information.

The apache http server uses the mod_log_config module which support custom log formats. See http://httpd.apache.org/docs/2.0/mod/mod_log_config.html for more information on LogFormat and CUstomLog

Apache Tomcat uses an Access Log Valve create a similar logfile as the apache http server does. For more information see http://tomcat.apache.org/tomcat-5.5-doc/config/valve.html

Database server

Most databases can monitor the time a query requires to execute. This is different for every database and difficult to map to a request of one user. MMBase can also log the query response time by enabling the debug level on the Sqlhandlers. The log configuration is default in the log4j.xml in the mmbase.jar or /WEB-INF/config/log/ directory.

<logger name="org.mmbase.storage.implementation.database.DatabaseStorageManager" 
        additivity="false"> <level class="&mmlevel;" value ="debug" />
  <appender-ref ref="sqllog" />
</logger>

4. Browser segment

If the response time is not acceptable is this segment then the problem is somewhere in:

Internet connection to Internet Service Provider
FIrewalls which block and filter traffic
Network with physical data limits and routing

How to fine tune this segment is out of the scope of this document.

5. Database segment

The most common error in the database segment is that the indexes on the database are not present. indexes on the following columns should always be present

_object - number
_insrel - snumber
_insrel - dnumber
_insrel - rnumber
_typerel - snumber
_typerel - dnumber
_typerel - rnumber
_versions - name
_oalias - name
_syncnodes - exportsource
_syncnodes - exportnumber

Other tables could also benefit from indexes. Mysql requires indexes on all snumber, dnumber and rnumber columns in tables containing relations. This is also the case for Postgresql. Postgresql extends the tables, but does not inherit the indexes of the parent table.

Every web application build on top of MMBase has it own required indexes to perform well. The sql statements log as shown above will help identify which columns need an index. All columns mentioned in WHERE parts of the query are candidates for an index.

See the documentation of your database how to do maintenance. A database can easily become a bottleneck over time.

6. Application server segment

If the response time in this segment is very bad then the problem can be in several parts:

Operating system (I/O and network settings)
Other running applications on the system
Java Virtual Machine settings
JSP code
Custom application code
MMBase code

The first thing to check is which resources on the server are overloaded and which processes are causing it: Monitor the cpu load, virtual memory usage and I/O operations statistics. When the cpu load or memory usage is not caused by the application server process then the problem is somewhere in other applications or the OS.

-server -XX:NewRatio=2 -XX:SurvivorRatio=6

The jvm has to run in server mode and not client mode. In client mode the ratio between old and new is -XX:NewRatio=8. This means that the new generation in client mode will be 1/8 of the heap and the old generation 7/8. In server mode the ratio is -XX:NewRatio=2. The new generation will then be 1/3 of the heap. The -XX:NewRatio=2 above just makes it explicit. The advantages are that more new instance can be created and die before a small GC is preformed.

The -XX:SurvivorRatio is default 25. The reason to increase the Survivor spaces is to prevent that there are too many new instances for the survivor space. If it doesn't fit then the rest will go to the old generation right away. This will happen when the editwizards are installed. A user keeps a stack of wizards with a lot of instances on the server. A very big wizard can occupy 4MB of memory. All instances die when the wizard is closed. You want to keep these objects in the new generation. With a default server mode jvm with 1G the Survivor spaces are 12M. If you have multiple editors then the stacks with wizards can easily be more then 12M and instances will go to the old generation with a small GC and can only be cleared with a Full GC.

To emphasize, in an ideal run you want your survivor space to have objects of different ages. That means you have enough space there to not instantly promote live objects to the old generation. This means less pollution of the old generation.

Default setup with 1G 
Old generation 682mb (client mode: 910) 
New geneartion 341mb (client mode: 113) 
eden 315 (client mode: 104) 
Survivors 12 (client mode:4)

Setup with 1G and -server -XX:NewRatio=2 -XX:SurvivorRatio=6 
Old generation 682mb 
New geneartion 341mb 
eden 255
Survivors 42

When you want to monitor the heap usage on a production server then you could use -Xloggc:/log/gc.log. The GC statistics will be written to this file with no overhead. GCViewer can generate a nice graph of the logfile (http://www.tagtraum.com/gcviewer.html)

Another tool which is very handy, and maybe even an absolute necessity, is jvmstat. Jvmstat shows you graphically how your memory is filled between permanent, old and young generation, It also shows how eden and survivor spaces are filled.

There are many articles on the Internet with more information on memory and heap size tuning on different platforms. See for example http://java.sun.com/performance/

6.4. Application server

Every application server contains a number of OOTB (out-of-the-box) performance-related parameters that can be fine-tuned depending on your environment and applications. Tuning these parameters based on your system requirements (rather than running with default settings) can greatly improve performance and the scalability of an application.

Modify the value of the http listener threads.

Application servers can service multiple simultaneous requests and because thread creation is expensive application servers have to maintain a pool of threads that handle each request. Some application servers break this thread pool into two: one to handle the incoming requests and place those in a queue and one to take the threads from the queue and do the actual work requested by the caller. Regardless of the implementation, the size of the thread pool limits the amount of work your application server can do; the tradeoff is that there is a point at which the context-switching (giving the CPU to each of the threads in turn) becomes so costly that performance degrades.

Some other things you might consider:

Turn off excessive logging, because this could significantly slow system performance.
Disable checks for JSP page checks and servlet reloading.
Precompile JSPs

7. MMBase optimization

7.1. Caches

MMBase relies heavily on its caches. Most caches are configurable in the caches.xml, which is in the root of the MMBase configuration. There are a few which have a large effect on memory usage and database round trips. The MMBase admin has a page where cache statistics are shown. This page shows how efficient a cache is.

The default implementation of all caches is a Least Recently Used Hashtable. Another implementation can be plugged in when this is not sufficient.

Before explaining these caches it is important to understand the two different types of nodes MMbase uses. MMBase has virtual and real nodes. Real nodes contain fields which are defined in a builder. Virtual nodes are usually a result of a multi level query. Fields in virtual nodes are original from multiple builders. The field name is always prefixed with the builder name. Real nodes represent objects like news items. Virtual nodes represent multiple parts of different objects. A virtual node can contain a news item title and the authors full name. Virtual nodes do not have a nodenumber.

'Nodes' cache. This cache is used to store individual real nodes. The key of this cache is the nodenumber. All real nodes requested are saved in this one.
'NodeList' cache which stores the results of real node list requests. This cache can become very big rapidly. All real nodes requested by a list are also stored in the 'Nodes' cache. This cache usually keeps references to nodes which are already flushed by the 'Nodes' cache.
'Multilevel' cache. This cache stores all results of virtual node list requests. This cache stores the results for all complicated SearchQueries.
'AggregatedResult' cache. All results from queries with min/max/count fields are stored in this.
'RelatedNodes' cache. The results in this cache contain nodes which are related to another node.
'Related' cache. The results in this cache are relations which belong to a node. The key of the cache is a nodenumber.

How the caches are used, depends on how the application on top of MMbase is coded. An application which uses a lot of getRelatedNodes request requires a large 'RelatedNodes' cache. An application with a lot of complicated queries with multiple tables involved requires a larger 'Multilevel' cache.

8. Performance tuning sequence

We recommend that you tune the production environment in the following sequence:

Tuning Your Application and MMBase
Tuning Application Server
Tuning the Java Runtime System
Tuning the Operating System
Tuning Database Servers

When you are done with tuniong thenreverse the order and do it all again. Changes in one part will require changes in others.

This is part of the MMBase documentation.

For questions and remarks about this documentation mail to: [email protected]