Table of Contents
When we started building Nuxeo EP, we defined several goals to achieve. Because these goals have a structural impact on Nuxeo EP platform it is important to understand them: it helps understanding the logic behind the platform.
An ECM platform like Nuxeo EP can be used in a lot of different cases.
The deployment of Nuxeo EP must be adapted to all these different cases:
Standard ECM web application
This is the most standard use case. The web browser is used to navigate in content repositories.
All services are deployed on an Application server
In order to be easily hostable, the platform need to be compatible with several application servers
Complex edition or rich media manipulation
In this case having a rich client that seamlessly communicates with other desktop applications and offers a rich GUI is more comfortable than a simple web browser.
Interface is deployed on a rich client on the user desktop
Services and storage are handled on the server
Offline usage
In some cases, it is useful to be able to manipulate and contribute content without needing a network connection.
GUI and some services (like a local repository) need to be deployed on the client side
Server hosts the collaborative services (like the workflow) and the central repository
Distributed architecture
In order to be able to address the needs of large decentralized organizations, Nuxeo EP must provide a way to be deployed on several servers on several locations
Scale out on several servers
Dedicate servers to some specific services
Have one unique Web application accessing several decentralized repositories
Use Nuxeo EP components from another application
When building an business application, it can be useful to integrate services from Nuxeo EP in order to address all content oriented needs of the application.
Provide web service API to access generic ECM services (including repository)
Provide EJB3 remoting API to access generic ECM services (including repository)
Provide POJO API to generic ECM services
There are certainly a lot of other use cases, but mainly the constraints are:
Be able to choose the deployment platform: POJO vs Java EE
As first deployment targets we choose
Eclipse RCP: a rich client solution that uses a POJO (OSGi) component model
JBoss Application Server: a Java EE 5 compliant application server
Be able to choose the deployment location of each component: client side vs server side
The idea is to be able to deploy a component on the server side or on the client side without having to change its code or its packaging
Before building Nuxeo EP we worked during several years on the Zope platform with the CPS solution. CPS was deployed for a lot different use cases and we learned a lot of good practices and design patterns. Even if Nuxeo EP is a full rewrite of our ECM platform, we want to keep as much as possible of CPS good concepts.
Concept of schemas and documents
Inside CPS most of the data manipulated was represented by a document object with a structure based on schemas.
This concept is very interesting:
Schemas enforce structure constraints and data integrity but also permit some flexibility.
When defining a schema you can specify what fields are compulsory, what are their data type, but you can also define some flexible part of the schema.
Share API and UI components for Documents, Users, Records ...
Because the Document/Schema model is very flexible it can be used to manipulate different types of data: like Users, Records and standards documents.
From the developer's perspective this permit using the same API and be able to reuse some UI components
From the user's perspective it give the application some consistency: because the same features and GUI can be used for all the data he manipulates.
Actions and Views
Because CPS was very pluggable, it was possible to easily define different views for each document type and also to let additional components contribute new actions or views on existing documents.
Nuxeo EP has a similar concept of views and actions, even if technically speaking the technologies are different.
Lazy fetching and caching
Because ECM applications make a very intensive use of the repository and often need to fetch a lot of different documents to construct each page, the way the document retrieval is handled if very important to have a scalable application.
With CPS we worked a lot on caching and lazy fetching.
With Nuxeo EP we incorporated this requirement from the beginning:
Distributed caching
Lazy fetching on schemas and fields
CPS was constructed as a set of pluggable components relying on a common platform. This modularity has been maintained in the new platform. Deploying a new feature or component on the new platform is as simple as it was on the old one.
This requirement as a huge impact on the platform because the Java packaging model and the Java EE constraints are not directly compatible with it.
Adding a new component should be as simple as dropping a file or an archive in some directory without having to rebuild nor repackage the application.
This is important from the administrator point of view: be able to easily deploy new features.
This is also very important from the support point of view: be able to deploy customized components without taking the risk of forking the platform and maintain the possibility to upgrade the standards components.
The CPS framework was powerful but we know it was very complex to use. Not only because of the unusual CMF/Zope/Python programming model, but also because there was a lot of different concepts and you had to understand them all to be able to leverage the platform when building a new application on top of it.
Nuxeo EP aims at simplifying the task of the developer
Clearly separate each layer
The idea is to clearly separate presentation, processing and storage so that developers can concentrate on their task.
Offer plugin API and SPI
Nuxeo EP is constructed as a set of plugins so you can modify the behavior of the application by just contributing a new plugin. This is simpler because for common tasks we will offer a simple plugin API and the developer just has to implement the given interface without having to understand each part of the platform.
Rely on JAVA standards
We try to follow as much as possible all the Java standards when they are applicable. This will allow experienced Java developers to quickly contribute to the Nuxeo EP platform.
We know what it's like to have to build and maintain an entire framework starting from the application server. With the switch to the Java technology, we will use as much as possible existing open source components and focus on integrating them seamlessly in the ECM platform. Nuxeo EP is a complete integrated solution for building an ECM application, but Nuxeo won't write all infrastructure components. This approach will also make the platform more standards compliant.
Thus developers can optimize their Java/JEE and open source experience to use Nuxeo EP.
Because ECM applications often need to be deeply integrated into the existing SI, Nuxeo EP will be easily integrable
API for each reusable service or component
Depending on the components, this API could be POJO, EJB3, or WebService, and in most cases it will be available in the three formats.
Pluggable hooks into Nuxeo EP
This mainly means synchronous or asynchronous events listener that are a great place to handle communication and synchronization between applications.
The Nuxeo EP platform was rewritten from the ground with the switch to Java. But we don't plan to do this kind of work every couple of years, it wont be efficient neither for us, nor for the users of the platform. For that reason, we choose innovative Java technologies like OSGi, EJB3, JSF, Seam ....
All the design goals explained just before have a huge impact on the Nuxeo EP architecture. Before going into more details, here are the main concepts of Nuxeo EP architecture.
Nuxeo EP is built of several layers, following at least the 3 tiers standard architecture
Presentation layer
Handles GUI interactions (in HTML, SWT ...)
Service layer
Service stack that offers all generic ECM services like workflow, relations, annotations, record management...
Storage layer
Handles all storage-oriented services like document storage, versioning, life cycle ....
Depending on the components, their complexity and the needed pluggability, there can be more that 3 layers.
This layering of all the components brings Nuxeo EP the following advantages
Choose the deployment target for each part of a component
By separating clearly the different parts of a feature, you can choose what part to deploy on the client and what part to deploy on a server.
Clear API separation
Each layer will provide its own API stack
Components are easier to reuse
Because the service and storage layers are not bound to a GUI, they are more generic and then more reusable
Thanks to this separation in component families you can easily extract from Nuxeo EP the components you need for your application.
If you need to include Document storage facilities into your application you can just use Nuxeo EP Core: It will offer you all the needed feature to store, version and retrieve documents (or any structured but flexible dataset). If you also need process management and workflow you can also use Nuxeo EP Workflow service. And finally, if you want to have a Web application to browse and manage your data, you can reuse the Nuxeo EP Web layer.
The targeted platform do not provide the same mechanism to handle all the deployment tasks:
Packaging (Java EE vs OSGi)
Dependency management
Extension management
Because of these differences, Nuxeo EP provides a unified deployment service that hides the specificity of the target platform. This is also a way to add a pluggable component deployment system to some platform that don't handle this (like Java EE).
This is one of the motivation for the Nuxeo Runtime that will be quickly introduce later in this document.
In Nuxeo EP, an ECM application is seen as an assembly of components.
This assembly will include:
Existing generic Nuxeo EP Components
Extensions or configurations contributing to generic Nuxeo EP components
Specific components and configuration
Inside Nuxeo EP each feature is implemented by a one or several reusable components and services. A feature may be implemented completely at storage level, or may require a dedicated service and a dedicated GUI.
Nuxeo EP Web application is a default distribution of a set of ECM components. This can be used "as is" or can be the base for making a business ECM application.
If you need to remove a feature
Just remove the component or deploy a configuration for disabling it.
If you need to change the default behavior of one component
You can deploy a new configuration for the component .
Declare a new Schema or define a document type
Configure the versioning policy
Deploy new workflow
...
This configuration may use an extension point to contribute the new behavior.
Contribute a new security policy
Contribute a new event handler
Deploy a new View on a document
...
If you need to add a completely new feature you can make your own component.
First check that there is no generic Nuxeo EP component available that could help you in your task (all components are not deployed in the default webapp).
Here is a quick list of the Java technology we use inside Nuxeo EP platform:
Java 5
Java EE 5: JSF and EJB3
OSGi component model
A lot a innovative open source projects
JBoss Seam, Trinidad and Ajax4JSF on the web layer
jBPM for the default workflow engine implementation
Lucene for the default search engine implementation
Jackrabbit JSR-170 repository for the default storage back end implementation
JenaRDF for the relation framework
...
Building the Nuxeo Runtime was one of the first task we started. This is one of the main infrastructure component or Nuxeo EP architecture.
This paragraph will give you a quick overview of the Nuxeo Runtime, a more detailed technical presentation can be found in an other chapter of this book.
Because most of Nuxeo EP components are shared by Nuxeo RCP (OSGI/RCP) and Nuxeo EP (Java EE), an abstraction layer is required so the components can use transparently the components services independently from the underlying infrastructure.
Nuxeo Runtime provides an abstraction layer on top of the target host platform. Depending on the target host platform, this Runtime layer may be very thin.
Nuxeo Runtime already supports Equinox (Eclipse RCP OSGi layer) and JBoss 4.x (JMX). The port of Nuxeo Runtime to other Java EE application server is in progress, we already have a part of Nuxeo EP components that can be deployed on top of SUN Glassfish application server. Technically speaking, the port of Nuxeo Runtime could be done on any JEE5 compliant platform and will be almost straightforward for any platform that supports natively the OSGi component model.
Java EE is a great standard, but it was not designed for a component based framework: it is not modular at all.
Java EE deployment model limitations
Most Java EE deployment descriptors are monolithic
For example, the web.xml descriptor is a unique XML file. If you want to deploy an additional component that needs to declare a new Java module you are stuck. You have to make one version of the web.xml for your custom configuration. For Nuxeo EP platform, this constraint is not possible:
Components don't know each other
Because there are a lot of optional components, we can't have a fixed configuration that fits all.
We can make a version of the web.xml for each possible distribution
There are too many optional components to build one static web.xml for each possible combination.
This problem with the web.xml is of course also true for a lot of standard descriptors (application.xml, faces-config.xml, persistence.xml, ejb-jar.xml ....)
One archive for one web application
We have here the exact same problem than with the web.xml. additional components can contribute new web pages, new web components ... We can have a monolithic web archive.
No dependency declaration
Inside Java EE there is no standard way to declare the dependency between components.
Because Nuxeo EP is extensible and has a plugin model, we need that feature. A contribution is dependent on the component it contribute to:
Contribution is only activated if/when the target component is activated
The contribution must be deployed after the target component as it may override some configuration
Java EE component model limitations
Unable to deploy a new component without rebuilding the whole package
If you take a .ear archive and want to add a new component, you have to rebuild a new ear.
No support for versionned components
Nuxeo Runtime provides an extensible component model that supports all these feature. It also handles the deployment of these components on the target host platform.
Nuxeo Runtime provides the component model for the platform.
This component model is heavily based on OSGi and provides the following features:
Platform agnostic component model
Can be deployed on POJO and Java EE platforms
Supports dependencies management
Components explicitly declare their requirements and are deployed and activated by respecting the inferred dependency chain.
Includes a plugin model
To let you easily configure and contribute to deployed components
A POJO test environment
Nuxeo Runtime components can be unit tested using JUnit without the need of a specific container.
OSGi ( Open Services Gateway initiative ) is a great standard for components based Java architecture.
OSGi provides out of the box the following features:
Dependencies declaration and management
A component gets activated only when the needed requirements are fulfilled
Modular deployment system
Manage bundles
Manage fragments (sub parts of a master bundle)
an OSGi bundle can define one or several services
A system to identify and lookup for a component
For Nuxeo EP, OSGi standard provides a lot of the needed features. This is the reason why Nuxeo Runtime is based on OSGi, in fact Nuxeo Runtime component model is a subset of OSGi specification.
To ensure platform transparency, Nuxeo Runtime provides adapters for each target platform to help it support OSGi components.
This adopter layer is very thin on Equinox (Eclipse RCP) since the underlying platform is already OSGi compliant.
This adapter may be more complex for platform that are not aware of OSGi (JBoss 4.x or Glassfish)
In this case, the runtime adapter will handle all OSGi logic and deploy the components as native platform components. For examples, on JBoss 4.x, Runtime components are deployed as JMX MBeans.
OSGi does not define a plugin model, but the Eclipse implementation (Equinox) does provide an extension point system.
Because we used a lot the Eclipse Extension Point system and we liked it, Nuxeo Runtime also includes an Extension Point system.
Basically every Nuxeo Component can:
declare its dependencies
The component will also be activated after all needed components
declare exposed extension points
Each components can define extension points that other components can use to contribute configuration or code.
declare contribution to other components
These declarations are handled by the OSGi deployment descriptor (MANIFEST.MF)
Manifest-Version: 1.0 Bundle-ManifestVersion: 2 Bundle-Name: Nuxeo ECM Core Bundle-SymbolicName: org.nuxeo.ecm.core;singleton:=true Bundle-Version: 1.0.0 Bundle-Vendor: Nuxeo Bundle-Localization: bundle Bundle-Activator: org.nuxeo.ecm.core.NXCoreActivator Bundle-ClassPath: ., lib/xsom.jar, lib/connector-api.jar, lib/java-cup-v11a.jar Export-Package: org.nuxeo.ecm.core, org.nuxeo.ecm.core.api, org.nuxeo.ecm.core.api.local, org.nuxeo.ecm.core.jca, org.nuxeo.ecm.core.lifecycle, org.nuxeo.ecm.core.model Require-Bundle: org.nuxeo.ecm.core.api, org.nuxeo.runtime Nuxeo-Component: OSGI-INF/CoreService.xml, OSGI-INF/TypeService.xml, OSGI-INF/RepositoryService.xml, OSGI-INF/CoreExtensions.xml, OSGI-INF/SecurityService.xml
For example, this descriptor defines that
this bundle depends on
org.nuxeo.ecm.core.api
this bundles contains Nuxeo components like
CoreServices.xml
The XML descriptor will be used to define new extension points or contribute to existing one.
An extension point is a way to declare that your component can be customized from the outside:
Contribute configuration
Activate or deactivate a component. Define resources for a given service.
Contribute code and behavior
Extension points also give you the possibility to register plugins
Extension points and contribution to extension points are defined
using a XML descriptor that has to be referenced in the
MANIFEST.MF
.
Here is a simple descriptor example:
<component name="org.nuxeo.ecm.core.listener.CoreEventListenerService"> <require>org.nuxeo.ecm.core.repository.RepositoryService</require> <implementation class="org.nuxeo.ecm.core.listener.impl.CoreEventListenerServiceImpl"/> <extension-point name="listener"> <object class="org.nuxeo.ecm.core.listener.extensions.CoreEventListenerDescriptor"/> </extension-point> <extension target="org.nuxeo.ecm.core.listener.CoreEventListenerService" point="listener"> <listener name="nxruntimelistener" class="org.nuxeo.ecm.core.listener.impl.NXRuntimeEventListener" /> </extension> <extension target="org.nuxeo.ecm.core.listener.CoreEventListenerService" point="listener"> <listener name="lifecyclelistener" class="org.nuxeo.ecm.core.lifecycle.impl.LifeCycleListener" /> </extension> </component>
This fragment depends on the Repository Service
This fragment won't be loaded until a Nuxeo Repository is setup
This fragment declares an extension point named listener
This extension point let register plugins that will be invoked when a core event occurs.
This extension point use
CoreEventListenerDescriptor
for
descriptor.
This fragment registers two contributions to the listener extension point
The contributions have to follow the descriptor defined by the target Extension Point. The descriptor defines what tags can be used when contributing.
The descriptor is simply defined by a Java class that uses annotations to defines how the XML descriptor will be used to create an Object Descriptor instance to pass to the extension point registration.
Nuxeo Runtime also provides deployment services to manage how components are deployed and contribute to each other
Dependencies management
The dependencies are declared in the
MANIFEST.MF
and can also be defined in XML
descriptors that hold contributions.
The Nuxeo Runtime orders the component deployment in order to be sure the dependencies are respected. Components that have unresolved dependencies are simply not deployed
Extension point contributions
XML descriptors are referenced in the
MANIFEST.MF
. These descriptors make
contributions to existing extension points or declare new extension
points.
Each component has its own deployment-fragment
The deployment fragment defines
Contribution to configuration files
For example contribute a navigation rule to
faces-config.xml
or a module declaration
to web.xml
.
Nuxeo Runtime let you declare template files (like
web.xml
,
persistence.xml
) and let other component
contribute to these files.
Installation instructions
Some resources contributions (like i18n files or web pages) need more complex installation instructions because they need archives and files manipulations. Nuxeo Runtime provide basic commands to define how your components should be deployed
Here is a simple example of a deployment-fragment.
<fragment> <extension target="application#MODULE"> <module> <ejb>${bundle.fileName}</ejb> </module> </extension> <extension target="faces-config#VALIDATOR"> <validator> <validator-id>dueDateValidator</validator-id> <validator-class>org.nuxeo.ecm.platform.workflow.web.ui.jsf.DueDateValidator</validator-class> </validator> </extension> <install> <!-- unzip the war template --> <unzip from="${bundle.fileName}" to="/"> <include>nuxeo.war/**</include> </unzip> <!-- create a temp dir --> <!-- be sure no directory with that name exists --> <delete path="nxworkflow-client.tmp" /> <mkdir path="nxworkflow-client.tmp" /> <unzip from="${bundle.fileName}" to="nxworkflow-client.tmp"> <include>OSGI-INF/l10n/**</include> </unzip> <append from="nxworkflow-client.tmp/OSGI-INF/l10n/messages.properties" to="nuxeo.war/WEB-INF/classes/messages.properties" addNewLine="true" /> <append from="nxworkflow-client.tmp/OSGI-INF/l10n/messages_en.properties" to="nuxeo.war/WEB-INF/classes/messages_en.properties" addNewLine="true" /> <append from="nxworkflow-client.tmp/OSGI-INF/l10n/messages_fr.properties" to="nuxeo.war/WEB-INF/classes/messages_fr.properties" addNewLine="true" /> <append from="nxworkflow-client.tmp/OSGI-INF/l10n/messages_de.properties" to="nuxeo.war/WEB-INF/classes/messages_de.properties" addNewLine="true" /> <append from="nxworkflow-client.tmp/OSGI-INF/l10n/messages_it.properties" to="nuxeo.war/WEB-INF/classes/messages_it.properties" addNewLine="true" /> <delete path="nxworkflow-client.tmp" /> </install> </fragment>
Inside Nuxeo 5, extension points are used each time a behavior or a component needs to be configurable or pluggable.
Here are some examples of extension points used inside the Nuxeo 5 platform.
Schemas and document types
Inside Nuxeo 5 a document structure is defined by a set of XSD schemas. Schemas and Document Types are defined using an extension point.
Storage repository
Nuxeo core stores documents according to their type but independently of the low level storage back-end. The default back-end is Jackrabbit JCR implementation. Nuxeo Core exposes an extension point to define the storage back-end. We are working on an other repository implementation that will be pure SQL based.
Security Management
Nuxeo include a security manager that checks access rights on each single operation. The extension point system allow to have different implementation of the security manager depending on the project requirements:
Enforce data integrity: store security descriptors directly inside the document
Performance: store security descriptors in an externalized index
Corporate security policy: implement a specific security manager that will enforce business rules
Event handlers
Nuxeo platform lets you define custom Event handler for a very large variety of events related to content or processes. The event handler extension mechanism gives a powerful way to add new behavior to an existing application
You can modify the behavior of the application without changing its code
The development model is easy because you have a very simple Interface to implement and you can use Nuxeo Core API to manipulate the data
Event handlers can be synchronous or asynchronous
Nuxeo 5 itself uses the Event Handler system for a lot of generic and configurable service
Automatic versioning: create a new version when a document is modified according to business rules
Meta-data automatic update: update contributor lists, last modification date ...
Meta-data extraction / synchronization: extract Meta-Data from MS-Office file, Picture ....
Nuxeo 5 development model is heavily based on the usage of extension points. When a project requires specific features we try as much of possible to include it as an extension of the existing framework rather than writing separated specific component. This means make existing services more generic and more configurable and implement the project specific needs as a configuration or a plugin of a generic component using Extension Points.
Nuxeo EP components are separated in 3 main layers: Core / Service / UI
From the logical point of view each layer is a group of components that provide the same nature of service:
Storage oriented services: Nuxeo Core
Nuxeo core provides all the storage services for managing documents
Repository service
Versioning service
Security service
Lifecycle service
Records storage (directories)
...
Content and process oriented services: Nuxeo Platform
Nuxeo provides a stack of generic services that handle documents and provide content and process management features. Depending on the project requirement only a part of the existing services can be deployed.
Typical Nuxeo EP platform services are:
Workflow management service
Relation management service
Archive management service
Notification service
...
Presentation service: UI Layer
The UI layer is responsible for providing presentation services like
Displaying a view of a document
Displaying available actions according to context
Managing page flow on a process driven operation
These services can be very generic (like the action manager) but can also be directly tied to a type of client (like the View generation can be bound to JSF/facelets for the web implementation)
The layer organization can also be seen as a deployment strategy
Thanks to the Nuxeo Runtime remoting features it is very easy to split the components on several JVM. But splitting some services can have a very bad effect on the global system performance.
Because of that, all the storage oriented services are inside the core. All components that have extensive usage of the repository and need multiple synchronous interaction with it are located in the core. This is especially true for all synchronous event handlers.
The services layer can itself be split in multiple deployment unit on multiple JVMs.
On the UI side all the services are logically deployed inside the same JVM. At least each JVM must have the minimum set of services to handle user interaction for the given application.
The components are also grouped by layers according to their dependencies.
Core Modules can depend on Core Internal API.
Generic ECM services can depend on Core external API and can depend on external optional library (like jBPM, Jena, OpenOffice.org ...).
UI services can rely on a client side API (like Servlet API) and share a common state associated to the user session.
Layers are also organized according to deployment target.
The Core layer is a POJO Layer with an optional EJB facade. The core can be embed in a client application.
The services are mostly implemented as POJO services so that they can be used as an embedded library. But some of them can depend on typical Application Server infrastructure (like JMS or EJB).
Inside the UI Layer most service are dedicated to a target platform: web (JSF/Seam), Eclipse RCP or other.
Because the layer organization has several constraints, the implementation of a unique feature is spread across several layers.
Typically a lot of transversal services is split in several sub-components in each layer in order to comply to deployment constraint and also to provide better reusability. For example, the Audit service is made of 3 main parts:
Core Event <=> JMS bridge (Core Layer)
Forwards core events to JMS Bridge according to configuration.
JMS Listener and JPA Logger (Service Layer)
Message driven bean that writes logs in DB via JPA.
Audit View (UI Layer)
Generates HTML fragment that displays all events that occurred on a document.
The layer organization can also be seen in the API.
Most of the components forming the core are exposed via the
DocumentManager
/
CoreSession
interface. The interfaces
and dependencies needed to access the Core services are packaged in a
API package: even if there are several Core component, you have only
one dependency and API package.
The idea is that for accessing the core, you will only need to use
the DocumentManager
to manipulate
DocumentModel
s (the document object
artifact). Some core services can be directly accessed via the
DocumentModel
(like the life cycle or
security data).
Each service exposes its own API and then has its own API package. Service related data (like process data, relation data) are not directly hosted by the DocumentModel object but can be associated to it via adapters and facets.
The web layer can be very specific to the target application. Nuxeo EP provides a default web application and a set of base classes, utility classes and pluggable services to handle web navigation inside the content repository.
Most features are made of several Java project and generate several Maven 2 artifact.
Nuxeo packaging and deployment system (Nuxeo Runtime, Platform API, Maven ...) leverage this separation to help you distributing the needed deployment unit according to your target physical platform.
Nuxeo core provides all the storage services for managing documents.
Schema service
Lets you register XSD schemas and document types based on schemas.
Repository service
Lets you define one or more repository for storing your documents.
Versioning service
Lets you configure how to store versions.
Security service
Manages data level security checks
Lifecycle service
Manages life cycle state of each document
The repository service lets you define new document repositories. Defining separated repositories for your documents is pretty much like defining separated Databases for your records.
Because Nuxeo Core defines a SPI on repository, you can configure how you want the repository to be implemented. For now, default implementation uses JSR-170 (Java Content Repository) reference implementation: Apache Jack Rabbit. In the future, we may provide other implementation of the Repository SPI (like native SQL DB or Object Oriented DB).
Even if for now there is only one Repository implementation available, using JCR implementation, you can configure how your data will be persisted: filesystem, xml or SQL Database. Please see "How to"s about repository configuration.
When defining a new repository, you can configure:
The name.
The configuration file
For JCR, it lets you define persistence manager.
The security manager
Defines how security descriptors are stored in the repository (for now: org.nuxeo.ecm.core.repository.jcr.JCRSecurityManager)
The repository factory
Defines how the repository is created (for now: org.nuxeo.ecm.core.repository.jcr.JCRRepositoryFactory)
The repository enforces data integrity and consistency based on Document types definition.
Each document type is defined by:
A name.
An optional super document type (inheritance)
A list of XSD schemas
Defines storage structure
A list of facets
Simple markers used to define document behavior.
Here is a simple DocumentType declaration:
<extension target="org.nuxeo.ecm.core.schema.TypeService" point="doctype"> <documentation>The core document types</documentation> <doctype name="Folder" extends="Document"> <schema name="common" /> <schema name="dublincore" /> <facet name="Folderish" /> </doctype> </extension>
For further explanation on Schemas and Document types, please see the dedicated section in this document.
Inside Nuxeo repository each document may be associated with a life-cycle. The life-cycle defines the states a document may have and the possible transitions between these states. Here we are not talking about workflow or process, we just define the possible states of a document inside the system.
The Nuxeo Core contains a LifeCycleManager service that exposes several extension points:
one for contribution Life-Cycle management engine
(default one is called JCRLifeCycleManager and stores life-cycle related information directly inside the JSR 170 repository)
one for contributing life-cycle definition
This includes states and transitions.
<lifecycle name="default" lifecyclemanager="jcrlifecyclemanager" initial="project"> <transitions> <transition name="approve" destinationState="approved"> <description>Approve the content</description> </transition> <transition name="obsolete" destinationState="obsolete"> <description>Content becomes obsolete</description> </transition> <transition name="delete" destinationState="deleted"> <description>Move document to trash (temporary delete)</description> </transition> <transition name="undelete" destinationState="project"> <description>Recover the document from trash</description> </transition> <transition name="backToProject" destinationState="project"> <description>Recover the document from trash</description> </transition> </transitions> <states> <state name="project" description="Default state"> <transitions> <transition>approve</transition> <transition>obsolete</transition> <transition>delete</transition> </transitions> </state> <state name="approved" description="Content has been validated"> <transitions> <transition>delete</transition> <transition>backToProject</transition> </transitions> </state> <state name="obsolete" description="Content is obsolete"> <transitions> <transition>delete</transition> <transition>backToProject</transition> </transitions> </state> <state name="deleted" description="Document is deleted"> <transitions> <transition>undelete</transition> </transitions> </state> </states> </lifecycle> </extension>
one for binding life-cycle to document-types
Here is an example
<lifecycle name="default" lifecyclemanager="jcrlifecyclemanager" initial="project"> <transitions> <transition name="approve" destinationState="approved"> <description>Approve the content</description> </transition> <transition name="obsolete" destinationState="obsolete"> <description>Content becomes obsolete</description> </transition> <transition name="delete" destinationState="deleted"> <description>Move document to trash (temporary delete)</description> </transition> <transition name="undelete" destinationState="project"> <description>Recover the document from trash</description> </transition> <transition name="backToProject" destinationState="project"> <description>Recover the document from trash</description> </transition> </transitions> <states> <state name="project" description="Default state"> <transitions> <transition>approve</transition> <transition>obsolete</transition> <transition>delete</transition> </transitions> </state> <state name="approved" description="Content has been validated"> <transitions> <transition>delete</transition> <transition>backToProject</transition> </transitions> </state> <state name="obsolete" description="Content is obsolete"> <transitions> <transition>delete</transition> <transition>backToProject</transition> </transitions> </state> <state name="deleted" description="Document is deleted"> <transitions> <transition>undelete</transition> </transitions> </state> </states> </lifecycle> </extension>
Life-Cycle service is detailed later in this document.
Inside Nuxeo Repository security is always checked when accessing a document.
Nuxeo security model includes :
Permissions
(Read, Write, AddChildren, ...).
Permissions management is hierarchical (there are groups of permissions)
ACE: Access Control Entry
An ACE grants or denies a permission to a user or a group of users.
ACL: Access Control List
An ACL is a list of ACE.
ACP: Access Control Policy
An ACP is a stack of ACL. We use ACP because security can be bound to multiples rules: there can be a static ACL, an ACL that is driven by the workflow, and another one that is driven by business rules.
Separating ACLs allows to easily reset the ACP when a process or a rules does not apply any more.
Inside the repository each single document can have an ACP. By default security descriptors are inherited from parent, but inheritance can be blocked when needed.
Security engine also lets you contribute custom policy services so that security management can include business rules.
Security model and policy service are described in details later in this document.
When an event happens inside the repository (document creation, document modification, etc...), an event is sent to the event service that dispatches the notification to its listeners. Listeners can perform whatever action when receiving an event, this includes modifying the document on the fly.
As an example, part of the dublincore management logic is implemented as a CoreEvent listener: whenever a document is created or modified, creation date, modification date, author and contributors fields are automatically updated by a CoreEvent Listener.
Core Events system is explained in more details later in this document.
The Repository support a Query API to extract Documents using a SQL like query.
NXQL (the associated Query Language) is presented later in this document.
The documents in the repository can be versionned.
Nuxeo Core provides:
A pluggable version storage manager
This lets you define how versions and stored and what operations can be done on versions
A pluggable versionning policy
This lets you define rules and logic that drives when new versions must be created and how versions numbers are incremented.
The versionning system is explained in details later in this document.
Nuxeo Core exposes a repository API on top of Jackrabbit JSR170 repository.
Nuxeo repository is implemented using a SPI and extension point model: this basically means that a non JCR based repository plugin can be contributed. In fact, we have already started a native SQL repository implementation (that is not yet finished because we have no direct requirement for such a repository).
Nuxeo core can server several repository: it provides a extension point to declare additional repository: this means a single web application can use several document repository.
Inside Nuxeo EP and especially inside the Core API the main data object is a Document.
Inside Nuxeo Core API, the object artifact used to represent a Document is called a DocumentModel.
The DocumentModel artifact encapsulates several useful features:
Data Access over the network
the DocumentModel encapsulate all access to Document internal fields, the DocumentModel can be sent over the network
DocumentModel support lazy loading
When fetched from the Core, a DocumentModel does not carries all document related information. Some data (called prefetch data) are always present, other data will be loaded (locally or remotely) from the core when needed.
This feature is very important to reduce network and disk I/O when manipulating Document that contains a lot of big blob files (like video, music, images ...) .
DocumentModel uses Core Streaming Service
For files above 1 MB the DocumentModel uses the Core Streaming service.
DocumentModel carries the security descriptors
ACE/ACL/ACP are embedded inside the DocumentModel
DocumentModels support an adapter service
In addition of the data oriented interface, a DocumentModel can be associated with one or several Adapters that will expose a business oriented interface.
DocumentModels embed lifecycle service access
DocumentModels can have facets
Facets are used to declare a behavior (Versionnable, HiddenInNavigation, Commentable...)
A DocumentModel can be located inside the repository using a DocumentRef. DocumentRef can be an IdRef (UUID in the case of the JCR Repository Implementation) or PathRef (absolute path).
DocumentModels also holds information about the Type of the Document and a set of flags to define some useful characteristics of the Document:
isFolder
Defines if the targeted document is a container
isVersion
Defines if the targeted document is an historic version
isProxy
Defines if the targeted document is a Proxy (see below)
A Proxy is a DocumentModel that points to a another one: very much like a symbolic link between 2 files.
Proxies are used when the same document must be accessible from several locations (paths) in the repository. This is typically the case when doing publishing: the same document can be visible in several sections. In order to avoid duplicating the data, we use proxies that point to same document.
A proxy can point to a checked out document (not yet associated to a version label) or to a versionned version (typical use-case of the publishing).
The proxy does not store document data: all data access are forwarded to the source document. But the Proxy does holds:
its own security descriptors
its own lifecycle information
its own DocumentRef
The service layer is an ECM services stack on top of the Nuxeo Repository. In a sense, the Repository itself is very much like any service of this layer, it just plays a central role.
This service layer is used for :
adding new generic ECM services (Workflow, Relations, Audit ...)
adding connectors to existing external services
adding dedicated projects specific components when the requirements can not be integrated into a generic component
This service layer provides the API used by client applications (Webapp or RCP based application) to do their work.
This means that in this layer, services don't care about UI, navigation or pageflows: they simply explode an API to achieve document oriented tasks.
Nuxeo platform provides a lot of different services, but they all fellow the same implementation pattern. This basically means that once you understand how works one service, you almost know how they all work.
As everything in the Nuxeo Platform the services use the Nuxeo Runtime component model.
A generic service will be composed of the following packages :
A API package (usually called nuxeo-platform-XXX-api)
Contains all interfaces and remotable objects.
This package is required to be able to call the service from the same JVM or from a remote JVM.
A POJO Runtime component (usually called nuxeo-platform-XXX-core)
The Runtime component will implement the service business logic (ie: implement the API) and also expose Extensions Points.
All the extensibility and pluggability is handled at runtime component level. This for example means that the API can be partially implemented via plugins.
A EJB3 Facade (usually called nuxeo-platform-XXX-facade)
The facade exposes the same API as the POJO component.
The target of this facade is :
provide EJB3 remoting access to the API
integrate the service into JEE managed environment (JTA and JAAS)
leverage some additional features of the application server like JMS and Message Driven Bean
provide state management via Stateful Session Beans when needed
Typically, the POJO module will be a Nuxeo Runtime Component that inherit from DefaultComponent, provide extension points and implement a Service Interface.
public class RelationService extends DefaultComponent implements RelationManager { ...}
The deployment descriptor associated to the component will register the component, declare it as service provider and it may also declare extension points
<?xml version="1.0"?> <component name="org.nuxeo.ecm.platform.relations.services.RelationService"> <implementation class="org.nuxeo.ecm.platform.relations.services.RelationService" /> <service> <provide interface="org.nuxeo.ecm.platform.relations.api.RelationManager" /> </service> <!-- declare here extension points --> </component>
The facade will declare a EJB that implement the same service interface. In simple cases, the implementation simply delegates calls to the POJO component.
The facade package will also contain a contribution to the Runtime Service management to declare the service implementation.
<?xml version="1.0" encoding="UTF-8"?> <component name="org.nuxeo.ecm.platform.relation.service.binding.contrib"> <extension target="org.nuxeo.runtime.api.ServiceManagement" point="services"> <documentation> Define the Relation bean as a platform service. </documentation> <service class="org.nuxeo.ecm.platform.relations.api.RelationManager" group="platform/relations"> <locator>%RelationManagerBean</locator> </service> </extension> </component>
Thanks to this declaration the POJO and the EJB Facade can now be used for providing the same interface based on a configuration of the framework and not on the client code.
This configuration is used when deploying Nuxeo components on several servers: platform configuration provides a way to define service groups and to bind them on physical servers.
Each service provides its own API composed of a main interface and of the other interfaces and types that can be accessed.
The API package is unique for a given service, access to a remote EJB3 based service is the same as accessing the POJO service.
From the client point of view, accessing a service is very simple and independent from service location and implementation: this means not manual JNDI call. Everything is encapsulated in the Framework.getService runtime API.
RelationManager rm = Framework.getService(RelationManager.class);
The framework.getService
will return the
interface of the required service:
This can be the POJO service (ie: Runtime Component based Service)
This can be the local interface of the EJB3 service (using call by ref in JBoss)
This can be the remote interface of the EJB3 service (using full network marshaling)
The choice of the implementation to return is left to the Nuxeo Runtime that will take the decision based on the platform configuration.
The client can explicitly ask for the POJO service via the Framework.getLocalService() API: this is typically used in the EJB Facade to delegate calls to the POJO implementation.
DocumentModel adapters are a way to adapt the DocumentModel interface (that is purely data oriented) to a more business logic oriented interface.
In the pure Service logic, adding a comment to a document would look like this:
CommentManager cm = Framework.getService(CommentManager.class); cm.createComment(doc,"my comment"); List<DocumentModel> comments = cm.getComments(doc);
DocumentModel adapter give the possibility to have a more object oriented API:
CommentableDoc cDoc = doc.getAdapter(CommentableDoc); cDoc.addComment("my comment"); List<DocumentModel> comments = cDoc.getComments();
The difference may seem small, but documentModel adapter can be very handy
to have a more clean and clear code
to handle to caching at DocumentModel level
to implement behavior and logic associating a Document and a Service
DocumenModelAdapters are declared using an extension point that defines the interface provided by the adapter and the factory class. DocumentModelAdapters can be associated to a Facet of the DocumentModel.
The requirements for the Nuxeo Web Framework are :
A Powerful templating system that supports composition
A modern MVC model that provides Widgets, Validators and Controllers
A standard framework
A set of Widgets libraries that allow reusing existing components
Support for AJAX integration
Nuxeo Web Layer uses JSF (SUN RI 1.2) and Facelets as presentation layer: JSF is standard and very pluggable, Facelets is much more flexible and adapted to JSF than JSP.
NXThemes provides a flexible Theme and composition engine based on JSF and Facelets.
In the 5.1 version of the platform, Nuxeo Web Layer uses Apache Tomahawk and trinidad as components library and AJAX4JSF for Ajax integration. In the 5.2 version we will move to Rich Faces.
Nuxeo Web Layer also uses Seam Web Framework to handle all the ActionListeners.
Using Seam provides :
Simplifications and helpers on JSF usage
A context management framework
Dependency injection
Remoting to access ActionsListeners from JavaScript
A built-in event system
Nuxeo Web Layer comes on top of a set of pluggable service.
Because this stack of services is very modular, so must be the web layer.
This basically mean that depending on the set of deployed services and on the configuration the web framework must provide a way
to add, remove or customize views
for example, if you don't need relations, you may want to remove the relations tab that is by default available on document
to add or remove a action button or link
the typical use case is removing actions that are bound to non deployed services or add new actions that are specific to your project
to override an action listener
you may want to change how some actions are handled by just overriding Nuxeo defaults
to add or customize forms
Adding fields or customizing forms used to display document is very useful
In order to fullfill these requirements, the key points of Nuxeo Web Framefulfill
Context management to let components share some state
Event system and dependency injection to let loosely coupled component collaborate
A deployment system to let several components make one unique WebApp
A set of pluggable services to configure the web application
Inside the web framework, each component will need to know at least
what is the current navigation context
This includes current document, current container, current Workspace, current Domain.
This information is necessary because most of the service will display a view on the current document, and can fetch some configuration from the content hierarchy.
who is the current user
This includes identity and roles, but also its preferences and the set of documents he choose to work on
In some cases, this context information may be huge, and it's time consuming to recompute all this information at each request.
Inside Nuxeo Web Framework, Seam context management is used to store these data. Depending on the lifecycle of the data Session, Conversation or Event context are used.
At some point the components of the web framework need to interact with each other. But because components can be present or not depending on the deployment scenario, they can't call each other directly.
For that matter, the Nuxeo Web Framework uses a lot of Seam features:
Seam's context is used to share some state between the components
Seam's event system is used to let components notify each other
Seam's dependency injection and Factory system is used to let component pull some data from each other without having to know each other
In order to facilitate Nuxeo Services integration into the web framework, we use the Seam Unwrap pattern to wrap Nuxeo Service into Seam Components that can be injected and manipulated as a standard Seam component.
The Web Layer is composed of several components.
The main components are webapp-core (default webapp base) and ui-web (web framework). On top of these base components dedicated web components are deployed for each specific service.
For example, the workflow-service has its own web components package, so do relation-service, audit-service, comment-service and so on.
Each package contains a set of views, actions, and ActionsListeners that are dedicated to one service and integrate this service into the base webapp.
Because JEE standards require the webapp to be mono-bloc, we use the Nuxeo Runtime deployment service to assemble the target webapp at deployment time.
This deployment framework let you: override resources, contribute
XML descriptors like web.xml
from several
components and manage deployment order.