Batch processing is about repetitive actions - either as a simple
optimization, or as part of a job. To strategize and generalize the
repetition as well as to provide what amounts to an iterator framework,
Spring Batch has the RepeatOperations
interface.
The RepeatOperations
interface looks like
this:
public interface RepeatOperations { RepeatStatus iterate(RepeatCallback callback) throws RepeatException; }
The callback is a simple interface that allows you to insert some business logic to be repeated:
public interface RepeatCallback { RepeatStatus doInIteration(RepeatContext context) throws Exception; }
The callback is executed repeatedly until the implementation
decides that the iteration should end. The return value in these
interfaces is an enumeration that can either be
RepeatStatus.CONTINUABLE
or
RepeatStatus.FINISHED
. A RepeatStatus
conveys information to the caller of the repeat operations about whether
there is any more work to do. Generally speaking, implementations of
RepeatOperations
should inspect the
RepeatStatus
and use it as part of the decision to
end the iteration. Any callback that wishes to signal to the caller that
there is no more work to do can return
RepeatStatus.FINISHED
.
The simplest general purpose implementation of
RepeatOperations
is
RepeatTemplate
. It could be used like this:
RepeatTemplate template = new RepeatTemplate(); template.setCompletionPolicy(new FixedChunkSizeCompletionPolicy(2)); template.iterate(new RepeatCallback() { public ExitStatus doInIteration(RepeatContext context) { // Do stuff in batch... return ExitStatus.CONTINUABLE; } });
In the example we return RepeatStatus.CONTINUABLE
to
show that there is more work to do. The callback can also return
ExitStatus.FINISHED
if it wants to signal to the caller that
there is no more work to do. Some iterations can be terminated by
considerations intrinsic to the work being done in the callback, others
are effectively infinite loops as far as the callback is concerned and the
completion decision is delegated to an external policy as in the case
above.
The method parameter for the RepeatCallback
is a RepeatContext
. Many callbacks will simply
ignore the context, but if necessary it can be used as an attribute bag
to store transient data for the duration of the iteration. After the
iterate
method returns, the context will no
longer exist.
A RepeatContext
will have a parent context
if there is a nested iteration in progress. The parent context is
occasionally useful for storing data that need to be shared between
calls to iterate
. This is the case for instance
if you want to count the number of occurrences of an event in the
iteration and remember it across subsequent calls.
RepeatStatus
is an enumeration used by
Spring Batch to indicate whether processing has finished. These are
possible RepeatStatus
values:
Table 8.1. ExitStatus Properties
Value | Description |
CONTINUABLE | There is more work to do. |
FINISHED | No more repetitions should take place. |
RepeatStatus
values can also be combined
with a logical AND operation using the and
()
method in RepeatStatus
. The effect of this is to
do a logical AND on the continuable flag. In other words, if either
status is FINISHED
, then the result will be
FINISHED
.
Inside a RepeatTemplate
the termination of
the loop in the iterate
method is determined by a
CompletionPolicy
which is also a factory for the
RepeatContext
. The
RepeatTemplate
has the responsibility to use the
current policy to create a RepeatContext
and pass
that in to the RepeatCallback
at every stage in the
iteration. After a callback completes its
doInIteration
, the
RepeatTemplate
has to make a call to the
CompletionPolicy
to ask it to update its state
(which will be stored in the RepeatContext
). Then
it asks the policy if the iteration is complete.
Spring Batch provides some simple general purpose implementations of
CompletionPolicy
. The
SimpleCompletionPolicy
just allows an execution up
to a fixed number of times (with RepeatStatus.FINISHED
forcing early completion at any time).
Users might need to implement their own completion policies for more complicated decisions. For example, a batch processing window that prevents batch jobs from executing once the online systems are in use would require a custom policy.
If there is an exception thrown inside a
RepeatCallback
, the
RepeatTemplate
consults an
ExceptionHandler
which can decide whether or not to
re-throw the exception.
public interface ExceptionHandler { void handleException(RepeatContext context, Throwable throwable) throws RuntimeException; }
A common use case is to count the number of exceptions of a
given type, and fail when a limit is reached. For this purpose Spring
Batch provides the SimpleLimitExceptionHandler
and
slightly more flexible
RethrowOnThresholdExceptionHandler
. The
SimpleLimitExceptionHandler
has a limit property
and an exception type that should be compared with the current exception -
all subclasses of the provided type are also counted. Exceptions of the
given type are ignored until the limit is reached, and then rethrown.
Those of other types are always rethrown.
An important optional property of the
SimpleLimitExceptionHandler
is the boolean flag
useParent
. It is false by default, so the limit is only
accounted for in the current RepeatContext
. When
set to true, the limit is kept across sibling contexts in a nested
iteration (e.g. a set of chunks inside a step).
Often it is useful to be able to receive additional callbacks for
cross cutting concerns across a number of different iterations. For this
purpose Spring Batch provides the RepeatListener
interface. The RepeatTemplate
allows users to
register RepeatListener
s, and they will be given
callbacks with the RepeatContext
and
RepeatStatus
where available during the
iteration.
The interface looks like this:
public interface RepeatListener { void before(RepeatContext context); void after(RepeatContext context, RepeatStatus result); void open(RepeatContext context); void onError(RepeatContext context, Throwable e); void close(RepeatContext context); }
The open
and
close
callbacks come before and after the entire
iteration. before
, after
and onError
apply to the individual
RepeatCallback calls.
Note that when there is more than one listener, they are in a list,
so there is an order. In this case open
and
before
are called in the same order while
after
, onError
and
close
are called in reverse order.
Implementations of RepeatOperations
are not
restricted to executing the callback sequentially. It is quite important
that some implementations are able to execute their callbacks in parallel.
To this end, Spring Batch provides the
TaskExecutorRepeatTemplate
, which uses the Spring
TaskExecutor
strategy to run the
RepeatCallback
. The default is to use a
SynchronousTaskExecutor
, which has the effect of
executing the whole iteration in the same thread (the same as a normal
RepeatTemplate
).
Sometimes there is some business processing that you know you want
to repeat every time it happens. The classic example of this is the
optimization of a message pipeline - it is more efficient to process a
batch of messages, if they are arriving frequently, than to bear the cost
of a separate transaction for every message. Spring Batch provides an AOP
interceptor that wraps a method call in a
RepeatOperations
for just this purpose. The
RepeatOperationsInterceptor
executes the
intercepted method and repeats according to the
CompetionPolicy
in the provided
RepeatTemplate
.
Here is an example of declarative iteration using the Spring AOP
namespace to repeat a service call to a method called
processMessage
(for more detail on how to
configure AOP interceptors see the Spring User Guide):
<aop:config> <aop:pointcut id="transactional" expression="execution(* com...*Service.processMessage(..))" /> <aop:advisor pointcut-ref="transactional" advice-ref="retryAdvice" order="-1"/> </aop:config> <bean id="retryAdvice" class="org.spr...RepeatOperationsInterceptor"/>
The example above uses a default
RepeatTemplate
inside the interceptor. To change
the policies, listeners etc. you only need to inject an instance of
RepeatTemplate
into the interceptor.
If the intercepted method returns void
then the
interceptor always returns ExitStatus.CONTINUABLE (so there is a danger of
an infinite loop if the CompletionPolicy
does not
have a finite end point). Otherwise it returns
ExitStatus.CONTINUABLE
until the return value from the
intercepted method is null, at which point it returns
ExitStatus.FINISHED
. So the business logic inside the target
method can signal that there is no more work to do by returning
null
, or by throwing an exception that is re-thrown by the
ExceptionHandler
in the provided
RepeatTemplate
.