public class PoiHssfContentTransformer extends TikaPoweredContentTransformer
Apache Tika
and
Apache POI
to perform
conversions from Excel spreadsheets.
Will transform from Excel spreadsheets into Html, Xml or Text (space or comma separated)
Handles all sheets in the file.
Modifier and Type | Class and Description |
---|---|
protected static class |
PoiHssfContentTransformer.CsvContentHandler
A wrapper around the normal Tika BodyContentHandler,
which causes things to be CSV encoded rather than
tab separated
|
Modifier and Type | Field and Description |
---|---|
static java.lang.String |
WRONG_FORMAT_MESSAGE_ID
Error message to delegate to NodeInfoBean
|
LINE_BREAK, sourceMimeTypes
transformerDebug
Constructor and Description |
---|
PoiHssfContentTransformer() |
Modifier and Type | Method and Description |
---|---|
protected org.xml.sax.ContentHandler |
getContentHandler(java.lang.String targetMimeType,
java.io.Writer output)
Returns an appropriate Tika ContentHandler for the
requested content type.
|
protected org.apache.tika.parser.Parser |
getParser()
Returns the correct Tika Parser to process
the document.
|
boolean |
isExplicitTransformation(java.lang.String sourceMimetype,
java.lang.String targetMimetype,
TransformationOptions options)
Make sure we win over openoffice when it comes to producing
HTML
|
boolean |
isTransformableMimetype(java.lang.String sourceMimetype,
java.lang.String targetMimetype,
TransformationOptions options)
Can we do the requested transformation via Tika?
We support transforming to HTML, XML, Text or CSV
|
buildParseContext, transformInternal
checkTransformable, getTransformationTime, recordTime, register, setProperties, setRegistry, toString, transform, transform, transform
getBeanName, getLimits, getLimits, getLimits, getMaxPages, getMaxSourceSizeKBytes, getMaxSourceSizeKBytes, getMimetypeLimits, getPageLimit, getReadLimitKBytes, getReadLimitTimeMs, getTimeoutMs, isPageLimitSupported, isTransformable, isTransformable, isTransformableSize, setBeanName, setLimits, setMaxPages, setMaxSourceSizeKBytes, setMimetypeLimits, setPageLimit, setPageLimitsSuported, setReaderLimits, setReadLimitKBytes, setReadLimitTimeMs, setTimeoutMs, setTransformerDebug
getMimetype, getMimetypeService, isSupportedTransformation, setExplicitTransformations, setMimetypeService, setSupportedTransformations
public static final java.lang.String WRONG_FORMAT_MESSAGE_ID
protected org.apache.tika.parser.Parser getParser()
TikaPoweredContentTransformer
TikaAutoContentTransformer
which
makes use of the Tika auto-detection.getParser
in class TikaPoweredContentTransformer
public boolean isTransformableMimetype(java.lang.String sourceMimetype, java.lang.String targetMimetype, TransformationOptions options)
isTransformableMimetype
in interface ContentTransformer
isTransformableMimetype
in class TikaPoweredContentTransformer
sourceMimetype
- the source mimetypetargetMimetype
- the target mimetypeoptions
- the transformation optionspublic boolean isExplicitTransformation(java.lang.String sourceMimetype, java.lang.String targetMimetype, TransformationOptions options)
isExplicitTransformation
in interface ContentTransformer
isExplicitTransformation
in class ContentTransformerHelper
sourceMimetype
- the source mimetypetargetMimetype
- the target mimetypeoptions
- the transformation optionsContentTransformer.isExplicitTransformation(java.lang.String,
java.lang.String, org.alfresco.service.cmr.repository.TransformationOptions)
protected org.xml.sax.ContentHandler getContentHandler(java.lang.String targetMimeType, java.io.Writer output) throws javax.xml.transform.TransformerConfigurationException
TikaPoweredContentTransformer
getContentHandler
in class TikaPoweredContentTransformer
javax.xml.transform.TransformerConfigurationException
Copyright © 2005 - 2010 Alfresco Software, Inc. All Rights Reserved.