public class TikaMetadataExtractor extends Object
Modifier and Type | Field and Description |
---|---|
static org.slf4j.Logger |
LOGGER |
static String |
METADATA_LIMIT_REACHED_MSG |
Constructor and Description |
---|
TikaMetadataExtractor(InputStream inputStream)
Constructs a new tika extractor which parses the provided input stream into a tika Metadata
object, the body text, and the metadata XML
|
TikaMetadataExtractor(InputStream inputStream,
int maxBodyLength,
int maxMetadataLength)
Constructs a new tika extractor which parses the provided input stream into a tika Metadata
object, the body text, and the metadata XML The body text is truncated after maxLength
|
Modifier and Type | Method and Description |
---|---|
String |
getBodyText() |
org.apache.tika.metadata.Metadata |
getMetadata() |
String |
getMetadataXml() |
public static final org.slf4j.Logger LOGGER
public static final String METADATA_LIMIT_REACHED_MSG
public TikaMetadataExtractor(InputStream inputStream) throws org.apache.tika.exception.TikaException
inputStream
- - the input stream to be parsedorg.apache.tika.exception.TikaException
- - if parsing failspublic TikaMetadataExtractor(InputStream inputStream, int maxBodyLength, int maxMetadataLength) throws org.apache.tika.exception.TikaException
inputStream
- - the input stream to be parsedmaxBodyLength
- - the max length of the parsed body textmaxMetadataLength
- - the max length of the parsed metadata.org.apache.tika.exception.TikaException
- - if parsing failsThis work is licensed under a Creative Commons Attribution 4.0 International License.