public class TikaMetadataExtractor extends Object
| Modifier and Type | Field and Description |
|---|---|
static org.slf4j.Logger |
LOGGER |
static String |
METADATA_LIMIT_REACHED_MSG |
| Constructor and Description |
|---|
TikaMetadataExtractor(InputStream inputStream)
Constructs a new tika extractor which parses the provided input stream into a tika Metadata
object, the body text, and the metadata XML
|
TikaMetadataExtractor(InputStream inputStream,
int maxBodyLength,
int maxMetadataLength)
Constructs a new tika extractor which parses the provided input stream into a tika Metadata
object, the body text, and the metadata XML The body text is truncated after maxLength
|
| Modifier and Type | Method and Description |
|---|---|
String |
getBodyText() |
org.apache.tika.metadata.Metadata |
getMetadata() |
String |
getMetadataXml() |
public static final org.slf4j.Logger LOGGER
public static final String METADATA_LIMIT_REACHED_MSG
public TikaMetadataExtractor(InputStream inputStream) throws org.apache.tika.exception.TikaException
inputStream - - the input stream to be parsedorg.apache.tika.exception.TikaException - - if parsing failspublic TikaMetadataExtractor(InputStream inputStream, int maxBodyLength, int maxMetadataLength) throws org.apache.tika.exception.TikaException
inputStream - - the input stream to be parsedmaxBodyLength - - the max length of the parsed body textmaxMetadataLength - - the max length of the parsed metadata.org.apache.tika.exception.TikaException - - if parsing failsThis work is licensed under a Creative Commons Attribution 4.0 International License.