DefaultTikaReader Class Reference

Inheritance diagram for DefaultTikaReader:
Collaboration diagram for DefaultTikaReader:

List of all members.

Public Methods

void parse (InputStream in) throws IOException, SAXException, TikaException
Result run () throws IOException
String getText ()
ArrayList< TextFragmentgetParagraphs ()
Metadata getMetaData ()
IDublinCoreMetadata getMetaData (InputStream in, String encoding)
boolean supportsContent (InputStream input)
void getPreview (InputStream input, OutputStream output, String encoding) throws IOException
TextFragment getRoot ()

Protected Methods

void analyzeDocument (TeslaDocument doc, InputStream in) throws IOException, SAXException, TikaException, UnsupportedEncodingException
Parser getParser ()
Result analyzeDocuments () throws IOException
void storeUrls (IOutputAdapter< IUrl > urlOut, List< TextFragment > urls, TeslaDocument doc)
void storeUrl (IOutputAdapter< IUrl > urlOut, TeslaDocument doc, int start, int end, String reference)
ContentHandler getHandler (TextFragment document)

Protected Attributes

IOutputAdapter< ParagraphparaOut
ISignalOutputAdapter< String > signalWriter
ISignalInputAdapter sa
TextFragment document

Method Details

void analyzeDocument ( TeslaDocument  doc,
InputStream  in 
) throws IOException, SAXException, TikaException, UnsupportedEncodingException [protected]

Reimplemented from AbstractTikaReader.

Result analyzeDocuments ( ) throws IOException [protected, inherited]
ContentHandler getHandler ( TextFragment  document) [protected, inherited]
Metadata getMetaData ( ) [inherited]
IDublinCoreMetadata getMetaData ( InputStream  in,
String  encoding 
) [inherited]
ArrayList<TextFragment> getParagraphs ( ) [inherited]
Parser getParser ( ) [protected, virtual]

Implements AbstractTikaReader.

void getPreview ( InputStream  input,
OutputStream  output,
String  encoding 
) throws IOException [inherited]
TextFragment getRoot ( ) [inherited]
String getText ( ) [inherited]
void parse ( InputStream  in) throws IOException, SAXException, TikaException

Reimplemented from AbstractTikaReader.

Result run ( ) throws IOException [inherited]
void storeUrl ( IOutputAdapter< IUrl urlOut,
TeslaDocument  doc,
int  start,
int  end,
String  reference 
) [protected, inherited]
void storeUrls ( IOutputAdapter< IUrl urlOut,
List< TextFragment urls,
TeslaDocument  doc 
) [protected, inherited]
boolean supportsContent ( InputStream  input) [inherited]

Field Details

TextFragment document [protected, inherited]
IOutputAdapter<Paragraph> paraOut [protected, inherited]
ISignalInputAdapter sa [protected, inherited]
ISignalOutputAdapter<String> signalWriter [protected, inherited]