Package | Description |
---|---|
de.l3s.boilerpipe.sax |
Classes related to parsing and producing HTML from/to Boilerpipe TextDocuments.
|
Class and Description |
---|
BoilerpipeHTMLContentHandler
A simple SAX
ContentHandler , used by BoilerpipeSAXInput . |
BoilerpipeHTMLParser
A simple SAX Parser, used by
BoilerpipeSAXInput . |
HTMLDocument
An
InputSourceable for HTMLFetcher . |
HTMLHighlighter
Highlights text blocks in an HTML document that have been marked as "content"
in the corresponding
TextDocument . |
InputSourceable
An InputSourceable can return an arbitrary number of new
InputSource s for a given document. |
TagAction
Defines an action that is to be performed whenever a particular tag occurs
during HTML parsing.
|
TagActionMap
Base class for definition a set of
TagAction s that are to be used for the
HTML parsing process. |