org.pdfbox.util
Class PDFHighlighter
public class PDFHighlighter
Highlighting of words in a PDF document with an XML file.
endDocument, endPage, endParagraph, flushText, getCharactersByArticle, getCurrentPageNo, getEndBookmark, getEndPage, getLineSeparator, getOutput, getPageSeparator, getStartBookmark, getStartPage, getText, getText, getWordSeparator, processPage, processPages, setEndBookmark, setEndPage, setLineSeparator, setPageSeparator, setShouldSeparateByBeads, setSortByPosition, setStartBookmark, setStartPage, setSuppressDuplicateOverlappingText, setWordSeparator, shouldSeparateByBeads, shouldSortByPosition, shouldSuppressDuplicateOverlappingText, showCharacter, startDocument, startPage, startParagraph, writeCharacters, writeText, writeText |
getColorSpaces, getCurrentPage, getFonts, getGraphicsStack, getGraphicsState, getGraphicsStates, getResources, getTextLineMatrix, getTextMatrix, getXObjects, processOperator, processOperator, processStream, processSubStream, registerOperatorProcessor, resetEngine, setColorSpaces, setFonts, setGraphicsStack, setGraphicsState, setGraphicsStates, setTextLineMatrix, setTextMatrix, showCharacter, showString |
PDFHighlighter
public PDFHighlighter()
throws IOException Default constructor.
generateXMLHighlight
public void generateXMLHighlight(PDDocument pdDocument,
String highlightWord,
Writer xmlOutput)
throws IOException Generate an XML highlight string based on the PDF.
pdDocument - The PDF to find words in.highlightWord - The word to search for.xmlOutput - The resulting output xml file.
generateXMLHighlight
public void generateXMLHighlight(PDDocument pdDocument,
String[] sWords,
Writer xmlOutput)
throws IOException Generate an XML highlight string based on the PDF.
pdDocument - The PDF to find words in.sWords - The words to search for.xmlOutput - The resulting output xml file.
main
public static void main(String[] args)
throws IOException Command line application.
args - The command line arguments to the application.