Package nu.validator.htmlparser.io
Class Driver
- java.lang.Object
-
- nu.validator.htmlparser.io.Driver
-
- All Implemented Interfaces:
EncodingDeclarationHandler
public class Driver extends java.lang.Object implements EncodingDeclarationHandler
-
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description voidaddCharacterHandler(CharacterHandler characterHandler)protected EncodingencodingFromExternalDeclaration(java.lang.String encoding)Initializes a decoder from external decl.java.lang.StringgetCharacterEncoding()Queries the environment for the encoding in use (for error reporting).org.xml.sax.LocatorgetDocumentLocator()booleaninternalEncodingDeclaration(java.lang.String internalCharset)Indicates that the parser has found an internal encoding declaration with the charset valuecharset.booleanisAllowRewinding()Returns the allowRewinding.booleanisCheckingNormalization()Query if checking normalization.voidsetAllowRewinding(boolean allowRewinding)Sets the allowRewinding.voidsetCheckingNormalization(boolean enable)Turns NFC checking on or off.voidsetCommentPolicy(XmlViolationPolicy commentPolicy)voidsetContentNonXmlCharPolicy(XmlViolationPolicy contentNonXmlCharPolicy)voidsetContentSpacePolicy(XmlViolationPolicy contentSpacePolicy)voidsetEncoding(Encoding encoding, Confidence confidence)voidsetErrorHandler(org.xml.sax.ErrorHandler eh)voidsetHeuristics(Heuristics heuristics)Sets the encoding sniffing heuristics.voidsetHtml4ModeCompatibleWithXhtml1Schemata(boolean html4ModeCompatibleWithXhtml1Schemata)voidsetMappingLangToXmlLang(boolean mappingLangToXmlLang)voidsetNamePolicy(XmlViolationPolicy namePolicy)voidsetTransitionHandler(TransitionHandler transitionHandler)voidsetXmlnsPolicy(XmlViolationPolicy xmlnsPolicy)voidtokenize(org.xml.sax.InputSource is)Runs the tokenization.protected voidwarnWithoutLocation(java.lang.String message)Reports a warning without line/colprotected EncodingwhineAboutEncodingAndReturnActual(java.lang.String encoding, Encoding cs)
-
-
-
Constructor Detail
-
Driver
public Driver(Tokenizer tokenizer)
-
-
Method Detail
-
isAllowRewinding
public boolean isAllowRewinding()
Returns the allowRewinding.- Returns:
- the allowRewinding
-
setAllowRewinding
public void setAllowRewinding(boolean allowRewinding)
Sets the allowRewinding.- Parameters:
allowRewinding- the allowRewinding to set
-
setCheckingNormalization
public void setCheckingNormalization(boolean enable)
Turns NFC checking on or off.- Parameters:
enable-trueif checking on
-
addCharacterHandler
public void addCharacterHandler(CharacterHandler characterHandler)
-
isCheckingNormalization
public boolean isCheckingNormalization()
Query if checking normalization.- Returns:
trueif checking on
-
tokenize
public void tokenize(org.xml.sax.InputSource is) throws org.xml.sax.SAXException, java.io.IOExceptionRuns the tokenization. This is the main entry point.- Parameters:
is- the input source- Throws:
org.xml.sax.SAXException- on fatal error (if configured to treat XML violations as fatal) or if the token handler threwjava.io.IOException- if the stream threw
-
setEncoding
public void setEncoding(Encoding encoding, Confidence confidence)
-
internalEncodingDeclaration
public boolean internalEncodingDeclaration(java.lang.String internalCharset) throws org.xml.sax.SAXExceptionDescription copied from interface:EncodingDeclarationHandlerIndicates that the parser has found an internal encoding declaration with the charset valuecharset.- Specified by:
internalEncodingDeclarationin interfaceEncodingDeclarationHandler- Parameters:
internalCharset- the charset name found.- Returns:
trueif the value ofcharsetwas an encoding name for a supported ASCII-superset encoding.- Throws:
org.xml.sax.SAXException- if something went wrong
-
setHeuristics
public void setHeuristics(Heuristics heuristics)
Sets the encoding sniffing heuristics.- Parameters:
heuristics- the heuristics to set
-
warnWithoutLocation
protected void warnWithoutLocation(java.lang.String message) throws org.xml.sax.SAXExceptionReports a warning without line/col- Parameters:
message- the message- Throws:
org.xml.sax.SAXException
-
encodingFromExternalDeclaration
protected Encoding encodingFromExternalDeclaration(java.lang.String encoding) throws org.xml.sax.SAXException
Initializes a decoder from external decl.- Throws:
org.xml.sax.SAXException
-
whineAboutEncodingAndReturnActual
protected Encoding whineAboutEncodingAndReturnActual(java.lang.String encoding, Encoding cs) throws org.xml.sax.SAXException
- Parameters:
encoding-cs-- Returns:
- Throws:
org.xml.sax.SAXException
-
setCommentPolicy
public void setCommentPolicy(XmlViolationPolicy commentPolicy)
- Parameters:
commentPolicy-- See Also:
Tokenizer.setCommentPolicy(nu.validator.htmlparser.common.XmlViolationPolicy)
-
setContentNonXmlCharPolicy
public void setContentNonXmlCharPolicy(XmlViolationPolicy contentNonXmlCharPolicy)
- Parameters:
contentNonXmlCharPolicy-- See Also:
Tokenizer.setContentNonXmlCharPolicy(nu.validator.htmlparser.common.XmlViolationPolicy)
-
setContentSpacePolicy
public void setContentSpacePolicy(XmlViolationPolicy contentSpacePolicy)
- Parameters:
contentSpacePolicy-- See Also:
Tokenizer.setContentSpacePolicy(nu.validator.htmlparser.common.XmlViolationPolicy)
-
setErrorHandler
public void setErrorHandler(org.xml.sax.ErrorHandler eh)
- Parameters:
eh-- See Also:
Tokenizer.setErrorHandler(org.xml.sax.ErrorHandler)
-
setTransitionHandler
public void setTransitionHandler(TransitionHandler transitionHandler)
-
setHtml4ModeCompatibleWithXhtml1Schemata
public void setHtml4ModeCompatibleWithXhtml1Schemata(boolean html4ModeCompatibleWithXhtml1Schemata)
- Parameters:
html4ModeCompatibleWithXhtml1Schemata-- See Also:
Tokenizer.setHtml4ModeCompatibleWithXhtml1Schemata(boolean)
-
setMappingLangToXmlLang
public void setMappingLangToXmlLang(boolean mappingLangToXmlLang)
- Parameters:
mappingLangToXmlLang-- See Also:
Tokenizer.setMappingLangToXmlLang(boolean)
-
setNamePolicy
public void setNamePolicy(XmlViolationPolicy namePolicy)
- Parameters:
namePolicy-- See Also:
Tokenizer.setNamePolicy(nu.validator.htmlparser.common.XmlViolationPolicy)
-
setXmlnsPolicy
public void setXmlnsPolicy(XmlViolationPolicy xmlnsPolicy)
- Parameters:
xmlnsPolicy-- See Also:
Tokenizer.setXmlnsPolicy(nu.validator.htmlparser.common.XmlViolationPolicy)
-
getCharacterEncoding
public java.lang.String getCharacterEncoding() throws org.xml.sax.SAXExceptionDescription copied from interface:EncodingDeclarationHandlerQueries the environment for the encoding in use (for error reporting).- Specified by:
getCharacterEncodingin interfaceEncodingDeclarationHandler- Returns:
- the encoding in use
- Throws:
org.xml.sax.SAXException- if something went wrong
-
getDocumentLocator
public org.xml.sax.Locator getDocumentLocator()
-
-