net.sf.saxon.charcode
Class UnicodeCharacterSet
java.lang.Objectnet.sf.saxon.charcode.UnicodeCharacterSet
- CharacterSet
public final class UnicodeCharacterSet
extends java.lang.Object
This class defines properties of the Unicode character set
decodeUTF8
public static int decodeUTF8(byte[] in,
int used)
throws IllegalArgumentException Decode a UTF8 character
getUTF8Encoding
public static int getUTF8Encoding(char in,
char in2,
byte[] out) Static method to generate the UTF-8 representation of a Unicode character
in - the Unicode character, or the high half of a surrogate pairin2 - the low half of a surrogate pair (ignored unless the first argument is in the
range for a surrogate pair)out - an array of at least 4 bytes to hold the UTF-8 representation.
- the number of bytes in the UTF-8 representation
inCharset
public boolean inCharset(int c)
Determine if a character is present in the character set
- inCharset in interface CharacterSet