Logo Search packages:      
Sourcecode: jericho-html version File versions  Download package

au::id::jericho::lib::html::CharacterEntityReference Class Reference

Inheritance diagram for au::id::jericho::lib::html::CharacterEntityReference:

au::id::jericho::lib::html::CharacterReference au::id::jericho::lib::html::Segment

List of all members.


Detailed Description

Represents an HTML Character Entity Reference.

Click here to scroll down to the method summary.

The full list of HTML character entity references can be found at the following URL:
http://www.w3.org/TR/REC-html40/sgml/entities.html.

There are a total of 253 HTML character entity references, ranging from codepoints U+0022 to U+2666.

Static methods to encode(CharSequence) encode} and decode(CharSequence) decode} strings and single characters can be found in the CharacterReference superclass.

The ' entity reference is not defined for use in HTML. It is defined in the XHTML Special Characters Entity Set, and is the only one that is not included in both HTML and XHTML. For this reason, the ' entity reference is recognised by this library in decoding functions, but in encoding functions the numeric character reference ' is used instead. Most modern browsers support it in both XHTML and HTML, with the notable exception of Microsoft Internet Explorer 6.0, which doesn't support it in either.

CharacterEntityReference instances are obtained using one of the following methods:

See also:
CharacterReference

NumericCharacterReference

Definition at line 57 of file CharacterEntityReference.java.


Public Member Functions

final char charAt (final int index)
int compareTo (final Object o)
final boolean encloses (final int pos)
final boolean encloses (final Segment segment)
final boolean equals (final Object object)
String extractText (final boolean includeAttributes)
String extractText ()
List findAllCharacterReferences ()
List findAllComments ()
List findAllElements (final StartTagType startTagType)
List findAllElements (String name)
List findAllElements ()
List findAllStartTags (final String attributeName, final String value, final boolean valueCaseSensitive)
List findAllStartTags (String name)
List findAllStartTags ()
List findAllTags (final TagType tagType)
List findAllTags ()
List findFormControls ()
FormFields findFormFields ()
final List findWords ()
final int getBegin ()
char getChar ()
String getCharacterReferenceString ()
List getChildElements ()
int getCodePoint ()
String getDebugInfo ()
String getDecimalCharacterReferenceString ()
final int getEnd ()
String getHexadecimalCharacterReferenceString ()
String getName ()
String getSourceText ()
final String getSourceTextNoWhitespace ()
String getUnicodeText ()
int hashCode ()
void ignoreWhenParsing ()
boolean isComment ()
boolean isTerminated ()
final boolean isWhiteSpace ()
final int length ()
Attributes parseAttributes ()
final CharSequence subSequence (final int beginIndex, final int endIndex)
String toString ()

Static Public Member Functions

static String decode (final CharSequence encodedText, final boolean insideAttributeValue)
static String decode (final CharSequence encodedText)
static String decodeCollapseWhiteSpace (final CharSequence text)
static String encode (final CharSequence unencodedText)
static String encodeWithWhiteSpaceFormatting (final CharSequence unencodedText)
static String getCharacterReferenceString (final int codePoint)
static int getCodePointFromCharacterReferenceString (final CharSequence characterReferenceText)
static int getCodePointFromName (final String name)
static String getDecimalCharacterReferenceString (final int codePoint)
static String getHexadecimalCharacterReferenceString (final int codePoint)
static String getName (final int codePoint)
static String getName (final char ch)
static Map getNameToCodePointMap ()
static String getUnicodeText (final int codePoint)
static final boolean isWhiteSpace (final char ch)
static CharacterReference parse (final CharSequence characterReferenceText)
static String reencode (final CharSequence encodedText)
static final boolean requiresEncoding (final char ch)

Static Public Attributes

static final char _aacute = '\u00E1'
static final char _Aacute = '\u00C1'
static final char _acirc = '\u00E2'
static final char _Acirc = '\u00C2'
static final char _acute = '\u00B4'
static final char _aelig = '\u00E6'
static final char _AElig = '\u00C6'
static final char _agrave = '\u00E0'
static final char _Agrave = '\u00C0'
static final char _alefsym = '\u2135'
static final char _alpha = '\u03B1'
static final char _Alpha = '\u0391'
static final char _amp = '\u0026'
static final char _and = '\u2227'
static final char _ang = '\u2220'
static final char _apos = '\''
static final char _aring = '\u00E5'
static final char _Aring = '\u00C5'
static final char _asymp = '\u2248'
static final char _atilde = '\u00E3'
static final char _Atilde = '\u00C3'
static final char _auml = '\u00E4'
static final char _Auml = '\u00C4'
static final char _bdquo = '\u201E'
static final char _beta = '\u03B2'
static final char _Beta = '\u0392'
static final char _brvbar = '\u00A6'
static final char _bull = '\u2022'
static final char _cap = '\u2229'
static final char _ccedil = '\u00E7'
static final char _Ccedil = '\u00C7'
static final char _cedil = '\u00B8'
static final char _cent = '\u00A2'
static final char _chi = '\u03C7'
static final char _Chi = '\u03A7'
static final char _circ = '\u02C6'
static final char _clubs = '\u2663'
static final char _cong = '\u2245'
static final char _copy = '\u00A9'
static final char _crarr = '\u21B5'
static final char _cup = '\u222A'
static final char _curren = '\u00A4'
static final char _Dagger = '\u2021'
static final char _dagger = '\u2020'
static final char _dArr = '\u21D3'
static final char _darr = '\u2193'
static final char _deg = '\u00B0'
static final char _delta = '\u03B4'
static final char _Delta = '\u0394'
static final char _diams = '\u2666'
static final char _divide = '\u00F7'
static final char _eacute = '\u00E9'
static final char _Eacute = '\u00C9'
static final char _ecirc = '\u00EA'
static final char _Ecirc = '\u00CA'
static final char _egrave = '\u00E8'
static final char _Egrave = '\u00C8'
static final char _empty = '\u2205'
static final char _emsp = '\u2003'
static final char _ensp = '\u2002'
static final char _epsilon = '\u03B5'
static final char _Epsilon = '\u0395'
static final char _equiv = '\u2261'
static final char _eta = '\u03B7'
static final char _Eta = '\u0397'
static final char _eth = '\u00F0'
static final char _ETH = '\u00D0'
static final char _euml = '\u00EB'
static final char _Euml = '\u00CB'
static final char _euro = '\u20AC'
static final char _exist = '\u2203'
static final char _fnof = '\u0192'
static final char _forall = '\u2200'
static final char _frac12 = '\u00BD'
static final char _frac14 = '\u00BC'
static final char _frac34 = '\u00BE'
static final char _frasl = '\u2044'
static final char _gamma = '\u03B3'
static final char _Gamma = '\u0393'
static final char _ge = '\u2265'
static final char _gt = '\u003E'
static final char _hArr = '\u21D4'
static final char _harr = '\u2194'
static final char _hearts = '\u2665'
static final char _hellip = '\u2026'
static final char _iacute = '\u00ED'
static final char _Iacute = '\u00CD'
static final char _icirc = '\u00EE'
static final char _Icirc = '\u00CE'
static final char _iexcl = '\u00A1'
static final char _igrave = '\u00EC'
static final char _Igrave = '\u00CC'
static final char _image = '\u2111'
static final char _infin = '\u221E'
static final char _int = '\u222B'
static final char _iota = '\u03B9'
static final char _Iota = '\u0399'
static final char _iquest = '\u00BF'
static final char _isin = '\u2208'
static final char _iuml = '\u00EF'
static final char _Iuml = '\u00CF'
static final char _kappa = '\u03BA'
static final char _Kappa = '\u039A'
static final char _lambda = '\u03BB'
static final char _Lambda = '\u039B'
static final char _lang = '\u2329'
static final char _laquo = '\u00AB'
static final char _lArr = '\u21D0'
static final char _larr = '\u2190'
static final char _lceil = '\u2308'
static final char _ldquo = '\u201C'
static final char _le = '\u2264'
static final char _lfloor = '\u230A'
static final char _lowast = '\u2217'
static final char _loz = '\u25CA'
static final char _lrm = '\u200E'
static final char _lsaquo = '\u2039'
static final char _lsquo = '\u2018'
static final char _lt = '\u003C'
static final char _macr = '\u00AF'
static final char _mdash = '\u2014'
static final char _micro = '\u00B5'
static final char _middot = '\u00B7'
static final char _minus = '\u2212'
static final char _mu = '\u03BC'
static final char _Mu = '\u039C'
static final char _nabla = '\u2207'
static final char _nbsp = '\u00A0'
static final char _ndash = '\u2013'
static final char _ne = '\u2260'
static final char _ni = '\u220B'
static final char _not = '\u00AC'
static final char _notin = '\u2209'
static final char _nsub = '\u2284'
static final char _ntilde = '\u00F1'
static final char _Ntilde = '\u00D1'
static final char _nu = '\u03BD'
static final char _Nu = '\u039D'
static final char _oacute = '\u00F3'
static final char _Oacute = '\u00D3'
static final char _ocirc = '\u00F4'
static final char _Ocirc = '\u00D4'
static final char _oelig = '\u0153'
static final char _OElig = '\u0152'
static final char _ograve = '\u00F2'
static final char _Ograve = '\u00D2'
static final char _oline = '\u203E'
static final char _omega = '\u03C9'
static final char _Omega = '\u03A9'
static final char _omicron = '\u03BF'
static final char _Omicron = '\u039F'
static final char _oplus = '\u2295'
static final char _or = '\u2228'
static final char _ordf = '\u00AA'
static final char _ordm = '\u00BA'
static final char _oslash = '\u00F8'
static final char _Oslash = '\u00D8'
static final char _otilde = '\u00F5'
static final char _Otilde = '\u00D5'
static final char _otimes = '\u2297'
static final char _ouml = '\u00F6'
static final char _Ouml = '\u00D6'
static final char _para = '\u00B6'
static final char _part = '\u2202'
static final char _permil = '\u2030'
static final char _perp = '\u22A5'
static final char _phi = '\u03C6'
static final char _Phi = '\u03A6'
static final char _pi = '\u03C0'
static final char _Pi = '\u03A0'
static final char _piv = '\u03D6'
static final char _plusmn = '\u00B1'
static final char _pound = '\u00A3'
static final char _Prime = '\u2033'
static final char _prime = '\u2032'
static final char _prod = '\u220F'
static final char _prop = '\u221D'
static final char _psi = '\u03C8'
static final char _Psi = '\u03A8'
static final char _quot = '\u0022'
static final char _radic = '\u221A'
static final char _rang = '\u232A'
static final char _raquo = '\u00BB'
static final char _rArr = '\u21D2'
static final char _rarr = '\u2192'
static final char _rceil = '\u2309'
static final char _rdquo = '\u201D'
static final char _real = '\u211C'
static final char _reg = '\u00AE'
static final char _rfloor = '\u230B'
static final char _rho = '\u03C1'
static final char _Rho = '\u03A1'
static final char _rlm = '\u200F'
static final char _rsaquo = '\u203A'
static final char _rsquo = '\u2019'
static final char _sbquo = '\u201A'
static final char _scaron = '\u0161'
static final char _Scaron = '\u0160'
static final char _sdot = '\u22C5'
static final char _sect = '\u00A7'
static final char _shy = '\u00AD'
static final char _sigma = '\u03C3'
static final char _Sigma = '\u03A3'
static final char _sigmaf = '\u03C2'
static final char _sim = '\u223C'
static final char _spades = '\u2660'
static final char _sub = '\u2282'
static final char _sube = '\u2286'
static final char _sum = '\u2211'
static final char _sup = '\u2283'
static final char _sup1 = '\u00B9'
static final char _sup2 = '\u00B2'
static final char _sup3 = '\u00B3'
static final char _supe = '\u2287'
static final char _szlig = '\u00DF'
static final char _tau = '\u03C4'
static final char _Tau = '\u03A4'
static final char _there4 = '\u2234'
static final char _theta = '\u03B8'
static final char _Theta = '\u0398'
static final char _thetasym = '\u03D1'
static final char _thinsp = '\u2009'
static final char _thorn = '\u00FE'
static final char _THORN = '\u00DE'
static final char _tilde = '\u02DC'
static final char _times = '\u00D7'
static final char _trade = '\u2122'
static final char _uacute = '\u00FA'
static final char _Uacute = '\u00DA'
static final char _uArr = '\u21D1'
static final char _uarr = '\u2191'
static final char _ucirc = '\u00FB'
static final char _Ucirc = '\u00DB'
static final char _ugrave = '\u00F9'
static final char _Ugrave = '\u00D9'
static final char _uml = '\u00A8'
static final char _upsih = '\u03D2'
static final char _upsilon = '\u03C5'
static final char _Upsilon = '\u03A5'
static final char _uuml = '\u00FC'
static final char _Uuml = '\u00DC'
static final char _weierp = '\u2118'
static final char _xi = '\u03BE'
static final char _Xi = '\u039E'
static final char _yacute = '\u00FD'
static final char _Yacute = '\u00DD'
static final char _yen = '\u00A5'
static final char _Yuml = '\u0178'
static final char _yuml = '\u00FF'
static final char _zeta = '\u03B6'
static final char _Zeta = '\u0396'
static final char _zwj = '\u200D'
static final char _zwnj = '\u200C'
static final int INVALID_CODE_POINT = -1

Static Package Functions

 [static initializer]
static final StringBuffer appendCharacterReferenceString (final StringBuffer sb, final String name)
static final StringBuffer appendCollapseWhiteSpace (final StringBuffer sb, final CharSequence text)
static final StringBuffer appendDecimalCharacterReferenceString (final StringBuffer sb, final int codePoint)
static StringBuffer appendEncode (final StringBuffer sb, CharSequence unencodedText, final boolean whiteSpaceFormatting)
static final StringBuffer appendHexadecimalCharacterReferenceString (final StringBuffer sb, final int codePoint)
static final StringBuffer appendUnicodeText (final StringBuffer sb, final int codePoint)
static CharacterReference construct (final Source source, final int begin, final int unterminatedMaxCodePoint)
static CharacterReference findPreviousOrNext (final Source source, final int pos, final boolean previous)

Package Attributes

final int begin
List childElements = null
int codePoint
final int end
final Source source

Static Package Attributes

static final int MAX_CODE_POINT = 0x10FFFF

Private Member Functions

 CharacterEntityReference (final Source source, final int begin, final int end, final int codePoint)

Static Private Member Functions

static String getCharacterReferenceString (final String name)
static final boolean isValidReferenceNameChar (final char ch)

Private Attributes

String name

Static Private Attributes

static IntStringHashMap CODE_POINT_TO_NAME_MAP
static int MAX_NAME_LENGTH = 0
static Map NAME_TO_CODE_POINT_MAP = new HashMap(512,1.0F)

The documentation for this class was generated from the following file:

Generated by  Doxygen 1.6.0   Back to index