Class HtmlUtils
Escapes and unescapes based on the W3C HTML 4.01 recommendation, handling character entity references.
Reference: https://www.w3.org/TR/html4/charset.html
For a comprehensive set of String escaping utilities, consider
Apache Commons Text
and its StringEscapeUtils class. We do not use that class here in order
to avoid a runtime dependency on Commons Text just for HTML escaping. Furthermore,
Spring's HTML escaping is more flexible and 100% HTML 4.0 compliant.
- Since:
- 01.03.2003
- Author:
- Juergen Hoeller, Martin Kersten, Craig Andrews
- 
Constructor SummaryConstructors
- 
Method SummaryModifier and TypeMethodDescriptionstatic StringhtmlEscape(String input) Turn special characters into HTML character references.static StringhtmlEscape(String input, String encoding) Turn special characters into HTML character references.static StringhtmlEscapeDecimal(String input) Turn special characters into HTML character references.static StringhtmlEscapeDecimal(String input, String encoding) Turn special characters into HTML character references.static StringhtmlEscapeHex(String input) Turn special characters into HTML character references.static StringhtmlEscapeHex(String input, String encoding) Turn special characters into HTML character references.static StringhtmlUnescape(String input) Turn HTML character references into their plain text UNICODE equivalent.
- 
Constructor Details- 
HtmlUtilspublic HtmlUtils()
 
- 
- 
Method Details- 
htmlEscapeTurn special characters into HTML character references.Handles the complete character set defined in the HTML 4.01 recommendation. Escapes all special characters to their corresponding entity reference (for example, <).- Parameters:
- input- the (unescaped) input string
- Returns:
- the escaped string
 
- 
htmlEscapeTurn special characters into HTML character references.Handles the complete character set defined in the HTML 4.01 recommendation. Escapes all special characters to their corresponding entity reference (for example, <) at least as required by the specified encoding. In other words, if a special character does not have to be escaped for the given encoding, it may not be.- Parameters:
- input- the (unescaped) input string
- encoding- the name of a supported- charset
- Returns:
- the escaped string
- Since:
- 4.1.2
 
- 
htmlEscapeDecimalTurn special characters into HTML character references.Handles the complete character set defined in the HTML 4.01 recommendation. Escapes all special characters to their corresponding numeric reference in decimal format (&#Decimal;). - Parameters:
- input- the (unescaped) input string
- Returns:
- the escaped string
 
- 
htmlEscapeDecimalTurn special characters into HTML character references.Handles the complete character set defined in the HTML 4.01 recommendation. Escapes all special characters to their corresponding numeric reference in decimal format (&#Decimal;) at least as required by the specified encoding. In other words, if a special character does not have to be escaped for the given encoding, it may not be. - Parameters:
- input- the (unescaped) input string
- encoding- the name of a supported- charset
- Returns:
- the escaped string
- Since:
- 4.1.2
 
- 
htmlEscapeHexTurn special characters into HTML character references.Handles the complete character set defined in the HTML 4.01 recommendation. Escapes all special characters to their corresponding numeric reference in hex format (&#xHex;). - Parameters:
- input- the (unescaped) input string
- Returns:
- the escaped string
 
- 
htmlEscapeHexTurn special characters into HTML character references.Handles the complete character set defined in the HTML 4.01 recommendation. Escapes all special characters to their corresponding numeric reference in hex format (&#xHex;) at least as required by the specified encoding. In other words, if a special character does not have to be escaped for the given encoding, it may not be. - Parameters:
- input- the (unescaped) input string
- encoding- the name of a supported- charset
- Returns:
- the escaped string
- Since:
- 4.1.2
 
- 
htmlUnescapeTurn HTML character references into their plain text UNICODE equivalent.Handles complete character set defined in HTML 4.01 recommendation and all reference types (decimal, hex, and entity). Correctly converts the following formats: &#Entity; - (Example: &) case sensitive &#Decimal; - (Example: D) 
 &#xHex; - (Example: å) case insensitive
 Gracefully handles malformed character references by copying original characters as is when encountered. - Parameters:
- input- the (escaped) input string
- Returns:
- the unescaped string
 
 
-