|
Xerces 1.0.1 | ||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: INNER | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object | +--org.apache.xml.serialize.BaseMarkupSerializer | +--org.apache.xml.serialize.HTMLSerializer
Implements an HTML/XHTML serializer supporting both DOM and SAX
pretty serializing. HTML/XHTML mode is determined in the
constructor. For usage instructions see Serializer
.
If an output stream is used, the encoding is taken from the output format (defaults to UTF-8). If a writer is used, make sure the writer uses the same encoding (if applies) as specified in the output format.
The serializer supports both DOM and SAX. DOM serializing is done
by calling BaseMarkupSerializer.serialize(org.w3c.dom.Element)
and SAX serializing is done by firing
SAX events and using the serializer as a document handler.
If an I/O exception occurs while serializing, the serializer
will not throw an exception directly, but only throw it
at the end of serializing (either DOM or SAX's DocumentHandler.endDocument()
.
For elements that are not specified as whitespace preserving, the serializer will potentially break long text lines at space boundaries, indent lines, and serialize elements on separate lines. Line terminators will be regarded as spaces, and spaces at beginning of line will be stripped.
XHTML is slightly different than HTML:
Serializer
Fields inherited from class org.apache.xml.serialize.BaseMarkupSerializer |
_exception,
_format,
_started |
Constructor Summary | |
|
HTMLSerializer()
Constructs a new serializer. |
protected |
HTMLSerializer(boolean xhtml,
OutputFormat format)
Constructs a new HTML/XHTML serializer depending on the value of xhtml. |
|
HTMLSerializer(OutputFormat format)
Constructs a new serializer. |
|
HTMLSerializer(java.io.OutputStream output,
OutputFormat format)
Constructs a new serializer that writes to the specified output stream using the specified output format. |
|
HTMLSerializer(java.io.Writer writer,
OutputFormat format)
Constructs a new serializer that writes to the specified writer using the specified output format. |
Method Summary | |
protected void |
characters(java.lang.String text,
boolean cdata,
boolean unescaped)
Called to print the text contents in the prevailing element format. |
void |
endElement(java.lang.String tagName)
Receive notification of the end of an element. |
protected java.lang.String |
escapeURI(java.lang.String uri)
|
protected java.lang.String |
getEntityRef(char ch)
Returns the suitable entity reference for this character value, or null if no such entity exists. |
protected void |
serializeElement(Element elem)
Called to serialize a DOM element. |
void |
setOutputFormat(OutputFormat format)
Specifies an output format for this serializer. |
void |
startDocument()
Receive notification of the beginning of a document. |
protected void |
startDocument(java.lang.String rootTagName)
Called to serialize the document's DOCTYPE by the root element. |
void |
startElement(java.lang.String tagName,
AttributeList attrs)
Receive notification of the beginning of an element. |
Methods inherited from class org.apache.xml.serialize.BaseMarkupSerializer |
asDocumentHandler,
asDOMSerializer,
attributeDecl,
breakLine,
characters,
comment,
comment,
content,
elementDecl,
endCDATA,
endDocument,
endDTD,
endEntity,
enterDTD,
enterElementState,
escape,
externalEntityDecl,
flush,
getElementState,
ignorableWhitespace,
indent,
internalEntityDecl,
leaveDTD,
leaveElementState,
notationDecl,
printDoctypeURL,
printSpace,
printText,
printText,
printText,
printText,
processingInstruction,
reset,
serialize,
serialize,
serialize,
serializeNode,
serializePreRoot,
setDocumentLocator,
setOutputByteStream,
setOutputCharStream,
startCDATA,
startDTD,
startEntity,
unindent,
unparsedEntityDecl |
Methods inherited from class java.lang.Object |
clone,
equals,
finalize,
getClass,
hashCode,
notify,
notifyAll,
toString,
wait,
wait,
wait |
Constructor Detail |
protected HTMLSerializer(boolean xhtml, OutputFormat format)
#init
first.xhtml
- True if XHTML serializingpublic HTMLSerializer()
BaseMarkupSerializer.setOutputCharStream(java.io.Writer)
or BaseMarkupSerializer.setOutputByteStream(java.io.OutputStream)
first.public HTMLSerializer(OutputFormat format)
BaseMarkupSerializer.setOutputCharStream(java.io.Writer)
or BaseMarkupSerializer.setOutputByteStream(java.io.OutputStream)
first.public HTMLSerializer(java.io.Writer writer, OutputFormat format)
writer
- The writer to useformat
- The output format to use, null for the defaultpublic HTMLSerializer(java.io.OutputStream output, OutputFormat format)
output
- The output stream to useformat
- The output format to use, null for the defaultMethod Detail |
public void setOutputFormat(OutputFormat format)
format
- The output format to usepublic void startDocument()
The SAX parser will invoke this method only once, before any other methods in this interface or in DTDHandler (except for setDocumentLocator).
public void startElement(java.lang.String tagName, AttributeList attrs)
The Parser will invoke this method at the beginning of every element in the XML document; there will be a corresponding endElement() event for every startElement() event (even when the element is empty). All of the element's content will be reported, in order, before the corresponding endElement() event.
If the element name has a namespace prefix, the prefix will still be attached. Note that the attribute list provided will contain only attributes with explicit values (specified or defaulted): #IMPLIED attributes will be omitted.
name
- The element type name.atts
- The attributes attached to the element, if any.DocumentHandler.endElement(java.lang.String)
,
AttributeList
public void endElement(java.lang.String tagName)
The SAX parser will invoke this method at the end of every element in the XML document; there will be a corresponding startElement() event for every endElement() event (even when the element is empty).
If the element name has a namespace prefix, the prefix will still be attached to the name.
name
- The element type nameprotected void startDocument(java.lang.String rootTagName)
This method will check if it has not been called before (BaseMarkupSerializer._started
),
will serialize the document type declaration, and will serialize all
pre-root comments and PIs that were accumulated in the document
(see BaseMarkupSerializer.serializePreRoot()
). Pre-root will be serialized even if
this is not the first root element of the document.
protected void serializeElement(Element elem)
startElement(java.lang.String, org.xml.sax.AttributeList)
, endElement(java.lang.String)
and serializing everything
inbetween, but better optimized.elem
- The element to serializeprotected void characters(java.lang.String text, boolean cdata, boolean unescaped)
text
- The text to printcdata
- True is should print as CDATAunescaped
- True is should print unescapedprotected java.lang.String getEntityRef(char ch)
ch
- Character valueprotected java.lang.String escapeURI(java.lang.String uri)
|
Xerces 1.0.1 | ||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: INNER | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |