0% found this document useful (0 votes)

46 views30 pages

5.XML Processing

The document discusses two dominant XML parsing standards: DOM and SAX. DOM builds a tree representation of the entire XML document in memory before providing it to the application. SAX is event-based and passes XML elements to the application as they are parsed without building an in-memory tree representation, making it more memory efficient than DOM.

Uploaded by

Nivethitha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

46 views30 pages

5.XML Processing

Uploaded by

Nivethitha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 30

XML and Web Services

XML DOM & SAX

XML Processing - Parser to

Application Communication

Two dominant standards:

Document Object Model (DOM):

Tree-based model passes complete picture of
document to application at processing conclusion,
Java, JavaScript, IDL descriptions; Perl
implementation in independent development,
Simple API for XML (SAX):
Event-based model reads document to application
handlers,
Supported by nearly all Java XML parsers
2

R. LOGAMBIGAI, TA

December 8, 2016

DOM
DOM (Document Object Model)
It is an object model for representing XML documents
in your code.
Using DOM we can create or modify an XML document
programmatically.
The DOM defines theobjects and propertiesof all
document elements, and themethods(interface) to
access them.

R. LOGAMBIGAI, TA

December 8, 2016

Parsing XML - The View from the

Application

<?xml?>

XML Document

Loads

document

Parses declarations

Builds DTD

Interprets document against DTD

May validate
May build DOM tree
May provide XSL or XLink Services

Application
4

R. LOGAMBIGAI, TA

December 8, 2016

DOM Levels
Core DOM - standard model for any structured document
XML DOM - standard model for XML documents
HTML DOM - standard model for HTML documents

R. LOGAMBIGAI, TA

December 8, 2016

XML DOM
The XML DOM is:
A standard object model for XML
A standard programming interface for XML
Platform- and language-independent
A W3C standard
The XML DOM defines theobjects and propertiesof all
XML elements, and themethods(interface) to access
them.

R. LOGAMBIGAI, TA

December 8, 2016

DOM Nodes

DOM, everything in an XML document is anode.

The DOM says:
The entire document is a document node
Every XML element is an element node
The text in the XML elements are text nodes
Every attribute is an attribute node
Comments are comment nodes

R. LOGAMBIGAI, TA

December 8, 2016

Generic Form

XML
Document

R. LOGAMBIGAI, TA

Parser

DOM

Application
Programme

December 8, 2016

Parent, Children and Siblings

The nodes in the node tree have a hierarchical
relationship to each other.
The terms parent, child, and sibling are used to describe
the relationships.
In a node tree,
The top node is called the root
Every node, except the root, has exactly one parent
node
A node can have any number of children
A leaf is a node with no children
Siblings are nodes with the same parent
9

R. LOGAMBIGAI, TA

December 8, 2016

XML Document
Information to be represented in the DOM
structure.
<?xml version="1.0" encoding="UTF-8"?>
<entry id="Baker2005">
<author>Mark Baker and Amy W. Apon and Clayton Ferner and Jeff
Brown</author>
<title>Emerging Grid Standards</title>
<journal>IEEE Computer</journal>
<year>2005</year>
<volume>38</volume>
<pages>43-50</pages>
<number>4</number>
</entry>

R. LOGAMBIGAI, TA

December 8, 2016

Information To Be Represented
Document
Attributes

<citation.xml>

Root Element

<entry>

R. LOGAMBIGAI, TA

December 8, 2016

DOM Example
<?xml version=1.0?>
Node

addressbook
Node

<name>John Doe</name>
<email>jdoe@yahoo.com</email>
</person>
<person>
<name>Jane Doe</name>
<email>jdoe@mail.com</email>

XML
Parser
Node

person
Node

Name=John Doe

Node

email=jdoe@yahoo.com

person
Node

Name=John Doe

Node

email=jdoe@yahoo.com

</person>
</addressbook>

R. LOGAMBIGAI, TA

December 8, 2016

DOM Representation
Document
Node

Document Root

NodeList

<Child id=123>Text here</Child>

Element
Node

</Parent>

NodeList
Element
Node

<Child>
NamedNodeMap
Attribute
Node

<id=123>

NodeList
Text CDATA
Node

R. LOGAMBIGAI, TA

Text here
December 8, 2016

Common DOM Methods

Node.getNodeType()- the type of the

underlying object, e.g.

Node.ELEMENT_NODE.
Node.getNodeName() - value of this node,
depending on its type, e.g. for elements its
tag name, for text nodes always string
#text.
Node.getFirstChild() and
Node.getLastChild()- the first or last child
of a given node.
Node.getNextSibling() and
Node.getPreviousSibling()- the next or
previous sibling of a given node.
Node.getAttributes()R. LOGAMBIGAI, TA
14
collection December 8, 2016

Common DOM methods (2)

Node.getNodeValue()- value of this node,

depending on its type, e.g. value of an

attribute but null in case of an element node.
Node.getChildNodes()- collection that
contains all children of this node.
Node.getParentNode()- parent of this node.
Element.getAttribute(name)- an attribute
value by name.
Element.getTagName()- name of the
element.
Element.getElementsByTagName()collection
of
all
descendant
Elements
with
a
R. LOGAMBIGAI, TA
December 8, 2016
15
given tag name.

Common DOM methods (3)

Element.setAttribute(name,value)- adds

a new attribute, if an attribute with that

name is already present in the element, its
value is changed.
Attr.getValue()- the value of the
attribute.
Attr.getName()- the name of this attribute.
Document.getDocumentElement()- allows
direct access to the child node that is the
root element of the document.
Document.createElement(tagName)creates an element of the type specified.
16

R. LOGAMBIGAI, TA

December 8, 2016

Advantages & Disadvantages

Advantage:

(1) It is good when random access to

widely
separated parts of a document is
required
(2) It supports both read and write
operations
Disadvantage:
17

(1) It is memory inefficient

R. LOGAMBIGAI, TA
December 8, 2016
(2) It seems complicated, although

Simple API for XML (SAX)

Event driven processing of XML documents.
Parser sends events to programmers code (start and

end of every component).

Programmer decides what to do with every event.
SAX parser does not create any objects at all, it
simply delivers events.

R. LOGAMBIGAI, TA

December 8, 2016

SAX features
SAX API acts like a data stream.
Stateless.
Events are not permanent.
Data not stored in memory.
Impossible to move backward in XML data.
Impossible to modify document structure.
Fastest and least memory intensive way of working

with XML.

R. LOGAMBIGAI, TA

December 8, 2016

Basic SAX events

startDocument receives notification of

the beginning of a document.

endDocument receives notification of the
end of a document.
startElement gives the name of the tag
and any attributes it might have.
endElement receives notification of the
end of an element.
characters parser will call this method to
report each chunk of character data.
20

R. LOGAMBIGAI, TA

December 8, 2016

Additional SAX events

ignorableWhitespace allows to react

(ignore) whitespace in element content.

warning reports conditions that are not
errors or fatal errors as defined by the XML
1.0 recommendation, e.g. if an element is
defined twice in a DTD.
error non-fatal error occurs when an
XML document fails a validity constraint.
fatalError a non-recoverable error e.g.
the violation of a well-formed-ness
constraint; the document is unusable after
the parser has invoked this method.
21

R. LOGAMBIGAI, TA

December 8, 2016

SAX events in a simple example

<?xml version="1.0"?>

startDocument()

startElement(): xmlExample

<heading>
This is a simple
example.

characters():

This is a simple example

endElement(): heading

</heading>

characters(): That is all folks

That is all folks.

endElement(): xmlExample

</xmlExample>

startElement(): heading

R. LOGAMBIGAI, TA

endDocument()

December 8, 2016

SAX2 Handlers Interfaces

ContentHandler - receives notification of

the logical content of a document

(startDocument, startElement,
characters etc.).
ErrorHandler - for XML processing errors
generates events (warning, error,
fatalError) instead of throwing exception
(this decision is up to the programmer).
DTDHandler - receives notification of
basic DTD-related events, reports notation
and unparsed entity declarations.
EntityResolver
handles the external
R. LOGAMBIGAI, TA
December 8, 2016
23
entities.

DefaultHandler class
Class

org.xml.sax.helpers.DefaultHandler:
Implements all four handle interfaces with
null methods,
Programmer can derive from
DefaultHandler his own class and pass its
instance to a parser,
Programmer can override only methods
responsible for some events and ignore the
rest.
24

R. LOGAMBIGAI, TA

December 8, 2016

How Does SAX work?

XML Document

SAX Objects

<?xml version=1.0?>

Parser

startDocument

Parser

startElement

Parser

startElement & characters

<email>jdoe@yahoo.com</email>

Parser

startElement & characters

</person>

Parser

endElement

Parser

startElement

Parser

startElement & characters

Parser

startElement & characters

Parser

endElement

Parser

endElement & endDocument

</person>
</addressbook>

R. LOGAMBIGAI, TA

December 8, 2016

Advantages & Disadvantages

Advantage:

(1) It is simple
(2) It is memory efficient
(3) It works well in stream application
Disadvantage:
The data is broken into pieces and
clients never have all the information as
a whole unless they create their own
data structure
26

R. LOGAMBIGAI, TA

December 8, 2016

SAX vs. DOM

DOM

More information about

structure of the document,

Allows to create or modify
documents.

SAX

You need to use the

information in the document

only once,
Less memory use.

R. LOGAMBIGAI, TA

December 8, 2016

SAX vs. DOM

SAX Parser:

A SAX (SimpleAPI forXML) parser does not create any

internal structure. Instead, it takes the occurrences of
components of an input documentas events, and tells the client
what it reads as it reads through the input document

A SAX parser serves the client application always only with

pieces of the document at any given time.
A SAX parser, however, is much more space efficient in case of
a big input document (because it creates no internal structure).
Whats more, it runs faster and is easier to learn than DOM parser
because its API is really simple. But from the functionality point
of view, it provides a fewer functions, which means that the users
themselves have to take care of more, such as creating their own
data structures.
28

R. LOGAMBIGAI, TA

December 8, 2016

SAX vs. DOM

DOM Parser
A DOM (Document Object Model) parser creates a tree
structure in memory from an input document and then waits for
requests from client.

A DOM parser always serves the client application with the

entire document no matter how much is actually needed by the client.
A DOM parser is rich in functionality. It creates a DOM tree in
memory and allows you to access any part of the document repeatedly
and allows you to modify the DOM tree. But it is space inefficient
when the document is huge, and it takes a little bit longer to learn
how to work with it.

R. LOGAMBIGAI, TA

December 8, 2016

R. LOGAMBIGAI, TA

December 8, 2016

XML Processors
No ratings yet
XML Processors
4 pages
XML Parsers: When A Software Program Reads An XML Document and Takes Actions
No ratings yet
XML Parsers: When A Software Program Reads An XML Document and Takes Actions
7 pages
Understanding XML: Basics & Parsing
No ratings yet
Understanding XML: Basics & Parsing
5 pages
07 Java API For XML Processing Jaxp
No ratings yet
07 Java API For XML Processing Jaxp
140 pages
Chapter 5 XML With Java - Tan
No ratings yet
Chapter 5 XML With Java - Tan
45 pages
XML Parsers: SAX vs DOM Guide
No ratings yet
XML Parsers: SAX vs DOM Guide
20 pages
Mern Previous Papers
No ratings yet
Mern Previous Papers
59 pages
SAP PI 7.3 XML Parsing Guide
50% (4)
SAP PI 7.3 XML Parsing Guide
153 pages
SAP PI 7.3 XML Parsing Guide
No ratings yet
SAP PI 7.3 XML Parsing Guide
153 pages
XML (Extensible Markup Language) UNIT-4: DR Anupama Jha
No ratings yet
XML (Extensible Markup Language) UNIT-4: DR Anupama Jha
43 pages
Lec12 XMLCS
No ratings yet
Lec12 XMLCS
60 pages
TCP Lec06
No ratings yet
TCP Lec06
39 pages
SAX DOM: 1. Which Parser Can Get Better Speed, DOM or SAX Parsers?
No ratings yet
SAX DOM: 1. Which Parser Can Get Better Speed, DOM or SAX Parsers?
51 pages
Unit4 - Ccs375-Webtechnologies
No ratings yet
Unit4 - Ccs375-Webtechnologies
48 pages
Lecture 6
No ratings yet
Lecture 6
39 pages
SAX (Simple API For XML)
No ratings yet
SAX (Simple API For XML)
16 pages
XML Parsing Techniques in Java
No ratings yet
XML Parsing Techniques in Java
44 pages
SAX DOMpresentation
No ratings yet
SAX DOMpresentation
19 pages
JAXP for Java XML Developers
No ratings yet
JAXP for Java XML Developers
8 pages
XML Question
No ratings yet
XML Question
5 pages
4th Question
No ratings yet
4th Question
1 page
XML Parsers (Dom Sax)
No ratings yet
XML Parsers (Dom Sax)
20 pages
XML Dom
No ratings yet
XML Dom
2 pages
XML Subjective Questions and Answers
No ratings yet
XML Subjective Questions and Answers
6 pages
XML Document Object Model
No ratings yet
XML Document Object Model
33 pages
Java XML Parsers for Developers
No ratings yet
Java XML Parsers for Developers
23 pages
A.M.Senthilkumar: Changepond Technologies LTD
No ratings yet
A.M.Senthilkumar: Changepond Technologies LTD
15 pages
XML Dom
No ratings yet
XML Dom
12 pages
14 XML
No ratings yet
14 XML
4 pages
J2EE Guide for Developers
100% (1)
J2EE Guide for Developers
118 pages
Two Types of XML Parsers
No ratings yet
Two Types of XML Parsers
6 pages
XML Basics for Developers
No ratings yet
XML Basics for Developers
6 pages
Parsing XML With SAX, DOM & JDOM: Hicham Qaissi
No ratings yet
Parsing XML With SAX, DOM & JDOM: Hicham Qaissi
16 pages
Untitled Document
No ratings yet
Untitled Document
19 pages
XML Parser
No ratings yet
XML Parser
66 pages
Web Technology-UNIT-2
No ratings yet
Web Technology-UNIT-2
4 pages
Fundamental XML For Developers: Dr. Timothy M. Chester Texas A&M University
No ratings yet
Fundamental XML For Developers: Dr. Timothy M. Chester Texas A&M University
82 pages
Web Sematics Mid
No ratings yet
Web Sematics Mid
14 pages
X Cert1423 A4
No ratings yet
X Cert1423 A4
38 pages
XML Parsing for Python Developers
No ratings yet
XML Parsing for Python Developers
42 pages
Unit III
No ratings yet
Unit III
39 pages
Untitled Document
No ratings yet
Untitled Document
4 pages
03a XML
No ratings yet
03a XML
57 pages
Web Data: XML
No ratings yet
Web Data: XML
13 pages
XML Interview Questions
No ratings yet
XML Interview Questions
6 pages
XML APIs: SAX, DOM, JAXP, StAX Overview
No ratings yet
XML APIs: SAX, DOM, JAXP, StAX Overview
30 pages
Java XML
No ratings yet
Java XML
59 pages
Understanding AWS Core Services - Services List
No ratings yet
Understanding AWS Core Services - Services List
3 pages
Snoopy
0% (1)
Snoopy
24 pages
Oracle Applications - Usefull XML Tags in Bi Publisher
No ratings yet
Oracle Applications - Usefull XML Tags in Bi Publisher
6 pages
Unit 3 Java Beans
No ratings yet
Unit 3 Java Beans
32 pages
80838581
No ratings yet
80838581
9 pages
SEO-Optimized E-Learning Tools Overview
No ratings yet
SEO-Optimized E-Learning Tools Overview
7 pages
Lab Exercise
No ratings yet
Lab Exercise
50 pages
Pmi Acp Resources
No ratings yet
Pmi Acp Resources
2 pages
Exercise - 13 (Applet) : B) To Display Analog Clock Using Applet
No ratings yet
Exercise - 13 (Applet) : B) To Display Analog Clock Using Applet
3 pages
Chapter 9 Lab More Classes and Objects Lab Objectives
No ratings yet
Chapter 9 Lab More Classes and Objects Lab Objectives
6 pages
Software Engineering Past Papers
No ratings yet
Software Engineering Past Papers
17 pages
Cloud Foundation - Presentation
No ratings yet
Cloud Foundation - Presentation
68 pages
Continuous Deployment
No ratings yet
Continuous Deployment
6 pages
Docs Nestjs Com Websockets Adapter...
No ratings yet
Docs Nestjs Com Websockets Adapter...
7 pages
Paper Class 12 Computer Science - 045937
No ratings yet
Paper Class 12 Computer Science - 045937
3 pages
Azure Data Factory Interview Questions and Aswers
No ratings yet
Azure Data Factory Interview Questions and Aswers
5 pages
Answers To Testing Throughout The Software Life Cycle Section
No ratings yet
Answers To Testing Throughout The Software Life Cycle Section
4 pages
DHTML Basics for Web Developers
No ratings yet
DHTML Basics for Web Developers
4 pages
Visual Basic .NET Basics & IDE Guide
No ratings yet
Visual Basic .NET Basics & IDE Guide
16 pages
Spring Cloud Sleuth Guide
No ratings yet
Spring Cloud Sleuth Guide
31 pages
ETL Course
No ratings yet
ETL Course
4 pages
AirohaLog 20230110 165501
No ratings yet
AirohaLog 20230110 165501
28 pages
Tutorial Open Flash Chart Line Chart
No ratings yet
Tutorial Open Flash Chart Line Chart
12 pages
Apa
No ratings yet
Apa
22 pages
1.write A Program in C++ For Defining Class and Object #Includeiostream Using Namespace STD - Defining A Class Class Car ( Private Members (Data Encapsulation) Private String Brand - Int Year - Pu
No ratings yet
1.write A Program in C++ For Defining Class and Object #Includeiostream Using Namespace STD - Defining A Class Class Car ( Private Members (Data Encapsulation) Private String Brand - Int Year - Pu
15 pages
KSP Reference Manual
No ratings yet
KSP Reference Manual
379 pages
PUSLHCI-Assessment Brief
No ratings yet
PUSLHCI-Assessment Brief
12 pages
Project Report of Gym Website
No ratings yet
Project Report of Gym Website
30 pages
Help For The W3C Markup Validation Service
No ratings yet
Help For The W3C Markup Validation Service
10 pages
Requirements Engineering Guide
No ratings yet
Requirements Engineering Guide
54 pages

5.XML Processing

Uploaded by

5.XML Processing

Uploaded by

XML and Web Services

XML DOM & SAX

XML Processing - Parser to

Two dominant standards:

Document Object Model (DOM):

Parsing XML - The View from the

Interprets document against DTD

DOM, everything in an XML document is anode.

Parent, Children and Siblings

<Child id=123>Text here</Child>

Common DOM Methods

underlying object, e.g.

Common DOM methods (2)

depending on its type, e.g. value of an

Common DOM methods (3)

a new attribute, if an attribute with that

Advantages & Disadvantages

(1) It is good when random access to

(1) It is memory inefficient

Simple API for XML (SAX)

end of every component).

Basic SAX events

the beginning of a document.

Additional SAX events

(ignore) whitespace in element content.

SAX events in a simple example

This is a simple example

characters(): That is all folks

That is all folks.

SAX2 Handlers Interfaces

the logical content of a document

How Does SAX work?

startElement & characters

startElement & characters

startElement & characters

startElement & characters

endElement & endDocument

Advantages & Disadvantages

SAX vs. DOM

More information about

structure of the document,

You need to use the

information in the document

SAX vs. DOM

A SAX (SimpleAPI forXML) parser does not create any

A SAX parser serves the client application always only with

SAX vs. DOM

A DOM parser always serves the client application with the

You might also like