0% found this document useful (0 votes)

173 views13 pages

DTD and XML

This document provides an introduction to DTDs and their use in XML documents. It explains that a DTD defines the structure of an XML document by listing allowed elements and attributes. DTDs provide documentation of a document's structure and enable validation to ensure elements are used correctly. The document then covers internal and external DTD declarations as well as the different element types that can be defined such as empty elements, text-only elements, elements with child elements, and more. It also discusses attributes, entities, PCDATA and CDATA.

Uploaded by

Murugananthan Ramadoss

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

173 views13 pages

DTD and XML

Uploaded by

Murugananthan Ramadoss

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 13

DTD and XML, Part 1 Introduction

The purpose of this assignment was to: Learn what DTD is. Learn how and why to put constrains on an XML document by using a DTD. What is a DTD? A DTD(Document Type Definition) defines the the structure of a document with a list of allowed elements and attributes. Why should/could a DTD be used? There are several advantages to using DTDs that become very obvious as the size and complexity of the XML code increases. Because almost all non-trivial software that use XML benefit from a DTD, it's essential for document authors to understand how to write them.

There are two main reasons for XML authors to use DTDs for their XML documents:

Documentation. A developer can look at the DTD of a XML document and immediately understand it's structure. This makes it easy for independent groups to agree opon a common DTD for interchanging data.

Validation. The process of document validation involves passing an XML document through a XML parser that parses/reads the DTD and compares with the XML markup to ensure that elements appear in correct order, that mandatory elements and attributes are in place, and that no undefined elements or attributes have been inserted where they shouldn't have been.

Working with validated data makes life much easier for a developer. If data is known to be valid, it's completely predictable. There's no longer any need to clutter the code with error checks or assertions; if the document validates it can be taken for granted that the data will be there in the format it should be. DTD Declaration

A DTD can be declared as an internal reference (i.e. inline in your XML document), or as an external reference (points to a separate file). Internal DOCTYPE declaration

If a DTD is included directly in the XML document, a DOCTYPE definition with the following syntax should be used:

<!DOCTYPE root-element [element-declarations]>

Example of a XML document with an internal DTD declaration:

<?xml version="1.0"?> <!DOCTYPE message [ <!ELEMENT message (receiver,sender,subject,content)> <!ELEMENT receiver (#PCDATA)>

<!ELEMENT sender (#PCDATA)> <!ELEMENT subject (#PCDATA)> <!ELEMENT content (#PCDATA)> ]>

<message> <receiver >Buck</receiver> <sender>Lenny</sender> <subject>Welcome</subject> <content>Welcome Buck!</content> </message>

The DTD is interpreted by a XML parser like this:

!DOCTYPE message (second row) defines that this is message document .

!ELEMENT message (third row) defines the message element to have these four elements:receiver, sender, subject, content

!ELEMENT receiver (fourth row) defines the receiver element to be of the type "#PCDATA".

!ELEMENT sender (fifth row) defines the sender element to be of the type "#PCDATA".

!ELEMENT subject (sixth row) defines the subject element to be of the type "#PCDATA".

!ELEMENT content (seventh row) defines the content element to be of the type "#PCDATA" External DOCTYPE declaration

If the DTD is included from a separate .dtd file(external), a DOCTYPE definition with the following syntax should be used:

<!DOCTYPE root-element SYSTEM "URI/URL or System path to .dtd file">

<!DOCTYPE root-element PUBLIC "Path Description" "URI/URL or System path to .dtd file">

Same XML document as above, but now with an external DTD:

<?xml version="1.0"?> <!DOCTYPE message SYSTEM "message.dtd"> <message> <receiver >Buck</receiver> <sender>Lenny</sender> <subject>Welcome</subject> <content>Welcome Buck!</content> </message>

And this is a copy of the external .dtd file "message.dtd", containing the DTD:

<!ELEMENT message (receiver,sender,subject,content)> <!ELEMENT receiver (#PCDATA)>

<!ELEMENT sender (#PCDATA)> <!ELEMENT subject (#PCDATA)> <!ELEMENT content (#PCDATA)>

This was part 1 of the DTD and XML assignment. In part 2 you will learn about the components of XML documents seen from a DTD perspective, and how to use them for the markup declarations in the DTD.

DTD and XML, Part Two

Components of XML documents from a DTD perspective

From a DTD perspective, XML documents are constructed by these five components: Elements Attributes Entities PCDATA CDATA Elements

Elements are the main components of XML documents as well as HTML and XHTML documents.

"message", "subject", "sender","receiver" and "content"from the message example in DTD and XML, Part 1. are examples of XML elements.

Elements can be empty, have text, or other elements as their content. Declaring an Element

XML elements are declared with a DTD element declaration inside the DTD.

This is the syntax for an element declaration:

<!ELEMENT element-name content-keyword> or <!ELEMENT element-name (element-content)> Empty elements

Elements types with empty content are declared using the content keyword EMPTY:

<!ELEMENT element-name EMPTY>

For example:

<!ELEMENT break EMPTY>

In XML document: <break />

As the example show empty elements have no content between it's start tag and it's end tag. This is referred to as having empty content.

The "img" "br" elements are examples of empty elements from HTML and XHTML. Elements with only pure text

Element types with text (character data) only are declared using the content keyword #PCDATA inside round brackets, like this:

<!ELEMENT element-name (#PCDATA)> example:

<!ELEMENT sender (#PCDATA)> Elements types with any content

Element types declared using the keyword ANY , have no constraints on its content. It may contain subelements of any type and number.

<!ELEMENT element-name ANY> example: <!ELEMENT message ANY> Element types with child elements

Element types with one or more child elements are declared in a sequence using the name of the child elements inside round brackets:

<!ELEMENT element-name (child-element-name)> or <!ELEMENT element-name (child-element-name,another-child-element-name,.....)>

example: <!ELEMENT message (receiver,sender,subject,content)>

When child elements(subelements) are declared in a sequence separated by commas, the children must occur in the same sequence in the XML document. In a complete declaration, the children, and all those childrens children... and so on...must be declared as well.

The complete declaration of the "message" element would be:

<!ELEMENT message (receiver,sender,subject,content)> <!ELEMENT receiver (#PCDATA)>

<!ELEMENT sender (#PCDATA)> <!ELEMENT subject (#PCDATA)> <!ELEMENT content (#PCDATA)> Element types that can occur only once

<!ELEMENT element-name (child-name)>

example:<!ELEMENT message (content)>

In the example declaration above the child element(i.e. the content element)is constrained to occur only once inside the "message" element. Element types that must occur at least once

<!ELEMENT element-name (child-name+)> example: <!ELEMENT message (content+)>

The + sign in the example above declares that the child element(i.e. the content element) must occur at least once inside the "message" element. (I.e. a one to many constrain) Element types that doesn't have to occur, but could occur many times

<!ELEMENT element-name (child-name*)>

example:<!ELEMENT message (content*)>

The * sign in the example above declares that the child element(i.e. the content element) doesn't have to occur - but can occur many times - within the "message" element. (I.e. a zero to many constrain) Element types that doesn't have to occur, but could occur one time

<!ELEMENT element-name (child-name?)>

example:

<!ELEMENT message (content?)>

The ? sign in the example above declares that the child element(i.e. the content element) can occur zero or one time within the "message" element. Element types with either this or that content

example: <!ELEMENT message (receiver,sender,subject,(content|announcement))>

The example above declares that the "message" element must contain a "receiver" element, a "sender"element, a "subject" element, and either a "content" element or a "announcement" element. Element types with mixed content

example: <!ELEMENT message (#PCDATA|receiver|sender|subject|content)*>

The example above declares that the "message" element can contain zero or more occurrences of text content(parsed character), "receiver elements", "sender elements", "subject elements", or "content" elements. Attributes

An attribute is used to give extra information about an element.

Attributes are inserted within an elements start tag. An Attribute have a attribute name and an attribute value. The img element in HTML and XHTML, for example, use the src attribute to give extra information:

<img src="hacker.jpg" />.

The element name is "img". The attribute name is "src", and the attribute value is "hacker.jpg". The element itself, however, is empty.(has empty content) In XML, XHTML and stricter versions of HTML empty elements are closed by a " /" in the end tag of the element. Entities

Entities are variables for defining shortcuts/macros to text.

They can be declared as: Internal Entities(shortcuts/macros for associating an arbitrary piece of text) External Entities(incorporation of content from other files, i.e. XML files)

You probably know the HTML entity reference: " "(No Breaking SPace), which is used in HTML to insert an extra space in a a document. Entities like " " are expanded when a document is parsed by a parser.

You can define your own entities within the DTD, but some common entities are already definded in XML:

<!ENTITY lt <!ENTITY gt

"&"> ">">

<!ENTITY amp "&"> <!ENTITY apos "'"> <!ENTITY quot """> PCDATA

PCDATA stands for Parsed Character DATA.

Character data is the text between the start tag end tag of an XML element. This text will be parsed by a parser. CDATA

CDATA is character data(text) that will NOT be parsed by a parser. Assignment Description

Put constrains on the cv-template.xml file from XML Basics Assignment by using a DTD. The document should be well-formed(have correct XML syntax) and validate(follow the rules set up in the DTD). My solution, Assignment Files cv-template.xml To validate cv-template.xml you can do so here with the W3C Markup Validator: W3C Validation Service

Introduction To DTD
No ratings yet
Introduction To DTD
24 pages
DTD Home
No ratings yet
DTD Home
21 pages
DTD Tutorial2
No ratings yet
DTD Tutorial2
14 pages
Part 2: Legal Building Blocks of XML
No ratings yet
Part 2: Legal Building Blocks of XML
29 pages
XML DTD
No ratings yet
XML DTD
35 pages
DTD Tutorials
No ratings yet
DTD Tutorials
19 pages
Tutorial: Internal DTD Declaration
No ratings yet
Tutorial: Internal DTD Declaration
16 pages
# Lecture-21 Document Type Definition: Internal DTD Declaration
No ratings yet
# Lecture-21 Document Type Definition: Internal DTD Declaration
8 pages
Tutorial DTD
No ratings yet
Tutorial DTD
19 pages
DTD
No ratings yet
DTD
26 pages
TCP Lec03
No ratings yet
TCP Lec03
44 pages
Document Type Declarations
No ratings yet
Document Type Declarations
25 pages
WT Unit-Ii
No ratings yet
WT Unit-Ii
85 pages
Applications of XML
No ratings yet
Applications of XML
19 pages
Document Type Definition
No ratings yet
Document Type Definition
48 pages
XML DTD Xmlschemas XSLT Json Dom
No ratings yet
XML DTD Xmlschemas XSLT Json Dom
68 pages
XML DTD & Schema Guide
No ratings yet
XML DTD & Schema Guide
200 pages
Web Technologies UNIT-1 XML
No ratings yet
Web Technologies UNIT-1 XML
34 pages
XML DTD
No ratings yet
XML DTD
15 pages
A.M.Senthilkumar: Changepond Technologies LTD
No ratings yet
A.M.Senthilkumar: Changepond Technologies LTD
15 pages
DTD
No ratings yet
DTD
4 pages
WT Unit 3 Notes
No ratings yet
WT Unit 3 Notes
33 pages
Unit Ii
No ratings yet
Unit Ii
106 pages
XML DTD Basics for Developers
No ratings yet
XML DTD Basics for Developers
23 pages
XML
No ratings yet
XML
27 pages
Unit-1 XML To RWD
No ratings yet
Unit-1 XML To RWD
103 pages
Document Type Definition
No ratings yet
Document Type Definition
6 pages
Chapter 4 XML
No ratings yet
Chapter 4 XML
52 pages
Module 5
No ratings yet
Module 5
29 pages
Unit-5 Web Technology
No ratings yet
Unit-5 Web Technology
17 pages
Document Type Definition
No ratings yet
Document Type Definition
28 pages
XML Documents - Xquery Xpath
No ratings yet
XML Documents - Xquery Xpath
11 pages
XML Schema
No ratings yet
XML Schema
58 pages
Ch-2 - Defining SOAP Messages With WSDL
No ratings yet
Ch-2 - Defining SOAP Messages With WSDL
49 pages
WP Unit5
No ratings yet
WP Unit5
17 pages
Document Type Definition Dtds
No ratings yet
Document Type Definition Dtds
38 pages
XML Notes
No ratings yet
XML Notes
11 pages
08 DTD
No ratings yet
08 DTD
26 pages
SOA - Module 1 - PPT
No ratings yet
SOA - Module 1 - PPT
64 pages
What Is XML?: Week 1
No ratings yet
What Is XML?: Week 1
15 pages
Lecture 3
No ratings yet
Lecture 3
39 pages
Introduction To XML
No ratings yet
Introduction To XML
49 pages
XML DTD Basics for Developers
No ratings yet
XML DTD Basics for Developers
5 pages
Unit 1: Benefits of XML 1.structured Document
No ratings yet
Unit 1: Benefits of XML 1.structured Document
26 pages
DTD - Overview
No ratings yet
DTD - Overview
25 pages
Note PDF
No ratings yet
Note PDF
52 pages
Lecture 6
No ratings yet
Lecture 6
25 pages
Chapter4 CEF482
No ratings yet
Chapter4 CEF482
13 pages
Iwt 4 Unit
No ratings yet
Iwt 4 Unit
30 pages
Unit 3 - XML
No ratings yet
Unit 3 - XML
44 pages
SOA-ppt Xml-Technologies
No ratings yet
SOA-ppt Xml-Technologies
46 pages
XML
No ratings yet
XML
7 pages
Unit 3 - XML
No ratings yet
Unit 3 - XML
44 pages
XML - Unit3
No ratings yet
XML - Unit3
30 pages
XML & JSON: Syntax, Structure, and Comparison
No ratings yet
XML & JSON: Syntax, Structure, and Comparison
80 pages
Week 7
No ratings yet
Week 7
18 pages
Unit 9 Java and XML
No ratings yet
Unit 9 Java and XML
29 pages
Lecture 1 XML Introduction
No ratings yet
Lecture 1 XML Introduction
64 pages
SAP Scholarship Test: Be A Part of Today's Technical Revolution
No ratings yet
SAP Scholarship Test: Be A Part of Today's Technical Revolution
1 page
6th Central Pay Commission Salary Calculator
100% (436)
6th Central Pay Commission Salary Calculator
15 pages
Sap Tables List
100% (1)
Sap Tables List
2 pages
Sappress Abap Objects Application
0% (1)
Sappress Abap Objects Application
56 pages
Regulation - 2013 (Syllabus) : Dpsapc1
No ratings yet
Regulation - 2013 (Syllabus) : Dpsapc1
20 pages
Verb Conjugation To Eat
No ratings yet
Verb Conjugation To Eat
2 pages
Blended Consonants GR
No ratings yet
Blended Consonants GR
2 pages
Business Report Writing Skills
100% (1)
Business Report Writing Skills
78 pages
Taw10 2-Schedule3
No ratings yet
Taw10 2-Schedule3
5 pages
Using Abbreviations For Days of The Week
No ratings yet
Using Abbreviations For Days of The Week
2 pages
Verb Tense Stories
100% (5)
Verb Tense Stories
2 pages
Exclamatory Sentences
0% (1)
Exclamatory Sentences
2 pages
Sort Words That Start With D
No ratings yet
Sort Words That Start With D
1 page
Sort Words That Start With G
No ratings yet
Sort Words That Start With G
1 page
Color by Number Vowel Sounds
No ratings yet
Color by Number Vowel Sounds
2 pages
Advanced Linking Verbs
No ratings yet
Advanced Linking Verbs
2 pages
Subject and Verb Agreement
No ratings yet
Subject and Verb Agreement
2 pages
Blended Consonants BR
No ratings yet
Blended Consonants BR
2 pages
Add A Letter
No ratings yet
Add A Letter
2 pages
Misused Verbs Will Would
No ratings yet
Misused Verbs Will Would
2 pages
Sentence Missing Verbs
100% (3)
Sentence Missing Verbs
2 pages
Action Verbs: Annie Writes On The Board. The Puppy Ran Down The Road
No ratings yet
Action Verbs: Annie Writes On The Board. The Puppy Ran Down The Road
2 pages
The Verb To Be
100% (1)
The Verb To Be
2 pages
Log
No ratings yet
Log
90 pages
Chapter 17
No ratings yet
Chapter 17
32 pages
Client-Side Web Development
No ratings yet
Client-Side Web Development
19 pages
3 6书源
No ratings yet
3 6书源
1,386 pages
GC 2024 09 08
No ratings yet
GC 2024 09 08
16 pages
CS3 Backup ٢٠٢٥ ٠٦ ١٨ ٠١ ٢٩
No ratings yet
CS3 Backup ٢٠٢٥ ٠٦ ١٨ ٠١ ٢٩
41 pages
XSL & XSLT Guide for Web Dev Students
No ratings yet
XSL & XSLT Guide for Web Dev Students
5 pages
XML Notes
No ratings yet
XML Notes
48 pages
AJAX in Web Programming Guide
No ratings yet
AJAX in Web Programming Guide
25 pages
Web Navigation Automation Guide
No ratings yet
Web Navigation Automation Guide
3 pages
XML and PHP Basics for Web Tech
No ratings yet
XML and PHP Basics for Web Tech
14 pages
7 Oral Question Bank Te Computer
No ratings yet
7 Oral Question Bank Te Computer
8 pages
BCS-502 (Web Technology)
No ratings yet
BCS-502 (Web Technology)
2 pages
jQuery Traversing Techniques
No ratings yet
jQuery Traversing Techniques
12 pages
JavaScript HTML DOM Guide
No ratings yet
JavaScript HTML DOM Guide
20 pages
PHP School Management System
No ratings yet
PHP School Management System
38 pages
Javascript HTML Dom Elements
No ratings yet
Javascript HTML Dom Elements
26 pages
Bug Inject
No ratings yet
Bug Inject
20 pages
Cal 1
No ratings yet
Cal 1
3 pages
Scripts para Unbounce
No ratings yet
Scripts para Unbounce
10 pages
Oracle SOA - Fault Handling in File Adapter For CSV and XML Files in SOA 11G
No ratings yet
Oracle SOA - Fault Handling in File Adapter For CSV and XML Files in SOA 11G
6 pages
CS2358 - Internet Programming Lab - AJAX
No ratings yet
CS2358 - Internet Programming Lab - AJAX
10 pages
Chapter 11
No ratings yet
Chapter 11
73 pages
GC 2024 12 31
No ratings yet
GC 2024 12 31
7 pages
CSS Notes
No ratings yet
CSS Notes
5 pages
Chapter 1 Introduction To HTML5
No ratings yet
Chapter 1 Introduction To HTML5
46 pages
B.Tech Web Tech Exam Paper
No ratings yet
B.Tech Web Tech Exam Paper
2 pages
HTML - CSS Interview Questions
No ratings yet
HTML - CSS Interview Questions
65 pages
CSS Basics for Web Design
100% (1)
CSS Basics for Web Design
28 pages
ForceTV Log: HTTP Command Handling
No ratings yet
ForceTV Log: HTTP Command Handling
5 pages

DTD and XML

Uploaded by

DTD and XML

Uploaded by

DTD and XML, Part 1 Introduction

<!DOCTYPE root-element [element-declarations]>

Example of a XML document with an internal DTD declaration:

<message> <receiver >Buck</receiver> <sender>Lenny</sender> <subject>Welcome</subject> <content>Welcome Buck!</content> </message>

The DTD is interpreted by a XML parser like this:

!DOCTYPE message (second row) defines that this is message document .

<!DOCTYPE root-element SYSTEM "URI/URL or System path to .dtd file">

Same XML document as above, but now with an external DTD:

<!ELEMENT message (receiver,sender,subject,content)> <!ELEMENT receiver (#PCDATA)>

<!ELEMENT sender (#PCDATA)> <!ELEMENT subject (#PCDATA)> <!ELEMENT content (#PCDATA)>

DTD and XML, Part Two

This is the syntax for an element declaration:

<!ELEMENT element-name content-keyword> or <!ELEMENT element-name (element-content)> Empty elements

<!ELEMENT element-name EMPTY>

<!ELEMENT break EMPTY>

In XML document: <break />

<!ELEMENT element-name (#PCDATA)> example:

<!ELEMENT sender (#PCDATA)> Elements types with any content

<!ELEMENT element-name (child-element-name)> or <!ELEMENT element-name (child-element-name,another-child-element-name,.....)>

example: <!ELEMENT message (receiver,sender,subject,content)>

The complete declaration of the "message" element would be:

<!ELEMENT message (receiver,sender,subject,content)> <!ELEMENT receiver (#PCDATA)>

<!ELEMENT element-name (child-name)>

example:<!ELEMENT message (content)>

<!ELEMENT element-name (child-name+)> example: <!ELEMENT message (content+)>

<!ELEMENT element-name (child-name*)>

example:<!ELEMENT message (content*)>

<!ELEMENT element-name (child-name?)>

<!ELEMENT message (content?)>

example: <!ELEMENT message (receiver,sender,subject,(content|announcement))>

example: <!ELEMENT message (#PCDATA|receiver|sender|subject|content)*>

An attribute is used to give extra information about an element.

<img src="hacker.jpg" />.

Entities are variables for defining shortcuts/macros to text.

PCDATA stands for Parsed Character DATA.

You might also like