Geospatial Data
Geospatial Data
DATA
Structure
4.1 Introduction
Objectives
4.2 Data and Information
What is Data?
What is Information?
Data versus Information
Precision versus Accuracy
4.3 Kinds of Data
Spatial Data
Non-spatial /Attribute Data
Temporal Data
Metadata
Measured Data
Inferred Data
Imported (Converted) Data
4.4 Data Processing
Data Acquisition
Data Retrieval
Data Analysis
Data Presentation
4.5 Summary
4.6 Unit End Questions
4.7 References
4.8 Further/Suggested Reading
4.9 Answers
4.1 INTRODUCTION
You have been introduced to the basic concepts of geoinformatics and its
components in Block 1 Introduction of Geoinformatics of MGY-001 Overview
of Geoinformatics. The development of geographical or spatial awareness can
be traced back to the earliest civilisations when ancient people were interested
to know about the places that were suitable or unsuitable for different
purposes, like urban planning, agriculture, etc., and also to know where they
themselves were located in relation to these features and to other tribes or
social groups. Although most of the societies in the world have clearly moved
on a long way from primitive times, the basic concerns of modern societies’
spatial questions are much the same. These questions are still fundamental in
enabling people to navigate between different places, to organise their lives
and to utilise the resources around them. At the core of these activities lies the
use of geospatial data and information. In this unit, you will be introduced to
the concepts of geospatial data, its types and processing.
5
Concepts of Geospatial Objectives
Data
After studying this unit you should be able to:
• differentiate between data and information;
• define geospatial data and information;
• discuss about different kinds of data; and
• recognise major stages in data processing.
The ability of GIS to Data which are geographically referenced describe both the location and
handle and process characteristics of spatial features on the Earth’s surface. For example, to
geographically referenced describe a tourist place, we need to refer to its location (i.e. where it is?) and
data distinguishes it from
its characteristics (i.e. name, classification, number of tourists visiting that
other information systems.
location, etc.). Therefore, geoinformatics involves two components of
geographic data. First component is spatial data, which relate to the geometry
of spatial features, and the second component is non-spatial or attributes data,
which provide information about spatial features.
‘Data’ or datum in the singular is derived from Latin word, which means
‘having been’ or that which is ‘given’. A similar concept is contained in the
French translation of data as ‘donnée’, which also means ‘given’. In other
words, data are those things on which understanding and explanations are
based. Data contains information which is represented in the format of digit,
letter and symbol that is used to describe status, behaviour and the outcome of
geographical objects. Let us put it more simply. Data are those things on
which understanding, explanations and inferences are based.
6
Let us now understand what are geographic, geospatial and spatial data. The Introduction to Geospatial
Data
three terms are used interchangeably referring to data having a spatial
component. As you know ‘geographic’ refers to the Earth’s surface and near Spatial is a generic term
that refers to any
surface hence geographic data would refer to any data related to Earth’s
3dimensional space.
surface and near surface. ‘Geospatial’ specifically refers to location relative to Geospatial is a specific
the Earth’s hence, ‘geospatial data’ refers to any data related to any features term referring to Earth’s
and phenomenon related to Earth and has location as one of its attributes. space whereas geographic
Further, it does not necessarily refer to the surface of the Earth but also above data refers to any data
which has an explicitly
(as in weather) or below (as in ground water) the Earth. The term ‘spatial’ is a
geographic component,
broad term which refers not only to the space of Earth’s surface but to any ranging from spatial data
space. Hence, ‘spatial data’ may refer to any data related to any planet, to non-spatial data with
cosmos and even of human body also. Thus, ‘geospatial data’ is a subset of site locations.
‘spatial data’.
The terms ‘geospatial data’ and ‘geographic data’ are often used Expression of geospatial
interchangeably. However, ‘geospatial data’ is considered more precise in data in a geoinformatics
many contexts than ‘geographic data’ because it is also used in ways that do tool involves conversion
not necessarily involve a graphic representation of the information. of raw data into symbols
arranged in a manner,
The relation and difference between data and information will be discussed in which appears meaning-
next section. ful to human eye.
You will better understand this by following example. A retail chain may
collect data, such as where its customers live and what they purchase when
they visit one of the chain’s stores. This may result in a list of postal codes and
purchase amounts. Of course, these data are generally not useful until they
have been analysed and plotted on a map. Transforming these data into
information indicates that the majority of purchases are made by people from a
particular area. Now you see, this is how data can easily be converted into
information by a geoinformatics tool that can be used to more precisely target
the next marketing campaign.
Not precise
Spatial Data
10 Fig. 4.2: Types of spatial data and examples of raster and vector data types
a) Vector Data Introduction to Geospatial
Data
In the vector data, spatial locations of features are defined on the basis of
coordinate pairs. In this data format, real world features are represented in
the form of:
• lines (arc or line data)
• points (point or node data), and
• polygons (closed boundary encompassing area).
Points (or nodes) are the fundamental building block of spatial data. They
refer to a specific place, generally in a two-dimensional space.
Points are used to record the locations of objects, such as tourist place or
overhead bridge. Vectors are lines between the points, and are generally
stored as an ordered series of two or more nodes. Linear features, such as
pipelines, roads and rivers are usually represented in as vector data.
Polygon is a closed figure made from several vector lines, where the first
and last node in the series is the same point. Polygons are used to
represent features having areal extent, such as lakes, political boundaries,
soil type, etc., as shown in Fig. 4.3. The figure depicts how the real world
is seen as vector data.
b) Raster Data
In raster data type, real world features are represented as grids. Raster You will study in more
data uses a fixed grid dimension and record information about each grid. detail about the raster and
vector data in detail in
One or more features are associated with each grid cell. One set of cell
Unit 5 Data Types and
and associated value is known as a layer, as shown in Fig. 4.3. Raster Sources.
data, which often comes from remote sensing also include thematic maps
or scanned maps, as shown in Fig. 4.2. Digital satellite images are one of
the commonly used forms of raster data. If you are familiar with digital
photography, you will be able to recognise the raster graphics pixel as the
smallest individual grid unit which is a building block of an image.
Raster Data
Vector Data
Fig. 4.3: Representation of real world features as vector and raster data
(source: www.gis-mapping.org/basics/how-gis-works.html) 11
Concepts of Geospatial 4.3.2 Non-spatial/Attribute Data
Data
Non-spatial data generally records information about the objects represented
as the spatial data, such as material used to construct a building, the type of
rain gauge used to measure rainfall intensity, or land use. Non-spatial data
also known as attribute (tabular) data are the descriptive data that are linked
to spatial data as shown in Fig. 4.4. Attribute data are collected and compiled
for specific project work, like districts in the states, census tracts, name of
cities, and so on, and often comes packaged with map data.
We will now discuss in the following section about the other names, which are
also used to describe data.
A good example for change detection is the land use change of India as
depicted in Fig. 4.5. The first map has been prepared using 2004-05 data, and
12
the last map using 2008-09 data. You can see the changes in land use pattern Introduction to Geospatial
Data
over a period of 6 years. Preparation of such maps is essential in managing
and restricting the development activities to ensure sustainable growth.
Fig. 4.5: Maps showing changes in land use/land cover in India during 2004-2009
(source: NRSC, 2010)
4.3.4 Metadata
Metadata are information about the data, their contents, sources, accuracy,
methods of collection, and other descriptions that help ensure the data are
collected appropriately. Metadata allows the transfer of important information
about the data along with a data set. Information, such as the method of data
acquisition, source of attributes recorded with the data, and similar details are
all essential information about the data to be used in geoinformatics.
The geoinformatics community has created standards for metadata. These
standards include structured fields (absolutely essential for the software to
read the data) and unstructured metadata (that is designed for humans to read
and evaluate the suitability of the dataset for their needs). The Federal
Geographic Data Committee (FGDC) established one metadata standard
called the Content Standard for Digital Geospatial Metadata (CSDGM). A
similar (and mostly compatible) ISO 19115 standard for geographic
information metadata is also widely used.
4.5 SUMMARY
In this unit, you have studied about the data, information, its precision and
accuracy. You have learned that-
• Data are what you collect through observation, measurement, and
inference
• Information is the data presented in a organised form.
• Precision of data relates to repeatability of data obtained, while accuracy
is the measurement of data.
• Any data can be categorised into primary and secondary data based on its
genesis.
• Data can be labelled as spatial data, non-spatial data, temporal data,
measured data, metadata, inferred data and imported data.
• Data processing is used to create useful information from data. Major
stages of data processing are acquisition, retrieval, analysis and
presentation.
15
Concepts of Geospatial
Data Spend 4.6 UNIT END QUESTIONS
30 mins
1) What do you understand by spatial data? Write its characteristics.
2) What is temporal data? Mention its importance.
3) What are primary and secondary geospatial data?
4) Briefly explain the stages in data processing cycle.
4.7 REFERENCES
• NRSC (2010), National land use and land cover mapping using multi-
temporal AWiFS data, Fifth cycle (2008-09) report- change analysis of
five cycles, NRSC/LULC/1:250K/2009, National Remote Sensing
Centre, ISRO, DOS, Hyderabad, 51p. http://applications.nrsc.gov.in/
images/ lulc_report_0809.pdf.
• www.gis-mapping.org/basics/how-gis-works.html.
Data from above websites was retrieved between 1st June to 15th July 2011
4.9 ANSWERS
Check Your Progress I
1) Refer to section 4.2.1
2) Refer to section 4.2.3
3) Refer to section 4.2.4
Check Your Progress II
1) Refer to the introduction of section 4.3
2) Refer to Fig. 4.2
16