Académique Documents
Professionnel Documents
Culture Documents
DATA MODEL
5.1 INTRODUCTION
The human eye is highly efficient at recognizing shapes and forms, but computer need to be
instructed exactly how spatial patterns should be handled and displayed (Burrough, 1993).
Geographical variations in the real world is infinitely complex the closure it can be looked the
more detail can be seen almost without any limit. It would take an infinitely large database to
capture the real world precisely. So data must be somehow reduced to a finite and manageable
quantity by a process of generalization or abstraction. Geographical variation must be
represented in terms of discrete elements or objects (Goodchild and Kemp et al. 1990).
Despite the heterogeneity of the information that can be stored in a GIS, there are only a few
common methods of representing spatial information in a GIS database. In developing a GIS
application, real world features need to be translated into simplified representations that can be
stored and manipulated in a computer. Two data models raster and vector are mainly dominating
current commercial GIS software.
While the concept of the data model is used in a variety of ways by numerous disciplines, a
digital geographic data model is generally defined as an information structure which allows the
user to store specific phenomena as distinct representations, and enables the user to manipulate
the phenomena when held in the system as data (Raper and Maguire, 1992,
http://www.ncgia.ucsb.edu/~curtin /nonplanar.html).
The data model represents a set of guidelines to convert the real world (called entity) to the
digitally and logically represented spatial objects consisting of the attributes and geometry. The
attributes are managed by thematic or semantic structure while the geometry is represented by
geometric-topological structure (Shunji 1999).
A GIS typically possesses two important characteristics. It provides a close linkage between
digital cartographic information and an associated database. GIS are able to integrate data from a
verity of sources into a common geographic framework, although they all do not use the same
logic to achieve this. Most GIS use one of two basic spatial data models to represent the real
world, namely the vector model and the tessellation model (Burrough 1986, Aronoff 1989).
TOPOLOGICAL MODEL
The topological model is the most widely used method of encoding spatial relationships in a
vector based GIS (Peuquet 1984). Topology is that branch of mathematics used to define spatial
relationships between entities (ESRI 1992). For example, an area or polygon is defined by a set
of lines which makes up its boundaries. In this case the line is the border between two polygons.
Each line can represent part of a path connecting such other paths. For example, lines can be
used to represent streets and the routes which pass along them. The connectivity or contiguity of
these features is referred to as their topology structure (ESRI 1992). By sorting information
about the location of a feature relative to other features, topology provides the basis for many
kinds of geographic analysis without having access to the absolute locations held in the
coordinate files (ESRI 1992).
Topology is the mathematical method used to define spatial relationships. The model is termed
Arc-Node data model.
Arc the basic logical entity, a series of point that starts and end at a node.
Node is an intersection point where two or more arcs meet. A node can also occur at
the end of a dangling arc i.e. and arc that is not connected to another arc such as the
end of a dead-end street.
Polygon is comprised of a closed chain of arcs that represents the boundary of the
area.
Point is encoded as a single XY co-ordinate pair. Point is considered as the polygon
with no area.
Figure 5.2: Topological Model
TIN is a vector-based topological data model that is used to represent terrain data. A TIN
represents the terrain surface as a set of interconnected triangular facets. For each of the three
vertices, the XY (geographic location) and the (elevation) Z values are encoded.
1. Grid/Lunr/Magi
2. Imgrid GIS
3. Map Analysis Package (MAP)
GRID/LUNR/MAGI
In this model each grid cell is referenced or addressed individually and is associated with
identically positioned grid cells in all other coverages, rather than like a vertical column of grid
cells, each dealing with a separate theme. Comparisons between coverages are therefore
performed on a single column at a time. Soil attributes in one coverage can be compared with
vegetation attributes in a second coverage. Each soil grid cell in one coverage can be compared
with a vegetation grid cell in the second coverage. The advantage of this data structure is that it
facilitates the multiple coverage analysis for single cells. However, this limits the examination of
spatial relationships between entire groups or themes in different coverages.
(http://www.olemiss.edu/depts/geology/courses/ge470/RasterDataModel.htm#grid)
IMGRID GIS
To represent a thematic map of land use that contains four categories: recreation, agriculture,
industry and residence, each of these features have to be separated out as an individual layer. In
the layer that represents agriculture 1 or 0 will represent the presence or absence of crops
respectively. The rest of layer will be represented in the same way, with each variable referenced
directly. The major advantage of IMGRID is its two-dimensional array of numbers resembling a
map-like structure. The binary character of the information in each coverage simplifies long
computations and eliminates the need for complex map legends. Since each coverage feature is
uniquely identified, there is no limitation of assigning a single attribute value to a single grid cell.
On the other side, the main problem related to information storage in an IMGRID structure is the
excessive volume of data stored. Each grid cell will contain more than 1 or 0 values from more
than one coverage and a large number of coverages are needed to store different types of
information. (http://www.olemiss.edu/ depts/geology/courses/ge
470/RasterDataModel.htm#mgrid)
TESSELLATION MODEL
In the tessellation model the basic logical unit is a single cell or unit of space in a mesh (Star and
Estes 1990). The location of geographic objects is defined by the row and column position of the
cells that they occupy (Burrough 1986). The area that each cell represents defines the spatial
resolution. The value assigned in the cell indicates the value of the attribute it represents. Hence,
the essential property of such a data model is that spatial relations between logical units are
implicit in the tessellation (Aronoff 1989).