GIS Slides
Transcript of GIS Slides
Chapter 1 - Introduction
Why Does GIS Matter? Everything that happens, happens somewhere…
Location is an issue in many of the problems that society must solve…
Some of these problems are routine, others are monumental in scope…
Examples of Geographic ProblemsHealth CareDelivery CompaniesTransportationForestryGovernmentand MANY more...
Geographic Problems
Bases for ClassificationLevel of geographic detail or ScaleIntent or Purpose
Practical Objectives (Minimize cost, etc.)Driven by human curiosity (Science)
Use the same tools…Time Scale
Operational (day-to-day)transactional databases
Tactical (medium term)Strategic (long term)
Why is geographic information special?
It is Multidimensional (at least two coordinates required--x, y)
It can be voluminous (some geodatabases = terabyte)
It must be projected onto a flat surface
It requires special methods for its analysis
It can be very time consuming to integrate various types
It can be complex and expensive to update
It can require large amounts of data retrieval for simple tasks
from just Data to elusive Wisdom
“Information Systems help us to manage what we know, by making it easy to organize and store, access and retrieve, manipulate and synthesize, and apply to solutions…”
DATA = numbers, text, symbols INFORMATION = data serving some purpose--implies some degree of selection, organization, preparation, interpretationEVIDENCE = multiplicity of information from different sources, related to a specific problem with a validated consistencyKNOWLEDGE = information to which value has been added by interpretation based on a particular context (a book is read)WISDOM = used in the context of decisions made or advice given--based on all the evidence and knowledge available
Knowledge about how the world works vs. knowledge about how the world looks….
“How it works” knowledge is most valuable => prediction
The software of a GIS captures and implements general knowledge, while the database represents specific information
an example of general knowledge = classificationmore sophisticated forms = rule sets
Everybody has their own definition of GIS
a container of maps in digital forma computerized tool for solving geographic problemsa spatial decision support system
a mechanized inventory of geographically distributed features and facilitiesa tool for revealing what is otherwise invisible in geographic informationa tool for performing operations on geographic data that are too tedious or expensive or inaccurate if performed by hand
General publicdecision-makers, community groups, plannersmanagement scientists, operations researchersutility, transportation, and resource managers
scientists, investigators
resource managers, planners, cartographers
Brief History of GIS
1st GIS = Canada GIS, 1960s (map measuring system)U.S. Bureau of the Census - 1970 census of population1970s - USGS, DMA, etc. (map creation/editing)1970s - digital remote sensing satellites (LANDSAT), GPS1980’s = price of sufficiently powerful computers falls below critical
threshold, commercial software companies established1990s = digital data sets become widely available
(DCW, 1992)1993 = Xerox PARC center publishes first web-based
interactive maps1996 = Commercial Internet Map Servers introduced (MapQuest)2000+ = GIServices (location-based services, g-commerce)
“The Era of Exploitation”
Components of a GIS
Hardware
Software
Data
People (SAPs)
Procedures
Network
GIScience and others’ terms
Systematic study of the fundamental issues arising from the creation, handling, storage, and use of geographic information, as a well-defined class of information in general.
Other terms currently being used:geomaticsgeoinformaticsspatial information sciencesgeocomputationgeo-information engineering
These terms have different roots and emphasize different ways of thinking about problems.
Chapter 2 - Gallery of Applications
“One Day of Life with GIS…” GIS affects each of us, every day….
Why GIS is becoming so widespread...
Wider availability of GIS through the Internet
Reductions in the price of hardware and software
Greater awareness of the geographic dimension of decision-making, etc.
Easier to use interfaces
Better technology to support applications (data visualization, data management, linkage to other software).
Proliferation of geo-referenced digital data (GPS, VARs, etc.)
“Commercial off-the-shelf” (COTS) applications
Accumulated experience in the industry (making it work…)
Typical Goals of Scientific Applications
Rational, effective, and efficient allocation of resources
Monitoring and understanding observed spatial distributions
Understanding the difference that place makes…
Understanding processes in the natural and economic systems
Prescription of strategies for environmental conservation
The five “M’s” of GIS Applications
MappingMeasurementMonitoringModelingManagement
Lambert’s versionMapping, Monitoring, Management, Analysis, Planning
Local Government Applications
Of the tasks undertaken by local governments, 70 - 80% are geographically related.
GoalsImprove the quality of products, processes, and services Protect the health, safety, and welfare of citizens
Inventory Applications (roads, parcels, facilities, etc.)
Policy Analysis Applications (resource demand, potential capacity, etc.)
Management/Policy-making Applications (facility siting, routing, allocation, projections, etc.)
Business and Service Planning (Retail) Applications
Geodemographics is a shorthand term for composite indicators of consumer behavior that are available at a small-area level (e.g., census tracts, postal zone, etc.)
basis for market area analysisfastest growing application of GIS
Operationalday-to-day processing of routine transactions and inventory analysis (ex: stock management)
Tacticalallocation of resources to address specific (usually short-term) problems (ex: store sales promotions)
Strategicsupport for planning to achieve long-term goals
(ex: opening new stores)
Logistics and Transportation Applications
Deals with the movement of goods and people from one place to another, and the infrastructure that moves them…
Each application requires two parts:Static - fixed infrastructure (highways, railroads, etc.)Dynamic - the goods and people that are moving around
GPS provides the technology to track vehicles, etc.
Recently, applications include dynamically updated maps on the InternetMany applications involve optimization methodsThese applications have provided substantial savings over traditional manual methods...
Environmental Applications
Earliest applications of GIS were in the environmental fields
Competition between alternative uses of land has driven many
applications of GIS
Satellite remote sensing used to monitor land use change
Dynamic simulation models
Links to non-spatial models
Professional standard in most fields today...
Chapter 3 - Representing Geography
Representations or Modelshelp us assemble far more knowledge about the Earth than is possible on our own.Are reinforced by the rules and laws that we have learned to apply to the unobserved world…
Toblers’ First Law of Geography: Everything is related to everything else, but near things are more related than those far apart.
Example: Spatial interpolation methodsSince the world is so complex, revealing more detail the closer we look, it is necessary to make choices about
what to represent, at what level of detail, over what period of time
Digital Representation of Geography
Binary representation
Every item of useful information about the Earth’s surface is ultimately reduced to some combination of 0s and 1s.
The representation itself (0s and 1s) is rarely seen by the user, instead, the user sees a view designed to present the contents in a meaningful form.
Geographic representations are among the most ancient…
Sketches in the dirt or on cave walls probably preceded language that could relate equivalent information…
Effective media for communication between members of a small group
Invention of the printing press in the 15th centuryKnowledge could be the common property of humanity…Major restriction: representation had to be flat…
The “Age of Discovery” (15th century)Henry the Navigator of Portugal and other explorersMaps became the most valuable medium for
Establishing new discoveriesAdministering colonial empires
A Key Issue: What to represent and how to represent it…
Any application of GIS requires clear attention to questions of what to represent, and how.
There is a multitude of possible ways of representing the geographic world in digital form,
none of which is perfect,
and none of which is ideal for all applications.
Place, Time, and Attributes…
Geographic data link place, time, and attributes.Time is optional in GIS
Time can be omitted in many casesExample: elevation
Time is essential in some casesExample: atmospheric temperature
Attributes are classified as:Nominal
Serves to identify one entity from anotherExamples: place namesCan include: Numbers, letters, colors, names, etc.IMPORTANT: It makes no sense to apply arithmetic operations to this class of data!
Attributes, Continued
OrdinalValues have a natural orderExample: Class 1 is best, Class 2 is not as good, etc.IMPORTANT: Adding, averaging, or taking ratios makes no sense with this class of data!Medians are legitimate and can be a useful value
IntervalDifference between values makes sense.Example: Celsius temperature scale
It makes sense to say that 30 and 20 areas different as 20 and 10
However, 20 is not twice as hot as 10Any scale data with an arbitrary zero point
Attributes, Continued
RatioRatios between values makes sense...Example: Weight (100 lb is twice as heavy as 50 lb)
Cyclic, or DirectionalSpecial type of data with special problems
Example: Compass Directions0 degrees to 359 degreesCannot average all values
(ex: (359 + 1)/2 = 180)Example: flow directions on linear segments
Degrees, Minutes, Seconds VS. Decimal Degrees
The Fundamental Problem with representations
The world is infinitely complex, but computer systems are finite. Therefore, representation is all about the choices that are made in capturing the knowledge about the world.
A representation must be partial…Limit level of detailIgnore changes through timeIgnore certain attributesSimplify by classifying into ranges, etc.
Example: spatial resolution for representation of worldwide elevation
the Earth’s surface covers 500 million sq km, therefore, 10km resolution = 5 million cells of info, 1km resolution = 500 million cells of info, and 1 meter resolution = 500 trillion cells of info
Two Conceptual Schemes for Representation of Geography
A fundamental choice has to be made between representing the geographic world as:
Discrete ObjectsOr
Fields
The Discrete Object View
Represents the world as objects with well-defined boundaries in empty space.
Objects are instances of generally recognized categories.Objects can be counted
Good Examples: biological organisms, manufactured objects
Messy Examples: mountains (where does it start/end?)
Offers powerful way of linking attribute information about each object (row in a data table corresponds to each object and the columns hold attribute data on each object)
Dimensionality in the Object View
Two-dimensional objectsareas (most often referred to as polygons)Examples: lakes, administrative regions
One-dimensional objectsLinesExamples: roads, rivers
Zero-dimensional objectsPointsExamples: individual animals, buildings
Limitations of the Object View
In reality, all objects are 3-dimensional, so representation in fewer dimensions is just an approximation.
Ability of GIS software to handle true 3-D objects as volumes is very limited.
Can assign height attributes to each coordinate that describes the object (sometimes called “2.5D”)
2-Dimensional networks handle overpasses by assigning turning options at each intersection (no turns= overpass)
Works poorly for continuous surfaces
The Field View
Continuous surfaces are represented better in the field view.
In this view the geographic world can be described by a finite number of variables, each measurable at any point on the Earth’s surface, and changing in value across the surface.
Fields are distinguished by what varies, and how smoothly.
Field or Object View?
Fields can represent continuous variation across space OR lines
Examples that may also be represented in the object view:ElevationPopulation densityLand useSoil typeTraffic density along a road networkLakes (degrees of “lakeness” – assign every point a value describing the condition….dry, sometimes flooded, etc.)
Raster and Vector Data Models
Fields and Discrete Objects are conceptual views.Two methods are used to reduce geographic phenomena to forms that can be coded into computer databases, called Raster and Vector data models.
Raster Representation ModelGeographic space is divided into an array of cells (or pixels)The cells are usually squareVariation is expressed by assigning values to cells
Vector Representation ModelPoints, Lines, Areas (Polygons)
Raster Data
Spatial resolution of raster data = length of a cell side
Square cells do not fit together neatly on a curved surface.
When information is presented in raster form all detail about variation within cells is lost, and instead the cell is given a single value.
Must establish rules for assigning cell valuesMajority or plurality methodCentral point of the cell method
Common form of raster data is satellite imagery.
Vector Data
Points are captured as x and y coordinates.
Lines are captured as points connected by precisely straight lines (also called polylines when representing a curved line)
Areas captured by as a series of points (vertices) connected by straight lines (also called polygons).
Raster vs. Vector
“Raster is vaster, and vector is correcter”Raster model requires very small cells sizes to accurately represent the location of features (lots of cells with attribute data for every cell).Vector model just requires that the vertices that make up the line or polygon be stored.
Apparent precision of vector is often an unreasonable representation of the spatial accuracy of the data.Issues:
Volume of Data Sources of dataApplicationsSoftwareResolution
Paper Maps
Analog representation, or physical model of the world scaled to fit the size of the paperLimitations include: static, hard to update, limited 3-D
Scale or Representative Fraction is defined as the ratio of the distance on the map to the distance on the Earth’s surface.Example:1:24,000 (1 inch on the map represents 24,000 inches on the surface of the Earth, AND, 1 foot on the map represents 24,000 feet on the surface, or 1 meter:24,000 meters, etc.)Scale for a digital geographic database refers to the scale of the source paper map.Large scale vs. small scale maps
1:2000 map is a “larger scale” map than a 1:100,000 map
Chapter 5 - Georeferencing
“Atomic element” of geographic informationLocation (essential)Time (optional)Attributes (usually included)
Without locations, data is non-spatial (or aspatial)
The act of assigning locations to informationCommon Terms: georeference, geolocate, geocode
Primary Requirements
A georeference must be uniqueOnly one location associated with a given georeferenceNo confusion about the location that is referencedMeaning must be shared among all that work with the infoCan link different kinds of info to based on a common location
To be most useful, georeferences should stay constant through time.
Georeferencing Concepts
Every georeference has an associated spatial resolution equal to the size of the area that is assigned.
note: mailing address, state, zip, etc. can vary in size
Many systems of georeferencing are unique only within an area or domain of the Earth’s surface.
Examples: City Names…there is a Jacksonville in Florida and in North Carolina…and street names…there are many Main Streets, but only one per city….
Metric Georeferences
Based on measurements instead of namesExamples: latitude and longitude, UTM coordinates, etc.
Biggest Advantage: potential for infinitely fine spatial resolution limited only be the measuring devices that we have…
Another advantage: from measurements of two or more locations, it is possible to compute distances
Note: other types of georeference systems, like street addresses, only order locations
Placenames
Simplest, most ancient method
Language extends the usefulness for georeferencingEx: “between” two places or “near” a place or 1 mile north of some place
But, usefulness is limited because:Meanings vary between people and within context usedCoarse resolution (ex: “within Asia” is vague)Placenames can vary with time
Postal Addresses and Postal Codes
Assumptions:Every dwelling and office is a potential destination for mailDwellings and offices are arrayed along roads and numbered accordinglyRoads have names that are unique within local areasLocal areas have names that are unique within larger regionsRegions have names that are unique within countries
Zip codes vary in size (and are changed frequently), but are useful for mapping summarized data
Postal Addresses don’t always work…
Don’t work for natural features
Don’t work when dwellings are not numbered consecutively along streets (such as in Japan, where the number reflects date of construction).
Don’t work well in high-rise buildings where many dwellings occupy the same horizontal space…
Linear Referencing Systems
Often used for managing transportation infrastructure
Defines location on a network by measuring the distance from a defined point of reference along a defined path in the network.
Closely related to the street address system, but provides a more explicit measurement of distance
Cadasters
The Cadaster is the map of land ownership in an area
The property appraiser maintains the cadastral map for purposes of taxing land and keeping public records of land ownership…
Parcels of land have unique numbers or codes that are consistent through time, but hard to remember!
Public Land Survey System (PLSS)
Used to survey the vast areas beyond the “original colonies” beginning in the early 19th CenturyA metric system of georeferencing (with some problems due to curvature of the earth…)Description of property location by section/township/range
Prime Meridian (north-south line)Ranges (rows that are six miles apart and perpendicular to the prime meridian line, N or S)Townships (columns that are six miles apart form blocks that are on either side (E or W) of the prime meridian)Sections (each township is divided into 36 sections, roughly 640 acres each (5280 ft/mi and 43560 sq. ft. / acre)Describe smaller parcels with quarter sections, etc.
Latitude and Longitude
The most comprehensive systemPotential for very fine spatial resolutionCan compute distances between locationsSupports other forms of spatial analysis
Often called the geographic system of coordinates
Longitude Defined
Based on the axis of the Earth’s rotationCenter of mass lies on this axis
The plane through the center of mass perpendicular to the axis of rotation is the equator
Slices through the Earth parallel to the axis and perpendicular to the plane of the equator define lines of constant longitude.
A line of constant longitude is called a meridian.
Zero longitude (the prime meridian) goes through a line at the Royal Observatory in Greenwich, England
Measuring Longitude
All longitude slices are measured as angles from the prime meridian
360 total degrees 180 degrees West or East
East longitudes are stored in computers as positive numbersWest longitudes are stored as negative numbers
60 minutes in a degree60 seconds in a minuteUse decimal degrees to accommodate computersDecimal Degrees = Degrees + minutes/60 + seconds/3600
The Earth is not a perfect sphere…
The term ellipsoid or spheroid is used to describe the shape…The Earth is slightly flattened, such that the distance between the Poles is about 1/300 less than the diameter at the Equator.
Many ellipsoids have been used over the years…The WGS84 ellipsoid (the World Geodetic System of 1984) is the basis for most new mapping
Or the North American Datum of 1983 (NAD83, NAD83 HARN, etc.)…
Others are still used by other countries—be careful….
Latitude Defined
If we draw a line through any point on the Earth’s surface that is perpendicular to the ellipsoid at that location, then measure the angle made by this line with the plane of the Equator, this angle defines the latitude of that point.
Angles vary from 90 degrees North to 90 degrees South.North angles are stored in computers as positive numbers and South angles as negative numbers.
Lines of constant latitude are called parallels
Distances in longitude and latitude
Latitude distances are constantTwo points on the same meridian, separated by one degree of latitude are about 111 km apart (1/360th)One minute of latitude ~= 1.86 km (a nautical mile)One second of latitude ~= 30 meters
Longitude distances vary Meridians converge on the poles, henceLines are farthest apart at the equator (111 km)
One degree of longitude ~= 96 km at 30 degrees North or South (~58 miles)
~ 78 km at 45 degrees~ 55 km at 60 degrees
Map Projections
Much work in GIS deals with a flattened or projected EarthPaper is flat, rasters are flat, etc.
A map projection transforms a position identified by latitude and longitude into a position in Cartesian coordinates (x,y)
It is very important to know the map projection of a data set.Projections distort at least one of these properties
Shape AreaDistanceDirection
Projection Properties and Classes
A map projection can have either property (but not both):Conformal – preserves local shape
(useful for navigation)Equal area – preserves area measurements
(useful for analysis)Other projections are termed equidistant and true-direction
Three major classes of projectionsCylindricalAzimuthal or planarConic
Graticule shows how the lat/long lines map onto the projection
Using Unprojected Data
Be careful using a GIS to analyze data in latitude and longitude rather than projected coordinates, because serious distortions in distance and area may result…
“Projection on-the-fly”
Common Coordinate Systems
Universal Transverse Mercator (UTM)Global or national mapping use60 zones (6 degrees of longitude for each zone)Problems:
Maps don’t fit together across zonesArbitrary definition of zones
Conformal (shape is preserved) and scale is same in all directionsStandard coordinate system used Coordinates are in meters
500,000m Easting
Common Coordinate Systems (cont.)
State Plane Coordinate SystemMore accurate than UTMCoordinates in feetNo problem with arbitrary zonesEstablished in the 1930’s, each state adopted its own map projection based on minimizing distortion
Some large states have internal zones to minimize distortion even more
Most GIS software have the named projections stored so that it is relatively easy to convert between the most common map projections.
Chapter 4 - The Nature of Geographic Data
Reminder: The fundamental problem of GIS is that of selecting what to leave in and what to leave out of digital representations of the real world.
Smoothness and irregularity are important distinguishing characteristics of geographic data.
The scale or level of detail may determine whether spatial and temporal phenomena appear regular or irregular.
Spatial heterogeneity = the tendency of geographic places and regions to be different from each other.
Spatial data tend to exhibit an increasing range of values, or increased heterogeneity, with increased distance.
Isopleth and Choropleth Maps
Isopleth maps are used to visualize phenomena that are conceptualized as fields and measured on interval or ratio scales. (see Figure 4.9)
An Isoline connects points of equal attribute values…created from sample points measurements
Choropleth maps are constructed from values describing properties of non-overlapping areas.
Areas are shaded/colored to show value of variablespatially extensive variables - values are true only for entire areas (ex: total population)
Important: Can be misleading…see Figure 4.10spatially intensive variables - values that could potentially be true for every part of an area (if it is homogeneous), (ex: densities, rates, proportions)
The Lengths of Geographic Objects?
Coastline Exampleas you measure with more precision, the length gets longer…
any approximation is scale-dependent length is indeterminate
where small deviations resemble larger deviations in form, the coastline is self-similaras the path of the coast traverses space, its intricate structure comes to fill up more space than a one-dimensional straight line but less space than a two-dimensional area….as such it is said to be of fractional dimension (a fractal) between 1(a line) and 2 (an area).
Chapter 8 - Geographic Data Modeling
Decisions about the type of model to be adopted are vital to the success of a GIS project.
Levels of abstraction (see Figure 8.2)reality
(buildings, streets, wells, etc.)conceptual
discrete object or field modelslogical
raster or vector modelsdiagrams
physicalactual format of files or database tablesDependent on your selected software, etc.
Phases of GIS Modeling
First phasedefine main types of real objects to be representedchoose conceptual model
Second Phasecreation of diagrams, lists of attributes, etc. for each object to be modeleda logical model is independent of the software used
Final Phasecreate a model showing how objects will be digitally implemented in a specific GIS software packagephysical models describe the exact files or tables used
Types of Data Models Used in GIS
Computer-aided design (CAD)GraphicalImageRaster/GridVector/geo-relational topologicNetworkTriangulated irregular network (TIN)Object
Raster Data Model
Stored as an array of cells (often called a grid)Satellite images are stored as stacks of arrays representing each spectral band in the imageair photos are usually just one arrayassociated with the field conceptual modelInteger or floating point values for cells
Integer values can be associated with an attribute tableMetadata about the array often held in a file header
geographic coordinate of the upper-left corner of arraycell sizenumber of rows and columns
Primary operational problem = large size of raw datasets
Raster Compression Techniques
Compression techniques (see Box 8.1)Lossless
run-length encoding (encoding row cells with a pair of values---no. of cells with same value, value)block encoding (quadtree data structure)
lossywavelet (remove information recursively by examining patterns in the data at different scales)
highest level of data compressiononly useful for satellite images, air photos, etc.MrSID (Multiresolution Seamless Image Database)fast viewing at different scales with appropriate amounts of detail for the scale (.sid extension)
Vector Data Model
Associated with the discrete object conceptual modelpoint, line, and polygon objectssee figure 8.7
2, 3, or 4 dimensions associated with each coordinate3-D = height, 4-D for time, offsets, etc.
geographic entities are called featuresfeature table - each feature occupies a row and each property or attribute of the feature occupies a column
Two types of featuressimpletopologic
Simple Features
Also called spaghetti features lines and polygons can overlapthere are no relationships between any of the objectsExample: ESRI shapefiles
Advantageseasy to create and storedraws quickly on the screen
Disadvantageslack of any connectivity relationships (limits network and polygon adjacency analysis methods)inefficient for modeling phenomena conceptualized as fields because adjacent boundary coords are stored twicepotential for overlap can cause problems (ex: ownership)
Topologic Features
Simple features that are structured using topologic rules
Topology is the science and mathematics of relationships used to validate the geometry of vector entities, and for operations such as network tracing and tests of polygon adjacency
Relationships are non-metric (qualitative) properties of geographic objects that remain constant when geographic space is distorted.
Example: When a map is “stretched” , properties like distancechange, but topological properties like adjacency do not.
Line data are also sometimes called 1-cell, arc, edge, or linkpolygon data sometimes called 2-cell, area, or face
Topology and Line Features
Example uses related to line featuresforce all line ends that are within a user-defined distance to be snapped together so that they have the same coordinate values and share a single node (for instance, snapping tolerance can be set during digitizing)where there are overlapping lines, place a node at the point of intersectioncan have attributes associated with each node
ex: turn tables (no left turn, etc.)maintain direction info with each line (tracing analysis)
info on “from node” and “to node” is maintained
Topology and Polygon Features
In a topologically structured polygon data layer, each polygon is defined as a collection of lines, that in turn are made up of an ordered list of coordinates (nodes and vertices).SEE Figure 8.8
polygon-line list table and line coordinate list table a line number may appear more than once in the polygon-line list table, but the actual coordinates for the line are only stored once in the line coordinate table
advantages: avoids gaps (slivers) and overlapsfewer coordinates are stored compared to “simple features” model
primary disadvantage is that drawing time is slower
Planar Enforcement and Contiguity
“Planar enforcement” implies that:all space on a map must be filledany point on a map must fall in only one polygon, in other words, there can be no overlapsphenomenon is conceptualized as a fieldspecial cases include “islands” and the “universal polygon”
“Contiguity” or adjacency between polygonsalso defined during the process of topologic structuringlist of polygons on the left and right-hand side of each line, in the direction defined by the list of coordinatesSEE Figure 8.9
Chapter 10 - Creating and Maintaining Geographic Databases
Standard DBMS (Data Base Management Systems) functionsstandard data models data loading toolstools to index standard database data typesstructured query language (SQL)security (controlled access to parts of database)controlled updating (mutli-user transaction management)backup and recovery utilitiesdatabase administration toolsapplications for creating, using, and maintainingprogrammable API (application programming interface)
Chapter 11 - METADATA
“Data about data”
We need metadata to be able to:automate the process of search and discovery over distributed archives…(ex: www.geographynetwork.com)
similar to library’s catalog, but can also search for data based on location
determine whether a dataset, once discovered, will satisfy the user’s requirements
sufficient spatial resolution?Date of creation?Spatial extent?
METADATA
Provides the information needed to handle the dataset effectively.
Tech specs on formatmap projection
provides useful information about the contents of the dataset
attribute definitions, etc.
Metadata generation can be expensive and time-consuming
METADATA Standards
To be most useful, standards are useful
FGDC StandardsUS Federal Geographic Data Committee’s Content Standards for Digital Geospatial Metadata, 1993
Content standard - describes the items that should be included but not the exact format or structuresee Box 11.2 for a list of types of content
Metadata LITE - limited set of properties that is cheaper to produce but still useful for search/discovery
see Box 11.3
Chapter 13 - Visualization and User Interaction
Maps can be used as decision support toolsto create, support, or reinforce a particular message
GIS (or just maps) can mislead viewers
Historically, many maps were created to support national interests (warfare, inventory of territories, trading routes, etc.)
time frame for changes was relatively slowToday, time frame for change is rapid
Limitations of Paper Maps
Paper Maps are of fixed scalegeneralization of detail is not recoverableGIS can allow different levels of detail at different scales
Paper Maps are of fixed extentGIS can be seamless (no adjoining maps to deal with)
Paper Maps present a static viewGIS can be linked to dynamically changing data
Paper Maps are flatGIS can include 2.5D data and visualization
Paper Maps are limited to what information is on the mapGIS allows supplementation with further data
Paper Maps provide only one map-producer centric view of the world, GIS users can create their own view
Attribute Representation
Many conventions for symbols exist Ex: highway shields, etc. (symbol sets in ArcMap)
Graphic primitivesgraduated size of symbols
principally used for ordinal or interval/ratio datavalue and saturation of color
variation in attribute valueshue - use of colors
discriminate between nominal categoriesshape or orientation of symbols
used to relate some value of attributearrangement, texture, focus
within/between-symbol properties, patterns
Attribute Representation (cont.)
Standard labeling method issuescentroid placement, splines, overlap, alignment
Issues in choropleth mappingvisual implication of implied within-zone uniformity
Dot Density maps represent relative density of zonally averaged data (not location of point events)
Proportionally-sized circlesproblem of overlapping circles in “busy areas” of map
Classification of Interval and Ratio Data
Interval = difference between values makes senseex: temperature
Ratio = ratios between values makes senseex: weight
Natural Breaksapparently natural groupings of data values
deductive assignment - based on known breakpointsex: arid, semi-arid, temperate, humid, etc.
inductive assignment - software finds relatively large jumps in the data values
Classification of Interval/Ratio Data (cont)
Quantile Breakseach of the predetermined number of classes contains an equal number of observationscan be problematic when values with widely different values get placed in the same class, etc.can have as many classes as required
Equal Interval Breaksbest used where data ranges are familiar to user
Standard Deviationdistance of observation from the mean valuetwo-color ramps help to visualize the above/below ranges
Cartograms and Dasymetric Mapping
CartogramsDistort area or distance in the interest of some objective
Examples: subway maps (see Figure 13.9) typical objectives:make patterns more obviouspromote legibility
Dasymetric mappingancillary data sources used to improve the model of a spatial distributionSee example in Figure 13.12 - allocation of population figures to areas smaller than census tracts
Chapter 17 and 18 - GIS Management
GIS has two obvious relationships to management:
First - GIS can help manage many types of projects so as to produce a more effective, more efficient, more equitable, or more productive outcome.
Many decisions have consequences which are geographically and existing geographic variations can influence key decisions…
Second - GIS projects themselves need to be managedspecification of needsselection and procurement of toolstraining of staffetc.
Management is best when it evolves
GIS implementations should evolve to take advantage of:
new technologies internet map servershandheld GISwireless communications
organizational developments
serendipity
“Dave Says”...
To be a “Good” GIS manager, you must:stay up-to-date on technology
attend conferencesjoin local or regional User Groupssubscribe to journalsbookmark GIS websites and visit them often
attempt to understand the business objectives and processes in all departments of the company or institution
look for opportunities to “add GIS” educate others about GIS
demonstrate to anyone that will take the timepass around articles describing how others have used GIS to their advantage
always consider yourself part of a teamGIS does not “stand alone” for long….
Potential Benefits that “The Boss” will relate to...
Tell your boss that GIS can:Provide “factual information” about the location of resourcesCompute derived “facts” (…such as, the fastest route, changes in customer distribution, etc.)Help select, compress, and visualize complex information to facilitate better understanding and hence, better decision-makingCan help search for patterns and correlates of geographic distributionsCan link information from different sources in one “more intuitive” map-centric interfaceCan help predict future events that are geographically distributed
Geographic Information (GI) from a management perspective...
In the past, most GI was produced by and for governments, increasingly, GI is being developed by businesses
sometimes, GI is just “value-added” government datait’s not going to be free anymore (and no sharing…)
GI is an “experience good” that consumers have to experience in order to appreciate and value
GI does not wear out through use, though it may diminish in value as time passes…
However much it costs to collect/create, or update, the first instance of GI, the marginal costs to copy and distribute it (especially by the Internet) is negligible….
Special characteristics of the GI market
Since GI has traditionally been supplied by govt. agencies for free, many consumers are not yet used to paying for data
Some detailed GI has some of the characteristics of a natural monopoly
ex: an organization has developed a very detailed GI dataset that was very expensive to create…and they are unwilling to distribute for free…and all costs are “sunk”
“Geographic Framework Information” = it may be in the best interest of everybody to use a standardized data set
Many GI data sets don’t change very fast…consumers may not need to buy updated data very often…
Special characteristics (cont.)
The value of GI also depends on the skills of the user and the available of softwareValue of a particular dataset varies with the userBecause metadata was not a priority in the past, we often know very little about the fitness of a dataset for a purpose.The legal implications of data quality have not yet been fully tested in court…Many “value-added” GI products are centered around providing standardized data sets and analysis methods that are appropriate for the data provided…since many consumers don’t have the skills, or the time, or the knowledgeThe GI market is not efficient…consumers do not fully understand the market and there are major distortions due to subsidies, legal constraints, etc. that vary by country, etc.
Advantages of the WWW for GIS Managers
The ease of setting up and using “information location tools”clearinghouses or geolibraries
The possibility to preview simplified versions of datasetsto determine suitability for purpose (quality, etc.)
The capacity for customizing applications for the needs of a specific market or group
inexpensive way to distribute wide array of applicationsIts ability to transfer data at a very low cost
share data without requiring staff to duplicate, mail, etc.The ability to transfer costs to the user from the producer
let them make their own maps!Efficient business method (charging for data/services, etc.)Familiarity of web interface for users--less training needed
G-Business
The rise of the commercial sectoruntil recently most GI was produced by governmentmany agencies now outsource data creationsome governments now charge for data
ISSUE - cost of reproduction vs. cost of productionGI as a business asset
can be protected by copyrightoften encapsulated with software to give it an advantagede facto data standards as a result of major vendors
Information Ownership?
Can geographic data, information, and knowledge be regarded as property?
Yes, but who owns it can be difficult to defineex: personal data on spending habits
Can geographic “facts” properly by protected?in USA, factual information collected by “ the sweat of the brow” -- as opposed to original, creative activities --is not protectable by copyright law.ISSUE - each representation of the same fact involves some art in the creation (each version is different…)
what is it about a compilation of data that is sufficiently original to merit copyright protection?
Information Ownership?
Is GI collected directly by a machine, such as a satellite sensor, protectable?
Very expensive to collect, hence few suppliersStrongest and easiest enforcement of copyrights
What is the “half-life” of GI?Varies by information type and applicationhistorical information can have valuecyclic transiency of value
ex: Census data loses value over 10 years until the next set of data is available, then it is valuable again for comparison analysis
Information Ownership?
How can you prove theft of your data or information?Watermarking
obvious - like watermark in paper (can be removed)non-visible - a series of groups of small numbers or colored pixels scattered apparently randomly throughout a raster dataset (“salting”) can be effective (need good documentation…)
Finger-printingex: add the occasional fictitious road, place name
Who owns information derived by adding new material to source information produced by another party?
Both you and the originator of the first dataset(s)common issue in GIS
People as a Business Asset
“Survey Says”…most GIS industry people think of themselves as technical experts…
unfortunately, this attitude means that many GIS people don’t make it into higher-level management positions
Many GIS “managers” have had little formal GIS education, instead having learned on the job over the years or by going to industry training courses
Accreditation issues are controversial and not resolved
Some Legal Issues for GIS Managers To Consider
Privacy
Government information access laws
Liability for bad decisions based on poor quality data
Geographic Framework Information
Forms the base or template for all other data sets
In the USA, the FGDC defines the following information as the US framework:
geodetic controlelevationhydrographypublic land cadaster informationdigital orthoimagerytransportationthe geography of governmental/administrative units
The End