“ Duplicate ” Entries in Gazetteers jordan Hastings Department of Geography University of...

26
DuplicateEntries in Gazetteers jordan Hastings Department of Geography University of California Santa Barbara
  • date post

    15-Jan-2016
  • Category

    Documents

  • view

    216
  • download

    0

Transcript of “ Duplicate ” Entries in Gazetteers jordan Hastings Department of Geography University of...

Page 1: “ Duplicate ” Entries in Gazetteers jordan Hastings Department of Geography University of California Santa Barbara.

“Duplicate” Entries in Gazetteers

jordan HastingsDepartment of Geography

University of CaliforniaSanta Barbara

Page 2: “ Duplicate ” Entries in Gazetteers jordan Hastings Department of Geography University of California Santa Barbara.
Page 3: “ Duplicate ” Entries in Gazetteers jordan Hastings Department of Geography University of California Santa Barbara.
Page 4: “ Duplicate ” Entries in Gazetteers jordan Hastings Department of Geography University of California Santa Barbara.

Gazetteer “Duplicates”Names & Features (1)

Naming Features in the Environment Linguistic Necessity Identity and Ownership Navigation and Wayfinding

Features Cover a Large Territory Crisp or Diffuse Compact or Extended Tangible or Abstract

Page 5: “ Duplicate ” Entries in Gazetteers jordan Hastings Department of Geography University of California Santa Barbara.
Page 6: “ Duplicate ” Entries in Gazetteers jordan Hastings Department of Geography University of California Santa Barbara.

Gazetteer “Duplicates”Names & Features (2)

Locations are Numerous & Various Multiscale Generalized Dis-coordinated Time-variant

Page 7: “ Duplicate ” Entries in Gazetteers jordan Hastings Department of Geography University of California Santa Barbara.
Page 8: “ Duplicate ” Entries in Gazetteers jordan Hastings Department of Geography University of California Santa Barbara.

Gazetteer “Duplicates”Names & Features (3)

Names are Numerous & Various Polynymous Mis-spelled Multilingual Time-variant

Page 9: “ Duplicate ” Entries in Gazetteers jordan Hastings Department of Geography University of California Santa Barbara.

Gazetteer “Duplicates”Names & Features (4)

Lake Bigler, thru 1920s Lake Bonpland (also Bondland), thru

1890s Da-ow-a-ga, thru 1850s

Page 10: “ Duplicate ” Entries in Gazetteers jordan Hastings Department of Geography University of California Santa Barbara.
Page 11: “ Duplicate ” Entries in Gazetteers jordan Hastings Department of Geography University of California Santa Barbara.

$

$

$

$

$

$

$

$

$

$

$

$

$

$

$

$

$

Dollar Point

Kings Beach

South Lake Tahoe

Sunnyside-Tahoe City

Tahoe Vista

Carson

Incline Village-Crystal Bay

Indian Hills

Johnson Lane

Kingsbury

MindenStateline

Zephyr Cove-Round Hill Village

Page 12: “ Duplicate ” Entries in Gazetteers jordan Hastings Department of Geography University of California Santa Barbara.

Gazetteer “Duplicates”Feature Types (1)

Dependable Type System Because Features are “Objects” Because Human Mind Categorizes

Types present in Taxonomy Hierarchy is Natural in Environment Because Human Mind Categorizes

Page 13: “ Duplicate ” Entries in Gazetteers jordan Hastings Department of Geography University of California Santa Barbara.

Gazetteer “Duplicates”Feature Types (2) – Examples

Cultural Environment Nations -> States -> Provinces -> Districts

Page 14: “ Duplicate ” Entries in Gazetteers jordan Hastings Department of Geography University of California Santa Barbara.

Gazetteer “Duplicates”Feature Types (2) - Examples

Physical Environment Watersources:

Springs-->Seeps Watercourses:

Rivers-->Streams-->Creeks Waterbodies:

Lakes-->Ponds-->Sloughs ?Glaciers

Page 15: “ Duplicate ” Entries in Gazetteers jordan Hastings Department of Geography University of California Santa Barbara.

Gazetteer “Duplicates”Fundaments (1)

Definition: GazetteerA spatial dictionary of named & typed features in the environment

Implications Features uniquely identified Searchable by name and type Also searchable geospatially

Page 16: “ Duplicate ” Entries in Gazetteers jordan Hastings Department of Geography University of California Santa Barbara.

Gazetteer “Duplicates”Fundaments (2)

Duplicates: An approximate notion Firm types, ±close in hierarchy Locations ±close dependent on scale Names ±close dependent on

language … or not at all

All aspects variant in time

Page 17: “ Duplicate ” Entries in Gazetteers jordan Hastings Department of Geography University of California Santa Barbara.

Gazetteer “Duplicates”Fundaments (3)

Database Implications / Support Custom Datatypes

Hierarchy Geometry

Multiple Attribution (unlimited) Names Locations

Efficient Geospatial Processing

Page 18: “ Duplicate ” Entries in Gazetteers jordan Hastings Department of Geography University of California Santa Barbara.

Gazetteer “Duplicates”Approach (1)

Independent Measures of Duplicates 1. Type Thesaurus Metrics

Inter-feature: hierarchy, explicit linkages 2. Geospatial Metrics

Intra-feature: size, compactness, … Inter-feature: distance, overlap, …

3. Geonomial Metrics Intra-feature: NL translation [not considered

yet] Intra-feature: stemming, soundex,

substitution

Page 19: “ Duplicate ” Entries in Gazetteers jordan Hastings Department of Geography University of California Santa Barbara.

Gazetteer “Duplicates”Approach (2)

Unified Assessment of Duplicates Weighted Combination of Measures

1 Type 2 Location(s) 3 Name(s)

Geographic Visualization, over Maps Final Authority of Human Cataloger

Page 20: “ Duplicate ” Entries in Gazetteers jordan Hastings Department of Geography University of California Santa Barbara.

random features

grouped features

prep

rework

Gazetteer “Duplicates”Processing Cycle

Page 21: “ Duplicate ” Entries in Gazetteers jordan Hastings Department of Geography University of California Santa Barbara.

random features

grouped features

prep

rework

Gazetteer “Duplicates”Processing Cycle

Page 22: “ Duplicate ” Entries in Gazetteers jordan Hastings Department of Geography University of California Santa Barbara.

random features

grouped features

accepted suspended

prep

weigh

feature

database

Gazetteer “Duplicates”Processing Cycle

Page 23: “ Duplicate ” Entries in Gazetteers jordan Hastings Department of Geography University of California Santa Barbara.

random features

grouped features

accepted suspended

prep

weigh

feature

database

Gazetteer “Duplicates”Processing Cycle

review

Page 24: “ Duplicate ” Entries in Gazetteers jordan Hastings Department of Geography University of California Santa Barbara.

random features

grouped features

accepted suspended

trash

review

post

prep

weighrework

reject feature

database

Gazetteer “Duplicates”Processing Cycle

Page 25: “ Duplicate ” Entries in Gazetteers jordan Hastings Department of Geography University of California Santa Barbara.

[end]

Page 26: “ Duplicate ” Entries in Gazetteers jordan Hastings Department of Geography University of California Santa Barbara.