The Getty Thesaurus of Geographic Names (TGN)

Metadata also available as

Metadata:


Identification_Information:
Citation:
Citation_Information:
Originator: The Getty Research Institute
Publication_Date: 2006
Title: The Getty Thesaurus of Geographic Names (TGN)
Geospatial_Data_Presentation_Form: vector digital data
Other_Citation_Details:
<http://www.getty.edu/research/conducting_research/vocabularies/tgn/about.html>
Online_Linkage: \\GIS-100\D$\GISData\BioGeomancer\geodb\TGN_gazeteer_entry.mdb
Description:
Abstract:
Scope and Structure TGN is a structured vocabulary currently containing around 1,106,000 names and other information about places. Names for a place may include names in the vernacular language, English, other languages, historical names, names and in natural order and inverted order. Among these names, one is flagged as the preferred name.

TGN is a thesaurus, compliant with ISO and NISO standards for thesaurus construction; it contains hierarchical, equivalence, and associative relationships. Note that TGN is not a GIS (Geographic Information System). While many records in TGN include coordinates, these coordinates are approximate and are intended for reference only.

The focus of each TGN record is a place. There are around 912,000 places in the TGN. In the database, each place record (also called a subject) is identified by a unique numeric ID. Linked to the record for the place are names, the place's parent or position in the hierarchy, other relationships, geographic coordinates, notes, sources for the data, and place types, which are terms describing the role of the place (e.g., inhabited place and state capital). The temporal coverage of the TGN ranges from prehistory to the present and the scope is global.

More about scope and structure: The TGN is a hierarchical database; its trees branch from a root called Top of the TGN hierarchies (Subject_ID: 1000000). Currently most of the TGN data is located under the facet World. Under the World, the places are generally arranged in hierarchies representing the current political and physical world, although some historical nations and empires are also included. There may be multiple broader contexts, making the TGN polyhierarchical.

Coordinates Geographic coordinates indicating the position of the place, expressed in degrees/minutes and decimal fractions of degrees. Latitude (Lat.) is the angular distance north or south of the equator, measured along a meridian. Longitude (Long.) is the angular distance east or west of the Prime Meridian at Greenwich, England. Bounding coordinates and elevation may also be included (as in the example for Great Lakes Region below). While many records in TGN include coordinates, these coordinates are approximate and are intended for reference.

Geographic coordinates in TGN typically represent a single point, corresponding to a point in or near the center of the inhabited place, political entity, or physical feature. For linear features such as rivers, the point represents the source of the feature.

Names Names and appellations referring to the place, including a preferred name and variant names. All names in a record (i.e., all names linked by a single Subject ID) are considered equivalents (i.e., synonyms). A TGN record may contain the vernacular and English names of the place, variant names in other languages, and historical names. One name is flagged as the preferred name, which is the indexing form of the name most often found in scholarly or authoritative publications.

Purpose:
The Getty Thesaurus of Geographic Names ® (TGN), the Art & Architecture Thesaurus ® (AAT), and the Union List of Artist Names ® (ULAN) are structured vocabularies that can be used to improve access to information about art, architecture, and material culture.
Supplemental_Information:
<http://www.getty.edu/research/conducting_research/vocabularies/tgn/>
Time_Period_of_Content:
Time_Period_Information:
Single_Date/Time:
Calendar_Date: 20061030
Currentness_Reference: publication date
Status:
Progress: Complete
Maintenance_and_Update_Frequency: As needed
Spatial_Domain:
Bounding_Coordinates:
West_Bounding_Coordinate: -179.966700
East_Bounding_Coordinate: 180.000000
North_Bounding_Coordinate: 87.250000
South_Bounding_Coordinate: -86.000000
Keywords:
Theme:
Theme_Keyword_Thesaurus: none
Theme_Keyword: place names
Place:
Place_Keyword_Thesaurus: none
Place_Keyword: World
Access_Constraints: None.
Use_Constraints: Licensed to be used only within BioGeomancer.
Point_of_Contact:
Contact_Information:
Contact_Person_Primary:
Contact_Organization: BioGeomancer Working Group
Contact_Instructions: <http://www.biogeomancer.org>
Data_Set_Credit: The Getty Research Institute
Native_Data_Set_Environment:
Microsoft Windows XP Version 5.1 (Build 2600) Service Pack 2; ESRI ArcCatalog 9.1.0.722

Data_Quality_Information:
Lineage:
Process_Step:
Process_Description:
BioGeomancer Spatial Database (BGSD) General Process and Modifications to Datasets

While any number of fields may be included in datasets submitted for entry into the BGSD, only a subset are uploaded. Extraneous fields not necessary for the purposes of the BGSD are deleted and, in most cases, additional fields are added during processing of the dataset. There are three main fields in the BGSD upload file; "name" which holds the feature name, "term" which holds the feature type, and "geometry" which holds the spatial information. In addition, "time_period_id" is used to indicate whether the feature name is current, former, proposed or historical, and "related_name" is used to hold a concatenated list of alternative names for the feature. Two fields are used to record the date of processing for a record set ("entryDate") or the date selected records were reprocessed ("modificationDate"). Another field relates each record to the dataset it came from ("g_coll_name") and another to the metadata about that dataset ("g_coll_note").

Importing data into the Biogeomancer Spatial Database involves a number of checks, modifications, and transformations. If metadata is not provided it must be created de novo using whatever information is available. When the input file is a shapefile, it must first be checked for projection. The BGSD does not use projected data, rather, data are stored with latitude and longitude as coordinates, using the WGS84 (World Geodetic Survey 1984) spheroid for the horizontal datum. This provides for an accurate and uniform storage of feature coordinates for the world and it is sometimes known as a Geographic Projection. If the input shapefile is in any other projection (e.g. UTM, Lambert Conformal Conic) it must be reprojected or converted to the WGS84 projection. If the feature is a polygon or a line, rather than a point, its geometry is then checked and, if necessary repaired. The problems that could arise in the geometry of a feature include short segments, null geometries, incorrect ring orderings, incorrect segment orientations, self intersections, unclosed rings, and/or empty parts. These problems are repaired with a script in ArcGIS software ver. 9.1 (ESRI, Redlands, CA, USA). If the input file is text, it must be first converted to a GIS layer using spatial information contained in the file (X, Y) and its associated metadata.

The input records are assigned feature types and all data imported into an ArcGIS Personal Geodatabase. These may have a typing system included, but in almost all cases this will differ from the one used by the BGSD, which itself is based on that of the Alexandria Digital Library. During processing, a feature type field, "term", is added to the dataset. If an incoming dataset is heterogeneous and has a field for feature type, a cross-reference table is created to convert from the dataset feature type lexicon to that of the BGSD. The use of feature types is integral to the proper functioning of the Biogeomancer Spatial Lookup Module, because it allows default extent for the feature type to be used in cases where an extent for the specific feature is not available. The feature types are useful for indexing and assigning relative uncertainty measures where geographic feature extents are unknown.

Where possible, we include the original metadata information for the dataset.

Process_Date: 20061030
Process_Contact:
Contact_Information:
Contact_Person_Primary:
Contact_Person: C. Frazier, T. Neville, T. Giermakowski
Contact_Organization: Museum of Southwestern Biology, University of New Mexico
Contact_Address:
Address_Type: mailing address
Address: MSC03 2020, 1 University of New Mexico 87131-0001
City: Albuquerque
State_or_Province: New Mexico
Postal_Code: 87131-0001
Country: U.S.A.
Contact_Voice_Telephone: 505-277-1360
Process_Step:
Process_Description: Metadata imported.
Source_Used_Citation_Abbreviation:
D:\GISData\BioGeomancer\metadata\FinalMetadata\BGMetadataTemplate.xml

Spatial_Data_Organization_Information:
Direct_Spatial_Reference_Method: Vector
Point_and_Vector_Object_Information:
SDTS_Terms_Description:
SDTS_Point_and_Vector_Object_Type: Entity point
Point_and_Vector_Object_Count: 868365

Spatial_Reference_Information:
Horizontal_Coordinate_System_Definition:
Geographic:
Latitude_Resolution: 0.000001
Longitude_Resolution: 0.000001
Geographic_Coordinate_Units: Decimal degrees
Geodetic_Model:
Horizontal_Datum_Name: D_WGS_1984
Ellipsoid_Name: WGS_1984
Semi-major_Axis: 6378137.000000
Denominator_of_Flattening_Ratio: 298.257224

Entity_and_Attribute_Information:
Detailed_Description:
Entity_Type:
Entity_Type_Label: TGN_BG
Attribute:
Attribute_Label: OBJECTID
Attribute_Definition: Internal feature number.
Attribute_Definition_Source: ESRI
Attribute_Domain_Values:
Unrepresentable_Domain:
Sequential unique whole numbers that are automatically generated.
Attribute:
Attribute_Label: SHAPE
Attribute_Definition: Feature geometry.
Attribute_Definition_Source: ESRI
Attribute_Domain_Values:
Unrepresentable_Domain: Coordinates defining the features.
Attribute:
Attribute_Label: name
Attribute_Definition: Name of the feature
Attribute_Definition_Source: BioGeomancer Working Group
Attribute:
Attribute_Label: schemeName
Attribute_Definition:
The scheme used to assign a "feature type" to the feature. In all cases, the ADL Feature Type Thesaurus is used.
Attribute_Definition_Source: BioGeomancer Working Group
Attribute:
Attribute_Label: schemeVersion
Attribute_Definition:
The version number for the ADL Feature Type Thesaurus, typically the field is = 2.
Attribute_Definition_Source: BioGeomancer Working Group
Attribute:
Attribute_Label: entryDate
Attribute_Definition:
The calendar date (mm/dd/yyyy) the dataset was processed for inclusion into the BGSD.
Attribute_Definition_Source: BioGeomancer Working Group
Attribute:
Attribute_Label: modificationDate
Attribute_Definition: The calendar date (mm/dd/yyyy) the dataset was modified.
Attribute_Definition_Source: BioGeomancer Working Group
Attribute:
Attribute_Label: g_coll_name
Attribute_Definition:
A name assigned to the dataset. Typically enough information is provided in the name to readily associate it with the original dataset prior to processing for inclusion into the BGSD.
Attribute_Definition_Source: BioGeomancer Working Group
Attribute:
Attribute_Label: g_coll_note
Attribute_Definition:
The file name that contains the Federal Geographic Data Committee (FGDC) metadata in XML format for the dataset.
Attribute_Definition_Source: BioGeomancer Working Group
Attribute:
Attribute_Label: term
Attribute_Definition:
The feature type assigned to the named feature from the list of types in the ADL Feature Type Thesaurus. The type is characteristic of a named feature, eg. The named feature "Albuquerque" is "populated places".
Attribute_Definition_Source: BioGeomancer Working Group
Attribute:
Attribute_Label: scheme_term_id
Attribute_Definition: The associated numeric id for the term (feature type).
Attribute_Definition_Source: BioGeomancer Working Group
Attribute:
Attribute_Label: related_name
Attribute_Definition:
Other names that have been associated with the named feature. In some cases it may be an historic name of the feature or the name without diacritics.
Attribute_Definition_Source: BioGeomancer Working Group
Attribute:
Attribute_Label: time_period_id
Attribute_Definition:
The numeric id associated with the time_period_note field. See time_period_note explanation.
Attribute_Definition_Source: BioGeomancer Working Group
Attribute:
Attribute_Label: time_period_note
Attribute_Definition:
An indication of the periodicity of the feature named, either current, former, proposed, or historical.
Attribute_Definition_Source: BioGeomancer Working Group
Attribute:
Attribute_Label: related_time_period_note
Overview_Description:
Entity_and_Attribute_Detail_Citation:
BioGeomancer Working Group <http://www.biogemancer.org>

Distribution_Information:
Resource_Description: Downloadable Data

Metadata_Reference_Information:
Metadata_Date: 20061030
Metadata_Contact:
Contact_Information:
Contact_Organization_Primary:
Contact_Organization:
Natural Heritage New Mexico, Museum of Southwestern Biology, University of New Mexico
Contact_Person: Teri B. Neville
Contact_Position: GIS Coordinator
Contact_Address:
Address_Type: mailing address
Address: MSC03 2020, 1 University of New Mexico
City: Albuquerque
State_or_Province: New Mexico
Postal_Code: 87131-0001
Country: U.S.A.
Contact_Voice_Telephone: 505-277-3822 x230
Contact_Facsimile_Telephone: 505-277-3844
Contact_Electronic_Mail_Address: tneville@unm.edu
Hours_of_Service: 0900-1700 MST
Metadata_Standard_Name: FGDC Content Standards for Digital Geospatial Metadata
Metadata_Standard_Version: FGDC-STD-001-1998
Metadata_Time_Convention: local time
Metadata_Extensions:
Online_Linkage: <http://www.esri.com/metadata/esriprof80.html>
Profile_Name: ESRI Metadata Profile
Metadata_Extensions:
Online_Linkage: <http://www.esri.com/metadata/esriprof80.html>
Profile_Name: ESRI Metadata Profile

Generated by mp version 2.8.6 on Mon Feb 12 14:21:57 2007