Metadata for LOGD International OGD Catalog (IOGDS)

Note (04 May 2011): See the relationship of this to the W3C eGov interest group's emerging DCAT Data Catalog Vocabulary

Namespaces used:
  • dgtwc: http://data-gov.tw.rpi.edu/2009/data-gov-twc.rdf#
  • dcterms: http://purl.org/dc/terms/
  • foaf: http://xmlns.com/foaf/0.1/
Catalog Metadata
*** indicates that predicate is not defined when its URI is resolved - as of 1 October 2011)
Name Cardinality Type Description
dcterms:title (Title) 1 Literal Name of the catalog
foaf:homepage (URL) 1 URI resource URL of the web page for accessing the catalog. Unique identifier for a catalog
dcterms:description (Description) 1 Literal Description of the catalog
dgtwc:number_of_datasets (Number of Dataset) 1 Integer Estimated number of datasets listed in the catalog
dcterms:spatial (Spatial Region) 0~1 URI resource The primary geographical region covered by all datasets in this catalog
*** dgtwc:spatial_granularity (Spatial Region Type) 0~1 Literal use the following value: Worldwide, National, State, City, ...
dcterms:language (Language)* 0~1 Literal The most frequently used language of the datasets listed in this catalog
***dgtwc:tbl_rating (TBL's 5-star Rating)* 0~1 Literal use: 1-downloadable; 2-structured; 3-open-format; 4-rdf; 5-linked-data
 
Dataset Metadata:
Name Cardinality Type Description
dcterms:title (Title) 0~1 Literal Name of the dataset
foaf:homepage (URL) 0~1 URI Resource URL of the web page for accessing the dataset. Unique identifier for a dataset
dcterms:identifier (ID) * 0~1 Literal Unique string ID of the dataset
dcterms:description (Description) 0~1 Literal Description of the dataset
dcterms:download_format (Format) 0~many Literal Available formats of the dataset
foaf:page {a foaf:Document; dcterms:format ..} 0~many Resource linking a dataset for web URLs where users can download/access the raw data.
***dgtwc:category (Category) 0~many Literal The original category (asserted by the catalog) of this dataset
***dgtwc:categories (All Categories) 0~1 Literal All categories (asserted by the catalog) of this dataset
dcterms:subject (Keyword) 0~many Literal The original keywords (asserted by the catalog) of this dataset
***dgtwc:keywords (All keywords) 0~1 Literal All keywords (or tags) of this dataset
dcterms:language (Language)* 0~1 Literal Language used by the text cells in the dataset
dcterms:dateSubmitted (Date-Released)* 0~1 Literal The date the dataset is released.
***dgtwc:agency (Agency) 0~1 URI Resource The actual publisher (typically a government agency) who curates and publishes the dataset
***dgtwc:catalog_title (Catalog Title) ** 1 Literal The government website (Country/State/City/Organization) which owns the dataset catalog that lists this dataset
***dgtwc:catalog_homepage (Catalog URL) ** 1 URI Resource URL of the web page of the dataset catalog that lists this dataset. Unique identifier for a catalog
***dgtwc:catalog_subtitle (Catalog SubTitle) ** 1 Literal for Data.gov's three sub-catalogs, namely Raw Data Catalog, Tool Catalog, and Geodata Catalog
***dgtwc:catalog_country ** 1 Literal The country of the owner of the dataset catalog. Strong indicator of the sptial region covered by the listed datasets.
* optional field
** to be added in metadata for enhancement (instead of included in column)

sample sparql query 1, users should expect to only query conversion:MetaDataset
# list 50 international datasets (please execute it at http://logd.tw.rpi.edu/sparql)
PREFIX rdfs:      <http://www.w3.org/2000/01/rdf-schema#> 
PREFIX foaf:       <http://xmlns.com/foaf/0.1/>
PREFIX dcterms:    <http://purl.org/dc/terms/>
PREFIX dgtwc:    <http://data-gov.tw.rpi.edu/2009/data-gov-twc.rdf#>
 
#SELECT ?dataset_title ?dataset_homepage ?catalog_title ?catalog_subtitle ?dataset_categories ?dataset_agency
SELECT  *
WHERE {
  GRAPH <http://purl.org/twc/vocab/conversion/MetaDataset> {
        _:dataset dcterms:title ?dataset_title .
        _:dataset dcterms:isReferencedBy _:catalog_version .
        OPTIONAL {  _:dataset dcterms:description ?dataset_description. }
        OPTIONAL {  _:dataset foaf:homepage ?dataset_homepage . }
        OPTIONAL {  _:dataset dgtwc:categories ?dataset_categories. }
        OPTIONAL {  _:dataset dgtwc:agency [ rdfs:label ?dataset_agency ]. }
         _:catalog_version void:subset [ dgtwc:catalog_title ?catalog_title ; 
                                                       dgtwc:catalog_homepage ?catalog_homepage ; 
                                                       dgtwc:catalog_country ?catalog_country  ]. 
        OPTIONAL {  _:dataset dgtwc:catalog_subtitle ?catalog_subtitle . }
        
        OPTIONAL {  _:dataset dgtwc:category ?dataset_category. }
        OPTIONAL {  {_:dataset dgtwc:download_format ?download_format.}
                                UNION 
                                { _:dataset foaf:page _:file. _:file dcterms:format ?download_format.}
        }
  }
} 
limit 50
note: for keyword search, you may compose query over: dcterms:description, dcterms:title, dgtwc:category, dcterms:subject
sampel sparql query 2
# counting contributions by each dataset catalog
PREFIX rdfs:      <http://www.w3.org/2000/01/rdf-schema#> 
PREFIX foaf:       <http://xmlns.com/foaf/0.1/>
PREFIX dcterms:    <http://purl.org/dc/terms/>
PREFIX dgtwc:    <http://data-gov.tw.rpi.edu/2009/data-gov-twc.rdf#>
SELECT ?catalog_title count (_:dataset) 
WHERE {
  GRAPH <http://purl.org/twc/vocab/conversion/MetaDataset> {
        _:dataset dcterms:title ?dataset_title .
        _:dataset dcterms:isReferencedBy _:catalog_version .
         _:catalog_version void:subset [ dgtwc:catalog_title ?catalog_title ; dgtwc:catalog_homepage ?catalog_homepage ].       
  }
} 
group by ?catalog_title