diff --git a/metadata-readme/Die-Bombe-metadata-readme.md b/metadata-readme/Die-Bombe-metadata-readme.md index 9e6281a5a0ebfc328e7835b678bd8352024f6a74..6c3d35a45ed9e12fab9db1c00cde9303459f33fe 100644 --- a/metadata-readme/Die-Bombe-metadata-readme.md +++ b/metadata-readme/Die-Bombe-metadata-readme.md @@ -1,23 +1,25 @@ -This Readme offers explanations for all attributes of the metadata csv table to the dataset "Botanical Illustrations" +This Readme offers explanations for all attributes of the metadata csv table to the dataset "Die Bombe". +The frequency attribute can have the value "all", "most", "some" or "none". It gives you an overview of how often a value is to be expected for the attribute explained in relation to the whole dataset. +The CSV is an export of keys and values of the Relational Database (MySQL) used for ANNO. | attribute | explanation | frequency | datatype | | ---- | ---- | ---- | ---- | -| manifest_id | | some | string | -| aid | | some | string | -| year | | some | string | -| day | | some | string | -| dc_title | | some | string | -| dc_title_additional | | some | string | -| subjects | | some | string | -| place_of_publications | | some | string | -| languages | | some | string | -| dc_type | | some | string | -| meta_type | | some | string | -| ini_type | | some | string | -| modification_datetime | | some | string | -| longer_page_id | | some | string | -| dc_date | | some | string | -| link_pdf | | some | string | -| link_old | | some | string | -| has_ocr | | some | string | -| page_count | | some | string | \ No newline at end of file +| manifest_id | ID for the IIIF manifest in the form of \{aidaaaammdd} | all | string | +| aid | ID for issues contained in the ANNO database, usually an abbreviation for the title | all | string | +| year | year of publication | all | integer | +| day | date of publication | all | string | +| dc_title | title of the newspaper | all | string | +| dc_title_additional | subtitle or any other additional title of the resource | none | string | +| subjects | subject terms (German), not linked to authority file; multiple values are separated by a hyphen | all | string | +| place_of_publications | placename (German) of publication place, not linked to authority file | all | string | +| languages | language of the document in the form of ISO-639-1 | all | string | +| dc_type | type of the document (English); default is "newspaper" | all | string | +| meta_type | type of document (German); default is "zeitungen" | all | string | +| ini_type | value for database; default is "anno" | all | string | +| modification_datetime | timestamp for date of last modification of the database entry | all | datetime | +| longer_page_id | value is either "O" or "1" | all | integer | +| dc_date | date of publication in ISO-form "aaaa-mm-dd"| all | date | +| link_pdf | URL to PDF download of issue | all | string | +| link_old | URL to issue in the ANNO interface | all | string | +| has_ocr | value is "1" if the document has OCR, else value is "0" | all | integer | +| page_count | number of pages for each issue | all | integer | diff --git a/metadata-readme/Historical-postcards-metadata-readme.md b/metadata-readme/Historical-postcards-metadata-readme.md index 242930d4d4514fc16f768d2665cec7f840c4cbfe..f1442d1a58dfc7751f095de667b1e9d6bd48dec7 100644 --- a/metadata-readme/Historical-postcards-metadata-readme.md +++ b/metadata-readme/Historical-postcards-metadata-readme.md @@ -1,35 +1,37 @@ -This Readme offers explanations for all attributes of the metadata csv table to the dataset "Historical Postcards" +This Readme offers explanations for all attributes of the metadata csv table to the dataset "Historical Postcards". +The frequency attribute can have the value "all", "most", "some" or "none". It gives you an overview of how often a value is to be expected for the attribute explained in relation to the whole dataset. +Please note that the CSV is an export of keys and values of the Relational Database Kawan used for editing the metadata. Some values may have a false attribution. Some data cleaning may be necessary. | attribute | explanation | frequency | datatype | | ---- | ---- | ---- | ---- | -| akon_id | id (shelfmark) of the document | all | string | +| akon_id | id of the document; please refer to this id | all | string | | id | internal numerical id | all | integer | | altitude | geolocation altitude | some | integer | -| building | has literal, if building was tagged in metadata editing | some | string | -| city | has literal, if city was tagged in metadata editing | most | string | +| building | name of a building (human artifact) | some | string | +| city | placename (city) | most | string | | color | True for color postcards | all | boolean | -| comment | has literal for | most | -| mountain | has literal, if mountain was tagged in metadata editing | some | string | -| other | has literal for any other tags | some | string | -| photographer | has literal, if photographer was identified. Name usually in the form \{forename surname\}. Not linked to GND | some | string | -| publisher | has literal, if publisher was identified. Name usually in the form \{forename surname\} OR \{surname\}. Not linked to GND | some | string | -| publisher_place | has literal, if publication place was identified. Not linked to Geonames or GND | some | string | -| region | | some | string | -| water_body | | some | string | -| year | | some | string | -| inventory_number | | some | string | -| signature | | some | string | -| revision_date | | some | string | -| date | | some | string | -| feature_class | | some | string | -| feature_code | | some | string | -| geoname_id | | some | string | -| latitude | | some | string | -| longitude | | some | string | -| name | | some | string | -| country_id | | some | string | +| comment | date or determined date; "date gel" stands for "date gelaufen", "gelaufen" means that year the postcards was sent, usually marked by a postal stemp; "v. date" stands for "vor ..." (engl. before), means a date ante quem | most | string | +| mountain | name of mountain, not linked to authority file | some | string | +| other | any other tags | some | string | +| photographer | personal name or corporate name, not linked to authority file. Name usually in the form \{forename surname\} | some | string | +| publisher | personal name or corporate name, not linked to authority file. Name usually in the form \{forename surname\} OR \{surname\} | some | string | +| publisher_place | placename, not linked to authority file (GND, Geonames) | some | string | +| region | | some | string | +| water_body | name of water_body, not linked to authority file (GND, Geonames) | some | string | +| year | exact year the postcard can be dated. If no exact date is available, a value can be found for the "comment" attribute | some | integer | +| inventory_number | an internal inventory number usually in the form of \{number/number section\} | some | string | +| signature | informations on attributions to former collections or subcollections; may contain information on provenance; please note the abbrevations | some | string | +| revision_date | timestamp of last revision of metadata in the RDB Kawan | most | date | +| date | combination of year and comment, "gelaufen" and "vor" are usually not abbreviated in this field | most | string | +| feature_class | code of the GeoNames export codes https://www.geonames.org/export/codes.html | most | string | +| feature_code | code of the GeoNames export codes https://www.geonames.org/export/codes.html | most | string | +| geoname_id | GeoName identifier of placename (attribute: "name"), if a placename has been identified | most | float | +| latitude | geolocation latitude as provided by GeoNames | most | float | +| longitude | geolocation longitude as provided by GeoNames | most | float | +| name | placename of place identified (GeoName ID is in attribute "geoname_id") | most | string | +| country_id | code for country (modern) within which borders the identified placename lies today; the form is ISO-3166 Alpha-2 code | most | string | | admin_name_1 | | some | string | | admin_code_1 | | some | string | -| geo | | some | string | +| geo | Tuple of latitude comma longitude | most | string | | download_link | url to the full resolution image | all | string | -| download_link_256x256 | url to the thumbnail image with resolution 256x256 px | all | string | \ No newline at end of file +| download_link_256x256 | url to the thumbnail image with resolution 256x256 px | all | string | diff --git a/metadata-readme/Personal-documents-Berg-metadata-readme.md b/metadata-readme/Personal-documents-Berg-metadata-readme.md index e0296f6ab5da7ccff9a3e24b31ccf420c84bb81c..5ba72ce951b90141757ce04db614c25ded1c6ac4 100644 --- a/metadata-readme/Personal-documents-Berg-metadata-readme.md +++ b/metadata-readme/Personal-documents-Berg-metadata-readme.md @@ -1,35 +1,38 @@ -This Readme offers explanations for all attributes of the metadata csv table to the dataset "Personal Documents Alban Berg" +This Readme offers explanations for all attributes of the metadata csv table to the dataset "Historical Postcards". +The frequency attribute can have the value "all", "most", "some" or "none". It gives you an overview of how often a value is to be expected for the attribute explained in relation to the whole dataset. +The CSV is an export of keys and values of catalogue system. Some values may have a false attribution. Some data cleaning may be necessary. + | attribute | explanation | frequency | datatype | | ---- | ---- | ---- | ---- | -| manifest_id | | some | string | -| mms_id | | some | string | -| barcode | | some | string | -| signature | | some | string | -| location | | some | string | -| urns | | some | string | -| snISBN | | some | string | -| snISSN | | some | string | -| volumeNumber | | some | string | -| languages | | some | string | -| main_title | | some | string | -| sub_title | | some | string | -| related_title | | some | string | -| countryCodes | | some | string | -| persons | | some | string | -| corperations | | some | string | -| publishers | | some | string | -| placesOfPublication | | some | string | -| yearOfPublication | | some | string | -| dateOfPublication | | some | string | -| Subjects | | some | string | -| GND-Links | | some | string | +| manifest_id | ID in the Austrian National Library's digital repository, which is the document ID for the access link as well as the IIIF manifest | all | string | +| mms_id | ID of the data record in the integrated library system ALMA (Exlibris group); The ID is the identifier of the local data record (local zone); The MMS-ID may be a value needed for a SRU request | all | string | +| barcode | ID of the document; At the Austrian National Library every digitized document in this collection has a barcode as identifier for the digitized item, usually in the form \"\+Z\d{7}[0-9, A-Z]{1,2}" | all | string | +| signature | shelfmark of document in collection of the Austrian National Library, for documents of the estate Alban Berg "F21.Berg.*"| all | string | +| location | code for collection department at the Austrian National Library; "MUS MAG" stands for the Music Department | some | string | +| urns | ID in the form of a Uniform Resource Name | none | string | +| snISBN | International Standard Book Number | none | string | +| snISSN | International Standard Serial Number | none | string | +| volumeNumber | number of volume as part of a multivolumed resource | none | string | +| languages | code for language of the document in ISO-639 3 | some | string | +| main_title | title of the document, either as in the resource or a fictious title | all | string | +| sub_title | subtitle of the document, either as in the resource or a fictious subtitle | some | string | +| related_title | any other title related to the document | none | string | +| countryCodes | code for country, in which borders the place of creation lies at the time the catalogue entry was created; ISO-code preceeded by "XA" | some | string | +| persons | personal names of creators and other persons involved (e.g. addressees of letters) in the form \{surname, forename} OR \{surname, forename, dateOfBirth-dateOfDeath, [MARCRelatorsTerm]\}; The MARC relators terms are listed in the MARC documentary: https://www.loc.gov/marc/relators/relaterm.html; the values in this dataset are in German, "VerfasserIn" stands for "creator", "KomponistIn" for "composer" and "AdressatIn" for "adressee" | all | string | +| corperations | personal name of corperation involved | none | string | +| publishers | personal name of publisher if known | none | string | +| placesOfPublication | placename for place of creation or place of publication, not linked to authority file; "Ohne Ort" stands for "no placename known"; Cave: Those entries that do not have a value do not necessarely have no place of publication | some | string | +| yearOfPublication | either a year, a range of years separated by a hyphen or "[ohne Datumsangabe]" for documents without date | some | string | +| dateOfPublication | year of publication in integer format | none | integer | +| Subjects | subject terms in German as used in the RSWK (Regeln für Schlagwortkatalogisierung), usually in the form of the GND authority file; the subject term can also be a term for a document type (e.g. "Lebensdokument"); Please note that the subject terms are not fully consistent within the dataset | most | string | +| GND-Links | URLs (PIDs) for GND-referenced subject terms; multiple values are comma separated | most | string | | gndReferences | | some | string | -| classifications | | some | string | -| contentInformations | | some | string | -| physicalPages | | some | string | -| illustrations | | some | string | -| physicalFormat | | some | string | -| sequence | | some | string | -| urls | | some | string | -| idnr | | some | string | \ No newline at end of file +| classifications | NA | none | string | +| contentInformations | any information on the content of the document, e.g. if illustrated or the coverage | most | string | +| physicalPages | any information on the coverage of the document (Umfang); The value may be identical to the value in "contentInformations" | most | string | +| illustrations | information on the illustrations; please refer to the attribute "contentInformations" | none | string | +| physicalFormat | size of the document either in centimeters (width per height) or in sheet format (8° for Octavo, 4° for Quarto) | some | string | +| sequence | link to sequential documents | none | string | +| urls | sequence of URLs; "http://katzoom.onb.ac.at*" for images of the card catalogue entries, "http://data.onb.ac.at/mus*" for a former catalogue entry created at the Music Sammlung, "https://data.onb.ac.at/rep*" for the entry in ÖNB Digital | all | string | +| idnr | ID for the MARC record in the Austrian Union Catalogue; Please refer to this ID for the bibliographic description in the public catalogue (OPA) | all | string |