Historical Newspapers

This data set includes historic newspapers and magazines between 1568 and 1877. In total, these are 218,000 issues with about 2.1 million pages, which are available as images, text and metadata. This is a subset of the digitised newspapers available in ANNO, the virtual reading room of the Austrian National Library. For ONB Labs we selected those issues, which are under a Public Domain Mark.

Data

Metadata for all newspaper issues available in the ONB Labs

DescriptionLink

CSV title metadata

CSV file, one record per title

anno_labs_titles.csv.bz2

CSV issue metadata

CSV file, one record per issue

anno_labs_issues.csv.bz2

CSV page metadata

CSV file, one record per scanned page

anno_labs_ocr_pages.tsv.bz2

Tutorials

Download Jupyter Notebooks

DescriptionLink

Explore metadata

Example code for working with the available metadata

anno-experiments