Bibliotheca Eugeniana Digital

The project “Bibliotheca Eugeniana Digital” (BED) is a cooperation project between the Austrian National Library and the University for Continuing Education Krems, funded by the Austrian Academy of Sciences under the “go!digital 3.0” program. The project runs for two years, from November 2022 to November 2024.

Project Description

The aim of the project Bibliotheca Eugeniana Digital (BED) is the digital reconstruction and visual representation of Prince Eugene’s book collection (UNESCO “Memory of Austria”), one of the most famous collections of the Baroque period. Since 1738, the collection has been part of the Habsburg Court Library, today the Austrian National Library (ONB). To this day, the exact composition, extent, and locations of the printed books in the ONB’s collections have not been analyzed, as the task has been considered too vast and complex for traditional methods. The digitization of sources, combined with new digital approaches, now enables new ways to open up large cultural collections such as the “Bibliotheca Eugeniana”.

The project applies tools and methods from Digital Humanities and Data Science to digitally reconstruct and visually explore the library in a systematic way, examining its composition and history through a variety of sources.

Digital Reconstruction

Most of the books of the Bibliotheca Eugeniana were digitized as part of the project “Austrian Books Online” (ABO). The majority of its bound volumes are uniformly decorated on the front and back covers with Prince Eugene’s coat of arms (see Figure 1). These bindings, here referred to as “supralibros bindings”, will be analyzed in the project using Machine Learning (ML) to detect visual features. In addition, the historical handwritten catalog of the Bibliotheca Eugeniana as well as archival material on the transformation of this library in the 19th century will be digitally processed using ML for handwritten text recognition (HTR) and published in the Austrian National Library’s digital editions infrastructure.

All of these data will be merged with the metadata from the Austrian National Library’s public catalog. Titles from the Digital Edition and full texts from ABO will again be classified into subject groups using ML and natural language processing (NLP) algorithms. This classification will provide new insights into the internal structure of the library and its relationship to the color system of the supralibro bindings.

Visual Exploration

The University for Continuing Education Krems (UWK) will develop a set of coordinated visualizations based on the multilayered historical collection data, enabling analysis and research into the structure, transformation, and localization of the Bibliotheca Eugeniana collection. For public communication of the project outcomes, complementary narrative visualizations will be created. BED will publish the results in a variety of formats tailored to both experts and a general audience.

All data generated within the project will be made available via the ONB Labs and shared with European research infrastructures in line with the FAIR principles. As a collaboration between a cultural heritage institution and a research institution, BED contributes to the strategy “DH Austria 2021” by promoting knowledge transfer between the two sectors.

Methods

To obtain as much information as possible for the reconstruction of the book collection, various approaches are combined:

Image Classification

In this step, ML-based image classification models are used to identify provenance markers. In a pilot study of the ONB, this method was successfully applied using CNN models for binary classification of Bibliotheca Eugeniana Supralibros from the ABO corpus. In the BED project, the approach will be revised and expanded by comparing different types of CNN architectures and configurations (e.g., varying network depths). A two-step model will be pursued. In the first step, the provenance marker, called the supralibros, is detected, and a cropped image of it is returned.

Volume with coat-of-arms supralibros — Figure 1: Showpiece of a volume with coat-of-arms supralibros from the Bibliotheca Eugeniana.

In the second step, the cropped image is processed by a binary classifier designed to preserve the optical information that would otherwise be lost through image scaling. Particular emphasis will be placed on building the training corpus appropriately, both in terms of size and quality, so that the different types of supralibros can be structurally distinguished. This will make it possible to develop a multi-classifier model. For true positive attributions, the corresponding descriptions will be integrated into the ONB’s publicly accessible catalog.

The Catalogs

With the help of an HTR model for text recognition with Transkribus, the information from the five-volume handwritten historical collection catalog digitized at the ONB will be extracted. For the (semi-)automatic tagging of authors and publication places, NLP methods (e.g., named entity recognition) will then be applied. The entries will be mapped to TEI-XML elements on the basis of a schema already developed by the ONB for the digital edition of another historical library catalog. The XML files and page images will be published as a digital edition in the sustainable infrastructure of the ONB for digital editions (edition.onb.ac.at). This digital edition will contain indices with bibliographic information on all titles, persons, and publication places.

Handwritten catalog — Figure 2: Excerpt of a page from the handwritten catalog.

To identify those books that are still in the ONB today, the search API of the digital catalog will be used. With the aid of fuzzy string matching, the titles, places, and years of publication in the historical catalog will be compared with those in the modern catalog. Furthermore, titles and available full texts will be clustered into subject categories with the help of the ANNIF algorithm for subject classification, to gain deeper insights into the classification and subject areas of the library. The results of the subject classification will be mapped to the subject areas in the modern library catalog and later integrated into the digital edition of the historical catalog, to create an additional subject index. The descriptions of the identified volumes with supralibros bindings will be automatically generated on the basis of the results of the image classification and supplemented manually if necessary. The digital edition will be enriched with descriptions of the identified objects, links to named entities, and references to the open-access catalog of the ÖNB. This approach has already been tested.

The metadata of the bibliographic entries will be published as an LOD set that corresponds to the DINI schema for RDF representations of bibliographic resources and is aligned with the DARIAH collection description schema.

Visualization and Communication

Data visualizations are intended to enable the exploration, representation, and public communication of the collection. They will allow the Eugeniana collection, its metadata, and their quality assessment to be represented from different analytical perspectives. In this way, they are meant to visually support distant reading and exploration of the collection, making it easier to analyze questions of composition and provenance, and to identify relevant patterns and information for further analyses and close reading.

The development of the visualizations follows a user-centered, iterative data-user-task approach, within which the most relevant options for visual analysis and exploration are defined collaboratively and iteratively in sessions with target users, and the available data examined. This analysis will serve as the basis for defining user requirements for the subsequent design and implementation of relevant visual perspectives and possible interactions. To ensure that the visualizations sufficiently support the intended tasks, interaction with the novel visualizations will be observed in a small user study, and the design refined based on the evaluation results.

In addition, a visual storytelling approach will be applied to present the history and provenance of the Bibliotheca Eugeniana in an engaging way to the public. The storyboard will be enriched with (interactive) visualizations and implemented by the UWK team in the form of a web-based story. The interface will be tested with target users from the general public and adjusted on the basis of these evaluation results. The visualized story of the provenance of the Bibliotheca Eugeniana will be an important outcome for supporting the public communication of the project’s results.

Dissemination

Organized Workshops

30.04.2024: “Co-Design Workshop Bibliotheca Eugeniana Digital” at the Austrian National Library
29.02.2024: Panel discussion at DHd 2024 titled “DH – Cui bono? Zielgruppenerschließung für Digital Humanities und Cultural Heritage” (DH – Cui bono? Target Group Engagement for Digital Humanities and Cultural Heritage)

Scientific Lectures

15.05.2023: Dissertation seminar “Quelle im Fokus: Aktuelle Fragestellungen und Methoden der Kodikologie und der Material Studies” (Source in Focus: Current Issues and Methods of Codicology and Material Studies), Faculty of Philological and Cultural Studies, University of Vienna
13.06.2023: Class “Editionstechnik/Digitale Edition” (Editing Technique/Digital Edition), Institute of History, University of Vienna
10.11.2023: “Erzeugung von Sichtbarkeit im Angesicht von Unsicherheit: Visualisierungsstrategien für die Bibliotheca Eugeniana Digital” (Creating Visibility in the Face of Uncertainty: Visualization Strategies for the Bibliotheca Eugeniana Digital), workshop “Vom Erkunden zur Erkenntnis? Ansätze und Perspektiven digitaler Sammlungsvisualisierungen” (From Exploration to Insight? Approaches and Perspectives of Digital Collection Visualizations), Research Library Gotha, University of Erfurt
22.11.2023: “#digiRoundtable V – Projektreigen, Status und Zukunft” (Project Series, Status and Future), Museum of Applied Arts, Vienna
28.11.2023: “43. Treffen der Systembibliothekarinnen und Systembibliothekare” (43rd Meeting of Systems Librarians), Austrian National Library
28.02.2024: “Über die Ordnung von materiellen und digitalen Dingen: Zur multi-klassifikatorischen Visualisierung der Bibliotheca Eugeniana” (On the Ordering of Material and Digital Things: On Multi-Classifier Visualization of the Bibliotheca Eugeniana), DHd 2024, Passau
17.04.2024: Presentation of DH methods in the BED project, MA program “Museum und Collection Studies”, University for Continuing Education Krems
10.06.2024: Lecture at the committee meeting of the Association of Austrian Librarians, Universal Library Vienna
12.09.2024: Hybrid lecture at the conference “Für ein digitales historisches Museum der Euregio” (For a Digital Historical Museum of the Euregio) titled “Digitales Vermitteln mit Sammlungsvisualisierungen” (Digital Mediation with Collection Visualizations)
25.09.2024: “Bibliotheca Eugeniana Digital - Unveiling and Visualizing the Treasures of Prince Eugene of Savoy's Library”, 28th International Conference on Theory and Practice of Digital Libraries (TPDL) 2024, Ljubljana
25.10.2024: “Bibliotheca Eugeniana Digital. Eine sammlungswissenschaftliche Aufarbeitung der Bibliothek des Prinz Eugen von Savoyen” (A Collection-Scientific Study of the Library of Prince Eugene of Savoy), 4th Heritage Science Austria Meeting, University for Continuing Education Krems
07.11.2024: “European Cultural Memory in its Digitalization – Inventing Cultural Memory in the 21st Century?” (European Cultural Memory in its Digitalization – Inventing Cultural Memory in the 21^st Century?), University of Graz
20.11.2024: “#digiRoundtable” (#digiRoundtable), Museum of Applied Arts, Vienna
26.11.2024: ÖNB Labs Symposium, “Bibliotheca Eugeniana: Using Machine Learning in DH Research” (Using Machine Learning in DH Research)
29.11.2024: “10. Tagung Digitale Bibliothek: Zurück (und) in die Zukunft” (10th Digital Library Conference: Back (and) to the Future), University of Graz

Publications

Simon Mayer, Olja Janjuš, Matej Ďurčo, Sophie Hammer, and Florian Windhager (Feb. 2024). “DH – Cui bono? Zielgruppenerschließung für Digital Humanities und Cultural Heritage” (Target Group Engagement for Digital Humanities and Cultural Heritage). In: DHd 2024 #Quo Vadis DH? Passau, Germany. doi: 10.5281/zenodo.10698214
Florian Windhager, Annerose Tartler, Simon Mayer, Johannes Liem, and Eva Mayr (Feb. 2024). “Über die Ordnung von materiellen und digitalen Dingen: Zur multi- klassifikatorischen Visualisierung der Bibliotheca Eugeniana” (On the Order of Material and Digital Things: Towards a Multi-Classifier Visualization of the Bibliotheca Eugeniana). In: DHd 2024 #Quo Vadis DH? Passau, Germany. doi: 10.5281/zenodo.10698329
Eva Mayr, Annerose Tartler, Florian Windhager, Johannes Liem, Michael Smuc, Max Kaiser, Monika Kiegler-Griensteidl, and Simon Mayer (Sept. 2024). “Bibliotheca Eugeniana Digital—Unveiling and Visualizing the Treasures of Prince Eugene of Savoy’s Library”. In: Linking Theory and Practice of Digital Libraries. 28th International Conference on Theory and Practice of Digital Libraries, TPDL 2024, Ljubljana, Slovenia, September 24–27, 2024, Proceedings, Part I. ed. by Apostolos Antonacopoulos et al. Vol. 15177. Lecture Notes in Computer Science. preprint available under 10.5281/zenodo.13847701, pp. 62–75. doi: 10.1007/978-3-031-72437-4_4
Simon Mayer, Christoph Steindl, and Annerose Tartler, eds. (Nov. 11, 2024). Eugeniana Digital. Digitale Edition des handschriftlichen Katalogs der Bibliothek Prinz Eugens (Digital Edition of the Handwritten Catalog of Prince Eugene’s Library). Vienna: Austrian National Library. url: edition.onb.ac.at/context:eugeniana
Annerose Tartler, Eva Mayr, Florian Windhager, and Simon Mayer (2024). “Digitale Erschließung historischer Bibliotheken: Erkenntnisse und Perspektiven aus dem Projekt Bibliotheca Eugeniana Digital” (Digital Processing of Historical Libraries: Insights and Perspectives from the Bibliotheca Eugeniana Digital Project). In: Bibliothek – Forschung und Praxis, vol. 49, no. 2, 2025, pp. 193-200. doi: 10.1515/bfp-2024-0074
Eva Mayr, Annerose Tartler, Florian Windhager, and Simon Mayer (2024). “Sammlungen als Daten – Das Projekt Bibliotheca Eugeniana Digital als Use Case aus der Österreichischen Nationalbibliothek” (Collections as Data – The Bibliotheca Eugeniana Digital Project as a Use Case from the Austrian National Library). In: Mitteilungen der Vereinigung Österreichischer Bibliothekarinnen und Bibliothekare, 78(1). doi: 10.31263/voebm.v78i1.9165
Florian Windhager, Michael Smuc, Simon Mayer, Annerose Tartler, and Eva Mayr (2025). “To BE or not to BE: Visualizing Conceptual and Material Knowledge Spaces of the Bibliotheca Eugeniana”. In: Digital Scholarship in the Humanities, in preparation
Simon Mayer, Eva Mayr, and Florian Windhager (2025). “Bridging Past and Present: Reconstructing Prince Eugene’s Library through Fuzzy String Matching”. In: Journal of Digital History, in preparation
Simon Mayer, Florian Windhager, and Eva Mayr (2025). Die Wissensklassen der Universalbibliothek des Prinz Eugen von Savoyen (The Knowledge Classes of Prince Eugene of Savoy’s Universal Library). Forschungsblog der Österreichischen Nationalbibliothek, in preparation
Eva Mayr, Florian Windhager, Annerose Tartler, und Simon Mayer (2025). “Bibliotheca Eugeniana Digital—eine sammlungswissenschaftliche Aufarbeitung der Bibliothek des Prinz Eugen von Savoyen” (Bibliotheca Eugeniana Digital - A Collection-Scientific Study of the Library of Prince Eugene of Savoy). In: Das Erbe der Adels- und Klosterkultur. Heritage Science aus sammlungswissenschaftlicher Perspektive (The Heritage of Aristocratic and Monastic Culture. Heritage Science from a Collection-Scientific Perspective). Universität für Weiterbildung Krems, accepted

Reception

11.05.2024: “Auf der Suche nach Prinz Eugens verlorener Privatbibliothek” (In Search of Prince Eugene’s Lost Private Library). Article in “Der Standard” by Paul M. Horntrich

Results

A prototype already allows a first exploration of Prince Eugene’s historical holdings through a visualization of the State Hall.

visualization prototype — Figure 3: First prototype of the visualization of the holdings of the Bibliotheca Eugeniana located in the central oval of the State Hall.

The publication of the digital edition of the handwritten catalog of the “Bibliotheca Eugeniana” is available via the platform for digital editions of the Austrian National Library.

Figure 4: Preview of the digital edition of the handwritten catalog of the Bibliotheca Eugeniana.

The code and the data of the project can be viewed via the project’s open GitLab repository.

Project Team

Project staff:

Simon Mayer (simon.mayer@onb.ac.at)
Eva Mayr (eva.mayr@donau-uni.ac.at)
Michael Smuc (office@mindfactor.at)
Annerose Tartler (annerose.tartler@onb.ac.at)
Florian Windhager (florian.windhager@donau-uni.ac.at)

Advisors and collection experts:

Max Kaiser
Katharina Kaska
Monika Kiegler-Griensteidl
Martin Krickl
Christoph Steindl

Former project staff:

Johannes Liem (johannes.liem@donau-uni.ac.at)

Interns:

Pol Edinger (03.-14.07.2023)
Gabriel Fritzsche (12.-16.06.2023)
Lena Fuchs (08.-25.04.2024)
Tobias Goldberg (03.-21.07.2023)
Philipp Grabowski (21.05.-06.06.2024)
Angelika Rayer (03.10.-28.11.2023)

Contact

For questions or suggestions, please contact: bed-project@onb.ac.at.

Data Management

On request, information on the data generated in the project can be viewed in the data management plan.

Bibliotheca Eugeniana Digital – BED