From ffb377f9daadecd425e5a7346ea56f91e2101d1c Mon Sep 17 00:00:00 2001 From: onb1259 <onb1259@onb.ac.at> Date: Wed, 18 Oct 2023 14:18:18 +0200 Subject: [PATCH] added README --- README.md | 16 ++++++++++++++++ 1 file changed, 16 insertions(+) create mode 100644 README.md diff --git a/README.md b/README.md new file mode 100644 index 0000000..3297a18 --- /dev/null +++ b/README.md @@ -0,0 +1,16 @@ +# Extract figures by iiif manifest + +[](https://mybinder.org/v2/git/https%3A%2F%2Flabs.onb.ac.at%2Fgitlab%2Fa.rabensteiner%2Fextract_figures_abo/HEAD?labpath=extract_figures.ipynb) + +This repository provides a Jupyter notebook that uses a YOLOv8 model to extract figures from a book given the url of its iiif manifest. + +The model has been trained on the following fives book from the ABO corpus: +- [](http://data.onb.ac.at/ABO/%2BZ97792402) +- [](http://data.onb.ac.at/ABO/%2BZ155502807) +- [](http://data.onb.ac.at/ABO/%2BZ156318706) +- [](http://data.onb.ac.at/ABO/%2BZ164403901) +- [](http://data.onb.ac.at/ABO/%Z259182702) + +From these approximately 1700 book pages 250 contain figures that have been annotated with bounding boxes with the image annotation webservice [CVAT](https://www.cvat.ai/). Training has been done locally with the nano version of YOLOv8. The resulting model for figure detection is given by ()[model_extract_figures.pt]. + + -- GitLab