diff --git a/README.md b/README.md new file mode 100644 index 0000000000000000000000000000000000000000..3297a184a59b348289013d9db29827401172f10d --- /dev/null +++ b/README.md @@ -0,0 +1,16 @@ +# Extract figures by iiif manifest + +[](https://mybinder.org/v2/git/https%3A%2F%2Flabs.onb.ac.at%2Fgitlab%2Fa.rabensteiner%2Fextract_figures_abo/HEAD?labpath=extract_figures.ipynb) + +This repository provides a Jupyter notebook that uses a YOLOv8 model to extract figures from a book given the url of its iiif manifest. + +The model has been trained on the following fives book from the ABO corpus: +- [](http://data.onb.ac.at/ABO/%2BZ97792402) +- [](http://data.onb.ac.at/ABO/%2BZ155502807) +- [](http://data.onb.ac.at/ABO/%2BZ156318706) +- [](http://data.onb.ac.at/ABO/%2BZ164403901) +- [](http://data.onb.ac.at/ABO/%Z259182702) + +From these approximately 1700 book pages 250 contain figures that have been annotated with bounding boxes with the image annotation webservice [CVAT](https://www.cvat.ai/). Training has been done locally with the nano version of YOLOv8. The resulting model for figure detection is given by ()[model_extract_figures.pt]. + +