diff --git a/1 - Overview.ipynb b/1 - Overview.ipynb index 8e31af8d7f8fa2fc196cc8e88866c86d786aed78..a4ccd972e3df50a7466defd1e2b5395e54f00102 100644 --- a/1 - Overview.ipynb +++ b/1 - Overview.ipynb @@ -16,6 +16,8 @@ "\n", "[https://labs.onb.ac.at](https://labs.onb.ac.at)\n", "\n", + "https://labs.onb.ac.at/gitlab/labs-team/pydays19\n", + "\n", "*Georg Petz, Stefan Karner - Austrian National Library*" ] }, @@ -29,7 +31,9 @@ "source": [ "# 1 - Overview\n", "\n", - "[https://labs.onb.ac.at](https://labs.onb.ac.at)" + "[https://labs.onb.ac.at](https://labs.onb.ac.at)\n", + "\n", + "[https://labs.onb.ac.at/gitlab/labs-team/pydays19](https://labs.onb.ac.at/gitlab/labs-team/pydays19)" ] }, { @@ -44,8 +48,9 @@ "\n", "* Part 1: Overview\n", " * What's this all about?\n", - " * Who are these clowns?\n", + " * Who are these people?\n", " * What do I need?\n", + " * How do we want to do this?\n", "* Part 2: Metadata and Catalogue\n", "* Part 3: Images and Text\n", "* Part 4: Webarchive" @@ -53,17 +58,127 @@ }, { "cell_type": "markdown", - "metadata": {}, + "metadata": { + "slideshow": { + "slide_type": "subslide" + } + }, + "source": [ + "### What's this all about?\n", + "\n", + "* The Austrian National Library offers data, free to use\n", + "* We want to show you roughly what data you can expect\n", + "* We want to show you how to work with the data interfaces" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "slideshow": { + "slide_type": "fragment" + } + }, + "source": [ + "#### Why?\n", + "\n", + "* If you ever need this kind of data\n", + "* For fun\n", + "* If you do something interesting with ONB's assets, there's a good chance you get to present it at the ONB Labs symposium on the 13th of June 2019 live, if you want that" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "slideshow": { + "slide_type": "subslide" + } + }, + "source": [ + "#### Data and Interfaces\n", + "\n", + "* **Metadata**: catalogue data, metadata for historic postcards and historic newspapers, SPARQL\n", + "* **Images and text**: Text for historic newspapers, images and text for historic newspapers and historic postcards\n", + "* **Webarchive**: Saved Austrian pages from roughly 2009 up until now; search: yes, view: no" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "slideshow": { + "slide_type": "subslide" + } + }, + "source": [ + "### Who are these people?" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "slideshow": { + "slide_type": "fragment" + } + }, + "source": [ + "* Presenters" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "slideshow": { + "slide_type": "fragment" + } + }, + "source": [ + "* Participants" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "slideshow": { + "slide_type": "fragment" + } + }, "source": [ - "**TODO**\n", + "#### What are you interested in?" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "slideshow": { + "slide_type": "subslide" + } + }, + "source": [ + "### What do I need?\n", "\n", - "* Im Workshop oder nach dem Workshop erschaffene Dinge beim Symposium präsentieren?\n", - "* Erwähnung \"Bring Your Project\"\n" + "* The repository at [https://labs.onb.ac.at/gitlab/labs-team/pydays19](https://labs.onb.ac.at/gitlab/labs-team/pydays19) in its freshest form\n", + "* A working Python3 installation\n", + "* A venv with the requirements installed\n", + "* A `jupyter notebook` running inside the venv" ] }, { "cell_type": "markdown", - "metadata": {}, + "metadata": { + "slideshow": { + "slide_type": "subslide" + } + }, + "source": [ + "### How do we want to do this?" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "slideshow": { + "slide_type": "slide" + } + }, "source": [ "## Used Libraries\n", "\n", @@ -72,15 +187,9 @@ "* **jsonpath** - xpath for json [https://github.com/h2non/jsonpath-ng](https://github.com/h2non/jsonpath-ng)\n", "* **lxml** - xml parser and xpath (version 1) implementation [https://lxml.de/](https://lxml.de/)\n", "* **sickle** - OAI-PMH for humans [https://sickle.readthedocs.io/en/latest/](https://sickle.readthedocs.io/en/latest/)\n", - "* **pyswagger** - dynamic OpenAPI / Swagger client [https://github.com/pyopenapi/pyswagger](https://github.com/pyopenapi/pyswagger)" + "* **pyswagger** - dynamic OpenAPI / Swagger client [https://github.com/pyopenapi/pyswagger](https://github.com/pyopenapi/pyswagger)\n", + "* **sparqlwrapper** - SPARQL endpoint interface to python [https://rdflib.github.io/sparqlwrapper/](https://rdflib.github.io/sparqlwrapper/)" ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": {}, - "outputs": [], - "source": [] } ], "metadata": { diff --git a/html-versions/1 - Overview.html b/html-versions/1 - Overview.html new file mode 100644 index 0000000000000000000000000000000000000000..e8269fe84d823f6dc66d701cc3e344ac03cabd1b --- /dev/null +++ b/html-versions/1 - Overview.html @@ -0,0 +1,13269 @@ + + +
+ +Use Images, Text, Webarchive and Catalogue Data from the Austrian National Library in Jupyter Notebooks
+Workshop 2019-05-03 - PyDays19
+ +https://labs.onb.ac.at/gitlab/labs-team/pydays19
+Georg Petz, Stefan Karner - Austrian National Library
+ +jupyter notebook
running inside the venv