diff --git a/README.md b/README.md index cf99afac6212f52ebfe6e24076adc6b47846c46e..7ba2d5ad39773ea6835ef2dc2e35eaaddfd73d40 100644 --- a/README.md +++ b/README.md @@ -6,6 +6,7 @@ The project aims to showcase the use of topic modelling, particularly the ETM, f This project aims to showcase how topic modelling can be used to compare historical newspapers. In the notebooks folder, there are notebooks concerning the tokenization, the preprocessing, the exploratory data analysis, the model fitting and the output analysis of the Wiener Zeitung and the Salzburger Intelligenzblatt. However, the code can easily be adapted to other needs. The repository also includes a requirements file, stating which packages are needed to run the whole code. For further reading of the specific steps, you can go to: [link]. Primary contribution by Thomas Kirchmair: +- project conception - data preprocessing - model training - model evaluation