We used Python==3.9.18, and we recommend using a virtual environment to install the required packages. We used a subset of the ArXiv dataset from Kaggle which ...