The repository contains all datasets and evaluations for the statbot.swiss benchmark.
-
conda create -n statbot python=3.11.4 -
conda activate statbot -
pip install -r requirements.txt
-
provide openai-api-key in
src/config.jsonfile. -
reproduce results based on gpt-3.5-turbo-16k run the command below.
-
For random shot selection, choose
randomand for the selection based on similarity choosesimilarityfor--shot-selection-strategy.
-
Command to run:
python src/main.py --shot-selection-strategy <random|similarity>
This work is the output of the INODE4StatBot.swiss project, funded by the Swiss Federal Statistical Office.
It is a collaboration between the Institute of Computer Science at ZHAW, the Competence Center for Data Science of the Federal Statistical Office, the Cantonal Statistical Office Zurich, and the Swiss Data Science Center.
This work is licensed under the MIT license.
Third-party software and data are subject to their respective licenses.
Please cite the paper as follows:
@article{statbot2024,
title={StatBot. Swiss: Bilingual Open Data Exploration in Natural Language},
author={Nooralahzadeh, Farhad and Zhang, Yi and Smith, Ellery and Maennel, Sabine and Matthey-Doret, Cyril and de Fondville, Rapha{\"e}l and Stockinger, Kurt},
journal={arXiv preprint arXiv:2406.03170},
year={2024}
}