ir_datasets
: Catalog
ir_datasets
provides a common interface to many IR ranking datasets.
Install with pip:
pip install ir_datasets==0.4.2
Guides:
✅: Data available as automatic download
⚠️: Data available from a third party
When using datasets provided by this package, be sure to properly cite them. Bibtex for each dataset can be found on each dataset's documenation page.
If you use this tool, please cite our SIGIR resource paper:
@inproceedings{macavaney:sigir2021-irds, author = {MacAvaney, Sean and Yates, Andrew and Feldman, Sergey and Downey, Doug and Cohan, Arman and Goharian, Nazli}, title = {Simplified Data Wrangling with ir_datasets}, year = {2021}, booktitle = {SIGIR} }