ir_datasets
: CSLThe CSL dataset, used for the TREC NueCLIR technical document task.
Language: zh
Examples:
import ir_datasets
dataset = ir_datasets.load("csl")
for doc in dataset.docs_iter():
doc # namedtuple<doc_id, title, abstract, keywords, category, category_eng, discipline, discipline_eng>
You can find more details about the Python API here.
The TREC NeuCLIR 2023 technical documen task.
Language: multiple/other/unknown
Examples:
import ir_datasets
dataset = ir_datasets.load("csl/trec-2023")
for query in dataset.queries_iter():
query # namedtuple<query_id, title, description, narrative, ht_title, ht_description, ht_narrative, mt_title, mt_description, mt_narrative, translation_lang>
You can find more details about the Python API here.