ir_datasets
: TREC ArabicA collection of news articles in Arabic, used for multi-lingual evaluation in TREC 2001 and TREC 2002.
Document collection from LDC2001T55.
Arabic benchmark from TREC 2001.
Language: ar
Example
import ir_datasets
dataset = ir_datasets.load('trec-arabic/ar2001')
for query in dataset.queries_iter():
query # namedtuple<query_id, title, description, narrative>
Arabic benchmark from TREC 2002.
Language: ar
Example
import ir_datasets
dataset = ir_datasets.load('trec-arabic/ar2002')
for query in dataset.queries_iter():
query # namedtuple<query_id, title, description, narrative>