ir_datasets
: TREC SpanishA collection of news articles in Spanish, used for multi-lingual evaluation in TREC 3 and TREC 4.
Document collection from LDC2000T51.
Language: es
Example
import ir_datasets
dataset = ir_datasets.load('trec-spanish')
for doc in dataset.docs_iter():
doc # namedtuple<doc_id, text, marked_up_doc>
Spanish benchmark from TREC 3.
Language: multiple/other/unknown
Example
import ir_datasets
dataset = ir_datasets.load('trec-spanish/trec3')
for query in dataset.queries_iter():
query # namedtuple<query_id, title_es, title_en, description_es, description_en, narrative_es, narrative_en>
Language: es
Example
import ir_datasets
dataset = ir_datasets.load('trec-spanish/trec3')
for doc in dataset.docs_iter():
doc # namedtuple<doc_id, text, marked_up_doc>
Relevance levels
Rel. | Definition |
---|---|
0 | not relevant |
1 | relevant |
Example
import ir_datasets
dataset = ir_datasets.load('trec-spanish/trec3')
for qrel in dataset.qrels_iter():
qrel # namedtuple<query_id, doc_id, relevance, iteration>
Spanish benchmark from TREC 4.
Language: multiple/other/unknown
Example
import ir_datasets
dataset = ir_datasets.load('trec-spanish/trec4')
for query in dataset.queries_iter():
query # namedtuple<query_id, description_es1, description_en1, description_es2, description_en2>
Language: es
Example
import ir_datasets
dataset = ir_datasets.load('trec-spanish/trec4')
for doc in dataset.docs_iter():
doc # namedtuple<doc_id, text, marked_up_doc>
Relevance levels
Rel. | Definition |
---|---|
0 | not relevant |
1 | relevant |
Example
import ir_datasets
dataset = ir_datasets.load('trec-spanish/trec4')
for qrel in dataset.qrels_iter():
qrel # namedtuple<query_id, doc_id, relevance, iteration>