ir_datasets
: mMARCOA version of the MS MARCO passage dataset (msmarco-passage) with the queries and documents automatically translated into several languages.
Bibtex:
@article{Bonifacio2021MMarco, title={{mMARCO}: A Multilingual Version of {MS MARCO} Passage Ranking Dataset}, author={Luiz Henrique Bonifacio and Israel Campiotti and Roberto Lotufo and Rodrigo Nogueira}, year={2021}, journal={arXiv:2108.13897} }Version of msmarco-passage, with documents translated into German.
Language: de
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/de")
for doc in dataset.docs_iter():
doc # namedtuple<doc_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev, with queries and documents translated into German.
Language: de
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/de/dev")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev/small, with queries and documents translated into German.
Language: de
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/de/dev/small")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/train, with queries and documents translated into German.
Language: de
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/de/train")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage, with documents translated into Spanish.
Language: es
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/es")
for doc in dataset.docs_iter():
doc # namedtuple<doc_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev, with queries and documents translated into Spanish.
Language: es
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/es/dev")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev/small, with queries and documents translated into Spanish.
Language: es
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/es/dev/small")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/train, with queries and documents translated into Spanish.
Language: es
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/es/train")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage, with documents translated into French.
Language: fr
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/fr")
for doc in dataset.docs_iter():
doc # namedtuple<doc_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev, with queries and documents translated into French.
Language: fr
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/fr/dev")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev/small, with queries and documents translated into French.
Language: fr
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/fr/dev/small")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/train, with queries and documents translated into French.
Language: fr
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/fr/train")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage, with documents translated into Indonesian.
Language: id
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/id")
for doc in dataset.docs_iter():
doc # namedtuple<doc_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev, with queries and documents translated into Indonesian.
Language: id
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/id/dev")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev/small, with queries and documents translated into Indonesian.
Language: id
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/id/dev/small")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/train, with queries and documents translated into Indonesian.
Language: id
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/id/train")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage, with documents translated into Italian.
Language: it
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/it")
for doc in dataset.docs_iter():
doc # namedtuple<doc_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev, with queries and documents translated into Italian.
Language: it
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/it/dev")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev/small, with queries and documents translated into Italian.
Language: it
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/it/dev/small")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/train, with queries and documents translated into Italian.
Language: it
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/it/train")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage, with documents translated into Portuguese.
Language: pt
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/pt")
for doc in dataset.docs_iter():
doc # namedtuple<doc_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev, with queries and documents translated into Portuguese.
Language: pt
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/pt/dev")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev/small, with queries and documents translated into Portuguese.
Language: pt
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/pt/dev/small")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev, with queries and documents translated into Portuguese.
Version 1.1 of this file includes manual corrections from the authorss of the translated files. See discussion here. It also removes some duplicated query IDs.
Language: pt
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/pt/dev/small/v1.1")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev, with queries and documents translated into Portuguese.
Version 1.1 of this file includes manual corrections from the authorss of the translated files. See discussion here. It also removes some duplicated query IDs.
Language: pt
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/pt/dev/v1.1")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/train, with queries and documents translated into Portuguese.
Language: pt
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/pt/train")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/train, with queries and documents translated into Portuguese.
Version 1.1 of this file includes manual corrections from the authorss of the translated files. See discussion here. It also removes some duplicated query IDs.
Language: pt
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/pt/train/v1.1")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage, with documents translated into Russian.
Language: ru
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/ru")
for doc in dataset.docs_iter():
doc # namedtuple<doc_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev, with queries and documents translated into Russian.
Language: ru
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/ru/dev")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev/small, with queries and documents translated into Russian.
Language: ru
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/ru/dev/small")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/train, with queries and documents translated into Russian.
Language: ru
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/ru/train")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage, with queries and documents translated into Arabic.
Language: ar
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/ar")
for doc in dataset.docs_iter():
doc # namedtuple<doc_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev, with queries and documents translated into Arabic.
Language: ar
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/ar/dev")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev/small, with queries and documents translated into Arabic.
Language: ar
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/ar/dev/small")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/train, with queries and documents translated into Arabic.
Language: ar
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/ar/train")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage, with queries and documents translated into German.
Language: de
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/de")
for doc in dataset.docs_iter():
doc # namedtuple<doc_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev, with queries and documents translated into German.
Language: de
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/de/dev")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev/small, with queries and documents translated into German.
Language: de
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/de/dev/small")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/train, with queries and documents translated into German.
Language: de
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/de/train")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage, with queries and documents translated into Dutch.
Language: dt
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/dt")
for doc in dataset.docs_iter():
doc # namedtuple<doc_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev, with queries and documents translated into Dutch.
Language: dt
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/dt/dev")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev/small, with queries and documents translated into Dutch.
Language: dt
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/dt/dev/small")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/train, with queries and documents translated into Dutch.
Language: dt
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/dt/train")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage, with queries and documents translated into Spanish.
Language: es
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/es")
for doc in dataset.docs_iter():
doc # namedtuple<doc_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev, with queries and documents translated into Spanish.
Language: es
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/es/dev")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev/small, with queries and documents translated into Spanish.
Language: es
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/es/dev/small")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/train, with queries and documents translated into Spanish.
Language: es
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/es/train")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage, with queries and documents translated into French.
Language: fr
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/fr")
for doc in dataset.docs_iter():
doc # namedtuple<doc_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev, with queries and documents translated into French.
Language: fr
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/fr/dev")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev/small, with queries and documents translated into French.
Language: fr
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/fr/dev/small")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/train, with queries and documents translated into French.
Language: fr
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/fr/train")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage, with queries and documents translated into Hindi.
Language: hi
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/hi")
for doc in dataset.docs_iter():
doc # namedtuple<doc_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev, with queries and documents translated into Hindi.
Language: hi
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/hi/dev")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev/small, with queries and documents translated into Hindi.
Language: hi
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/hi/dev/small")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/train, with queries and documents translated into Hindi.
Language: hi
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/hi/train")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage, with queries and documents translated into Indonesian.
Language: id
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/id")
for doc in dataset.docs_iter():
doc # namedtuple<doc_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev, with queries and documents translated into Indonesian.
Language: id
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/id/dev")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev/small, with queries and documents translated into Indonesian.
Language: id
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/id/dev/small")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/train, with queries and documents translated into Indonesian.
Language: id
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/id/train")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage, with queries and documents translated into Italian.
Language: it
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/it")
for doc in dataset.docs_iter():
doc # namedtuple<doc_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev, with queries and documents translated into Italian.
Language: it
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/it/dev")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev/small, with queries and documents translated into Italian.
Language: it
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/it/dev/small")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/train, with queries and documents translated into Italian.
Language: it
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/it/train")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage, with queries and documents translated into Japanese.
Language: ja
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/ja")
for doc in dataset.docs_iter():
doc # namedtuple<doc_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev, with queries and documents translated into Japanese.
Language: ja
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/ja/dev")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev/small, with queries and documents translated into Japanese.
Language: ja
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/ja/dev/small")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/train, with queries and documents translated into Japanese.
Language: ja
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/ja/train")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage, with queries and documents translated into Portuguese.
Language: pt
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/pt")
for doc in dataset.docs_iter():
doc # namedtuple<doc_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev, with queries and documents translated into Portuguese.
Language: pt
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/pt/dev")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev/small, with queries and documents translated into Portuguese.
Language: pt
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/pt/dev/small")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/train, with queries and documents translated into Portuguese.
Language: pt
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/pt/train")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage, with queries and documents translated into Russian.
Language: ru
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/ru")
for doc in dataset.docs_iter():
doc # namedtuple<doc_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev, with queries and documents translated into Russian.
Language: ru
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/ru/dev")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev/small, with queries and documents translated into Russian.
Language: ru
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/ru/dev/small")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/train, with queries and documents translated into Russian.
Language: ru
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/ru/train")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage, with queries and documents translated into Vietnamese.
Language: vi
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/vi")
for doc in dataset.docs_iter():
doc # namedtuple<doc_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev, with queries and documents translated into Vietnamese.
Language: vi
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/vi/dev")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev/small, with queries and documents translated into Vietnamese.
Language: vi
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/vi/dev/small")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/train, with queries and documents translated into Vietnamese.
Language: vi
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/vi/train")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage, with queries and documents translated into Chinese.
Language: zh
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/zh")
for doc in dataset.docs_iter():
doc # namedtuple<doc_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev, with queries and documents translated into Chinese.
Language: zh
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/zh/dev")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev/small, with queries and documents translated into Chinese.
Language: zh
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/zh/dev/small")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/train, with queries and documents translated into Chinese.
Language: zh
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/v2/zh/train")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage, with documents translated into Chinese.
Language: zh
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/zh")
for doc in dataset.docs_iter():
doc # namedtuple<doc_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev, with queries and documents translated into Chinese.
Language: zh
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/zh/dev")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev/small, with queries and documents translated into Chinese.
Language: zh
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/zh/dev/small")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev, with queries and documents translated into Chinese.
Version 1.1 of this file includes manual corrections from the authorss of the translated files. See discussion here.
Language: zh
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/zh/dev/small/v1.1")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/dev, with queries and documents translated into Chinese.
Version 1.1 of this file includes manual corrections from the authorss of the translated files. See discussion here.
Language: zh
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/zh/dev/v1.1")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.
Version of msmarco-passage/train, with queries and documents translated into Chinese.
Language: zh
Examples:
import ir_datasets
dataset = ir_datasets.load("mmarco/zh/train")
for query in dataset.queries_iter():
query # namedtuple<query_id, text>
You can find more details about the Python API here.