ir_datasets
: CodeSearchNetA benchmark for semantic code search. Uses
Language: multiple/other/unknown
Example
import ir_datasets
dataset = ir_datasets.load('codesearchnet')
for doc in dataset.docs_iter():
doc # namedtuple<doc_id, repo, path, func_name, code, language>
Official challenge set, with keyword queries and deep relevance assessments.
Official test set, using queries inferred from docstrings.
Official train set, using queries inferred from docstrings.
Official validation set, using queries inferred from docstrings.