Dataset is a collection of tractograms of 20 subjects, randomly selected from the HCP dataset, where artifactual streamlines are labelled as anatomically non plausible. Labelling was performed using several heuristic rules to exclude non plausible pathways according to the current knowledge of brain anatomy.