Download

nemotron-cc-10K-samplev1synSynthetic

Latest releasev1syn
LicenseCC-BY 4.0
Please cite the following article if you use the OPUS packages and downloads in your own work:
J. Tiedemann, 2012, Parallel Data, Tools and Interfaces in OPUS. In Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012)

nemotron-cc-10K-sample's Numbers

LanguagesBitextsFilesTokensSentence fragments
37361850000940M31M

Language pairs in nemotron-cc-10K-samplev1syn

Click a bar to select a pair below.

Downloads

Pick a language pair to see available downloads.

Disclaimer

Notice and take down policy

Notice: Should you consider that our data contains material that is owned by you and should therefore not be reproduced here, please:

Take down: We will comply to legitimate requests by removing the affected sources from the next release of the corpus.