OPUS Publications

If you use OPUS resources, please cite the following publication:

News from OPUS - A Collection of Multilingual Parallel Corpora with Tools and Interfaces

In N. Nicolov and K. Bontcheva and G. Angelova and R. Mitkov (eds.) Recent Advances in Natural Language Processing (vol V), pages 237-248, John Benjamins, Amsterdam/Philadelphia

Jörg Tiedemann, 2011

PDF

OPUS-MT – Building open translation services for the World

In Proceedings of the 22nd Annual Conference of the European Association for Machine Translation

Jörg Tiedemann and Santhosh Thottingal, 2020

PDF

OpusFilter: A Configurable Parallel Corpus Filtering Toolbox

In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations

Mikko Aulamo, Sami Virpioja, Jörg Tiedemann, 2020

PDF

OpusTools and Parallel Corpus Diagnostics

In Proceedings of the 12th Language Resources and Evaluation Conference

Mikko Aulamo, Umut Sulubacak, Sami Virpioja, Jörg Tiedemann, 2020

PDF

OPUS - Parallel Corpora for Everyone

In Baltic Journal of Modern Computing (BJMC), Vol 4, No. 2, Special Issue: Proceedings of the 19th Annual Conference of the European Association of Machine Translation (EAMT), 2016

Jörg Tiedemann, 2016

PDF

Finding Alternative Translations in a Large Corpus of Movie Subtitles

In Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC-2016), 2016

Jörg Tiedemann, 2016

PDF

OpenSubtitles2016: Extracting Large Parallel Corpora from Movie and TV Subtitles

In Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC-2016)

Pierre Lison and Jörg Tiedemann, 2016

PDF

Parallel Data, Tools and Interfaces in OPUS

In Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC'2012)

Jörg Tiedemann, 2012

PDF

News from OPUS - A Collection of Multilingual Parallel Corpora with Tools and Interfaces

In N. Nicolov and K. Bontcheva and G. Angelova and R. Mitkov (eds.) Recent Advances in Natural Language Processing (vol V), pages 237-248, John Benjamins, Amsterdam/Philadelphia

Jörg Tiedemann, 2011

PDF

News from OPUS - A Collection of Multilingual Parallel Corpora with Tools and Interfaces

In N. Nicolov and K. Bontcheva and G. Angelova and R. Mitkov (eds.) Recent Advances in Natural Language Processing (vol V), pages 237-248, John Benjamins, Amsterdam/Philadelphia

Jörg Tiedemann, 2011

PDF

Bitext Alignment

Synthesis Lecture on HLT, Morgan & Claypool Publishers (at Amazon)

Jörg Tiedemann, 2011

PDF

Billions of Parallel Words for Free

In Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC'2014), Reykjavik, Iceland

Raivis Skadiņš, Jörg Tiedemann, Roberts Rozis and Daiga Deksne, 2009

PDF

Synchronizing Translated Movie Subtitles

In Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC'2008)

Jörg Tiedemann, 2008

PDF

Building a Multilingual Parallel Subtitle Corpus

In Proceedings of CLIN 17, Leuven, Belgium, 2007

Jörg Tiedemann, 2007

PDF

Improved Sentence Alignment for Movie Subtitles

In Proceedings of RANLP '07, Borovets, Bulgaria, 2007

Jörg Tiedemann, 2007

PDF

OPUS - an open source parallel corpus

In Proceedings of the 13th Nordic Conference on Computational Linguistics, University of Iceland, Reykjavik, 2003.

Jörg Tiedemann, 2003

PDF

The OPUS corpus - parallel & free

In Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC'04)Lisbon, Portugal, May 26-28.

Jörg Tiedemann, Lars Nygaard, 2003

PDF