Home / Query / WordAlign / Wiki    [ada83] [bible] [bianet] [books] [CAPES] [DGT] [DOGC] [ECB] [EhuHac] [Elhuyar] [EMEA] [EUbooks] [EU] [Europarl] [finlex] [fiskmö] [giga] [GNOME] [GlobalVoices] [hren] [JRC] [JW300] [KDE4/doc] [MBS] [memat] [MontenegrinSubs] [MultiUN] [NCv9/v11/v14] [Ofis] [OO/OO3] [subs/16/18] [ParaCrawl] [ParCor] [PHP] [QED] [sardware] [SciELO] [SETIMES] [SPC] [Tatoeba] [Tanzil] [TEP] [TedTalks] [TED] [Tilde] [Ubuntu] [UN] [UNPC] [Wikimedia] [Wikipedia] [WikiSource] [WMT] [XhosaNavy]

OPUS - an open source parallel corpus

Tools for processing OPUS corpora

Using OPUS corpora with Uplug is very straightforward. Here is a small selection of some simple tools to process parallel corpora from OPUS:

Tools used for building OPUS

The following tools have been used for pre-processing, annotation & alignment (not including standard GNU-tools):

The following tools are used for data management: