wiki:Tools/OpusUDpipe

opus-udpipe

Parse data in OPUS XML using UDPipe. Requires a proper installation of UDPipe and its Perl interface.

SYNOPSIS

        # run OPUS raw data through English UDPipe
        opus-udpipe -l en < raw/english.xml > parsed.xml

OPTIONS:

        -l <langid> ......... language ID (ISO639-1)
        -m <modeldir> ....... path to udpipe models
        -v <version> ........ model version
        -D .................. print model dir (and stop)
        -L .................. list supported languages
        -M .................. list UDPipe models

TODO

Treat markup inside of sentences! Now all sentence-internal markup is ignored and not printed to the parsed output.

Last modified 3 years ago Last modified on Nov 16, 2017, 8:35:04 PM