wiki:Tools/OpusPt2dice

opus-pt2dice

Extract a rough bilingual dictionary with Dice scores and co-occurrence frequencies from a Moses-style phrase-table.

USAGE

zcat phrase-table.gz | opus-pt2dice [OPTIONS] > dictionary.txt

OPTIONS

 -d <thr> .... set Dice threshold (default = 0.01)
 -f <thr> .... set coccurrence threshold (default = 1)
 -s .......... disallow source language MWEs
 -t .......... disallow target language MWEs
 -m .......... disallow if both languages are MWEs
 -l .......... only lower-cased letter sequences
 -i .......... allow translations that are included in other options
Last modified 3 years ago Last modified on Nov 16, 2017, 8:29:09 PM