aboutsummaryrefslogtreecommitdiffstats
path: root/dictionaries/nb_wordlist.combined.gz
diff options
context:
space:
mode:
authorJean Chalard <jchalard@google.com>2012-12-18 13:06:48 +0900
committerJean Chalard <jchalard@google.com>2012-12-18 13:06:48 +0900
commit21dbe3701c0a0bbc8281becd818cfcae259bb483 (patch)
tree69490299f32dae609b24646357ca3f7c31aaffb9 /dictionaries/nb_wordlist.combined.gz
parentc5da4365fbe6ff23a8db381ee7de6fa43fd7086b (diff)
downloadlatinime-21dbe3701c0a0bbc8281becd818cfcae259bb483.tar.gz
latinime-21dbe3701c0a0bbc8281becd818cfcae259bb483.tar.xz
latinime-21dbe3701c0a0bbc8281becd818cfcae259bb483.zip
Update dictionaries
cs, da, de, el, es, fi, fr, hr, it, lt, lv, nb, nl, pl, pt_BR, pt_PT, sl, sr, sv, tr : rescale frequencies to match spec. This has no large effect in the practice except the dictionary will become stronger vs spatial model (especially in lower count corpora, like lt, lv, sr) en* : Small changes (rounding going the other way essentially) ru : the above rescaling, and remove the following words: Дре, ОСТа, Планше, легкими, легком, легкому, легкости, легкую, нелегкие, нелегкий, нелегким, нелегкое, нелегкой, нелегкую, полулегком and add нелёгкие, нелёгкое, нелёгкую; other accented forms were already in the dictionary. Change-Id: I40386c2ebd4d2be38874e822bde89db7cb512ae6
Diffstat (limited to 'dictionaries/nb_wordlist.combined.gz')
-rw-r--r--dictionaries/nb_wordlist.combined.gzbin964442 -> 964815 bytes
1 files changed, 0 insertions, 0 deletions
diff --git a/dictionaries/nb_wordlist.combined.gz b/dictionaries/nb_wordlist.combined.gz
index 0644fc93a..d0d3d8be8 100644
--- a/dictionaries/nb_wordlist.combined.gz
+++ b/dictionaries/nb_wordlist.combined.gz
Binary files differ