aboutsummaryrefslogtreecommitdiffstats
path: root/dictionaries/cs_wordlist.combined.gz
diff options
context:
space:
mode:
authorJean Chalard <jchalard@google.com>2012-12-18 13:06:48 +0900
committerJean Chalard <jchalard@google.com>2012-12-18 13:06:48 +0900
commit21dbe3701c0a0bbc8281becd818cfcae259bb483 (patch)
tree69490299f32dae609b24646357ca3f7c31aaffb9 /dictionaries/cs_wordlist.combined.gz
parentc5da4365fbe6ff23a8db381ee7de6fa43fd7086b (diff)
downloadlatinime-21dbe3701c0a0bbc8281becd818cfcae259bb483.tar.gz
latinime-21dbe3701c0a0bbc8281becd818cfcae259bb483.tar.xz
latinime-21dbe3701c0a0bbc8281becd818cfcae259bb483.zip
Update dictionaries
cs, da, de, el, es, fi, fr, hr, it, lt, lv, nb, nl, pl, pt_BR, pt_PT, sl, sr, sv, tr : rescale frequencies to match spec. This has no large effect in the practice except the dictionary will become stronger vs spatial model (especially in lower count corpora, like lt, lv, sr) en* : Small changes (rounding going the other way essentially) ru : the above rescaling, and remove the following words: Дре, ОСТа, Планше, легкими, легком, легкому, легкости, легкую, нелегкие, нелегкий, нелегким, нелегкое, нелегкой, нелегкую, полулегком and add нелёгкие, нелёгкое, нелёгкую; other accented forms were already in the dictionary. Change-Id: I40386c2ebd4d2be38874e822bde89db7cb512ae6
Diffstat (limited to 'dictionaries/cs_wordlist.combined.gz')
-rw-r--r--dictionaries/cs_wordlist.combined.gzbin945721 -> 948225 bytes
1 files changed, 0 insertions, 0 deletions
diff --git a/dictionaries/cs_wordlist.combined.gz b/dictionaries/cs_wordlist.combined.gz
index 8cbf2e961..b8d4d60eb 100644
--- a/dictionaries/cs_wordlist.combined.gz
+++ b/dictionaries/cs_wordlist.combined.gz
Binary files differ