diff options
author | 2012-12-18 13:06:48 +0900 | |
---|---|---|
committer | 2012-12-18 13:06:48 +0900 | |
commit | 21dbe3701c0a0bbc8281becd818cfcae259bb483 (patch) | |
tree | 69490299f32dae609b24646357ca3f7c31aaffb9 /dictionaries/cs_wordlist.combined.gz | |
parent | c5da4365fbe6ff23a8db381ee7de6fa43fd7086b (diff) | |
download | latinime-21dbe3701c0a0bbc8281becd818cfcae259bb483.tar.gz latinime-21dbe3701c0a0bbc8281becd818cfcae259bb483.tar.xz latinime-21dbe3701c0a0bbc8281becd818cfcae259bb483.zip |
Update dictionaries
cs, da, de, el, es, fi, fr, hr, it, lt, lv, nb, nl, pl,
pt_BR, pt_PT, sl, sr, sv, tr : rescale frequencies to match
spec. This has no large effect in the practice except the
dictionary will become stronger vs spatial model (especially in
lower count corpora, like lt, lv, sr)
en* : Small changes (rounding going the other way essentially)
ru : the above rescaling, and remove the following words:
Дре, ОСТа, Планше, легкими, легком, легкому, легкости,
легкую, нелегкие, нелегкий, нелегким, нелегкое, нелегкой,
нелегкую, полулегком and add нелёгкие, нелёгкое, нелёгкую;
other accented forms were already in the dictionary.
Change-Id: I40386c2ebd4d2be38874e822bde89db7cb512ae6
Diffstat (limited to 'dictionaries/cs_wordlist.combined.gz')
-rw-r--r-- | dictionaries/cs_wordlist.combined.gz | bin | 945721 -> 948225 bytes |
1 files changed, 0 insertions, 0 deletions
diff --git a/dictionaries/cs_wordlist.combined.gz b/dictionaries/cs_wordlist.combined.gz Binary files differindex 8cbf2e961..b8d4d60eb 100644 --- a/dictionaries/cs_wordlist.combined.gz +++ b/dictionaries/cs_wordlist.combined.gz |