Commit message (Expand) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Add words to Portuguese•••>>> dictionaries/pt_BR_wordlist.combined.gz Header : date : 1355802839 <=> 1357790917 version : 29 <=> 30 Body : Added: à 30 Added: é 30 Added: ò 30 Added: ô 30 >>> dictionaries/pt_PT_wordlist.combined.gz Header : date : 1355802856 <=> 1357790930 version : 29 <=> 30 Body : Added: à 30 Added: é 30 Added: ò 30 Added: ô 30 >>> java/res/raw/main_pt_br.dict Header : date : 1355802839 <=> 1357790917 version : 29 <=> 30 Body : Added: à 30 Added: é 30 Added: ò 30 Added: ô 30 Bug: 7966948 Change-Id: I71c0986cf616d67926d0a6a0e53099b04b0427d5 | 2013-01-10 | 1 | -0/+0 | |
* | Update dictionaries•••cs, da, de, el, es, fi, fr, hr, it, lt, lv, nb, nl, pl, pt_BR, pt_PT, sl, sr, sv, tr : rescale frequencies to match spec. This has no large effect in the practice except the dictionary will become stronger vs spatial model (especially in lower count corpora, like lt, lv, sr) en* : Small changes (rounding going the other way essentially) ru : the above rescaling, and remove the following words: Дре, ОСТа, Планше, легкими, легком, легкому, легкости, легкую, нелегкие, нелегкий, нелегким, нелегкое, нелегкой, нелегкую, полулегком and add нелёгкие, нелёгкое, нелёгкую; other accented forms were already in the dictionary. Change-Id: I40386c2ebd4d2be38874e822bde89db7cb512ae6 | 2012-12-18 | 1 | -0/+0 | |
* | Update dictionaries and fix mistakes•••- Combined de dict : Remove digraph shortcuts that were in by mistake. - Combined en dict : Set freq of "baton" "batons" "mace" "puff" "puffs" and "tasers" to zero. They are offensive in en_GB. - Combined en_GB dict : Change freq of "il" to 0 and flag it "not a word". Still in the dict as a whitelist entry for "I'll"; for some reason it had freq 99. Add "milk:122" and "practice:143" - Combined fr dict : Add missing words : "Nostradamus:40" "défendais:30" "gmail:50" "générale:140" "hm:0" "hmm:0" "y'en:130" "l'apocalypse:31" "m'épuise:30" "recontacter:80" "t'annonce:30" Set freq of non-word shortcuts for digraphs to 1 instead of 0, allowing to gesture them. - Combined ru dict : Remove a lot of two-character non-words. - Binary de dict : Remove the obsolete "options" header, and add the "dictionary" header. - Binary en dict : Flag "hoe" "hoes" "il" "shel" as non-words. Also drop freq of "il" and "shel" to 0 Add the "locale" header that was missing. - Binary es dict : Add the "dictionary" header. - Binary fr dict : Add the same words as above. Non-word shortcuts were already set to 1. - Binary it dict : Add a "dictionary" header. Also change freq of "Šarapova" from 50 to 37; not sure why it was 50. - Binary pt_BR dict : Add a "dictionary" header. - Binary ru dict : Add a "dictionary" header and remove the same words as above. For all dictionaries : bump the version to 27. Change-Id: I94fe7f8f42b31fdad223085c00a94115e14d2276 | 2012-11-21 | 1 | -0/+0 | |
* | Use all Lexiteria sources and update existing directories.•••New dictionaries : - Danish - Greek - Finnish - Lithuanian - Latvian - Dutch - Polish - Russian - Slovene - Serbian - Swedish - Turkish Also, compress those files to reduce the footprint in the repository. Also, update and improve English and French dictionaries, and add the ligatures shortcut into the French dictionary. Finally, move the Russian binary dictionary here now that it can at last be open sourced. Bug: 5587752 Bug: 6775251 Bug: 6995793 Bug: 7149666 Change-Id: Iec9831d4dce425a2b5b0657571e4448436610525 | 2012-09-21 | 1 | -0/+0 | |
* | Fix the name of the resource to lower case•••Change-Id: Icbacf10702de20ef1a60d2648ee6440812d13f1d | 2012-05-25 | 1 | -0/+0 |