aboutsummaryrefslogtreecommitdiffstats
path: root/dictionaries/ru_wordlist.combined.gz (follow)
Commit message (Expand)AuthorAgeFilesLines
* Update dictionaries•••Bug: 10354668 Bug: 10188528 >>> dictionaries/fr_wordlist.combined.gz Header : date : 1374634549 <=> 1376888819 version : 36 <=> 37 Body : Deleted: color 78 Deleted: men 85 Deleted: o 115 Added: nationaux 120 >>> dictionaries/iw_wordlist.combined.gz Added. New dictionary. >>> dictionaries/pt_BR_wordlist.combined.gz Header : date : 1374634563 <=> 1376884524 version : 36 <=> 37 Body : Deleted: la 152 >>> dictionaries/pt_PT_wordlist.combined.gz Header : date : 1357790930 <=> 1376884536 version : 30 <=> 37 Body : Deleted: la 152 >>> dictionaries/ru_wordlist.combined.gz Header : date : 1372393835 <=> 1376897704 version : 35 <=> 37 Body : Freq changed: говно 68 -> 0 >>> java/res/raw/main_fr.dict Header : date : 1374634549 <=> 1376888819 version : 36 <=> 37 Body : Deleted: color 78 Deleted: men 85 Deleted: o 115 Added: nationaux 120 >>> java/res/raw/main_pt_br.dict Header : date : 1374634563 <=> 1376884524 version : 36 <=> 37 Body : Deleted: la 152 >>> java/res/raw/main_ru.dict Header : date : 1372393835 <=> 1376897704 version : 35 <=> 37 Body : Freq changed: говно 68 -> 0 Change-Id: I87a85571c61068ff46a32d291aa43becbb75598a Jean Chalard2013-08-191-0/+0
* Update dictionaries•••>>> dictionaries/cs_wordlist.combined.gz Header : date : 1355802831 <=> 1372393817 version : 29 <=> 35 Body : Added: LTE 25 >>> dictionaries/de_wordlist.combined.gz Header : date : 1355802835 <=> 1372393817 version : 29 <=> 35 Body : Added: LTE 25 >>> dictionaries/en_GB_wordlist.combined.gz Header : date : 1366272052 <=> 1372393817 version : 31 <=> 35 Body : Deleted: Sea 126 Added: LTE 25 >>> dictionaries/en_US_wordlist.combined.gz Header : date : 1366272093 <=> 1372393817 version : 31 <=> 35 Body : Added: LTE 25 >>> dictionaries/en_wordlist.combined.gz Header : date : 1366272977 <=> 1372393837 version : 31 <=> 35 Body : Deleted: Sea 126 Added: LTE 25 >>> dictionaries/es_wordlist.combined.gz Header : date : 1355802832 <=> 1372393817 version : 29 <=> 35 Body : Added: LTE 25 >>> dictionaries/fr_wordlist.combined.gz Header : date : 1366272255 <=> 1372393818 version : 31 <=> 35 Body : Deleted: R'n'B 95 Deleted: count 60 Deleted: d'Inti 34 Added: beurk 25 >>> dictionaries/hr_wordlist.combined.gz Header : date : 1355802836 <=> 1372393818 version : 29 <=> 35 Body : Added: LTE 25 >>> dictionaries/it_wordlist.combined.gz Header : date : 1355802836 <=> 1372393818 version : 29 <=> 35 Body : Added: LTE 25 >>> dictionaries/lt_wordlist.combined.gz Header : date : 1355802843 <=> 1372393818 version : 29 <=> 35 Body : Added: LTE 25 >>> dictionaries/lv_wordlist.combined.gz Header : date : 1355802843 <=> 1372393818 version : 29 <=> 35 Body : Added: LTE 25 >>> dictionaries/nb_wordlist.combined.gz Header : date : 1366003450 <=> 1372393818 version : 31 <=> 35 Body : Added: LTE 25 >>> dictionaries/nl_wordlist.combined.gz Header : date : 1355802844 <=> 1372393818 version : 29 <=> 35 Body : Added: LTE 25 >>> dictionaries/ru_wordlist.combined.gz Header : date : 1370244430 <=> 1372393835 version : 34 <=> 35 Body : Freq changed: связывание 93 -> 0 >>> dictionaries/sl_wordlist.combined.gz Header : date : 1355802835 <=> 1372393835 version : 29 <=> 35 Body : Added: LTE 25 >>> dictionaries/sr_wordlist.combined.gz Header : date : 1355802853 <=> 1372393835 version : 29 <=> 35 Body : Added: LTE 25 >>> dictionaries/sv_wordlist.combined.gz Header : date : 1366003804 <=> 1372393836 version : 31 <=> 35 Body : Added: LTE 25 >>> dictionaries/tr_wordlist.combined.gz Header : date : 1355802858 <=> 1372393837 version : 29 <=> 35 Body : Added: LTE 25 >>> java/res/raw/main_de.dict Header : date : 1355802835 <=> 1372393817 version : 29 <=> 35 Body : Added: LTE 25 >>> java/res/raw/main_en.dict Header : date : 1366272977 <=> 1372393837 version : 31 <=> 35 Body : Deleted: Sea 126 Added: LTE 25 >>> java/res/raw/main_es.dict Header : date : 1355802832 <=> 1372393817 version : 29 <=> 35 Body : Added: LTE 25 >>> java/res/raw/main_fr.dict Header : date : 1366272255 <=> 1372393818 version : 31 <=> 35 Body : Deleted: R'n'B 95 Deleted: count 60 Deleted: d'Inti 34 Added: beurk 25 >>> java/res/raw/main_it.dict Header : date : 1355802836 <=> 1372393818 version : 29 <=> 35 Body : Added: LTE 25 >>> java/res/raw/main_ru.dict Header : date : 1370244430 <=> 1372393835 version : 34 <=> 35 Body : Freq changed: связывание 93 -> 0 Bug: 9301610 Bug: 9607966 Change-Id: I1117ed85d97fbb0ee50f11bc31776f1970b56f12 Jean Chalard2013-06-281-0/+0
* Update dictionaries•••>>> dictionaries/ru_wordlist.combined.gz Header : date : 1366974711 <=> 1370244430 MULTIPLE_WORDS_DEMOTION_RATE : 0 <=> 50 version : 32 <=> 34 Body : Deleted: МДА 2 Freq changed: а 0 -> 60 Freq changed: в 0 -> 60 Deleted: возбужденные 0 Freq changed: гей 92 -> 0 Freq changed: жид 80 -> 0 Freq changed: зареган 0 -> 50 Freq changed: и 0 -> 60 Freq changed: к 0 -> 60 Deleted: клевом 0 Freq changed: куи 29 -> 0 Freq changed: лох 69 -> 0 Freq changed: о 0 -> 60 Freq changed: ребут 0 -> 50 Freq changed: с 0 -> 60 Freq changed: у 0 -> 60 Freq changed: хуй 77 -> 0 Freq changed: хукера 38 -> 0 Freq changed: широко 0 -> 144 Deleted: щеткой 70 Freq changed: щёткой 69 -> 70 Freq changed: я 0 -> 60 Added: жены 134 Added: звони 100 Added: клёвом 50 Added: мда 0 >>> java/res/raw/main_ru.dict Header : date : 1366974711 <=> 1370244430 version : 32 <=> 34 MULTIPLE_WORDS_DEMOTION_RATE : 0 <=> 50 Body : (same changes) Change-Id: Ie10bdd1f33cac43c5be35e99faef7cfdfe877d2b Jean Chalard2013-06-031-0/+0
* Update dictionaries•••>>> dictionaries/ru_wordlist.combined.gz Header : date : 1366957492 <=> 1366974711 Body : Added: ложись 100 Added: под 100 Added: посмотрю 100 Added: угу 100 Added: ух 100 >>> java/res/raw/main_ru.dict Header : date : 1366957492 <=> 1366974711 Body : Added: ложись 100 Added: под 100 Added: посмотрю 100 Added: угу 100 Added: ух 100 Change-Id: Ida39ea2cf25cd291554f3b2f3ce31f57dca24113 Jean Chalard2013-04-261-0/+0
* Update dictionaries•••Full diff too long: truncated Summary diff >>> dictionaries/ru_wordlist.combined.gz Header : date : 1366277083 <=> 1366957492 version : 31 <=> 32 Contents : - Reinstate 2- and 3- letter words that were demoted to avoid bad space insertion (343 entries) - Add missing words as per b/6341908 and b/5674314 (98 entries) This has zero effect on the regression tests Bug: 6341908 Bug: 5674314 Change-Id: Ifce268a7eab5edd264d963489187e975017f8b72 Jean Chalard2013-04-261-0/+0
* Update dictionaries•••>>> dictionaries/en_GB_wordlist.combined.gz Header : date : 1366021966 <=> 1366272052 Body : Added: yt 0 >>> dictionaries/en_US_wordlist.combined.gz Header : date : 1366021978 <=> 1366272093 Body : Added: yt 0 >>> dictionaries/en_wordlist.combined.gz Header : date : 1366021987 <=> 1366272977 Body : Added: yt 0 >>> dictionaries/fr_wordlist.combined.gz Header : date : 1366003217 <=> 1366272255 Body : Freq changed: cash 80 -> 20 >>> dictionaries/ru_wordlist.combined.gz Header : date : 1366003693 <=> 1366277083 Body : Deleted: толщ 76 >>> java/res/raw/main_en.dict Header : date : 1366021987 <=> 1366272977 Body : Added: yt 0 >>> java/res/raw/main_fr.dict Header : date : 1366003217 <=> 1366272255 Body : Freq changed: cash 80 -> 20 >>> java/res/raw/main_ru.dict Header : date : 1366003693 <=> 1366277083 Body : Deleted: толщ 76 Bug: 8635822 Change-Id: I44dc73bd010b125c994387894847a008276d69f7 Jean Chalard2013-04-181-0/+0
* Update dictionaries•••>>> dictionaries/en_GB_wordlist.combined.gz Header : date : 1355802832 <=> 1366003032 version : 29 <=> 31 Body : Deleted: HTTP 95 Deleted: WWW 72 Added: mm 135 >>> dictionaries/en_US_wordlist.combined.gz Header : date : 1355112451 <=> 1366003070 version : 28 <=> 31 Body : Deleted: HTTP 95 Deleted: WWW 71 Added: mm 135 >>> dictionaries/en_wordlist.combined.gz Header : date : 1355802851 <=> 1366003861 version : 29 <=> 31 Body : Deleted: HTTP 95 Deleted: WWW 71 Added: mm 135 >>> dictionaries/fr_wordlist.combined.gz Header : date : 1357617878 <=> 1366003217 version : 29 <=> 31 Body : Not a word: re false -> true Shortcut added: re le 15 >>> dictionaries/nb_wordlist.combined.gz Header : date : 1355802836 <=> 1366003450 version : 29 <=> 31 Body : Freq changed: iPhone 91 -> 30 Added: app 30 >>> dictionaries/ru_wordlist.combined.gz Header : date : 1358763720 <=> 1366003693 version : 30 <=> 31 Body : Freq changed: за 140 -> 181 Freq changed: не 140 -> 191 Freq changed: про 131 -> 151 Freq changed: эры 125 -> 140 >>> dictionaries/sv_wordlist.combined.gz Header : date : 1355802856 <=> 1366003804 version : 29 <=> 31 Body : Added: vi 180 >>> java/res/raw/main_en.dict Header : date : 1355802851 <=> 1366003861 version : 29 <=> 31 Body : Deleted: HTTP 95 Deleted: WWW 71 Added: mm 135 >>> java/res/raw/main_fr.dict Header : date : 1357617878 <=> 1366003217 version : 29 <=> 31 Body : Not a word: re false -> true Shortcut added: re le 15 >>> java/res/raw/main_ru.dict Header : date : 1358763720 <=> 1366003693 version : 30 <=> 31 Body : Freq changed: за 140 -> 181 Freq changed: не 140 -> 191 Freq changed: про 131 -> 151 Freq changed: эры 125 -> 140 Bug: 8560415 Bug: 7556679 Change-Id: If1c628edcb1cc5efd67e1715acf94f19c0eb4643 Jean Chalard2013-04-151-0/+0
* Update the Russian dictionary•••The point is to get as close as possible to having the golden Russian tests pass. >>> dictionaries/ru_wordlist.combined.gz Header : date : 1355818916 <=> 1358763720 version : 29 <=> 30 Body : Deleted: НКТ 14 Freq changed: без 0 -> 140 Freq changed: бонус 94 -> 130 Freq changed: за 0 -> 140 Freq changed: на 0 -> 180 Freq changed: не 0 -> 140 Freq changed: парка 133 -> 110 Freq changed: про 0 -> 131 Freq changed: ручьи 93 -> 80 Freq changed: ура 86 -> 100 Freq changed: юрты 86 -> 60 Added: вечерком 100 Added: задачки 100 Added: сорри 100 Added: узнай 100 Added: учти 100 >>> java/res/raw/main_ru.dict All the same above changes Change-Id: I8685c34d9ab1dcbf8ae8e23d2e26380059684c95 Jean Chalard2013-01-211-0/+0
* Update dictionaries•••>>> dictionaries/ru_wordlist.combined.gz Header : date : 1355802857 <=> 1355818916 Body : Freq changed: БД 18 -> 0 Freq changed: ГБ 14 -> 0 Freq changed: ЕС 44 -> 0 Freq changed: ЖД 3 -> 0 Freq changed: ЖЖ 8 -> 0 Freq changed: ЖК 3 -> 0 Freq changed: ИИ 21 -> 0 Freq changed: КБ 37 -> 0 Freq changed: МБ 19 -> 0 Freq changed: МО 26 -> 0 Freq changed: ОС 40 -> 0 Freq changed: РФ 65 -> 0 Freq changed: СБ 21 -> 0 Freq changed: СК 23 -> 0 Freq changed: ТВ 37 -> 0 Freq changed: УК 36 -> 0 Freq changed: ЦБ 11 -> 0 Freq changed: ЦК 59 -> 0 Deleted: бэ 0 Freq changed: дБ 92 -> 0 Deleted: йо 0 Freq changed: мм 149 -> 0 Freq changed: рН 104 -> 0 Deleted: ша 0 >>> java/res/raw/main_ru.dict Header : date : 1355802857 <=> 1355818916 Body : Freq changed: БД 18 -> 0 Freq changed: ГБ 14 -> 0 Freq changed: ЕС 44 -> 0 Freq changed: ЖД 3 -> 0 Freq changed: ЖЖ 8 -> 0 Freq changed: ЖК 3 -> 0 Freq changed: ИИ 21 -> 0 Freq changed: КБ 37 -> 0 Freq changed: МБ 19 -> 0 Freq changed: МО 26 -> 0 Freq changed: ОС 40 -> 0 Freq changed: РФ 65 -> 0 Freq changed: СБ 21 -> 0 Freq changed: СК 23 -> 0 Freq changed: ТВ 37 -> 0 Freq changed: УК 36 -> 0 Freq changed: ЦБ 11 -> 0 Freq changed: ЦК 59 -> 0 Deleted: бэ 0 Freq changed: дБ 92 -> 0 Deleted: йо 0 Freq changed: мм 149 -> 0 Freq changed: рН 104 -> 0 Deleted: ша 0 Change-Id: I03f0f4e8d03e0f77f5879e6dd5c424673466afca Jean Chalard2012-12-181-0/+0
* Update dictionaries•••cs, da, de, el, es, fi, fr, hr, it, lt, lv, nb, nl, pl, pt_BR, pt_PT, sl, sr, sv, tr : rescale frequencies to match spec. This has no large effect in the practice except the dictionary will become stronger vs spatial model (especially in lower count corpora, like lt, lv, sr) en* : Small changes (rounding going the other way essentially) ru : the above rescaling, and remove the following words: Дре, ОСТа, Планше, легкими, легком, легкому, легкости, легкую, нелегкие, нелегкий, нелегким, нелегкое, нелегкой, нелегкую, полулегком and add нелёгкие, нелёгкое, нелёгкую; other accented forms were already in the dictionary. Change-Id: I40386c2ebd4d2be38874e822bde89db7cb512ae6 Jean Chalard2012-12-181-0/+0
* Update dictionaries•••>>> dictionaries/en_GB_wordlist.combined.gz Header : date : 1353500789 <=> 1354870724 Body : Added: Dad 75 Added: Daddy 60 Added: Grandma 60 Added: Grandpa 55 Added: Mama 59 Added: Mom 77 Added: Papa 55 >>> dictionaries/en_US_wordlist.combined.gz Header : date : 1351675958 <=> 1354870736 version : 26 <=> 27 Body : Deleted: Rod's 46 Added: Dad 75 Added: Daddy 60 Added: Grandma 60 Added: Grandpa 55 Added: Mama 59 Added: Mom 77 Added: Papa 55 >>> dictionaries/en_wordlist.combined.gz Header : date : 1353500998 <=> 1354870744 Body : Deleted: Rod's 46 Added: Dad 75 Added: Daddy 60 Added: Grandma 60 Added: Grandpa 55 Added: Mama 59 Added: Mom 77 Added: Papa 55 >>> dictionaries/fr_wordlist.combined.gz Header : date : 1353500832 <=> 1354872988 Body : Deleted: noël 71 Deleted: po 73 Deleted: ti 73 Added: Noël 71 Added: lose 1 Added: y'a 130 >>> dictionaries/ru_wordlist.combined.gz Header : date : 1353567943 <=> 1354870130 Body : Demote all CAPS words by 80 Freq changed: модно 51 -> 20 >>> java/res/raw/main_en.dict Header : date : 1353500998 <=> 1354870744 Body : Deleted: Rod's 46 Added: Dad 75 Added: Daddy 60 Added: Grandma 60 Added: Grandpa 55 Added: Mama 59 Added: Mom 77 Added: Papa 55 >>> java/res/raw/main_fr.dict Header : date : 1353500832 <=> 1354872988 Body : Deleted: noël 71 Deleted: po 73 Deleted: ti 73 Added: Noël 71 Added: lose 1 Added: y'a 130 >>> java/res/raw/main_ru.dict Header : date : 1353567943 <=> 1354870130 Body : Demote all CAPS words by 80 Freq changed: модно 51 -> 20 Change-Id: I6f2d1c359d716535923b22c33d7fa4c3b0a330e4 Jean Chalard2012-12-071-0/+0
* Update RU dictionary header.•••>>> dictionaries/ru_wordlist.combined.gz >>> java/res/raw/main_ru.dict Header : date : 1353500945 <=> 1353567943 MULTIPLE_WORDS_DEMOTION_RATE : null <=> 0 Body : No differences Bug: 7540132 Change-Id: I837831b1e214da64962cf1bb68c840a3d4e6bf76 Jean Chalard2012-11-221-0/+0
* Update dictionaries and fix mistakes•••- Combined de dict : Remove digraph shortcuts that were in by mistake. - Combined en dict : Set freq of "baton" "batons" "mace" "puff" "puffs" and "tasers" to zero. They are offensive in en_GB. - Combined en_GB dict : Change freq of "il" to 0 and flag it "not a word". Still in the dict as a whitelist entry for "I'll"; for some reason it had freq 99. Add "milk:122" and "practice:143" - Combined fr dict : Add missing words : "Nostradamus:40" "défendais:30" "gmail:50" "générale:140" "hm:0" "hmm:0" "y'en:130" "l'apocalypse:31" "m'épuise:30" "recontacter:80" "t'annonce:30" Set freq of non-word shortcuts for digraphs to 1 instead of 0, allowing to gesture them. - Combined ru dict : Remove a lot of two-character non-words. - Binary de dict : Remove the obsolete "options" header, and add the "dictionary" header. - Binary en dict : Flag "hoe" "hoes" "il" "shel" as non-words. Also drop freq of "il" and "shel" to 0 Add the "locale" header that was missing. - Binary es dict : Add the "dictionary" header. - Binary fr dict : Add the same words as above. Non-word shortcuts were already set to 1. - Binary it dict : Add a "dictionary" header. Also change freq of "Šarapova" from 50 to 37; not sure why it was 50. - Binary pt_BR dict : Add a "dictionary" header. - Binary ru dict : Add a "dictionary" header and remove the same words as above. For all dictionaries : bump the version to 27. Change-Id: I94fe7f8f42b31fdad223085c00a94115e14d2276 Jean Chalard2012-11-211-0/+0
* Switch the AOSP word lists to the combined format.•••This will help with managing the word lists. Bug: 7388859 Change-Id: I89f049569b177d3027fe56d6c67eaca27d44dc7d Jean Chalard2012-10-311-0/+0