aboutsummaryrefslogtreecommitdiffstats
path: root/dictionaries/pt_PT_wordlist.combined.gz (follow)
Commit message (Expand)AuthorAgeFilesLines
* Update dictionaries (possibly_offensive flag)•••Correctly encoding possibly offensive words with their correct frequency and the possibly_offensive flag set. Continuing to encode with zero frequency only distracters or words that should never come up. https://paste.googleplex.com/5167060875214848 Bug: 11031090 Change-Id: Ia394b1827f292ff8d4791cc2f3e6e50b5aff4cbe Adrian Velicu2014-10-311-0/+0
* Update all dicts to version 44.•••Bug: 13164302 Change-Id: I8dc1a839c7dcfaa08a53e26cb6600e9f871447ce Jean Chalard2014-02-241-0/+0
* Update dictionaries•••Add KitKat to all dictionaries. Version da, fi, pl : 29 → 40 cs, de, hr, it, lt, lv, nb, nl, sl, sr, sv, tr : 35 → 40 es : 36 → 40 en_gb, en_us, en, fr, pt_br, pt_pt : 39 → 40 Bug: 10958192 Change-Id: I14436616285ced5eb3b70b8c44b9243da94eed4f Jean Chalard2013-09-301-0/+0
* Update dictionaries•••>>> dictionaries/en_GB_wordlist.combined.gz Header : date : 1374721653 <=> 1380099152 version : 36 <=> 39 Body : Freq changed: gay 127 -> 10 Added: draft 138 >>> dictionaries/en_US_wordlist.combined.gz Header : date : 1374721654 <=> 1380099152 version : 36 <=> 39 Body : Freq changed: gay 127 -> 10 >>> dictionaries/en_wordlist.combined.gz Header : date : 1374721663 <=> 1380099172 version : 36 <=> 39 Body : Freq changed: gay 127 -> 10 >>> dictionaries/fr_wordlist.combined.gz Header : date : 1376888819 <=> 1380099153 version : 37 <=> 39 Body : Added: septembre 150 >>> dictionaries/pt_BR_wordlist.combined.gz Header : date : 1376884524 <=> 1380099168 version : 37 <=> 39 Body : Freq changed: atras 87 -> 0 Not a word: atras false -> true Shortcut added: atras atrás 15 Shortcut added: cade cadê 15 Shortcut added: cafe café 15 Shortcut added: ferias férias 15 Shortcut added: musica música 15 Shortcut added: musicas músicas 15 >>> dictionaries/pt_PT_wordlist.combined.gz Header : date : 1376884536 <=> 1380099168 version : 37 <=> 39 Body : Shortcut added: atras atrás 15 Shortcut added: cade cadê 15 Shortcut added: ferias férias 15 Shortcut added: musica música 15 Shortcut added: musicas músicas 15 Added: cafe 0 >>> java/res/raw/main_en.dict Header : date : 1374721663 <=> 1380099172 version : 36 <=> 39 Body : Freq changed: gay 127 -> 10 >>> java/res/raw/main_fr.dict Header : date : 1376888819 <=> 1380099153 version : 37 <=> 39 Body : Added: septembre 150 >>> java/res/raw/main_pt_br.dict Header : date : 1376884524 <=> 1380099168 version : 37 <=> 39 Body : Freq changed: atras 87 -> 0 Not a word: atras false -> true Shortcut added: atras atrás 15 Shortcut added: cade cadê 15 Shortcut added: cafe café 15 Shortcut added: ferias férias 15 Shortcut added: musica música 15 Shortcut added: musicas músicas 15 Bug: 10504313 Bug: 10507536 Bug: 10561100 Change-Id: I4267c76cf0de221a703523d5f2dd2befbaf020a0 Jean Chalard2013-09-261-0/+0
* Update dictionaries•••Bug: 10354668 Bug: 10188528 >>> dictionaries/fr_wordlist.combined.gz Header : date : 1374634549 <=> 1376888819 version : 36 <=> 37 Body : Deleted: color 78 Deleted: men 85 Deleted: o 115 Added: nationaux 120 >>> dictionaries/iw_wordlist.combined.gz Added. New dictionary. >>> dictionaries/pt_BR_wordlist.combined.gz Header : date : 1374634563 <=> 1376884524 version : 36 <=> 37 Body : Deleted: la 152 >>> dictionaries/pt_PT_wordlist.combined.gz Header : date : 1357790930 <=> 1376884536 version : 30 <=> 37 Body : Deleted: la 152 >>> dictionaries/ru_wordlist.combined.gz Header : date : 1372393835 <=> 1376897704 version : 35 <=> 37 Body : Freq changed: говно 68 -> 0 >>> java/res/raw/main_fr.dict Header : date : 1374634549 <=> 1376888819 version : 36 <=> 37 Body : Deleted: color 78 Deleted: men 85 Deleted: o 115 Added: nationaux 120 >>> java/res/raw/main_pt_br.dict Header : date : 1374634563 <=> 1376884524 version : 36 <=> 37 Body : Deleted: la 152 >>> java/res/raw/main_ru.dict Header : date : 1372393835 <=> 1376897704 version : 35 <=> 37 Body : Freq changed: говно 68 -> 0 Change-Id: I87a85571c61068ff46a32d291aa43becbb75598a Jean Chalard2013-08-191-0/+0
* Add words to Portuguese•••>>> dictionaries/pt_BR_wordlist.combined.gz Header : date : 1355802839 <=> 1357790917 version : 29 <=> 30 Body : Added: à 30 Added: é 30 Added: ò 30 Added: ô 30 >>> dictionaries/pt_PT_wordlist.combined.gz Header : date : 1355802856 <=> 1357790930 version : 29 <=> 30 Body : Added: à 30 Added: é 30 Added: ò 30 Added: ô 30 >>> java/res/raw/main_pt_br.dict Header : date : 1355802839 <=> 1357790917 version : 29 <=> 30 Body : Added: à 30 Added: é 30 Added: ò 30 Added: ô 30 Bug: 7966948 Change-Id: I71c0986cf616d67926d0a6a0e53099b04b0427d5 Jean Chalard2013-01-101-0/+0
* Update dictionaries•••cs, da, de, el, es, fi, fr, hr, it, lt, lv, nb, nl, pl, pt_BR, pt_PT, sl, sr, sv, tr : rescale frequencies to match spec. This has no large effect in the practice except the dictionary will become stronger vs spatial model (especially in lower count corpora, like lt, lv, sr) en* : Small changes (rounding going the other way essentially) ru : the above rescaling, and remove the following words: Дре, ОСТа, Планше, легкими, легком, легкому, легкости, легкую, нелегкие, нелегкий, нелегким, нелегкое, нелегкой, нелегкую, полулегком and add нелёгкие, нелёгкое, нелёгкую; other accented forms were already in the dictionary. Change-Id: I40386c2ebd4d2be38874e822bde89db7cb512ae6 Jean Chalard2012-12-181-0/+0
* Update dictionaries•••>>> dictionaries/en_GB_wordlist.combined.gz Header : date : 1354870724 <=> 1355112440 version : 27 <=> 28 Body : Deleted: DoCoMo 65 Added: Docomo 65 Added: KDDI 25 Added: Softbank 25 >>> dictionaries/en_US_wordlist.combined.gz Header : date : 1354870736 <=> 1355112451 version : 27 <=> 28 Body : Deleted: DoCoMo 65 Added: Docomo 65 Added: KDDI 25 Added: Softbank 25 >>> dictionaries/en_wordlist.combined.gz Header : date : 1354870744 <=> 1355112460 version : 27 <=> 28 Body : Deleted: DoCoMo 65 Added: Docomo 65 Added: KDDI 25 Added: Softbank 25 >>> dictionaries/es_wordlist.combined.gz Header : date : 1351676002 <=> 1355117676 version : 26 <=> 28 Body : Deleted: DoCoMo 40 Added: Docomo 40 Added: KDDI 25 Added: Softbank 25 >>> dictionaries/fi_wordlist.combined.gz Header : date : 1351676054 <=> 1355117691 version : 26 <=> 28 Body : Deleted: DoCoMo 28 Added: Docomo 28 Added: KDDI 25 Added: Softbank 25 >>> dictionaries/fr_wordlist.combined.gz Header : date : 1354872988 <=> 1355117708 version : 27 <=> 28 Body : Deleted: DoCoMo 52 Added: Docomo 52 Added: KDDI 25 Added: Softbank 25 >>> dictionaries/pt_PT_wordlist.combined.gz Header : date : 1351676510 <=> 1355117723 version : 26 <=> 28 Body : Deleted: DoCoMo 48 Added: Docomo 48 Added: Softbank 25 >>> java/res/raw/main_en.dict Header : date : 1354870744 <=> 1355112460 version : 27 <=> 28 Body : Deleted: DoCoMo 65 Added: Docomo 65 Added: KDDI 25 Added: Softbank 25 >>> java/res/raw/main_es.dict Header : date : 1353500806 <=> 1355117676 version : 27 <=> 28 Body : Deleted: DoCoMo 40 Added: Docomo 40 Added: KDDI 25 Added: Softbank 25 >>> java/res/raw/main_fr.dict Header : date : 1354872988 <=> 1355117708 version : 27 <=> 28 Body : Deleted: DoCoMo 52 Added: Docomo 52 Added: KDDI 25 Added: Softbank 25 Change-Id: I3801cbe4535407f55ede8db327674d493a92d1ae Jean Chalard2012-12-101-0/+0
* Switch the AOSP word lists to the combined format.•••This will help with managing the word lists. Bug: 7388859 Change-Id: I89f049569b177d3027fe56d6c67eaca27d44dc7d Jean Chalard2012-10-311-0/+0