aboutsummaryrefslogtreecommitdiffstats
path: root/native/jni/src (follow)
Commit message (Collapse)AuthorAgeFilesLines
* Use enum to specify ngram type.Keisuke Kuroyanagi2014-11-2514-251/+218
| | | | Change-Id: Ie28768ceadcd7a2d940c57eb30be7d4c364e509f
* Fix the base character of D with strokeJean Chalard2014-11-191-1/+6
| | | | | Bug: 18436480 Change-Id: Ic92cae7c85c07c8f62a5b6e69d2b71e204aff50d
* Implement IntArrayView::split for dicttoolkit.Keisuke Kuroyanagi2014-11-111-0/+23
| | | | | Bug: 10059681 Change-Id: Ic29e79d049bb532727cf5cb1e529fec5d35156ed
* Merge "Use reference instead of pointer for WordProperty()."Keisuke Kuroyanagi2014-11-104-11/+16
|\
| * Use reference instead of pointer for WordProperty().Keisuke Kuroyanagi2014-11-104-11/+16
| | | | | | | | Change-Id: Idf03e97661d64186c752e35964d641a5528be5b1
* | Make profiler use getTimeInMicroSec().Keisuke Kuroyanagi2014-11-113-73/+96
|/ | | | | Bug: 17797064 Change-Id: Ie992c9454edfc3bf93d5ea367c3a4427b513a205
* Fix: BoS prediction after inputting just once.Keisuke Kuroyanagi2014-11-011-2/+10
| | | | Change-Id: Ib69569ab6b6edfcc8c1d2c621b95de4127789ab6
* Merge "Update v4 format version from 402 to 403."Keisuke Kuroyanagi2014-10-316-21/+21
|\
| * Update v4 format version from 402 to 403.Keisuke Kuroyanagi2014-10-316-21/+21
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Without personalization: Total words: 1134774, Success Num: 899230, Success Percentage: 79.243% Bad Failures, with auto-correction (typed word == expected word, output word != expected word): 1871, Bad Failure Percentage: 0.165% Failures, with auto-correction (F-C): 29084, F-C Percentage: 2.563% Max Keystrokes: 6072959, Min Keystrokes: 4436090, Keystroke Saving Percentage:26.953% Before: Total words: 1134646, Success Num: 925194, Success Percentage: 81.540% Bad Failures, with auto-correction (typed word == expected word, output word != expected word): 1316, Bad Failure Percentage: 0.116% Failures, with auto-correction (F-C): 28288, F-C Percentage: 2.493% Max Keystrokes: 6072831, Min Keystrokes: 3946188, Keystroke Saving Percentage:35.019% After Total words: 1134659, Success Num: 944746, Success Percentage: 83.263% Bad Failures, with auto-correction (typed word == expected word, output word != expected word): 1258, Bad Failure Percentage: 0.111% Failures, with auto-correction (F-C): 28016, F-C Percentage: 2.469% Max Keystrokes: 6072844, Min Keystrokes: 3387333, Keystroke Saving Percentage:44.222% Change-Id: I3af42ec37a11847c0429c28616e726f6a339247f
* | Merge "Add hacks for better handling count value during migration."Keisuke Kuroyanagi2014-10-312-1/+4
|\|
| * Add hacks for better handling count value during migration.Keisuke Kuroyanagi2014-10-312-1/+4
| | | | | | | | | | Bug: 14425059 Change-Id: Ib050574aa7c4babd4285322a11c3af9be9fbab1e
* | Merge "Change v403 historical info format."Keisuke Kuroyanagi2014-10-314-3/+5
|\|
| * Change v403 historical info format.Keisuke Kuroyanagi2014-10-314-3/+5
| | | | | | | | | | | | count -> 2B, level -> 0B. Change-Id: I3b241126f56eb33cdf09cb1ebfed04f534e4ec48
* | Further fixes to treat 0-frequency wordsAdrian Velicu2014-10-317-12/+47
|/ | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Previously, when both legitimate 0-frequency words (such as distracters) and offensive words were encoded in the same way, distracters would never show up when the user blocked offensive words (the default setting, as well as the setting for regression tests). When b/11031090 was fixed and a separate encoding was used for offensive words, 0-frequency words would no longer be blocked when they were an "exact match" (where case mismatches and accent mismatches would be considered an "exact match"). The exact match boosting functionality meant that, for example, when the user typed "mt" they would be suggested the word "Mt", although they most probably meant to type "my". For this reason, we introduced this change, which does the following: * Defines the "perfect match" as a really exact match, with no room for case or accent mismatches * When the target word has probability zero (as "Mt" does, because it is a distracter), ONLY boost its score if it is a perfect match. By doing this, when the user types "mt", the word "Mt" will NOT be boosted, and they will get "my". However, if the user makes an explicit effort to type "Mt", we do boost the word "Mt" so that the user's input is not autocorrected to "My". Bug: 11031090 Change-Id: I92ee1b4e742645d52e2f7f8c4390920481e8fff0
* Block offensive words in multi-word suggestionsAdrian Velicu2014-10-313-13/+65
| | | | | | | | | If the user has chosen to block offensive words and types "aaaxbb", where "aaa" is an offensive word and "bb" is not, we should not suggest "aaa bb". Bug: 11031090 Change-Id: Ie23b8dd5d347bc26b1c046c3f5e8dfbc259bf528
* Using "blacklist" flag as "possibly offensive"Adrian Velicu2014-10-318-26/+26
| | | | | Bug: 11031090 Change-Id: I5cc0d006ab003656498eb82b0875eb9c051d331e
* Enable count based dynamic ngram language model for v403.Keisuke Kuroyanagi2014-10-306-89/+128
| | | | | | Bug: 14425059 Change-Id: Icc15e14cfd77d37cd75f75318fd0fa36f9ca7a5b
* Add DynamicLanguageModelProbabilityUtils.Keisuke Kuroyanagi2014-10-302-0/+151
| | | | | Bug: 14425059 Change-Id: Ia58ab3f0ead02798046d182a9464dcbd95f086bc
* Add method to encode probability.Keisuke Kuroyanagi2014-10-302-0/+38
| | | | | Bug: 14425059 Change-Id: I3e5d359ba5fa38f1669f0e98dfae792ff53efbf8
* Fix: TRT and ime-simulator bulid.Keisuke Kuroyanagi2014-10-301-0/+2
| | | | Change-Id: I1697a907562d1ed6aff2b001763d1594263ba0d3
* Merge "Add a class to have global counters for LanguageModelDictContent."Keisuke Kuroyanagi2014-10-296-7/+147
|\
| * Add a class to have global counters for LanguageModelDictContent.Keisuke Kuroyanagi2014-10-296-7/+147
| | | | | | | | | | Bug: 14425059 Change-Id: I08ec19903432356b6028853fd73b4eefce20218e
* | Merge "Improve space substitution error correction."Keisuke Kuroyanagi2014-10-286-11/+14
|\ \
| * | Improve space substitution error correction.Keisuke Kuroyanagi2014-10-286-11/+14
| |/ | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Bug: 17432052 [Category diff] +1 262 -1 93 +2 2 -2 18 +3 18 -3 2 +4 111 -4 148 +5 295 -5 217 +6 51 -6 276 +7 139 -7 124 [Weighted category diff] +1 276 -1 100 +2 4 -2 20 +3 20 -3 4 +4 118 -4 160 +5 309 -5 225 +6 52 -6 298 +7 163 -7 135 show diff for ./en_user_log_phones_2011_08.csv +1 173 -1 28 +2 2 -2 17 +3 17 -3 2 +4 63 -4 82 +5 120 -5 51 +6 24 -6 220 +7 88 -7 87 Change-Id: I9d673acb0ff632828ae2e0ead56e76e3a20411c6
* / Fix: deleted PtNode handling in v403.Keisuke Kuroyanagi2014-10-271-0/+3
|/ | | | | | | If a word is once deleted, the word never gets into the personalized dictionaries due to this bug. Change-Id: Ife4e3fe1ba0615b4135e6291d2151b0db7d3f940
* Fix: Personalized dicts suggest invalid words with v403.Keisuke Kuroyanagi2014-10-231-2/+3
| | | | | Bug: 14425059 Change-Id: I45ae00069dd3b7c461dd9a1f3558b96af0a1c975
* Use trigrams for personalization dict.Keisuke Kuroyanagi2014-10-231-0/+3
| | | | | 5Bug: 14425059 Change-Id: I73cf6904e569d60996a3b079f16ea6df0cb90f02
* Support dumping ngram entries.Keisuke Kuroyanagi2014-10-224-23/+47
| | | | | Bug: 14425059 Change-Id: Ib03a0c3d166ed6f1e60c67127b28006d55143b6b
* Support ngram entry migration.Keisuke Kuroyanagi2014-10-225-41/+130
| | | | | Bug: 14425059 Change-Id: I98cb9fa303af2d93a0a3512e8732231c564e3c5d
* Fix debug build.Keisuke Kuroyanagi2014-10-212-4/+4
| | | | Change-Id: Id94636714d04a8828718b87741c0ee62a14cb3b4
* Merge changes I210acb81,Ie9508788Keisuke Kuroyanagi2014-10-2113-139/+178
|\ | | | | | | | | | | * changes: Make NgramProperty have NgramContext. Create .cpp file for NgramContext.
| * Make NgramProperty have NgramContext.Keisuke Kuroyanagi2014-10-2111-33/+40
| | | | | | | | | | Bug: 14425059 Change-Id: I210acb816b122857dbbe1ee4dd6a35c5335bf2bf
| * Create .cpp file for NgramContext.Keisuke Kuroyanagi2014-10-212-106/+138
| | | | | | | | | | | | Bug: 14425059 Change-Id: Ie950878817b9c80cc9c970e1a84880c9b9ab228a
* | Merge "Use EntryCounters during GC."Keisuke Kuroyanagi2014-10-215-65/+54
|\|
| * Use EntryCounters during GC.Keisuke Kuroyanagi2014-10-215-65/+54
| | | | | | | | | | Bug: 14425059 Change-Id: I61eb798686dc753fb6c0fe99a0719c1732198f30
* | Merge "Renaming "blacklist" flag to "possibly offensive""Adrian Velicu2014-10-2113-52/+99
|\ \ | |/ |/|
| * Renaming "blacklist" flag to "possibly offensive"Adrian Velicu2014-10-2113-52/+99
| | | | | | | | | | | | | | | | | | No behaviour changes. Unified the overloaded FusionDictionary::add method to always take an isPossiblyOffensive argument. Bug: 11031090 Change-Id: I5741a023ca1ce842d2cf10d4f6c926b0efabaa78
* | Introduce EntryCounters to count entries in a dictionary.Keisuke Kuroyanagi2014-10-2116-89/+230
|/ | | | | | Bug: 14425059 Change-Id: Ic13ba827d96fa4a147485ba92fdb37e23e04e8e8
* Change entry count limit.Keisuke Kuroyanagi2014-10-202-3/+7
| | | | | | Unigram 10K, Bigram 30K, Trigram 30K. Change-Id: Ibd19c6a2b618499df1c70000bad7b47498187f0a
* Merge "Update useless n-gram entry detection logic during GC."Keisuke Kuroyanagi2014-10-152-21/+32
|\
| * Update useless n-gram entry detection logic during GC.Keisuke Kuroyanagi2014-10-152-21/+32
| | | | | | | | | | Bug: 14425059 Change-Id: Ib939deae5b60167751dee07965bb1ef1a43c4625
* | Merge "Use better conditional probability for ngram entries."Keisuke Kuroyanagi2014-10-152-11/+26
|\|
| * Use better conditional probability for ngram entries.Keisuke Kuroyanagi2014-10-152-11/+26
| | | | | | | | | | | | | | | | | | | | | | | | Old: P(W | W_prev) = f(W, W_prev) + C New: P(W | W_prev) = f(W, W_prev) / f(W_prev) Bug: 14425059 Bug: 16547409 Change-Id: I4d13be6de2c6bad6bad7fb22320a23ba4ecd361c
* | Merge "Move entry updating method to language model dict content."Keisuke Kuroyanagi2014-10-157-72/+115
|\|
| * Move entry updating method to language model dict content.Keisuke Kuroyanagi2014-10-157-72/+115
| | | | | | | | | | Bug: 14425059 Change-Id: I710055490d141539458cbf968adf5a7ccffd9552
* | Quit using weightChildNode for ADDITIONAL_PROXIMITY and SUBSTITUTION.Keisuke Kuroyanagi2014-10-144-9/+14
|/ | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | [Category diff] +1 0 -1 1 +2 0 -2 0 +3 0 -3 0 +4 1 -4 1 +5 8 -5 7 +6 0 -6 1 +7 1 -7 0 [Weighted category diff] +1 0 -1 1 +2 0 -2 0 +3 0 -3 0 +4 1 -4 1 +5 8 -5 7 +6 0 -6 1 +7 1 -7 0 Bug: 13756409 Change-Id: I6ac3567545676bbefbee3e87dda54bc083c15fb6
* Fix: BoS bigram from user history dictionary is too strong.Keisuke Kuroyanagi2014-10-132-4/+10
| | | | | | | They can be always stronger than BoS predictions from the contextual dictionary. Bug: 17961731 Change-Id: I70297d82436c10c790bdfad6f3dfefdb4bb2f852
* [ML23] Introduce a different accuracy/performance tradeoffJean Chalard2014-10-106-6/+43
| | | | | Bug: 11230254 Change-Id: Ic09518c818ae7b68942b1c63160dd462e5922cb5
* Merge "Make members of classes that are used with std::vector const"Keisuke Kuroyanagi2014-10-103-21/+19
|\
| * Make members of classes that are used with std::vector constKeisuke Kuroyanagi2014-10-093-21/+19
| | | | | | | | Change-Id: Id93fb87f5630230fc3f9cd339e12f3b0e2006ea9