aboutsummaryrefslogtreecommitdiffstats
path: root/dictionaries
diff options
context:
space:
mode:
authorJean Chalard <jchalard@google.com>2011-12-14 15:25:31 +0900
committerJean Chalard <jchalard@google.com>2011-12-14 15:25:31 +0900
commit4fc97c2c01646d877505295713abdf16d775d3d4 (patch)
tree3c9b6139f6871bd49147e5e4f15aede9373804ac /dictionaries
parent8e3faff244a03aa49dfff03f2a6d982590ff605c (diff)
downloadlatinime-4fc97c2c01646d877505295713abdf16d775d3d4.tar.gz
latinime-4fc97c2c01646d877505295713abdf16d775d3d4.tar.xz
latinime-4fc97c2c01646d877505295713abdf16d775d3d4.zip
Add a note of documentation to the sample word list
Change-Id: I95f09da03457933a14b549e04575d566de97dd49
Diffstat (limited to 'dictionaries')
-rw-r--r--dictionaries/sample.xml5
1 files changed, 3 insertions, 2 deletions
diff --git a/dictionaries/sample.xml b/dictionaries/sample.xml
index 85233b63a..ad98f2b6f 100644
--- a/dictionaries/sample.xml
+++ b/dictionaries/sample.xml
@@ -2,7 +2,9 @@
for use by the Latin IME.
The format of the word list is a flat list of word entries.
Each entry has a frequency between 255 and 0.
- Highest frequency words get more weight in the prediction algorithm.
+ Highest frequency words get more weight in the prediction algorithm. As a
+ special case, a weight of 0 is taken to mean profanity - words that should
+ not be considered a typo, but that should never be suggested explicitly.
You can capitalize words that must always be capitalized, such as "January".
You can have a capitalized and a non-capitalized word as separate entries,
such as "robin" and "Robin".
@@ -13,4 +15,3 @@
<w f="128">sample</w>
<w f="1">wordlist</w>
</wordlist>
-