Adding new dictionaries to aspell: Difference between revisions
m (Corrected the preg_split change) |
|||
Line 39: | Line 39: | ||
$misspelled .= "$word:$suggestions\n"; | $misspelled .= "$word:$suggestions\n"; | ||
</nowiki></pre> | </nowiki></pre> | ||
<b>NOTE:</b> After Changing the aspell language restart the spellchecker as the user zimbra with: 'zmspellctl stop; zmspellctl start' | |||
There is also a problem when splitting words. Replace (line 48 or so) | There is also a problem when splitting words. Replace (line 48 or so) |
Revision as of 16:06, 2 October 2007
How to add a new dictionary to aspell
- Download the dictionary from gnu.org
- Extract it from the tar file in a work directory
- Set Zimbra's aspell in the path with:
- PATH=/opt/zimbra/aspell-0.60.3/bin:$PATH
- Configure and install the dictionary according to the README
- ./configure
- make
- make install
- Edit the file /opt/zimbra/httpd/htdocs/aspell.php to reference the new dictionary. For example to add the french dictionary:
- $dictionary = pspell_new("en_EN");
- TO
- $dictionary = pspell_new("fr_FR");
NOTE: I think is better to change the value of the variable $locale at the start of the file.
NOTE: If you are using a non-english based language with special chars like tildes (spanish, for example), you have to modify aspell.php
This file is located at /opt/zimbra/httpd/htdocs/aspell.php. Replace this block (line 82 or so)
$suggestions = implode(",", pspell_suggest($dictionary, $word)); $misspelled .= "$word:$suggestions\n";
with this one:
$suggestions = implode(",", pspell_suggest($dictionary, $word)); $suggestions=iconv("iso-8859-1","UTF-8",$suggestions); $misspelled .= "$word:$suggestions\n";
NOTE: After Changing the aspell language restart the spellchecker as the user zimbra with: 'zmspellctl stop; zmspellctl start'
There is also a problem when splitting words. Replace (line 48 or so)
$words = preg_split('/[^\w\'-] /', $text);
with this one:
$words = preg_split('/[^\w\'\xc0-\xfd-]+/', $text);
This regexp line should be enough for most western Europe languages (Spanish, French, German, Portuguese and Italian). It includes all ISO8859 europeean letters in the range 192-253 of the table below.
- To add a set of custom words see this How-To