Fix for bug 9413 and the related Malayalam issue reported on wikitech-l.
authorTim Starling <tstarling@users.mediawiki.org>
Mon, 4 Jan 2010 08:28:50 +0000 (08:28 +0000)
committerTim Starling <tstarling@users.mediawiki.org>
Mon, 4 Jan 2010 08:28:50 +0000 (08:28 +0000)
commitad19c032b0fe6d5ecbf0ab5a94bc90f6df66d7e2
treebb45f832e42273c8f9ba32c3c41711c0ea6ec2c2
parentdff724821eda8a867b7f457f71413b71e1892947
Fix for bug 9413 and the related Malayalam issue reported on wikitech-l.
* Added $wgFixArchaicUnicode, which, if enabled, converts some deprecated Unicode sequences in Arabic and Malayalam text to their Unicode 5.1 equivalents.
* Added generateNormalizerData.php to generate the relevant data files. Added the generated data files also.
* Made most things call the new wrapper method $wgContLang->normalize() instead of UtfNormal::cleanUp(), so that Unicode normalization can be customised on a per-language basis.
* Added some generic support for conversion tables to Language so that subclasses can easily implement these kinds of transformations.
16 files changed:
RELEASE-NOTES
includes/DefaultSettings.php
includes/WebRequest.php
includes/Xml.php
includes/api/ApiResult.php
languages/Language.php
languages/classes/LanguageAr.php
languages/classes/LanguageMl.php [new file with mode: 0644]
maintenance/cleanupImages.php
maintenance/cleanupTitles.php
maintenance/cleanupWatchlist.php
maintenance/dumpTextPass.php
maintenance/language/generateNormalizerData.php [new file with mode: 0644]
maintenance/preprocessorFuzzTest.php
serialized/normalize-ar.ser [new file with mode: 0644]
serialized/normalize-ml.ser [new file with mode: 0644]