Strip Unicode 6.3.0 directional formatting characters from title
authorFomafix <fomafix@googlemail.com>
Thu, 22 Mar 2018 10:48:55 +0000 (11:48 +0100)
committerFomafix <fomafix@googlemail.com>
Thu, 10 May 2018 06:58:23 +0000 (08:58 +0200)
commit7564624d1ca80a4d2f1cc2b4d3d32d5d2e0bca38
treead3be8c5101905617a08947571acf0df82297033
parent6b8a5a137d3f449b4056e3de82fa6747b45f1f9a
Strip Unicode 6.3.0 directional formatting characters from title

Unicode 6.3.0 (September 2013) the added additional directional
formatting characters:

U+061C ARABIC LETTER MARK
U+2066 LEFT-TO-RIGHT ISOLATE
U+2067 RIGHT-TO-LEFT ISOLATE
U+2068 FIRST STRONG ISOLATE
U+2069 POP DIRECTIONAL ISOLATE

https://www.fileformat.info/info/unicode/version/6.3/index.htm

This change strips the new directional formatting characters from the
title like the directional formatting characters from Unicode 1.1.0
(June 1993).

Any existing titles containing the new Unicode directional formatting
characters get stripped by a run of maintenance/cleanupTitles.php after
deployment.

This change also allows to insert the new Unicode directional
formatting characters into the DISPLAYTITLE.

Change-Id: I2279f51048f5252c2e4280ec6a13f060ff9967cb
RELEASE-NOTES-1.32
includes/title/MediaWikiTitleCodec.php
resources/src/mediawiki.Title/Title.js
tests/parser/parserTests.txt
tests/phpunit/includes/title/MediaWikiTitleCodecTest.php
tests/qunit/suites/resources/mediawiki/mediawiki.Title.test.js