Improve RemexStripTagHandler working with tables
authorErik Bernhardson <ebernhardson@wikimedia.org>
Thu, 14 Mar 2019 20:06:27 +0000 (13:06 -0700)
committerErik Bernhardson <ebernhardson@wikimedia.org>
Thu, 14 Mar 2019 20:11:59 +0000 (13:11 -0700)
commitaef02d516d0fb20571120ada7278d1658d4d9222
tree6a7f695c2fef9dee1294b0648a21fe19c33e897d
parent5537b54249ba8d70cdf97e5a20a35c4249b33327
Improve RemexStripTagHandler working with tables

HTML, generated by some infoboxes and perhaps other places, gets
stripped in a way that merges words together that should not be
merged. Add tr, th, and td to the list of tags that should force
word separation.

Bug: T218001
Change-Id: Ib374339628b1f543ea4e07f24aa3e3b76f3117b5
includes/parser/RemexStripTagHandler.php
tests/phpunit/includes/parser/SanitizerTest.php