Last modified: 2014-02-12 23:38:11 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T23816, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 21816 - more agressive norm()
more agressive norm()
Status: NEW
Product: MediaWiki extensions
Classification: Unclassified
AbuseFilter (Other open bugs)
unspecified
All All
: Low enhancement (vote)
: ---
Assigned To: Nobody - You can work on this!
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2009-12-10 23:05 UTC by Martijn Hoekstra
Modified: 2014-02-12 23:38 UTC (History)
2 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Martijn Hoekstra 2009-12-10 23:05:20 UTC
norm() currently behaves as rmwhitespace(rmspecials(rmdoubles(ccnorm(arg1))))

this has the drawback that whitespace is removed last. With fairly common vandalism edits, we have stuff like
  "G  G G G GAY!". If doubles are removed before repetition is removed, the list of G's are unrecognised. I propose to either change

        $s = $this->ccnorm( $s );
        $s = $this->rmdoubles( $s );
        $s = $this->rmspecials( $s );
        $s = $this->rmwhitespace( $s );

to

        $s = $this->ccnorm( $s );
        $s = $this->rmspecials( $s );
        $s = $this->rmwhitespace( $s );
        $s = $this->rmdoubles( $s );

or to make a new function agrNorm() (for agressive norm) that mimics the behaviour of norm, but with the second operation order, not to break backwards compatibility.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links