Wikipedia:Bots/Requests for approval/WikiCleanerBot 21
- The following discussion is an archived debate. Please do not modify it. To request review of this BRFA, please start a new section at Wikipedia:Bots/Noticeboard. The result of the discussion was Approved.
Operator: NicoV (talk · contribs · SUL · edit count · logs · page moves · block log · rights log · ANI search)
Time filed: 06:34, Tuesday, September 8, 2020 (UTC)
Function overview: Fix some problems with missing whitespace before internal links.
Automatic, Supervised, or Manual: Automatic
Programming language(s): Java (WPCleaner)
Source code available: On GitHub (especially algorithm 557)
Links to relevant discussions (where appropriate): Request from Bakertheacre
Edit period(s): Twice a month
Estimated number of pages affected: On frwiki, it resulted in about 3k edits, so I guess a few times this number (10-20k maybe). A test run could give better insight on the potential number of pages.
Namespace(s): Main
Exclusion compliant (Yes/No): Yes
Function details: With tools like Visual Editor or Content Translation, users can easily make mistakes when creating internal links, by not selecting the entire text to be put in the link. This task will solve some of the problems when the internal link is directly after a letter, without any whitespace in beetwen. Currently, my bot can fix cases like:
ab[[Abcde|cde]]
is replaced by[[abcde]]
(like here)xx[[yyyy| zzzz]]
is replaced byxx [[yyyy|zzzz]]
xx[[ yyyy]]
is replaced byxx [[yyyy]]
xx[[yyyy|'zzzz]]
replaced byxx'[[yyyy|zzzz]]
(like here)
For the moment, the bot will run directly on a dump file to find the articles where it can fix the problem. At a later time (if I can reduce enough the number of false positives), I will generate a list of pages with such problems, like I do for other errors.
Discussion
[edit]Approved for trial (50 edits). Please provide a link to the relevant contributions and/or diffs when the trial is complete. Primefac (talk) 17:04, 15 September 2020 (UTC)[reply]
- Thanks Primefac. Trial complete. I've done 51 edits. I'm surprised by the small number of cases found (my bot went through more than 1.1 million articles), and by the fact that almost all cases are the second or third cases which are less interesting to do compared to what I saw on frwiki (except Constructed wetland, Apolipoprotein, Nick Ferrari). And not sure about this modification, which I can prevent if deemed better to do so. If you think it's not worth the trouble, request can be closed, or accepted only as cosmetic. I can also run the bot once only for the first or fourth cases which are more interesting to fix. --NicoV (Talk on frwiki) 19:25, 15 September 2020 (UTC)[reply]
- Approved. If you can manage to fix the Flag of Washington, D.C. apostrophe issue, then please do, otherwise skip if that happens. Primefac (talk) 15:22, 25 September 2020 (UTC)[reply]
- The above discussion is preserved as an archive of the debate. Please do not modify it. To request review of this BRFA, please start a new section at Wikipedia:Bots/Noticeboard.