Wikipedia:Bots/Requests for approval/B-bot 2
- The following discussion is an archived debate. Please do not modify it. To request review of this BRFA, please start a new section at WT:BRFA. The result of the discussion was Approved.
Operator: B (talk · contribs · SUL · edit count · logs · page moves · block log · rights log · ANI search)
Time filed: 12:15, Sunday, May 3, 2015 (UTC)
Automatic, Supervised, or Manual: Automatic
Programming language(s): C#
Source code available: source
Function overview: Check images in the dated orphaned non-free category that is due for deletion tomorrow to see if they are still orphaned. Remove any from the category that are not. Log any removals to the bot's event log in case I (or anyone else) would like to manually inspect them for NFCC compliance.
Links to relevant discussions (where appropriate):
Edit period(s): Once daily, probably around 20:00 UTC
Estimated number of pages affected: Maybe 2-3/day on an average day. On the average day, there are 50 orphaned fair use images, and only a small number of those turn out to be re-added to articles.
Exclusion compliant (Yes/No): In general, yes, but not for this task as no users are notified.
Already has a bot flag (Yes/No): Yes
Function details: This task will loop though the dated Category:Orphaned non-free use Wikipedia files category that is due to be deleted tomorrow. For each image in the category, it will purge the image and purge any articles for which there is a fair use rationale. If it finds the image to be no longer orphaned, it will remove the {{di-orphaned fair use}} tag.
The purpose of this exercise is (1) To assist the deleting admin by clearing out a tag that they would need to remove manually and (2) to reduce the likelihood that an image will be inadvertently deleted, either because the deleting admin does not see that it is in use or because the image links have not been updated and do not show the image to be in use.
For purposes of this bot, "orphaned" means "not used in article space" and "not orphaned" means "used in article space". (Tags will NOT be removed from images used in drafts, etc. Though WP:F5 does say "Reasonable exceptions may be made for images uploaded for an upcoming article", any judgment call there is for the deleting admin, not a bot.)
Discussion
[edit]I am doing a test run on Category:Orphaned non-free use Wikipedia files as of 2 May 2015 (the really big category from the first day the orphaned fair use tagging process ran). (In the real thing, the process will always be run on the category that is due to be deleted tomorrow, but for testing purposes, I am using a really big day because there would be nothing to do in the smaller days.) The event log is here. The edits the bot would make are at User:B-bot/Test page. --B (talk) 12:15, 3 May 2015 (UTC)[reply]
- Some comments:
- Consider using Category:All orphaned non-free use Wikipedia files instead of a dated category. It probably makes things simpler for people looking at the dated categories if files which have been added to articles are removed from the 'orphaned fair use' categories as soon as possible.
Match matchOrfud = Regex.Match(pgCurrentImagePage.text, @"\{\{\s*(d|D)i-orphaned fair use[^\{]*\}\}");
also seems to remove {{di-orphaned fair use-notice}} and other templates whose names begin with 'di-orphaned fair use'. Not sure if this is a problem. {{Di-orphaned fair use-notice}} should not be used on file information pages anyway.- What happens if the orfud category contains a page which is not a file? The category should only contain files, but there could be errors, such as people erroneously adding a category to a page using Hotcat.
- Instead of looking at the event log, note that you can also find these by filtering Special:Contributions/B-bot on the file namespace and looking for edits made more than a week ago which have not been deleted. --Stefan2 (talk) 19:56, 3 May 2015 (UTC)[reply]
- The reason for using the dated category instead of the general one was to try and catch them right before they are due to be deleted. It isn't really a huge problem if the template sits on the image for a few more days (it would be doing that even if the bot didn't exist). My purpose in doing this is to be an aid to the admin processing the category. I will modify the code to ignore non-files that it finds in the category. --B (talk) 21:12, 3 May 2015 (UTC)[reply]
((BotTrial|days=5)) -- Magioladitis (talk) 21:46, 3 May 2015 (UTC)[reply]
- @Magioladitis: FYI, today, for the first time, there was a non-redlinked category for the bot to examine. The log is at User:B-bot/Event_log#No_Longer_Orphaned_Checker_-_22:00.2C_5_May_2015_.28UTC.29. There was nothing interesting - there are only two images in the category and both are still orphans. Thursday afternoon, there is a category with 44 files that it will process and Friday afternoon is the big one (497 files). I fed it this day for my testing at User:B-bot/Test page, so I'm highly confident with it processing that day. I just wanted to note that Thursday and Friday will be the two days of the trial worth watching where there will actually be non-trivial work for it to do. The run is scheduled for around this time (starting 15 minutes ago ... 22:00 UTC / 6:00 PM Eastern) so in the (extremely unlikely) event that you should happen to notice something go wrong, please feel free to stop the bot at User:B-bot/STOP or block it if need be. (After the Friday run, I will turn off the task until you approve or authorize a further trial.) --B (talk) 22:17, 5 May 2015 (UTC)[reply]
Approved for trial (15 days). Please provide a link to the relevant contributions and/or diffs when the trial is complete. I extended the trial to 15 days. -- Magioladitis (talk) 22:30, 5 May 2015 (UTC)[reply]
- Okay, thanks. I have set the job to run at 22:00 UTC / 6:00 PM Eastern up until (and including) 5/19. --B (talk) 23:54, 5 May 2015 (UTC)[reply]
- FYI, I have moved the day that it processes to a day later. We're deleting images on the 8th day (after at least seven full 24-hour periods), not on the 7th day (after seven partial days, using the term "days" like you would when you say your vacation is 3 "days" and 2 nights). Last night, it ran the big day at User:B-bot/Event_log#No_Longer_Orphaned_Checker_-_22:00.2C_8_May_2015_.28UTC.29. That isn't due to be deleted until tomorrow and so I have changed it to wait one more day. Interestingly, last night, it removed only three tags. When I ran it in test mode, there were a lot more and I notice that Aspects (talk · contribs) has removed all of those. --B (talk) 14:19, 9 May 2015 (UTC)[reply]
- I was not aware that this bot was also checking at the end of the seven days to see if the file is being used. I have removed the booksmarks I had for the orphaned files for October 3 and 4, so you should be able to get a better sense of what the bot is doing. Aspects (talk) 01:12, 10 May 2015 (UTC)[reply]
@Magioladitis: Today should be the last day of the trial. Results are at User:B-bot/Event log (results over one week old are archived). Everything seems to have gone well - no images that should not have been were untagged and on days that I processed the deletions, I did not find any unused images that the bot missed. The most excitement was this past weekend when a user stopped the bot using its STOP page in an (incorrect) effort to opt out of orfud notifications. --B (talk) 18:35, 19 May 2015 (UTC)[reply]
Approved. -- Magioladitis (talk) 18:42, 19 May 2015 (UTC)[reply]
- Thanks. --B (talk) 18:50, 19 May 2015 (UTC)[reply]
- The above discussion is preserved as an archive of the debate. Please do not modify it. To request review of this BRFA, please start a new section at WT:BRFA.