Index
|
|||||||
Requests: Please list any requests for reports below in a new section. Be as specific as possible, including how often you would like the report run.
Request: editors by number of unreviewed pages
This would be very useful for prioritisation at new page patrol (related discussion: Wikipedia_talk:New_pages_patrol/Reviewers#Request:_Report_of_number_of_unreviewed_articles,_grouped_by_creator) and WP:PERM/A. I've mocked up an SQL query on quarry that's straightforward enough – can it be converted into a regular database report? – Joe (talk) 10:03, 27 June 2022 (UTC)
- Never mind. I've used {{Database report}} to create the report at Wikipedia:New pages patrol/Reports#Unreviewed new articles by creator (top 10). -MPGuy2824 (talk) 09:05, 2 November 2022 (UTC)
Request: Mainspace pages without talk pages
Ignore the redirects and the disambiguation pages, if possible. -MPGuy2824 (talk) 11:27, 27 August 2022 (UTC)
- Hi @MPGuy2824 - it looks like there around 120K pages that meet that criteria. Any chance you could explain what the purpose of the report would be? For a one-off list have a look at this Quarry. Thparkth (talk) 22:01, 1 September 2022 (UTC)
- Thparkth, I was mulling over starting some sort of taskforce to add wikiproject tags to every talk page. Also, this seemed like a good beginner task to add to Wikipedia:Community portal/Open tasks. Thanks for the query, that will be enough for now. If my taskforce gets off the ground, I might get back here and ask for the top X of that query as a periodic report. -MPGuy2824 (talk) 02:34, 2 September 2022 (UTC)
- Hi @MPGuy2824 - it looks like there around 120K pages that meet that criteria. Any chance you could explain what the purpose of the report would be? For a one-off list have a look at this Quarry. Thparkth (talk) 22:01, 1 September 2022 (UTC)
New articles proposed to Merge
Category:All articles to be merged currently has about 2,600 articles. Could a report be generated that shows which of these are Unreviewed articles (still listed in Special:NewPagesFeed), by date of proposed merge? MB 20:25, 8 September 2022 (UTC)
- I've used {{Database report}} to get this at Wikipedia:New pages patrol/Reports#Unreviewed articles with merge tags. It doesn't have the date of proposed merge, but maybe someone can tweak the sql further to get that. -MPGuy2824 (talk) 09:19, 2 November 2022 (UTC)
Links to userspace
A user link has been added to Template:Cleanup bare URLs, and as a result Wikipedia:Database reports/Articles containing links to the user space is now nearly 30,000 lines, any chance that template can be ignored?--Jac16888 Talk 15:14, 9 September 2022 (UTC)
- You could also replace that link with Wikipedia:Citation bot, a redirect. – Jonesey95 (talk) 13:19, 11 September 2022 (UTC)
Listing maintenance categories
For some reason, tonight's Empty Categories list has maintenance categories listed on it. They are typically ommitted as they would overwhelm the content categories and because they don't stay empty for long and they do not get tagged for speedy deletion, CSD C1. I'm not sure if this talk page is monitored so I'll just ping Jonesey95 and see if they know what has happened. Liz Read! Talk! 01:11, 15 September 2022 (UTC)
- They've gone and changed the schema on us. See quarry:query/67346; what's now in lt_namespace and lt_title in the new linktargets table used to be in tl_namespace and tl_title (which now seem to always be 0 and ''?), and the database report still assumes they're there. —Cryptic 01:34, 15 September 2022 (UTC)
- Apparently it was announced in March. The change to templatelinks is (obviously) live; pagelinks, imagelinks, and categorylinks aren't yet, but will follow. —Cryptic 01:46, 15 September 2022 (UTC)
- Liz, you can ping me any time. I noticed that Wikipedia:Database reports/Transclusions of non-existent templates had been blanked by the bot this morning and figured that something screwy was happening with a database or one of the servers, so I restored the previous report and figured I'd give things a day to sort themselves out. The above wikitech-l posting is gibberish to me, but maybe Fastily will know if and what things need to change in that report. If this change affects a bunch of reports, I expect that we'll see a thread on VPT in the next day or two. – Jonesey95 (talk) 02:57, 15 September 2022 (UTC)
- This is a pretty succinct statement of how to update queries that read from templatelinks. For the non-existent template report, for example, you'd need to change this to this. —Cryptic 03:57, 15 September 2022 (UTC)
- This is all like reading a Greek textbook to me but I have enormous confidence in your abilities to get to the bottom of this, Cryptic and Jonesey95. Thank you for looking into this. There are only one or two of us that utilize this database report but it's one I check daily and helps us keep on top of the category clutter than comes out of deleting articles at AFD and categories at CFD. It also helps us notice if a new editor (they are almost always new editors) goes on a tear, creating dozens of unused categories. And lately a very experienced editor has been working on a major job recategorizing pages that left hundreds of empty categories to tag and delete.
- Now that I think about it, when there are problems with this list, I usually go directly to the bot operator, MZMcBride so I will ping him to this discussion in case he can follow all of this. I appreciate your help! Liz Read! Talk! 07:23, 15 September 2022 (UTC)
- Thanks for the fix, @Cryptic! -FASTILY 07:50, 15 September 2022 (UTC)
- Fastily, Wikipedia:Database reports/Transclusions of non-existent templates appears to be broken for the last couple of days. It should have 100+ entries on it every day (see typical pages in the history from a couple of weeks ago). – Jonesey95 (talk) 01:46, 19 September 2022 (UTC)
- Thanks for letting me know, looks like I missed a change in Cryptic's example; this should be fixed now. -FASTILY 04:35, 19 September 2022 (UTC)
- All fixed today, back to 270 entries. Thanks. – Jonesey95 (talk) 14:18, 19 September 2022 (UTC)
- That was my error, not yours - I was fooled by there still being enough rows in templatelinks with tl_title not the empty string that the results looked right, when I hadn't found even a single non-empty instance before that. —Cryptic 17:40, 19 September 2022 (UTC)
- Thanks for letting me know, looks like I missed a change in Cryptic's example; this should be fixed now. -FASTILY 04:35, 19 September 2022 (UTC)
- Fastily, Wikipedia:Database reports/Transclusions of non-existent templates appears to be broken for the last couple of days. It should have 100+ entries on it every day (see typical pages in the history from a couple of weeks ago). – Jonesey95 (talk) 01:46, 19 September 2022 (UTC)
- This is a pretty succinct statement of how to update queries that read from templatelinks. For the non-existent template report, for example, you'd need to change this to this. —Cryptic 03:57, 15 September 2022 (UTC)
- Liz, you can ping me any time. I noticed that Wikipedia:Database reports/Transclusions of non-existent templates had been blanked by the bot this morning and figured that something screwy was happening with a database or one of the servers, so I restored the previous report and figured I'd give things a day to sort themselves out. The above wikitech-l posting is gibberish to me, but maybe Fastily will know if and what things need to change in that report. If this change affects a bunch of reports, I expect that we'll see a thread on VPT in the next day or two. – Jonesey95 (talk) 02:57, 15 September 2022 (UTC)
- Apparently it was announced in March. The change to templatelinks is (obviously) live; pagelinks, imagelinks, and categorylinks aren't yet, but will follow. —Cryptic 01:46, 15 September 2022 (UTC)
- Cryptic and Jonesey95, it happened again on tonight's Wikipedia:Database reports/Empty categories. Looks like they are maintenance categories involving files and Proposed deletions. There are plenty of empty clean-up categories that aren't appearing on this list, it's the daily, not monthly maintenance If you tell me that this situation will be lasting a while, then I'll stop pinging you every time it happens. Just thought I'd let you know. Liz Read! Talk! 01:13, 16 September 2022 (UTC)
- Wikipedia:Database reports/Empty categories is updated by BernsteinBot, which is maintained by MZMcBride & Legoktm; you'll probably have to ask one of them to fix it. -FASTILY 02:50, 16 September 2022 (UTC)
- Yes, I pinged MZMcBride (above) but I'll go to their talk page and ask about this. Liz Read! Talk! 03:11, 16 September 2022 (UTC)
- Sorry, I fixed another tool of mine (ours even), forgot about these. I'm traveling tomorrow, so it might not be until Saturday that I have time to fix the reports. Legoktm (talk) 05:51, 16 September 2022 (UTC)
- Hello, Legoktm,
- You know how to fix this problem? That's great! I look forward to it. Liz Read! Talk! 01:17, 17 September 2022 (UTC)
- I think fixed most of them, hopefully the next runs of the reports are better. If there's a monthly report that's off let me know and I can kick it manually. Legoktm (talk) 23:19, 17 September 2022 (UTC)
- Oh, my, Legoktm. Things went back to normal for a few days and then in tonight's report, things went bananas! Even worse than before. Ayiiieeee! Liz Read! Talk! 01:24, 22 September 2022 (UTC)
- When I run the query in Wikipedia:Database reports/Empty categories/Configuration, I only get Category:MAX (band) video albums, Category:IIT Roorkee Alumni, and Category:Polish pilots, which looks right. —Cryptic 01:53, 22 September 2022 (UTC)
- Ughhh, I have no clue why and I'm mostly offline tomorrow, if it's still wrong after tomorrow's update I'll start poking at it again... Legoktm (talk) 08:38, 22 September 2022 (UTC)
- Oh, my, Legoktm. Things went back to normal for a few days and then in tonight's report, things went bananas! Even worse than before. Ayiiieeee! Liz Read! Talk! 01:24, 22 September 2022 (UTC)
- I think fixed most of them, hopefully the next runs of the reports are better. If there's a monthly report that's off let me know and I can kick it manually. Legoktm (talk) 23:19, 17 September 2022 (UTC)
- Wikipedia:Database reports/Empty categories is updated by BernsteinBot, which is maintained by MZMcBride & Legoktm; you'll probably have to ask one of them to fix it. -FASTILY 02:50, 16 September 2022 (UTC)
Goodbye BernsteinBot, hello HaleBot
If you haven't seen the news yet, BernsteinBot has been disabled. HaleBot will take over most of the tasks that it used to do. There are a lot of scattered reports in various places, if you notice something isn't updating, please leave a note here and ping me.
A big thank you to MZMcBride for starting this project 14(!) years ago. Wikipedia is better because of it. Legoktm (talk) 15:10, 12 October 2022 (UTC)
- @Legoktm please document this bot's tasks on its userpage. — xaosflux Talk 15:18, 12 October 2022 (UTC)
- Hale hath no fury. --MZMcBride (talk) 18:18, 12 October 2022 (UTC)
- Legoktm, thanks to you and the bot approval team for your swift action. And thanks to all of our bot operators, like MZMcBride, of past and present. The tools you create make our editing lives so much easier. Liz Read! Talk! 02:00, 13 October 2022 (UTC)
Legoktm, can you please check on Wikipedia:Database reports/Unused templates and Wikipedia:Database reports/Uncategorized templates? The former was updating daily, and the latter was weekly, so it is not overdue yet. MZMcBride was also developing Wikipedia:Database reports/Unused templates (filtered) just before the bot retired (discussion); that would be a useful daily report. Thanks. – Jonesey95 (talk) 12:54, 13 October 2022 (UTC)
- I tried to get Unused templates working last night but messed up with the subst:#time calls, will fix that tonight. I found the code for the (filtered) report, I'll set that up tonight too. Uncategorized templates should be set to go on the regular schedule. Legoktm (talk) 15:43, 14 October 2022 (UTC)
- I saw the update and figured you were working on it. Did you notice that there were undesirable underscores, and that links with parens in them were not quite right, e.g.
1910s_in_music_
(in code here to make sure that the underscores show)? Maybe that's all tied up in the subst work. – Jonesey95 (talk) 17:00, 14 October 2022 (UTC)- I did not notice that, it was me being lazy by using the pipe trick. Should be fixed now, though the last page of the report is missing because of an edit filter I just fixed. The (filtered) report is running daily now too. Legoktm (talk) 02:48, 15 October 2022 (UTC)
- Nice work. It's good to have the reports running again. Please see this discussion for suggestions about how the filtered report could benefit from a few more filters. It should be able to fit on one page pretty easily. – Jonesey95 (talk) 05:32, 15 October 2022 (UTC)
- I did not notice that, it was me being lazy by using the pipe trick. Should be fixed now, though the last page of the report is missing because of an edit filter I just fixed. The (filtered) report is running daily now too. Legoktm (talk) 02:48, 15 October 2022 (UTC)
- I saw the update and figured you were working on it. Did you notice that there were undesirable underscores, and that links with parens in them were not quite right, e.g.
Legoktm, if there is any way to get Wikipedia:Database reports/Ownerless pages in the user space up and running again, it would be helpful. Most of these pages appear to be errors or the result of incomplete moves. I would guess that a monthly refresh is all that is needed for now. – Jonesey95 (talk) 17:30, 7 December 2022 (UTC)
- After cleaning up this report somewhat, I have noticed that redirects and soft redirects are included, but those pages are valid when an editor's username has been changed. Ideally, those pages would be excluded from the report. Soft redirects are in Category:User soft redirects. Regular redirects are often, but maybe not always, in Category:Redirects from moves. – Jonesey95 (talk) 19:00, 8 December 2022 (UTC)
- Yes. I should get to it over the weekend. Legoktm (talk) 05:05, 9 December 2022 (UTC)
Is it possible to get Wikipedia:Database reports/Articles containing links to the user space working again too?--Jac16888 Talk 14:56, 16 December 2022 (UTC)
- It is already being run; check the history of that page. Or otherwise the bot would be doing something wrong. 0xDeadbeef→∞ (talk to me) 15:13, 16 December 2022 (UTC)
- Yes but there is clearly an issue since it's coming out blank which is definitely not correct, there are certain articles that have a userspace link for legitimate reasons (Jimmy Wales for example), if the report was running correctly these would be listed--Jac16888 Talk 15:53, 16 December 2022 (UTC)
- Something is definitely wonky with that report, the /Configuration page is for a different report entirely! Legoktm (talk) 00:03, 17 December 2022 (UTC)
- @Jac16888: should be fixed in the next run. Legoktm (talk) 00:34, 17 December 2022 (UTC)
- Thanks--Jac16888 Talk 01:02, 17 December 2022 (UTC)
- Hi @Legoktm:, this week it seems to have included all articles featuring Template:Cleanup bare URLs (due to the citation bot link) which were previously not counted (approx. 4K articles), are you able to exclude this please?--Jac16888 Talk 15:25, 7 January 2023 (UTC)
- Thanks--Jac16888 Talk 01:02, 17 December 2022 (UTC)
- Yes but there is clearly an issue since it's coming out blank which is definitely not correct, there are certain articles that have a userspace link for legitimate reasons (Jimmy Wales for example), if the report was running correctly these would be listed--Jac16888 Talk 15:53, 16 December 2022 (UTC)
- @Legoktm The Largely duplicative file names report stopped when the bot was swtiched. I just checked the page today and realised it hasn't updated since october. Is it possible to start running this one again? Thanks, Terasail[✉️] 04:23, 1 February 2023 (UTC)
- I have started working on a Rust rewrite of that report. Stay tuned! 0xDeadbeef→∞ (talk to me) 04:34, 1 February 2023 (UTC)
- The report was deployed. 0xDeadbeef→∞ (talk to me) 07:41, 1 February 2023 (UTC)
Database report template
{{Database report}} template can now be used to set up one-off or periodically updating reports in userspace or project namespace, given an SQL query. The template doc lists the supported formatting options. Feel free to give it a try and let me know if you face any issues. – SD0001 (talk) 15:43, 28 October 2022 (UTC)
- This is nice, thanks. Hopefully no-one kills the DBs with it. -MPGuy2824 (talk) 03:12, 29 October 2022 (UTC)
- There are some protections in place to prevent anyone from killing the DBs with it, see phab:T320657 for details. – SD0001 (talk) 10:45, 2 November 2022 (UTC)
- Hi SD0001. I played around with the wiki template-based approach the other day on bizarrely subnested userpages (configuration) and I quite enjoyed it. The wikilinks formatting options of
{{database report}}
are neat and it's cool that it supports on-demand updates by clicking a link. Nice job. - Could we please switch the bot to use
{{static row numbers}}
similar to what Wikipedia:Database reports/Unused templates/2 is using? - I also wonder whether configuration could be moved to a subpage, but for now I just made Wikipedia:Database reports/Bizarrely subnested userpages/Configuration a redirect. --MZMcBride (talk) 07:37, 1 February 2023 (UTC)
Polluted categories
Is there any particular reason why Wikipedia:Database reports/Polluted categories only runs once a month? Given the importance of cleaning polluted categories out, and the fact that running it only once a month means that there are typically hundreds of categories to deal with by the time it actually updates (thus making it an onerous task that people become significantly less likely to bother with at all), once a month isn't often enough. Bearcat (talk) 15:41, 22 November 2022 (UTC)
Inclusion of bot
Today's update to Wikipedia:List of Wikipedians by number of edits/1–1000 has suddenly added User:BG19bot - a deactivated bot that last ran in February 2017, so I cannot imagine what triggered its addition.. Please could this be removed, as the list is not supposed to include bots - thanks - Arjayay (talk) 13:54, 1 February 2023 (UTC)
- Also reported at Wikipedia talk:List of Wikipedians by number of edits as it is unclear where it should be reported. - Arjayay (talk) 13:58, 1 February 2023 (UTC)
- This was already answered elsewhere, but the exclusion page is Wikipedia:List of Wikipedians by number of edits/Unflagged bots. We have many talk pages, it's true. --MZMcBride (talk) 17:06, 1 February 2023 (UTC)
Paginated reports using automated row numbering
Hi. For paginated reports such as Wikipedia:Database reports/Potential U5s/2, we currently restart the row numbering for each subsequent page. That is, page 2 is 1–3000 instead of 3001–6000. We could adjust this by specifying <https://developer.mozilla.org/en-US/docs/Web/CSS/counter-set>, but I'm still pondering whether that would be a good idea. --MZMcBride (talk) 19:19, 7 February 2023 (UTC)
New report request: FAs by length
Would it be possible to have a database report listing WP:featured articles by word count or readable prose size (not wikitext size) or is there a better way to produce such a list? If it's to be a database report, it wouldn't need to be run more than once a month. Thanks! HJ Mitchell | Penny for your thoughts? 18:33, 11 February 2023 (UTC)
- Hi @HJ Mitchell, Is there a definition for how "word count" or "readable prose size" should be measured? Specifically, is there a list of what elements should be excluded or not? Legoktm (talk) 00:21, 13 February 2023 (UTC)
- This was Wikipedia:Featured articles/By length of course, though it just used the overall page size in bytes. --MZMcBride (talk) 07:46, 13 February 2023 (UTC)
- @Legoktm and MZMcBride: Wikipedia:Prosesize is able to produce prose size in both characters (bytes) and words; could that definition be used? I'm aware of FA/By length but as you say MZM, it only measures the the total page size, which doesn't necessarily bear any resemblance to the amount of prose. Articles that cite lots of sources (especially web sources), for example, use more markup and hence have more wikitext than those that cite fewer sources more heavily (eg books). Hence Taylor Swift (10111 words) tops that list and Douglas MacArthur (18679 words) is at #34. Thanks, HJ Mitchell | Penny for your thoughts? 14:13, 13 February 2023 (UTC)
- @HJ Mitchell: here you go: Wikipedia:Database reports/Featured articles by size, it'll update weekly. Let me know if you want things in a different format, e.g. in kb rather than bytes.
- Note that the numbers will be slightly different than the prosesize gadget, as I re-implemented it for use in the bot. Legoktm (talk) 06:18, 14 February 2023 (UTC)
- Thank you very much for such a speedy response! HJ Mitchell | Penny for your thoughts? 10:02, 14 February 2023 (UTC)
- @Legoktm and MZMcBride: Wikipedia:Prosesize is able to produce prose size in both characters (bytes) and words; could that definition be used? I'm aware of FA/By length but as you say MZM, it only measures the the total page size, which doesn't necessarily bear any resemblance to the amount of prose. Articles that cite lots of sources (especially web sources), for example, use more markup and hence have more wikitext than those that cite fewer sources more heavily (eg books). Hence Taylor Swift (10111 words) tops that list and Douglas MacArthur (18679 words) is at #34. Thanks, HJ Mitchell | Penny for your thoughts? 14:13, 13 February 2023 (UTC)
New WikiProjects report
Please add the redirect status to it, for each row. The italic font change isn't enough of a difference. -MPGuy2824 (talk) 02:39, 19 February 2023 (UTC)