User:CricketBot

CricketBot is a bot run by me, Stephen Turner, to help correct some common errors in cricket articles. It doesn't make any edits itself: it just identifies possible errors, and then I approve or discard each one. But if there are any problems, leave a message on User talk:CricketBot or User talk:Stephen Turner.


 * Task in progress:
 * None at the moment


 * Completed (although I tend to re-run them from time to time where appropriate):
 * Turn "test" into "Test".
 * Link "Test" and "Test match" to Test cricket, not to Test or Test match.
 * Find very short articles (see list at /stubs).
 * Find all links in the lists of players pointing to disambiguation pages (none found!).
 * Find all links in the lists of players pointing to redirects (see list at /redirects).
 * Make sure all short biographies have (done up to length 700; beyond that I was having to think too hard about whether it was a stub or not).
 * Spider Category:Cricketers and find articles not already in List of cricketers (see list at /missing cricketers; suggested by Ianbrown).
 * Spider all the cricket subcategories to find articles not already in List of cricketers or List of cricket topics (see list at /missing articles; suggested by Ianbrown).
 * Make a list of longest biographies, to help identify candidate Wikipedia 1.0 cricket articles (see list at /longarticles).
 * Find all links to CricketArchive scorecards, because the format has just changed (see list at /CricketArchive; requested by Tintin1107).
 * Populate new stub categories Zimbabwe-cricketbio-stub and Bangladesh-cricketbio-stub.
 * Change syntax in infobox from oversORballs=balls to balls=true. Delete oversORballs=overs.


 * Possible future tasks:
 * Add country-specific bio-stub to all cricket bio stubs (ask on country page first, because this could generate a lot of new stubs in their lists).
 * Find articles entitled "N (cricketer)" but not linked from "N".
 * Link "country" to "nationality cricket team" instead of to "country".
 * When a cricketer has initials in his usual name (e.g. W. G. Grace), format properly and make redirects from the other common systems (suggested by Tintin1107).
 * Standardise season names, once we've agreed on them (suggested by Ianbrown).
 * Standardise links to Cricinfo (use template and/or don't link to national mirrors).
 * Find all cricket articles that don't mention that they're about cricket in the text.
 * Make sure all Test and ODI players are in the corresponding categories (I've found a few without recently).
 * Statistics on which articles have the most edits per day (suggested by Tintin1107).
 * A list of cricketers without articles with the most other articles linking to them.
 * Add to all people without a date of death.
 * Find any international cricketers without an infobox.
 * No-one should be in both Category:English cricketers and Category:English Test cricketers etc. per consensus at this discussion.
 * Make sure all Wisden Cricketers of the Year have that fact mentioned, and are in the corresponding category.
 * Remove squad templates, and replace them with categories.
 * Make a list of all articles marked as stubs that are rather long (suggested by Dweller).
 * Change all Image:Replace this image1.svg back to Image:cricket no pic.png.

Length of an article
For the purpose of calculating the length of an article (so as to decide whether it's a stub), CricketBot does the following: This is meant to be a more accurate measure of the true amount of content in the article than just counting the amount of text shown on the page.
 * 1) Remove all text in templates, including players' infoboxes.
 * 2) Remove all external links.
 * 3) Count the number of letters and digits.