Commons:Bots/Requests
If you want to run a bot on Commons, you must get permission first. To do so, file a request following the instructions below.
Please read Commons:Bots before making a request for bot permission.
I | Create a user account (while logged in to your normal account) and user page for the bot
On the bot's userpage, add {{Bot}}, which automatically adds the page to Category:Commons bots. Then add the following information to the bot's userpage (all this is mandatory):
|
---|---|
II | Create your bot request:
Add your bot request to the list here:
|
III | Test run
You can be demanded to make a short test run with your bot account (30–50 edits/uploads) to allow other users to review your bot's tasks. Unauthorized test run is not allowed. |
IV | Waiting for approval.
You now need to wait for community approval. A bureaucrat will close the request and will also grant a bot flag, where necessary. Closed requests are moved to Commons:Bots/Archive. |
|
Requests made on this page are automatically transcluded in Commons:Requests and votes for wider comment.
Requests for permission to run a bot
[edit]Before making a bot request, please read the new version of the Commons:Bots page. Read Commons:Bots#Information on bots and make sure you have added the required details to the bot's page. A good example can be found here.
When complete, pages listed here should be archived to Commons:Bots/Archive.
Any user may comment on the merits of the request to run a bot. Please give reasons, as that makes it easier for the closing bureaucrat. Read Commons:Bots before commenting.
Operator: DaxServer (talk · contributions · Statistics · Recent activity · block log · User rights log · uploads · Global account information)
Bot's tasks for which permission is being sought: Convert TIFF files to JPEG files and link both. As requested at Convert Commons:Bots/Work requests § Category:Photographs by Carol M. Highsmith to JPEG. The TIFF files at Category:Photographs by Carol M. Highsmith are [recursively] loaded into the bot and converted to JPEG using Wand, a Python binding for ImageMagick. The Exif metadata is copied over using PyExifTool, a Python binding for ExifTool by Phil Harvey. The metadata groups that are being copied over, that I've discovered so far, are: Author
, Camera
, Composite
, ExifIFD
, GPS
, ICC_Profile
, IFD0
, IPTC
, Location
and XMP-crs
. The entire metadata can be copied indiscriminately if that is preferred rather a selection. The new JPEG file will have the same wikitext as the TIFF file, with an addition of {{{other_versions}}}
gallery and but a removal of categories such as Uploaded by xyz user as it will be retained in the TIFF file page. The TIFF file page is edited with a link to the JPEG in the gallery and all the categories are removed with the addition of Category:LC TIF images with categorized JPGs. If duplicates are found, using the checksum, the page is skipped over and marked for manual verification and linking using gallery. The OpenCV strategy as described at User:Fæ/LOC#Housekeeping is rather out of my reach. The bot is being written using Pywikibot and is intended to run on Toolforge.
Automatic or manually assisted: Automatic
Edit type (e.g. Continuous, daily, one time run): Continuous
Maximum edit rate (e.g. edits per minute): 5
Bot flag requested: (Y/N): Y
Programming language(s): Python (Pywikibot)
-- DaxServer (talk) 15:07, 1 July 2024 (UTC)
- Discussion
- I'm not able to understand the issue we are trying to solve. All previews of these gigantic TIFFs load just fine for me (in under 2 seconds). I do not expencience much difference as compared to JPEGs. --Schlurcher (talk) 14:18, 2 July 2024 (UTC)
- On hold for the discussion linked -- DaxServer (talk) 08:58, 4 July 2024 (UTC)
- Operator: Taylor 49 (talk · contributions · Statistics · Recent activity · block log · User rights log · uploads · Global account information)
- Bot: Taylorbot (talk · contributions · Statistics · Recent activity · block log · User rights log · uploads · Global account information)
- Bot's tasks for which permission is being sought: move ca 34'000 files from Category:Audio files in Kotava to Category:Kotava pronunciation to make category use consistent with other languages, the uploader has agreed
- Automatic or manually assisted: automatic, supervised at the beginning
- Edit type: one time run as needed
- Maximum edit rate: 12 edits / mi
- Bot flag requested YES
- Creator of the bot code: myself (the operator)
- Programming language(s): ASM+BAS
Taylor 49 (talk) 14:01, 26 June 2024 (UTC)
- Discussion
- Please make test run. --EugeneZelenko (talk) 14:59, 27 June 2024 (UTC)
- @User:EugeneZelenko Done 100 files moved. Taylor 49 (talk) 12:09, 28 June 2024 (UTC)
- Looks OK for me. However, usual naming for pronunciation files is <language code>-<word>, so looks like another task for your bot. --EugeneZelenko (talk) 15:16, 28 June 2024 (UTC)
- @User:EugeneZelenko @User:Kotavusik Agree that following the standard naming pattern would be good. My bot recently acquired renaming/moving capability. But moving to another title and moving into another cat AFAIK cannot be done in a single request. So my bot will have to run through the ca 34'000 files two times anyway. Can you give the flag and approve the recategorization now? For the renaming I would like to hear an opinion of the uploader. Plus, many of those files are used, thus renaming them would require subsequent editing of other wikis, partially of such where I do NOT have a bot flag yet. Anyway, I will have to find out a way how to read out automatically on what wikis the files are used. But it might be a good idea to add sorting hints/keys when recategorizing, and do this not only on moved files, but also on those that are already in Category:Kotava pronunciation. Taylor 49 (talk) 21:03, 28 June 2024 (UTC)
- @User:EugeneZelenko @User:Kotavusik Semms that most files have names consisting of just the bare word, but some are suffixed by
(avk)
for example File:Pabú (avk).wav. So the new name would be constructed by stripping of(avk)
if such can be found, and addingAvk-
. So far no files begin withAvk-
. What to do about broken "File:JustaxoAudio files in Kotava.wav" and "File:BakesikJen elparolo de vorto en Kotavao.wav" ? Taylor 49 (talk) 11:35, 29 June 2024 (UTC)- Same bare word may be used in other language with different pronunciation, so this is whole point of using language code prefix (this convention came from Wikipedia and as far as I notice, more widespread) or suffix. --EugeneZelenko (talk) 13:58, 29 June 2024 (UTC)
- If we put Avk- in front of the kotava words, then the kotava audio files already present on Wiktionary would need to be replaced. Kotavusik (talk) 18:39, 29 June 2024 (UTC)
- @User:Bjh21 @User:EugeneZelenko @User:Kotavusik I definitely support the idea of mass renaming adding the prefix
Avk-
. Still how should I rename 34'000 files if I cannot rename the 2 most broken ones? The renaming in wikis where the file are used will be done by bot (CommmonsDelinker or my bot or other bot), it is NOT a task for Kotavusik. An additional advantage of this mass renaming is that theling=
parameter on eo wiktinoary will not be needed anymore, since the modul is able to read out the language from filenames following one of two supported standards. Can I get the approval and flag for the recategorization now? The mass renaming needs further discussion, but is sufficiently independent from the recategorization. Taylor 49 (talk) 14:52, 30 June 2024 (UTC)- @Taylor 49: The two renaming requests that I declined were submitted under criterion 3 (obvious error) with no further explanation. Criterion 3 covers factual errors in filenames, but neither "JustaxoAudio files in Kotava.wav" nor "File:BakesikJen elparolo de vorto en Kotavao.wav" contains any factual error. However, the files might still be eligible for renaming under other criteria. You seem to have the co-operation of the original uploader, so maybe they could request that the files be renamed. That would allow you to use criterion 1 (original uploader request), which is very simple and doesn't require any consideration of the current filename. You might also be able to use criterion 2 (ambiguous name) or 4 (harmonizing names), but 2 would require case-by-case evaluation and I'm not sure 4 applies to pronunciation files. Whatever you choose, make sure that the bot records the criterion in the edit summary, preferably with an explanation of how it applies. --bjh21 (talk) 12:57, 1 July 2024 (UTC)
- Are there any obstacles against approving the recategorization task? Should I make a separate request for the mass move task, or can it be approved here at same time or separately later? Taylor 49 (talk) 14:54, 8 July 2024 (UTC)
- @Taylor 49: The two renaming requests that I declined were submitted under criterion 3 (obvious error) with no further explanation. Criterion 3 covers factual errors in filenames, but neither "JustaxoAudio files in Kotava.wav" nor "File:BakesikJen elparolo de vorto en Kotavao.wav" contains any factual error. However, the files might still be eligible for renaming under other criteria. You seem to have the co-operation of the original uploader, so maybe they could request that the files be renamed. That would allow you to use criterion 1 (original uploader request), which is very simple and doesn't require any consideration of the current filename. You might also be able to use criterion 2 (ambiguous name) or 4 (harmonizing names), but 2 would require case-by-case evaluation and I'm not sure 4 applies to pronunciation files. Whatever you choose, make sure that the bot records the criterion in the edit summary, preferably with an explanation of how it applies. --bjh21 (talk) 12:57, 1 July 2024 (UTC)
- @User:Bjh21 @User:EugeneZelenko @User:Kotavusik I definitely support the idea of mass renaming adding the prefix
- @User:EugeneZelenko @User:Kotavusik Semms that most files have names consisting of just the bare word, but some are suffixed by
- @User:EugeneZelenko @User:Kotavusik Agree that following the standard naming pattern would be good. My bot recently acquired renaming/moving capability. But moving to another title and moving into another cat AFAIK cannot be done in a single request. So my bot will have to run through the ca 34'000 files two times anyway. Can you give the flag and approve the recategorization now? For the renaming I would like to hear an opinion of the uploader. Plus, many of those files are used, thus renaming them would require subsequent editing of other wikis, partially of such where I do NOT have a bot flag yet. Anyway, I will have to find out a way how to read out automatically on what wikis the files are used. But it might be a good idea to add sorting hints/keys when recategorizing, and do this not only on moved files, but also on those that are already in Category:Kotava pronunciation. Taylor 49 (talk) 21:03, 28 June 2024 (UTC)
- Looks OK for me. However, usual naming for pronunciation files is <language code>-<word>, so looks like another task for your bot. --EugeneZelenko (talk) 15:16, 28 June 2024 (UTC)
- @User:EugeneZelenko Done 100 files moved. Taylor 49 (talk) 12:09, 28 June 2024 (UTC)
Operator: Emijrp (talk · contributions · Statistics · Recent activity · block log · User rights log · uploads · Global account information)
Bot's tasks for which permission is being sought: add depict value in people images when such image is used in P18 property in the Wikidata item.
Automatic or manually assisted: automatic
Edit type (e.g. Continuous, daily, one time run): continuous
Maximum edit rate (e.g. edits per minute): 1 edit/sec
Bot flag requested: (Y/N): no (it already has)
Programming language(s): python
emijrp (talk) 16:49, 22 June 2024 (UTC)
- Discussion
- How about setting Prominent status? --EugeneZelenko (talk) 15:04, 23 June 2024 (UTC)
- Sure, it's possible. If nobody says otherwise here, bot will mark them as prominent. emijrp (talk) 15:48, 23 June 2024 (UTC)
- Sounds good to me. If they are used to illustrate wikidata entries, they should be sufficent to be marked as "prominent" here. Please tag these kind of edits from the bot with Special:Tags "
BotSDC
". This can be added during the editentity api call as an additional parameter. This will allow people to effectively filter these type of edits from their watchlist, if they wish to do so. --Schlurcher (talk)- OK, adding that suggestion too. emijrp (talk) 14:59, 24 June 2024 (UTC)
- Please make another test run. --EugeneZelenko (talk) 14:03, 29 June 2024 (UTC)
- Occasionally people add pictures like this to P18 (a park in this case). They are related to the person but not portraits. I am thinking how to exclude them. emijrp (talk) 08:30, 30 June 2024 (UTC)
- Please make another test run. --EugeneZelenko (talk) 14:03, 29 June 2024 (UTC)
- OK, adding that suggestion too. emijrp (talk) 14:59, 24 June 2024 (UTC)
- Sounds good to me. If they are used to illustrate wikidata entries, they should be sufficent to be marked as "prominent" here. Please tag these kind of edits from the bot with Special:Tags "
https://meta.wikimedia.org/wiki/User:AkbarBot
Operator: Akbarali (talk · contributions · Statistics · Recent activity · block log · User rights log · uploads · Global account information)
Bot's tasks for which permission is being sought:
- Upload files bulk to wikimedia commons,
- Add descriptions , caption and file name
Automatic or manually assisted:
Edit type (e.g. Continuous, daily, one time run): Intermittently
Maximum edit rate (e.g. edits per minute): 8 edits per minute
Bot flag requested: (Y/N): y
Programming language(s): Pywikibot, Python scripts are on PAWS https://hub-paws.wmcloud.org/hub/spawn-pending/Akbarali
Akbarali (talk) 13:57, 11 June 2024 (UTC)
- Discussion
- What are origins of each batch (at least in foreseeable future)? --EugeneZelenko (talk) 14:40, 12 June 2024 (UTC)
ImagehashBot (talk · contribs)
[edit]Operator:
- Zache (talk · contributions · Statistics · Recent activity · block log · User rights log · uploads · Global account information)
- Ipr1 (talk · contributions · Statistics · Recent activity · block log · User rights log · uploads · Global account information)
Bot's tasks for which permission is being sought: adding pHash checksum (P9310) and Imagehash difference hash (P12563) values to the photos.
- Documentation for the hashes
- pHash: https://www.hackerfactor.com/blog/index.php?/archives/432-Looks-Like-It.html
- dHash: https://www.hackerfactor.com/blog/index.php?/archives/529-Kind-of-Like-That.html
- Example images with P9310 and P12563 values
- https://commons.wikimedia.org/wiki/File:Patricia-Seppala-1995.jpg
- https://commons.wikimedia.org/wiki/File:Jean-Sibelius-1927.jpg
- https://commons.wikimedia.org/wiki/File:Potato_crop_lifting_(JOKAMT2Ju29-2).tif
First targets are photos from Europeana, Estonian, Finland, Sweden and Flickr, but long term target is to add imagehashes to all commons photos. Currently we have used FinnaUploadBot for Finna images. Reason for the new account is to make dedicated account and service for the non Finna related edits.
Automatic or manually assisted: automatic
Edit type (e.g. Continuous, daily, one time run): first a batch jobs, later continuous
Maximum edit rate (e.g. edits per minute):
Bot flag requested: (Y/N): Y
Programming language(s):
Zache (talk) 15:08, 12 April 2024 (UTC)
- Discussion
- What is use for such hashes? --EugeneZelenko (talk) 14:47, 13 April 2024 (UTC)
- One can use them to compare the similarity of pictures by checking how much the identifiers differ to detect duplicates and match photos in different repositories. We have used image hashes to prevent duplicates when uploading files and to prevent the wrong photos from being updated when reuploading photos from Finna with better quality and/or updating metadata. --Zache (talk) 16:31, 13 April 2024 (UTC)
- Such hashes make much more sense as part of Commons database. --EugeneZelenko (talk) 14:26, 14 April 2024 (UTC)
- In SDC they are filemetadata and in particular using SPARQL it would be easy way for querying and sharing the hashes for external usage. Ie. it is part of metadata for the files. Zache (talk) 14:52, 14 April 2024 (UTC)
- Also, even if the information would be added to the Wikimedia Commons database (there are good technical reasons why one would like to use an external service instead of adding this to the MediaWiki core), I would like to note that we are populating SDC values from the Commons internal database using bots. Most notable in this context are the SHA-1 checksum, mime type, image width, and image height. (Commons:Structured data/Modeling/Meta) And yes, there would be probaply better ways to do this, but currently using bots is the preferred method. --Zache (talk) 06:42, 18 April 2024 (UTC)
- Is there any community discussion that such data shall be generated at large scale? Krd 06:53, 18 April 2024 (UTC)
- I am not aware that there would have been a wider discussion. Current discussions, to my knowledge, are related to the Fæ's User:Fæ/Imagehash and village pump discussions 1 and 2. In my structured data property proposal in 2021, there were no follow-up comments in Wikimedia Commons. Phabricator has some tickets (for example, phab:T121797) related to image hashing.
- Also, just for background, I am running ImageHash-Toolforge, which has approximately 25% of Wikimedia Commons bitmap images (jpg, tiff, png) indexed with phash and dhash. I also made a Wikimania lightning talk proposal for it. (Proposals are currently under review.) My current idea was to proceed gradually when adding values to SDC, and my current personal need was to add hashes to European and Estonian photos before the Wikimedia Hackathon, Tallinn, in May so they would be available there. (see my question in Commons_talk:Bots/Requests#Extending_FinnaUploadBot).
- However, if you think I should do the village pump discussion or the discussion on the Structured Data talk pages, I am happy to start these. --Zache (talk) 07:49, 18 April 2024 (UTC)
- Please do. Krd 05:48, 21 April 2024 (UTC)
- Now I made a village pump proposal --Zache (talk) 16:44, 17 May 2024 (UTC)
- How do you interpret the discussion, how would you conclude? Krd 13:15, 3 July 2024 (UTC)
- Current status is 3 vs 2 and good general rule for bot edits is do only uncontroversial edits. Based on that rule it is good idea to skip the addition using bot and implement it in some other way. -- Zache (talk) 12:26, 4 July 2024 (UTC)
- How do you interpret the discussion, how would you conclude? Krd 13:15, 3 July 2024 (UTC)
- Now I made a village pump proposal --Zache (talk) 16:44, 17 May 2024 (UTC)
- Please do. Krd 05:48, 21 April 2024 (UTC)
- Is there any community discussion that such data shall be generated at large scale? Krd 06:53, 18 April 2024 (UTC)
- Such hashes make much more sense as part of Commons database. --EugeneZelenko (talk) 14:26, 14 April 2024 (UTC)
- One can use them to compare the similarity of pictures by checking how much the identifiers differ to detect duplicates and match photos in different repositories. We have used image hashes to prevent duplicates when uploading files and to prevent the wrong photos from being updated when reuploading photos from Finna with better quality and/or updating metadata. --Zache (talk) 16:31, 13 April 2024 (UTC)
GeertivpBot (talk · contribs)
[edit]Operator: Geertivp (talk · contributions · Statistics · Recent activity · block log · User rights log · uploads · Global account information)
Bot's tasks for which permission is being sought:
- Add missing SDC depict statements on media files (File namespace)
- Add missing Wikidata Infobox template to Category pages (Category namespace)
Automatic or manually assisted: Automatically, but monitored
Edit type (e.g. Continuous, daily, one time run): Intermittently
Maximum edit rate (e.g. edits per minute): 8 edits per minute
Bot flag requested: (Y/N): Y
Programming language(s): Pywikibot, Python scripts are on GitHub:
- https://github.com/geertivp/Pywikibot/blob/main/add_image_from_sdc.py
- https://github.com/geertivp/Pywikibot/blob/main/copy_label.py
Test runs are here.
Geert Van Pamel (talk) 22:29, 3 January 2024 (UTC)
- Discussion
- Could you please elaborate how `depicts` is filled? For example, File:Novosibirsk Regional Museum at night 2.jpg should depict building and condition of shoot (night shoot) should be qualifier. --EugeneZelenko (talk) 15:40, 4 January 2024 (UTC)
- The image depicts a "night view of the Royal museum", expressed as SDC depicts (P180) nighttime view (Q28333482) with qualifier of (P642) City Trade House (Q19908752), based upon and generated by the original Wikidata statement Q19908752#P3451 City Trade House (Q19908752) nighttime view (P3451) (M19171168). By doing so, both the SDC depict statement in Wikimedia Commons and the Wikidata statement are describing the same fact.
- What qualifier would you use instead? Can you please elaborate more about the exact statement that you would create? Thanks. Geert Van Pamel (talk) 16:23, 4 January 2024 (UTC)
- Maybe you would like to see: depicts (P180) City Trade House (Q19908752) with qualifier depicted format (P7984) nighttime view (Q28333482)? Please give your point of view/preferences. Geert Van Pamel (talk) 14:24, 5 January 2024 (UTC)
- It makes sense to have broader discussion on matter of qualifiers. May be bot should be limited just for subjects for now? --EugeneZelenko (talk) 15:35, 5 January 2024 (UTC)
- @Geertivp: ? --Krd 14:34, 26 January 2024 (UTC)
- Or we might generate two statements without qualifiers:
- depicts (P180) City Trade House (Q19908752)
- depicted format (P7984) nighttime view (Q28333482) Geert Van Pamel (talk) 19:43, 26 January 2024 (UTC)
- @Geertivp: ? --Krd 14:34, 26 January 2024 (UTC)
- But here we have the problem that depicted format (P7984) may not be used as a qualifier: d:Property:P7984#P2302 => property scope constraint (Q53869507) as main value (Q54828448). In addition to that it can only be used with work of art (Q838948) entities and requires item-requires-statement constraint (Q21503247) genre (P136), which is in general not the case in this suggested usage. Which other qualifier property could be used instead? Geert Van Pamel (talk) 18:51, 18 February 2024 (UTC)
- I cannot follow. Please advise. --Krd 05:04, 7 March 2024 (UTC)
- @Geertivp: ? Krd 07:31, 31 March 2024 (UTC)
- In my previous reply I concluded that depicted format (P7984), because of its defined constraints, can't be used as a qualifier and only in some specific domains as work of art (Q838948). So we would need another more generic (currently not existing) property that could be used as a qualifier with a generic meaning of "type of representation", e.g. nighttime view (Q28333482) in the above example. Geert Van Pamel (talk) 20:53, 31 March 2024 (UTC)
- What does this mean regarding this request, what should be the conclusion? Krd 04:30, 3 April 2024 (UTC)
- We would need a new property "type of representation", that could be added as a qualifier to the Depict statements in Wikimedia Commons SDC for all images that are registered in Wikidata, and don't have a image (P18) property in Wikidata. Geert Van Pamel (talk) 07:45, 3 April 2024 (UTC)
- Are you going to take care of property creation? Krd 09:09, 3 April 2024 (UTC)
- See d:wikidata:property proposal/Type of representation Geert Van Pamel (talk) 19:28, 4 April 2024 (UTC)
- Thank you. Please advise when the discussion is closed. Krd 07:02, 18 April 2024 (UTC)
- See d:wikidata:property proposal/Type of representation Geert Van Pamel (talk) 19:28, 4 April 2024 (UTC)
- Are you going to take care of property creation? Krd 09:09, 3 April 2024 (UTC)
- We would need a new property "type of representation", that could be added as a qualifier to the Depict statements in Wikimedia Commons SDC for all images that are registered in Wikidata, and don't have a image (P18) property in Wikidata. Geert Van Pamel (talk) 07:45, 3 April 2024 (UTC)
- What does this mean regarding this request, what should be the conclusion? Krd 04:30, 3 April 2024 (UTC)
- In my previous reply I concluded that depicted format (P7984), because of its defined constraints, can't be used as a qualifier and only in some specific domains as work of art (Q838948). So we would need another more generic (currently not existing) property that could be used as a qualifier with a generic meaning of "type of representation", e.g. nighttime view (Q28333482) in the above example. Geert Van Pamel (talk) 20:53, 31 March 2024 (UTC)
- @Geertivp: ? Krd 07:31, 31 March 2024 (UTC)
- I cannot follow. Please advise. --Krd 05:04, 7 March 2024 (UTC)
- It makes sense to have broader discussion on matter of qualifiers. May be bot should be limited just for subjects for now? --EugeneZelenko (talk) 15:35, 5 January 2024 (UTC)
- Maybe you would like to see: depicts (P180) City Trade House (Q19908752) with qualifier depicted format (P7984) nighttime view (Q28333482)? Please give your point of view/preferences. Geert Van Pamel (talk) 14:24, 5 January 2024 (UTC)
@Geertivp: ? --Krd 05:23, 27 June 2024 (UTC)
- The property has been approved and is implemented. I need some more time to adapt my scripts and to run a few example transactions before I can request the approval of my bot script. I will notify when I am done. --Geert Van Pamel 09:31, 27 June 2024 (UTC)