Commons:Village pump/Technical

From Wikimedia Commons, the free media repository
Jump to navigation Jump to search

Shortcuts: COM:VP/T • COM:VPT

Welcome to the Village pump technical section
Technical discussion
Village pump/Technical
 Bug reports
 Code review
Tools
 Tools/Directory
 Idea Lab



This page is used for technical questions relating to the tools, gadgets, or other technical issues about Commons; it is distinguished from the main Village pump, which handles community-wide discussion of all kinds. The page may also be used to advertise significant discussions taking place elsewhere, such as on the talk page of a Commons policy. Recent sections with no replies for 30 days and sections tagged with {{Section resolved|1=--~~~~}} may be archived; for old discussions, see the archives; recent archives: /Archive/2024/09 /Archive/2024/10.

Please note
 
SpBot archives all sections tagged with {{Section resolved|1=~~~~}} after 1 day and sections whose most recent comment is older than 30 days.

Could translated pages be hidden from categories?

[edit]

For example see Category:Commons video resources – all those translated pages in that category make it cluttered, hard to go through the pages, and bury pages on the next page.

Would it be possible to hide translated pages so that only one is shown?

  • One could have the links to the translated pages at the top of that page
  • One could automatically open the respective translated page when opening the page depending on one's language settings
  • (and there are more alternatives)

Prototyperspective (talk) 22:00, 16 August 2024 (UTC)[reply]

Much less useful but still useful would be if the translated pages were also hidden or hidable in the File uses on commons section of file pages because it makes it cluttered and hard to see where a file is used; example. Prototyperspective (talk) 10:08, 19 August 2024 (UTC)[reply]
It's possible to place them in a subcategory, but I don't think status quo is necessarily an issue. They should be categorized in any case. Enhancing999 (talk) 12:17, 22 August 2024 (UTC)[reply]
It's currently not a big problem but it makes things far less overseeable and buries things on other pages or beneath the cluttered page. Imagine if there were 300 translated versions of a page which is just roughly the number of languages with a Wikipedia and not even all notable languages, it would make the category barely usable to find and organize things. Manually subcategorizing wouldn't be a good solution because it requires people to spend time manually doing so and new translations will be directly in the category again. Maybe there could be some multilingual redirect page that redirects to whatever language the user has configured if that language version exists and English otherwise? Prototyperspective (talk) 12:35, 22 August 2024 (UTC)[reply]
Another example: Category:Digitisation help. Prototyperspective (talk) 10:52, 18 September 2024 (UTC)[reply]

Vandals often move captions to other languages – detection needed

[edit]

What's going on with people somewhat apparently systematically moving captions from one language to a false one? This is happening frequently and often, if not usually, not detected & reverted by editors.

Could a detection of this please be developed? Compared to other vandalism that is well-detected automatically on Wikipedia by for example ClueBot NG (acc) I think it would be easy to detect if a caption got moved from the original language to another one, if possible with language detection so it also checks if the language it was moved indeed does not match the caption text language. Maybe a better place to ask about this would be the Bot requests or the ClueBot NG talk page / code repo.


Examples

Previously I was wondering how to search my contributions (edit summaries). I was trying to use standardized phrases or terms in edit summaries so I can easily look them up later, for example to replace texts I previously added with templates. I found the following useful tool and because I try to always use term "rvv" when reverting edits that are or seem to most likely be vandalism many examples of such edits can be found here. These are the most recent ones: 1 2 3 4 5 6 7. By the way, I think captions are more a problem or redundant than anything else since there already is the machine-translatable description field (it's useful sometimes when descriptions are long but these could also instead have a short version at the top or be shortened). Another problem is that when undoing changes to captions there is no prefilled edit summary so one has to tediously copy the contributions link of the user and write the edit summary anew. Prototyperspective (talk) 13:04, 17 August 2024 (UTC)[reply]

Given how often this happens (I was able to find a couple of recent instances in a few minutes of looking through RecentChanges) and how weirdly specific of an action it is (it only changes the language of the caption, never its content), I suspect this is a UI/UX issue, not deliberate vandalism, and I suspect it can be addressed by making some changes, like inserting a confirmation dialog when changing the language of an existing caption.
Does anyone know how/where the caption editing interface is implemented, and who's responsible for it? (I also have a couple of gripes about the language picker used in the caption interface - especially its appearance on mobile.) Omphalographer (talk) 05:16, 18 August 2024 (UTC)[reply]
you're right. try File:JPG Test.jpg.
given existing captions, users can change the language and then click publish. that results in the aforementioned problematic edits. RZuo (talk) 05:30, 18 August 2024 (UTC)[reply]
There is the new feature to require captchas for action defined in an abuse filter. I thought about requiring captchas for all IP edits on captions. This might reduce these king of accidental edits. GPSLeo (talk) 05:36, 18 August 2024 (UTC)[reply]
  • Could be but sometimes they do change the text or change multiple languages at once and it seems like it's always only done by new or unregistered users who sometimes did some other different problematic changes.
  • Another thing that could be done is automatic detection of the language and displaying at least a warning or adding the file to a maintenance cat if it doesn't match the specified language – this would also be useful since often people specify the wrong language even at upload.
  • @RZuo: What do you mean? Why would Omphalographer be right in that this is a UI/UX issue? What you described is exactly the expected behavior: changing the language and then clicking publish, how does this suggest it's not vandalism and a UI issue? I don't see why you and GPSLeo think it would be accidental / an UI issue if one has to deliberately click "publish".
  • Even if both captchas are added and they indeed reduce these changes, I still think there should be automatic detection of these changes as well as other likely vandalism. Why is ClueBot_NG not active on WMC? Does ORES work with WMC? There's lots of vandalism here (not just in the captions and the relatively hidden structured data) and I've come across multiple cases where it stayed on a relatively large page for a year or so. Bots/tools could build a queue of edits to check as well as automatically revert edits that are very likely to be vandalism. Moreover, they could learn from edits that specify that the reverted edit was likely vandalism or similarly nonconstructive (it doesn't matter if deliberate or not) if terms like RVV are used.
Prototyperspective (talk) 10:07, 18 August 2024 (UTC)[reply]
People are on a website an they see a language selection. They want to change the language of the text. They do not expect that they are able to change the content on a website where they do not even have an account. The button says publish but how is this translated into different languages and are these translated terms always that clear? GPSLeo (talk) 10:37, 18 August 2024 (UTC)[reply]
This type of "vandalism" is so specific that it would basically have to be a very dedicated LTA who utilizes IPs from all around the world; it's disruptive but I don't think it's deliberate. Gnomingstuff (talk) 04:49, 26 August 2024 (UTC)[reply]
Yes I think the explanation by GPSLeo is quite plausible. The main subject of this thread however is detecting such edits and auto-reverting them (also useful for other unconstructive edits / reducing maintenance workload) regardless of whether or not they are intentional or not. I don't know what you mean by "LTA" but it could have also been many people that found this to be an effective type of vandalism as it's often not detected and reverted but I already think inadvertent edits may be more likely. Maybe there could be some special confirmation box asking if the user really wants to publish that to the file data without using the word "publish" because maybe those users didn't understand that word. In any case, detecting if the text in the caption matches the languages seems useful and needed in any case, for example because many users add captions in English to other languages at upload or similar things. Prototyperspective (talk) 09:52, 26 August 2024 (UTC)[reply]
Something in the GUI needs improvement. It's plausibel that changing the language and saving it actually adds an additional language rather than deleting one as well: [1] Enhancing999 (talk) 10:10, 26 August 2024 (UTC)[reply]
LTA = long-term abuse, people who vandalize in their specific identifiable way for months or years.
The main tells on these are edits by infrequent editors with 2 changes per file. Not much help though once it's out of recent changes. I've found some by searching for captions with mismatched languages e.g. "Spanish the," but obviously that only works with certain patterns.
The problem though with any kind of auto-reverting is that it would have to not catch people fixing this stuff, especially when it's undetected. Gnomingstuff (talk) 11:29, 27 August 2024 (UTC)[reply]
Many don't have 2 changes per file. I think the main indicators are 1. language does not match specified caption language (no other indicator is needed; check language auto-detection of Google Translate or DeepL to see what I mean with language detection) 2. user isn't an editor with many unreverted edits (would only use this indicator early on as language mismatching is a general problem).
I don't understand what you mean with The problem though with any kind of auto-reverting is that it would have to not catch people fixing this stuff. People fixing this stuff would move the caption back to its matching language or remove the flawed captions so they wouldn't be detected. Prototyperspective (talk) 11:51, 27 August 2024 (UTC)[reply]
The solution may be mw:Moderator Tools/Automoderator but I don't know if it also checks changes to captions, does it? Prototyperspective (talk) 10:51, 18 September 2024 (UTC)[reply]
Can Automoderator detect these changes? People still move the captions. Maybe it's indeed some interface issue where things aren't clear and people think moving the caption to another language and then clicking "Publish" (or a translation of that word) would show the caption in their language. Prototyperspective (talk) 10:43, 25 September 2024 (UTC)[reply]
[edit]

With videos it is possible to specify a start (and end) time of the video – see Commons:Video#Temporal media fragments.

  • Is it possible to link to times of the videos in the file description?
  • Is that already possible somehow for audio files?

I think it would be very useful if in the file description of Spoken Wikipedia audio files, like those that I just uploaded, there were links to the different sections of the article. If one is only interested in a particular section of the article one could jump to it directly and listen to only that. It also gives some orientation where one is currently at when listening to articles. Linking to chapters is possible on YouTube and probably many users have already noticed some ways this can be useful. For example, I'd like to link the timestamps in the description here and add timestamp links to the different exercises here.

This doesn't seem to work with videos either as one would have to append ?start=00:26 but can only append things like #start=00:26. Is there some issue about these things? Prototyperspective (talk) 12:46, 22 August 2024 (UTC)[reply]

There isn't really a universal chapter methodology in HTML5 video. The HTML5 idea is that you create your own VTT file (which we don't yet support). These could refer to Commons link (they can be anything you like, as all support for it is completely custom work). Then you write custom Javascript to listen to the timedtext events, read the 'text' (a description and link in this case) and go do something with that (wrapped inside a videojs custom plugin).
Related tickets about this in phabricator. phab:T116154, phab:T301826. —TheDJ (talkcontribs) 18:08, 25 August 2024 (UTC)[reply]
You can link to start points in the file description page using start= and end= query params like:
File:Using_a_Power_Rack_in_bodybuilding,_powerlifting,_strength_training,_resistance_exercise.webm?start=00:10&end=00:15, but it is not that useful, as there is no autoplay etc. —TheDJ (talkcontribs) 18:14, 25 August 2024 (UTC)[reply]
  • The two issues you linked don't seem to be related to this at all. What you wrote is about annotations like a link or a balloon message informing about an inaccuracy in the video at specific times of the video. In contrast this post here is about video chapters and linking to different times of the video in the file description.
  • Yes, I know that one can link to start points that way and explicitly said that in the last paragraph. As said, one would need to link to these from there like #start=00:26 because when including a link like your example, it opens the video in a new tab at that starting point instead of directly jumping to it.
  • This is really important for Spoken Wikipedia where you may like to jump to a particular section. There also needs to be some proper audio player, with the current one only being the fallback, that for example is wider so you can jump to some timing better and with a -10 seconds feature.
Prototyperspective (talk) 10:49, 26 August 2024 (UTC)[reply]
It generally needs a modern improved audio-player that has not only chapter/timestamp-link functionality, but also several other widely established standard features (one could use an existing open source audio player). Prototyperspective (talk) 10:54, 18 September 2024 (UTC)[reply]

How to distinguish categories set by the Infobox from other cats?

[edit]

It would be great if somebody could create a report that lists categories that don't have any categories except for meta-categories set by the Wikidata Infobox like Category:Uses of Wikidata Infobox with no image as proposed here. Example

Is there some way to distinguish categories set by the Infobox from other categories?

It would be best if one could also distinguish between meta-categories set by the WD Infobox like the one above and other categories set by the WD Infobox like Category:Lonnie (given name) (from example) because some categories that only have categories set by the WD Infobox don't need any other categories so would best show up in some separate less important report at some point (and it may even be desirable to increase cats that only have cats set by their WD Infobox). Prototyperspective (talk) 19:47, 30 August 2024 (UTC)[reply]

Any other place to ask about this? Prototyperspective (talk) 10:45, 18 September 2024 (UTC)[reply]

Cat-a-lot performance, maintenance

[edit]

Cat-a-lot seems very slow, since a few days. For example It takes 8 mins to edit a batch of 500 files (locking the tab). Can this be confirmed to be a server or a scripting issue and checked and fixed for speed. rollback ? It should not be my bandwith, but maybe advice on a local setting? Thank you Peli (talk) 12:21, 1 September 2024 (UTC)[reply]

I have the same experience. Very slow. Wouter (talk) 12:51, 1 September 2024 (UTC)[reply]
Please see this thread. Prototyperspective (talk) 21:59, 1 September 2024 (UTC)[reply]

Tech News: 2024-36

[edit]

MediaWiki message delivery 01:02, 3 September 2024 (UTC)[reply]

upload wizard for books

[edit]

Is there a campaign interface (Upload Wizard configuration) that fills in {{Book}} instead of {{Information}}?

It could make it easier for people to understand files like the ones in the Chinese categories.
 ∞∞ Enhancing999 (talk) 12:30, 3 September 2024 (UTC)[reply]

Audio of music contain copyvio thumbnails

[edit]

The thumbnails are not showing up at the audio file but the thumbnail is embedded in them. However, they are embedded in the file and when downloading the file one can see or extract them. Example.

  1. Many of these thumbnails are copyrighted. This means usually the thumbnail would need to be removed. video2commons already imports audio files without the thumbnails. Could there be some script or bot that categorized all audio files with a thumbnail set into e.g. Category:Audio files with embedded thumbnail?
  2. Then as a next step one could remove all of them at scale and efficiently using some metadata removal tool, for example similar to command eyeD3 --remove-all-images **/*.opus (applied to all audio files in some category). I guess it would be best to not remove the thumbnail for identified cases where the thumbnail is CCBY as well, these could e.g. be moved to another category or audio files whose thumbnails should be removed to a subcategory of the category above. (A more sophisticated method would be to reverse image search each thumbnail for finds via tineye so only non-original works are deleted and thumbnails created by the person licensing the work under CCBY kept (if the CCBY license also applies to the thumbnail) but I don't think this would be necessary as it would cause a lot of manual work of checking whether it's indeed a copyvio and whether thumbnails without reverse search result are indeed not copyvios.)

Just as a note: the audio files of the example display 0:00 as duration instead of the duration which only shows after one has clicked play. Prototyperspective (talk) 00:07, 4 September 2024 (UTC)[reply]

When removing the thumbnail one could replace it with a link that enables people to easily download the thumbnail again from some metadata provider. So they should just contain a link or an ID with which to fetch the thumbnail but not a thumbnail image. Prototyperspective (talk) 15:12, 6 September 2024 (UTC)[reply]
Maybe this should be put into bot requests. I think thumbnails should be fetchable via e.g. MusicBrainz. Prototyperspective (talk) 10:45, 18 September 2024 (UTC)[reply]

A bot that moves categories to the bottom of the page

[edit]

Many of my files have the categories in the description. There’s too many to manually move, but is there a way to move them to the bottom with a bot as is done on Wikipedia? Immanuelle ❤️💚💙 (please tag me) 07:37, 8 September 2024 (UTC)[reply]

Example? Prototyperspective (talk) 09:53, 8 September 2024 (UTC)[reply]
@Immanuelle: You can almost certainly use COM:VFC to do it, but I'd suggest not bothering. The categories will work properly wherever they are, so no-one will care unless they're actually reading the wikitext. And if they're reading the wikitext they can fix it themselves. --bjh21 (talk) 11:11, 15 September 2024 (UTC)[reply]

Tech News: 2024-37

[edit]

MediaWiki message delivery 18:48, 9 September 2024 (UTC)[reply]

Troubleshooting needed for File:AMD Zen.svg

[edit]

Can someone investigate why the SVG graphic image File:AMD Zen.svg suddenly stopped working? On the Wikipedia pages where it's used, it's just a blank grey image, and if I click on it, it says "Sorry, the file cannot be displayed - There seems to be a technical issue. You can retry if it persists. Error: could not load image from https://upload.wikimedia.org/wikipedia/commons/thumb/9/9f/AMD_Zen.svg/800px-AMD_Zen.svg.png". Then when I go to Commons where it's hosted, I just see a link to "File:AMD Zen.svg" in place of where the image should be. Opening the link brings me to a page with the error "XML Parsing Error: prefix not bound to a namespace", and trying to open a lower-resolution render just results in a random WMF error like "server technical issue" or "Too many requests, try again later".

Obviously I've had a look at the file upload and page history for this item and there doesn't seem to be any recent changes (or vandalism) that could have caused this to happen. AP 499D25 (talk) 07:53, 13 September 2024 (UTC)[reply]

no xlink namespace declaration. We will need to wait for Commons image scalars to quiet down. Glrx (talk) 20:03, 13 September 2024 (UTC)[reply]
Now it's working again. Kinda bizarre that the other similar-looking files File:AMD Threadripper.svg and File:AMD Epyc.svg were still working at the time, which led me to think that perhaps there was a programming error or some code change that broke File:AMD Zen.svg. AP 499D25 (talk) 03:12, 14 September 2024 (UTC)[reply]

For some reason when I attempted to open File:Typhoon-Yagi 5.jpg, I get nothing but File not found: /v1/AUTH_mw/wikipedia-commons-local-public.d5/d/d5/Typhoon-Yagi_5.jpg. Neither switching browser nor clearing cache help the problem. Initially I thought the file was broken, but Túrelio informed me that he can access the file without problem.

From the archives there appears that there are 2 similar problems. One in February 2022 was resolved by clearing cache, while another in August 2022 ended up in Phabricator. I'd like to ask are there anyone having similar problems, and should I report the matter to Phabricator? Many thanks.廣九直通車 (talk) 13:02, 14 September 2024 (UTC)[reply]

Works perfectly well to me. Are you still getting the error message? — Alien  3
3 3
14:01, 14 September 2024 (UTC)[reply]
Well I just clicked on it and I'm getting the same error myself too! It must be related to my posting about File:AMD Zen.svg above. AP 499D25 (talk) 14:22, 14 September 2024 (UTC)[reply]
Maybe a browser issue? I'm on Firefox, and you? — Alien  3
3 3
14:30, 14 September 2024 (UTC)[reply]
i can also access it, both file page and original file https://upload.wikimedia.org/wikipedia/commons/d/d5/Typhoon-Yagi_5.jpg . using firefox on windows 11. RZuo (talk) 15:46, 14 September 2024 (UTC)[reply]
  • I'm on Win 11. File page does not display image with Chrome or Edge. Loading directly in Chrome, I get "File not found: /v1/AUTH_mw/wikipedia-commons-local-public.d5/d/d5/Typhoon-Yagi_5.jpg" out of the cache.
access-control-allow-origin: *
access-control-expose-headers: Age, Date, Content-Length, Content-Range, X-Content-Duration, X-Cache
age: 480
content-length: 85
content-type: text/html; charset=UTF-8
date: Sat, 14 Sep 2024 17:58:37 GMT
nel: { "report_to": "wm_nel", "max_age": 604800, "failure_fraction": 0.05, "success_fraction": 0.0}
report-to: { "group": "wm_nel", "max_age": 604800, "endpoints": [{ "url": "https://intake-logging.wikimedia.org/v1/events?stream=w3c.reportingapi.network_error&schema_uri=/w3c/reportingapi/network_error/1.0.0" }] }
server: envoy
server-timing: cache;desc="hit-front", host;desc="cp4052"
strict-transport-security: max-age=106384710; includeSubDomains; preload
timing-allow-origin: *
x-cache: cp4052 miss, cp4052 hit/4
x-cache-status: hit-front
x-content-type-options: nosniff
File page and JPEG display with Firefox 130.0 (64-bit).
Glrx (talk) 18:12, 14 September 2024 (UTC)[reply]
Different users get (or don't get) the same file from different servers?
 ∞∞ Enhancing999 (talk) 23:58, 14 September 2024 (UTC)[reply]
Update: tried Safari on iOS, also failed. Probably best to be dealt on Phabricator?廣九直通車 (talk) 06:58, 15 September 2024 (UTC)[reply]
Phabricator bug report filed at phab:T374773, FYI.廣九直通車 (talk) 07:19, 15 September 2024 (UTC)[reply]
Johannnes89 on Phabricator reported that he has no problem in accessing the file with Chrome and Safari, presumably in his home in Germany. Like to ask where did you access the file?廣九直通車 (talk) 09:24, 15 September 2024 (UTC)[reply]
Yes I accessed it from Germany indeed, so the issue might be about accessing it from different servers. Johannnes89 (talk) 09:37, 15 September 2024 (UTC)[reply]
Just tried it from Edge, and it worked.
accept-ranges: bytes
access-control-allow-origin: *
access-control-expose-headers: Age, Date, Content-Length, Content-Range, X-Content-Duration, X-Cache
age: 0
content-length: 7434740
content-type: image/jpeg
date: Sun, 15 Sep 2024 14:55:18 GMT
etag: fe68fa2d2c9fb9101db078cb263815cb
last-modified: Fri, 13 Sep 2024 09:57:44 GMT
nel: { "report_to": "wm_nel", "max_age": 604800, "failure_fraction": 0.05, "success_fraction": 0.0}
report-to: { "group": "wm_nel", "max_age": 604800, "endpoints": [{ "url": "https://intake-logging.wikimedia.org/v1/events?stream=w3c.reportingapi.network_error&schema_uri=/w3c/reportingapi/network_error/1.0.0" }] }
server: envoy
server-timing: cache;desc="miss", host;desc="cp1115"
strict-transport-security: max-age=106384710; includeSubDomains; preload
timing-allow-origin: *
x-cache: cp1115 miss, cp1115 miss
x-cache-status: miss
x-content-type-options: nosniff
x-object-meta-sha1base36: l1h10jxvtd5o73z4q51fcqsot4fy2wu
Glrx (talk) 14:59, 15 September 2024 (UTC)[reply]
server cp1115 seems to have the file, but not cp4052
 ∞∞ Enhancing999 (talk) 15:07, 15 September 2024 (UTC)[reply]
As of 16:00 UTC+8, I can now access the file without problem in Hong Kong. Will like to hear if anyone elsewhere still has trouble in accessing the file?廣九直通車 (talk) 08:14, 16 September 2024 (UTC)[reply]
Now that the task has been resolved on Phabricator, I think it's time to resolve and archive this thread. Thanks for all of your comments.廣九直通車 (talk) 10:33, 16 September 2024 (UTC)[reply]
Curious how often this happens. Apparently there is a weekly process to fix it, see phab:T374773#10147831.
 ∞∞ Enhancing999 (talk) 10:43, 16 September 2024 (UTC)[reply]

Reupload crashed midway

[edit]

On Poems Betham p9.jpg. I redid the colors, tried to reupload it, it lagged for a few minutes then crashed. A new version of the file has been added to the upload history, but the file itself is still exactly the same (including after purge), and when I try to reupload the corrected version, it gets refused as a duplicate of the "current version" of the file, which it is not. What should I do? — Alien  3
3 3
13:59, 14 September 2024 (UTC)[reply]

Undid the upload, redid the upload, all good now. Whatever... — Alien  3
3 3
14:54, 14 September 2024 (UTC)[reply]
Caching issue? I did see two different files when there were just two versions.
 ∞∞ Enhancing999 (talk) 15:43, 14 September 2024 (UTC)[reply]
No, purged everything twice, still didn't work. Once I undid and redid it, though, it started pretending that the first try worked. — Alien  3
3 3
15:52, 14 September 2024 (UTC)[reply]
Could be the typhoon problem mentioned above.
 ∞∞ Enhancing999 (talk) 13:40, 17 September 2024 (UTC)[reply]
Don't think so, it wasn't the same problem, as the file was not updating, but I didn't get a 404 error. — Alien  3
3 3
16:31, 17 September 2024 (UTC)[reply]

Tech News: 2024-38

[edit]

MediaWiki message delivery 23:58, 16 September 2024 (UTC)[reply]

Best way to batch upload from Youtube?

[edit]

I found on Youtube some interesting video collections under Creative Commons, mainly [15]https://www.youtube.com/@AnimadosICAIC Cartoons from Cuba]. Some of those are really good. But there are a ton of those cartoons under CC. Is there any way of batch uploading the collection?

I use video2commons to upload one per one, but it is a slow method. TaronjaSatsuma (talk) 12:55, 17 September 2024 (UTC)[reply]

I think asking at the talk page of video2commons would be more appropriate. There already is a thread about this: Commons talk:Video2commons#API for this tool albeit probably not easy to see due to its title. Some info on that there. Batch upload of all videos from a channel would be great. Your example isn't really good however, there are far better examples. I think it would be best be added directly into video2commons so you can simply enter a channel URL and it guides you through importing all CCBY videos where you can deselect some videos to not import, adjust the titles, and so on. Alternatively a separate tool could make use of V2C via some API or fork it for specifically this functionality. However it's implemented it should not lead to blocking others from using the tool so there would need to be some measures like some pause between every 5 videos or so. Another thing that is needed is that video2commons needs to check if a video with that youtube ID has already been imported so things don't get imported multiple times which can more easily happen once such functionality is there. Since nobody seems to yet developed such a tool according to the thread at V2C it may now indeed be good to ask here. Prototyperspective (talk) 15:21, 17 September 2024 (UTC)[reply]
Thanks. Very good reply. TaronjaSatsuma (talk) 15:30, 17 September 2024 (UTC)[reply]

Can we create a tracking category for Galleries not connected to Wikidata items?

[edit]

Can we create a tracking category for Galleries not connected on Wikidata items?

I would like to be able to see which Gallery pages are not being used in Wikidata's Gallery' Property Commons gallery (P935), but I have no idea what the best strategy is for a crosswiki tracking category like that would be. A bot? @Multichill: Anyone you think who would be good at this? Sadads (talk) 20:41, 19 September 2024 (UTC)[reply]

We already can't reliably track categories that are actually connected to Wikidata, so it's unlikely. Given the average quality of galleries, it's probably not really a priority for these.
 ∞∞ Enhancing999 (talk) 08:58, 21 September 2024 (UTC)[reply]
You can use SQL queries to fetch a list about gallery pages without Wikidata items (Example: Quarry: 86422) -- Zache (talk) 10:41, 21 September 2024 (UTC)[reply]

Defaultsort template

[edit]

Can someone explain how the template Template:DEFAULTSORT works as the page has no information of any kind like most other template have? Ww2censor (talk) 08:23, 21 September 2024 (UTC)[reply]

It seems to be for people who get the syntax wrong ("|" instead of ":"), see w:Template:DEFAULTSORT
 ∞∞ Enhancing999 (talk) 08:54, 21 September 2024 (UTC)[reply]

Filter request: parliamentdiagram images with incorrect licenses

[edit]

When an editor uses the parliament diagram tool (https://parliamentdiagram.toolforge.org/), the license should be {{PD-shape}}. However, if someone manually downloads the image and uploads it to Commons, instead of clicking the buttons at the bottom to upload the image though OAuth, they can choose any license, and frequently choose incorrect ones (I regularly see the combo of cc-zero + pd-algorithm because that's what the upload wizard seems to steer people towards). Is there anything we can do with filters to catch when this happens so they can be fixed? The Squirrel Conspiracy (talk) 13:57, 22 September 2024 (UTC)[reply]

Tech News: 2024-39

[edit]

MediaWiki message delivery 23:31, 23 September 2024 (UTC)[reply]

[edit]
Sharing in LINE.
Image being shared in sample

Odd, the link preview in LINE looks so unfriendly: File:Sharing Commons files in LINE.png Jidanni (talk) 13:31, 28 September 2024 (UTC)[reply]

The text is from MediaWiki:Upload-disallowed-here. The question is if Commons places that text too prominently or if LINE (software) just picks the wrong string (likely if it doesn't happen with most others). This should probably be reported at LINE instead.
 ∞∞ Enhancing999 (talk) 19:49, 28 September 2024 (UTC)[reply]

Search box for category pages

[edit]

Template:Search inside category is a neat template that can be manually added to category pages to add a search box where even unexperienced users can search the category contents.

This is most if not only useful when also searching subcategories so I forked it to Template:Search box inside category to use the deepcategory search operator that also searches subcategories. The problem with that is that it doesn't work in many categories where it would be most useful since deepcategory only works for relatively flat categories, not those with deeply nested subcategory branches. I created phab:T369808 and this was recently improved. However, it still fails on large categories and I think instead of failing it should show the results up to some level of subcategory depth and show a warning that not all subcategories could be included (maybe even list them).

Two examples where I think the search box searching all subcategories is useful: Category:Our World in Data and Category:Videos by Terra X.

Aside for the deepcategory issues there are now two further problems which is why I'm posting here:

  • How can it be made to use MediaSearch instead of the older SpecialSearch? (I've also asked here at mw:Extension:InputBox earlier)
  • This |namespaces= parameter doesn't seem to work – how can the box be made to search only Files?

Maybe at some point it (or a variant of it that is for example much smaller) could be added to category pages by default or to the Wikidata infobox. Better than these approaches would be if the Wikimedia search engine could show a dropdown whether to search all Wikimedia Commons or only the current category. The default would be to search all of WMC so the search experience for somebody looking to search all of Commons would stay the same but when looking to search only the category one would only need to select the second dropdown after entering some search terms into the box. This is like on GitHub where when entering something into the search bar when on some repository page (like this page) it shows the dropdown options "Search in this repository" or "Search all of GitHub".

If nobody knows, another page where to ask this would also be very useful.

--Prototyperspective (talk) 22:21, 28 September 2024 (UTC)[reply]

The search mode (MediaSearch vs. SpecialSearch) should default to the user's preferred choice. Commons:Village pump/Technical may be a better venue for, er, technical questions. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 12:57, 29 September 2024 (UTC)[reply]
I have configured MediaSearch to be the default but it searches SpecialSearch. Okay, I'll move this discussion there. Prototyperspective (talk) 20:44, 29 September 2024 (UTC)[reply]

Tech News: 2024-40

[edit]

MediaWiki message delivery 22:15, 30 September 2024 (UTC)[reply]