User talk:Dominic/2022
This is an archive of past discussions. Do not edit the contents of this page. If you wish to start a new discussion or revive an old one, please do so on the current talk page. |
File:"An enemy ear may be near" - NARA - 513844.jpg has been listed at Commons:Deletion requests so that the community can discuss whether it should be kept or not. We would appreciate it if you could go to voice your opinion about this at its entry.
If you created this file, please note that the fact that it has been proposed for deletion does not necessarily mean that we do not value your kind contribution. It simply means that one person believes that there is some specific problem with it, such as a copyright issue. Please see Commons:But it's my own work! for a guide on how to address these issues. |
Darrelljon (talk) 05:21, 19 February 2022 (UTC)
File:"An enemy ear may be near" - NARA - 513844.tif has been listed at Commons:Deletion requests so that the community can discuss whether it should be kept or not. We would appreciate it if you could go to voice your opinion about this at its entry.
If you created this file, please note that the fact that it has been proposed for deletion does not necessarily mean that we do not value your kind contribution. It simply means that one person believes that there is some specific problem with it, such as a copyright issue. Please see Commons:But it's my own work! for a guide on how to address these issues. |
Darrelljon (talk) 07:07, 19 February 2022 (UTC)
File:"In the Wind", 1961 - NARA - 558854.tif has been listed at Commons:Deletion requests so that the community can discuss whether it should be kept or not. We would appreciate it if you could go to voice your opinion about this at its entry.
If you created this file, please note that the fact that it has been proposed for deletion does not necessarily mean that we do not value your kind contribution. It simply means that one person believes that there is some specific problem with it, such as a copyright issue. Please see Commons:But it's my own work! for a guide on how to address these issues. |
Thuresson (talk) 18:02, 23 March 2022 (UTC)
DPLA import, withheld files
Dominic,
A few files in the DPLA import show a "withdrawal sheet" instead of the expected image: File:Vice President Cheney Reviews Document with David Addington - DPLA - 31915140eded6ee84b980959ef3d8611.pdf. Were such files intended to be part of the import? MKFI (talk) 13:43, 5 May 2022 (UTC)
- @MKFI: It wasn't uploaded mistakenly, in the sense that this is apparently a real file as represented in the US National Archives' catalog and the metadata is correct. At the same time, it wasn't deliberately chosen for uploaded, since I'm not really curating uploads beyond verifying the license. If this is an issue, I can see if there is a way to filter these out (though there isn't a very clear indication from the metadata that this is a withdrawal sheet rather than the record itself). I searched for other withdrawal sheets in the catalog, and found many where only a partial record or page was replaced with a withdrawal sheet, but the rest of the PDF included archival materials, e.g. https://catalog.archives.gov/id/118569226. Maybe this was just a rare case where the document itself was short enough, or the redaction major enough, that the entire PDF is a withdrawal sheet. If it is not common, maybe not a big deal? Dominic (talk) 16:12, 5 May 2022 (UTC)
- I have only found a handful of such. I guess we can just ignore them. MKFI (talk) 18:14, 5 May 2022 (UTC)
File:AS06-02-1187 - Apollo 6 - NARA - 16659349.jpg has been listed at Commons:Deletion requests so that the community can discuss whether it should be kept or not. We would appreciate it if you could go to voice your opinion about this at its entry.
If you created this file, please note that the fact that it has been proposed for deletion does not necessarily mean that we do not value your kind contribution. It simply means that one person believes that there is some specific problem with it, such as a copyright issue. Please see Commons:But it's my own work! for a guide on how to address these issues. |
𝟙𝟤𝟯𝟺𝐪𝑤𝒆𝓇𝟷𝟮𝟥𝟜𝓺𝔴𝕖𝖗𝟰 (𝗍𝗮𝘭𝙠) 00:58, 6 June 2022 (UTC)
Uploading to testwiki?
testwiki:Special:Contributions/DPLA_bot - Is your bot supposed to be uploading so many file to test wiki? Reedy (talk) 16:58, 17 August 2022 (UTC)
- I've just blocked it... It's uploaded well over 1000 images today, this doesn't seem right. Reedy (talk) 17:03, 17 August 2022 (UTC)
- In fact, 17,772 images. Yeah, this definitely can't be right. Reedy (talk) 17:08, 17 August 2022 (UTC)
- @Reedy: Sorry about that, let me look into it. Dominic (talk) 17:48, 17 August 2022 (UTC)
- @Reedy: The bot was just doing a regular upload batch that should have been going to Commons, but was running on an outdated configuration. That has been killed and it's now uploading to Commons correctly. If you have the ability to easily nuke all uploads from that account for cleanup, that would be great. Dominic (talk) 18:29, 17 August 2022 (UTC)
- @Reedy: Sorry about that, let me look into it. Dominic (talk) 17:48, 17 August 2022 (UTC)
- In fact, 17,772 images. Yeah, this definitely can't be right. Reedy (talk) 17:08, 17 August 2022 (UTC)
Duplicate files
File:Success (Ship) (approximately 1912) - DPLA - 17a87722055a5f63d79708f76d1670c0.jpg is marked as a duplicate of File:Smith, Dennison Billings Home, Toledo, Ohio, 1876 - DPLA - 1ebbbf7aee100a25a845ce81f666988a.jpg, since they are. However, the latter one is a completely different photo than its description, so it looks like an error at the source which associated the wrong photo with that record. Would it be easiest to simply delete it, or is there a way to get the original corrected and re-uploaded? Or should we delete until that happens? Carl Lindberg (talk) 03:52, 13 October 2022 (UTC)
Duplicates (by message digest)
DPLA bot is uploading 50000+ duplicates of files already uploaded by the bot - and it is ongoing. C.Suthorn (talk) 15:17, 2 November 2022 (UTC)
- @C.Suthorn: Just saw this, and I am looking into it. Dominic (talk) 16:28, 2 November 2022 (UTC)
- @C.Suthorn: So far, I think the bot has only uploaded about 30K files total since the current batch started this week. I checked and not all appeared to have the issue of duplicates, so hopefully there are not 50+K (unless you meant including original plus duplicate). I paused uploads from one of our partners, Plains to Peaks, but left the others coming in from Massachusetts running for now. But please let me know if you are seeing the issue. Meanwhile, I will look at what has occurred and see how I can fix it, but I wanted to stop whatever was happening, and also note here that I was being responsive even if you see some edits continuing. There are multiple upload batches from separate institutional partners ongoing. The issue is in the data and not the bot code. Dominic (talk) 16:38, 2 November 2022 (UTC)
- It'll be good idea if bot will check file existence on Commons before uploading. Data may be not ideal, so bot should not rely on it entirely. --EugeneZelenko (talk) 04:17, 6 November 2022 (UTC)
- @C.Suthorn: So far, I think the bot has only uploaded about 30K files total since the current batch started this week. I checked and not all appeared to have the issue of duplicates, so hopefully there are not 50+K (unless you meant including original plus duplicate). I paused uploads from one of our partners, Plains to Peaks, but left the others coming in from Massachusetts running for now. But please let me know if you are seeing the issue. Meanwhile, I will look at what has occurred and see how I can fix it, but I wanted to stop whatever was happening, and also note here that I was being responsive even if you see some edits continuing. There are multiple upload batches from separate institutional partners ongoing. The issue is in the data and not the bot code. Dominic (talk) 16:38, 2 November 2022 (UTC)
File:F - NARA - 34381123 (page 4).jpg has been listed at Commons:Deletion requests so that the community can discuss whether it should be kept or not. We would appreciate it if you could go to voice your opinion about this at its entry.
If you created this file, please note that the fact that it has been proposed for deletion does not necessarily mean that we do not value your kind contribution. It simply means that one person believes that there is some specific problem with it, such as a copyright issue. Please see Commons:But it's my own work! for a guide on how to address these issues. |
Alan Liefting (talk) 22:07, 11 November 2022 (UTC)
Filemoving
Hi, FYI User talk:DPLA bot#Filemoving •2003:DE:720:1683:FDFD:4C52:1F18:FB25 07:00, 13 November 2022 (UTC)
Would you have any objection to turning Category:Media contributed by Seattle Public Library into a hidden category? Seems to me it's not a "topical" category, it has to do with the provenance of the materials. - Jmabel ! talk 23:43, 22 November 2022 (UTC)
- 5 days, no response. I'm going ahead with this. If you have a problem with it, it can always be undone. - Jmabel ! talk 16:37, 27 November 2022 (UTC)
- @Jmabel: Hey, so sorry for the sluggish response. I was out a lot for Thanksgiving. I think it's up to you. Initially, I had made all upload tracking categories like this hidden. However, that ended up resulting in hundreds of thousands of images getting "uncategorized" tags added, which upset some people. As a result, I do not make the institution category hidden, but I do make the aggregator categories (DPLA and Northwest Digital Heritage, in this case), since I am guessing if anything is at least arguably topical, it's the local institution. I agree with your reasoning, especially if you've done a good job of categorizing most of them anyway. Sometimes I am making decisions just based on maintaining over 3 million file uploads and doing whatever rocks the boat the least at that scale. Dominic (talk) 19:17, 1 December 2022 (UTC)
- Frankly, we are better off with those being tagged as "uncategorized", because effectively they are. The only way to avoid that is if there is one or more topical category that can also be associated with an entire archival collection at upload time. If there is no particular topic of the collection, then just knowing that it came from that collection is not useful categorization. If there is a topic -- even a broad one -- that would be useful to have (as would a hidden category under ). - Jmabel ! talk 19:25, 1 December 2022 (UTC)
- @Jmabel: Hey, so sorry for the sluggish response. I was out a lot for Thanksgiving. I think it's up to you. Initially, I had made all upload tracking categories like this hidden. However, that ended up resulting in hundreds of thousands of images getting "uncategorized" tags added, which upset some people. As a result, I do not make the institution category hidden, but I do make the aggregator categories (DPLA and Northwest Digital Heritage, in this case), since I am guessing if anything is at least arguably topical, it's the local institution. I agree with your reasoning, especially if you've done a good job of categorizing most of them anyway. Sometimes I am making decisions just based on maintaining over 3 million file uploads and doing whatever rocks the boat the least at that scale. Dominic (talk) 19:17, 1 December 2022 (UTC)