00:02:12cascode quits [Read error: Connection reset by peer]
00:02:26cascode joins
00:06:25Wohlstand quits [Quit: Wohlstand]
00:19:49etnguyen03 (etnguyen03) joins
00:28:11Webuser369573 joins
00:28:20Webuser369573 quits [Client Quit]
00:37:39dabs quits [Quit: Leaving]
00:40:40nine quits [Client Quit]
00:40:52nine joins
00:40:52nine quits [Changing host]
00:40:52nine (nine) joins
00:46:55xkey quits [Quit: WeeChat 4.6.3]
00:47:07xkey (xkey) joins
00:57:59hexagonwin joins
00:59:39hexagonwin_ quits [Ping timeout: 260 seconds]
01:00:31<xarph>Can an s3 > archivebot boffin check that assets.shopcats.app s3 bucket now has file listings on?
01:01:12<xarph>The site operator threw the switch for us but didn't have time to check and I'm fighting the costco eastern front
01:01:30<pabs>loading that page in a browser gives a file listing yeah
01:02:32<pabs>hmm, little-things s3-bucket-list crashes with: Marker loop (empty marker in response despite providing one)
01:04:30<pokechu22>Often when it's on a subdomain, ?marker= won't work on the subdomain but will work on the original bucket at https://shopcats-prod-bucket.s3.amazonaws.com/
01:05:00<pokechu22>in that case, I generate a list using the s3.amazonaws.com URL but then modify it to be on their subdomain (other than the marker URLs)
01:05:52<pokechu22>compare https://assets.shopcats.app/?marker=protected/us-east-1:017b3273-680c-4e68-82ab-d40508ca7b49/8abac6b2-741e-4993-b9f8-1cb835250b63ms.jpg and https://shopcats-prod-bucket.s3.amazonaws.com/?marker=protected/us-east-1:017b3273-680c-4e68-82ab-d40508ca7b49/8abac6b2-741e-4993-b9f8-1cb835250b63ms.jpg
01:07:02<pabs>https://shopcats-prod-bucket.s3.amazonaws.com/ indeed doesn't crash
01:07:16<pokechu22>I'm currently listing it and will run it
01:12:43<pokechu22>https://transfer.archivete.am/BG5eS/assets.shopcats.app_shopcats-prod-bucket.s3.amazonaws.com_urls.txt.zst
01:14:28nicolas17 clicks a random URL and goes "awwww" already
01:19:16Guest58 joins
01:27:23<xarph>wahoo 17,000 bodega cat images
01:33:45<nicolas17>xarph: why do I see 31462?
01:35:54<xarph>because I misread a field
01:36:24<nicolas17>hm was the website shut down already?
01:36:27<xarph>yes
01:36:44<xarph>the frontend went down this morning
01:36:51<xarph>the owner is leaving the s3 bucket up for us to mirror
01:37:04<xarph>still negotiating about the postgres database
01:40:56<nicolas17>hm what's this? user profile pictures? https://assets.shopcats.app/protected/us-east-1%3Aabb8265f-b3ec-4429-846d-1b31d07e73fe/3f678254-3980-4d6c-8453-c0df50692a22l.jpg https://assets.shopcats.app/protected/us-east-1%3A2af9aa03-8217-4541-b8e3-eb8b6f452cd0/517d7cc8-4615-4e9a-b055-cd70abc3a772l.jpg
01:41:45<xarph>yeah it did have profile pics
01:42:01<xarph>s3cmd sync shows 160648 items
01:42:40<nicolas17>yeah every image has several downscaled versions
01:42:47<xarph>I was about to write that
01:43:01<xarph>idgaf I'll go to the best buy down the street and buy a giant hard drive if I need it
01:43:23<xarph>not one cat image shall die on our watch
01:44:27<nicolas17>xarph: http://archivebot.com/?initialFilter=shopcats
01:45:42malteeez joins
01:49:01GradientCat quits [Client Quit]
02:13:31malteeez quits [Client Quit]
02:22:18BornOn420 quits [Remote host closed the connection]
02:25:55nicolas17 quits [Quit: Konversation terminated!]
02:26:22nicolas17 joins
02:29:34<anonymoususer852>There's a typo on the Current_Projects page for the AT wiki. I've made applied them, but am awaiting moderation. If anyone has access to moderation, please also undo the Talk page that I previously contributed to main page, as I have found the correct place.
02:44:19cuphead2527480 quits [Quit: Connection closed for inactivity]
02:55:34cascode quits [Ping timeout: 240 seconds]
02:56:24cascode joins
03:03:01etnguyen03 quits [Client Quit]
03:06:06cascode quits [Read error: Connection reset by peer]
03:07:07cascode joins
03:09:21etnguyen03 (etnguyen03) joins
03:14:27Webuser335071 joins
03:15:04Webuser335071 quits [Client Quit]
03:40:33etnguyen03 quits [Remote host closed the connection]
03:44:44@imer quits [Ping timeout: 260 seconds]
03:57:44imer (imer) joins
03:57:44@ChanServ sets mode: +o imer
05:02:32egallager quits [Quit: This computer has gone to sleep]
05:23:22egallager joins
05:47:54awauwa (awauwa) joins
06:54:33Island quits [Read error: Connection reset by peer]
07:35:09nothere quits [Ping timeout: 260 seconds]
07:43:34Radzig quits [Ping timeout: 240 seconds]
08:15:39Radzig joins
08:16:29nothere_ joins
08:25:13Webuser105032 joins
08:25:17Webuser105032 quits [Client Quit]
08:37:01archiveDrill quits [Quit: The Lounge - https://thelounge.chat]
08:56:51Webuser581666 joins
08:57:00Webuser581666 quits [Client Quit]
09:23:50Dada joins
09:24:38<@OrIdow6>On Itch.io embeds, I'm thinking of either
09:24:49<@OrIdow6>- THrowing it all into SPN (probably would encounter IA difficulties)
09:25:17<@OrIdow6>- Throwing it into JSEater and pushing to have those accepted to the WBM
09:25:53<@OrIdow6>- Setting up some little setup with a headless browser, maybe just running on my own/a few volunteers' machines, which will record the URLs of the resources, then we'll throw those into AB or somewhere
09:26:02<@OrIdow6>- Considering using API keys
09:26:12<@OrIdow6>Third is what I'm leaning towards
09:28:54archiveDrill joins
10:02:54Radzig quits [Ping timeout: 240 seconds]
10:10:38Radzig joins
10:15:10T31M quits [Quit: ZNC - https://znc.in]
10:15:32T31M joins
10:22:34chrismrtn quits [Ping timeout: 260 seconds]
10:23:09jspiros quits [Ping timeout: 260 seconds]
10:43:22Wohlstand (Wohlstand) joins
11:00:04Bleo182600722719623455222 quits [Quit: The Lounge - https://thelounge.chat]
11:02:47Bleo182600722719623455222 joins
11:29:08<BlankEclair>i _think_ wikiforge is now dead? see https://wm-bot.wmcloud.org/browser/index.php?start=07%2F27%2F2025&end=07%2F27%2F2025&display=%23miraheze-offtopic (convo starts at 2025-07-27 02:42:48)
12:55:23etnguyen03 (etnguyen03) joins
13:25:47GradientCat (GradientCat) joins
13:35:06Wohlstand quits [Remote host closed the connection]
13:40:28Wohlstand (Wohlstand) joins
13:43:17etnguyen03 quits [Client Quit]
13:46:26etnguyen03 (etnguyen03) joins
14:02:32jspiros (jspiros) joins
14:04:27chrismrtn (chrismrtn) joins
15:07:01etnguyen03 quits [Client Quit]
15:45:08<h2ibot>Dragon789 created Talk:Banned from youtube (+519, Youtube Project "Your're likely banned"): https://wiki.archiveteam.org/?title=Talk%3ABanned%20from%20youtube
15:45:09<h2ibot>JustAGrook edited Alive... OR ARE THEY (+210, /* Endangered */): https://wiki.archiveteam.org/?diff=56599&oldid=55903
15:45:10<h2ibot>KamafaDelgato edited Itch.io (+4): https://wiki.archiveteam.org/?diff=56600&oldid=56593
15:45:11<h2ibot>R74n edited Deathwatch (+153, /* 2025 */ Note about Firebase Dynamic Links…): https://wiki.archiveteam.org/?diff=56602&oldid=56585
15:45:12<h2ibot>Legowerewolf created Talk:Goo.gl (+214, /* Memory leak? */ new section): https://wiki.archiveteam.org/?title=Talk%3AGoo.gl
15:45:13<h2ibot>Anonymoususer852 edited Talk:Main Page (+228, /* Typo under the section Meta */ new section): https://wiki.archiveteam.org/?diff=56604&oldid=51271
15:45:14<h2ibot>Anonymoususer852 edited Main Page/Current Projects (+0, /* Medium-term projects */ Fixed typo with Meta…): https://wiki.archiveteam.org/?diff=56605&oldid=56493
15:46:09<h2ibot>Arkiver changed the user rights of User:Anonymoususer852
15:46:10<h2ibot>Arkiver changed the user rights of User:R74n
16:03:49<@arkiver>OrIdow6: on the three options. 1. i would not trust SPN too much, especially when using some form of automatic submission. 2. JSEater would work i guess. 3. this would work as well, unless certain headers are needed, information that is not preserved by listing URLs alone. 4. what are these keys for, are they a type of login?
16:09:37Guest58 quits [Quit: My Mac has gone to sleep. ZZZzzz…]
16:10:01Guest58 joins
16:10:18Guest58 quits [Client Quit]
16:10:49<@OrIdow6>arkiver: Yes, keys derived from an account. But technically the only way to be 100% sure we have everything, as the embeds in question are just little static sites hosted by itch.io, and technically the JS embedded in them can load resources dynamically (though it *looks* like most load all the resources up front)
16:13:23<@OrIdow6>Anyway like I say I am going with 3
16:13:32<@OrIdow6>Probably
16:15:35grill (grill) joins
16:16:39jspiros quits [Client Quit]
16:17:05linuxgemini quits [Quit: Ping timeout (120 seconds)]
16:17:55jspiros (jspiros) joins
16:21:34grill quits [Ping timeout: 240 seconds]
16:23:17dabs joins
16:27:13linuxgemini (linuxgemini) joins
16:27:33grill (grill) joins
16:43:15linuxgemini quits [Client Quit]
16:46:18<h2ibot>Anonymoususer852 edited Talk:Main Page (-228, Undo revision 56604 by…): https://wiki.archiveteam.org/?diff=56606&oldid=56604
17:03:20<h2ibot>OrIdow6 edited Shutdown rumors, hoaxes, and scares (+507, Amino): https://wiki.archiveteam.org/?diff=56607&oldid=52757
17:14:22<h2ibot>Anonymoususer852 edited Talk:Banned from youtube (+986, /* Re: Banned from youtube */ new section): https://wiki.archiveteam.org/?diff=56608&oldid=56598
17:37:56<egallager>Tom Lehrer died: https://tomlehrersongs.com/
17:38:46<pokechu22>We've saved it several times in the past (https://archive.fart.website/archivebot/viewer/domain/tomlehrersongs.com) so it might be best to wait for a notice on the site
17:39:01<pokechu22>though it's only 5gb so probably not too big of an issue
17:40:20<@OrIdow6>RIP
17:40:49<@OrIdow6>That thing was on Deathwatch for what, 5 years?
17:41:15etnguyen03 (etnguyen03) joins
17:43:26<h2ibot>Cooljeanius edited Shutdown rumors, hoaxes, and scares (+32, use URL template): https://wiki.archiveteam.org/?diff=56609&oldid=56607
17:44:50feed quits [Quit: Limnoria 2024.12.20]
17:45:30feed (feed) joins
17:48:14grill quits [Ping timeout: 260 seconds]
17:50:02hexagonwin_ joins
17:50:22DartRetaliator_ joins
17:51:44hexagonwin quits [Ping timeout: 260 seconds]
17:53:29DartRetaliator__ quits [Ping timeout: 260 seconds]
17:53:39etnguyen03 quits [Client Quit]
17:55:07etnguyen03 (etnguyen03) joins
18:03:18grill (grill) joins
18:12:03dabs quits [Client Quit]
18:12:13dabs joins
18:12:30<h2ibot>Anonymoususer852 edited Data compression algorithms and tools (+2427, /* TAR/SHAR */ New section - created primarily…): https://wiki.archiveteam.org/?diff=56610&oldid=56537
18:14:31<h2ibot>Anonymoususer852 edited Formats (+85, /* Compression */ Add missing links, especially…): https://wiki.archiveteam.org/?diff=56611&oldid=49459
18:16:54atphoenix_ quits [Read error: Connection reset by peer]
18:17:29lennier2_ quits [Read error: Connection reset by peer]
18:17:51atphoenix_ (atphoenix) joins
18:18:09lennier2_ joins
18:24:32<h2ibot>Anonymoususer852 edited Data compression algorithms and tools (+108, /* TAR/SHAR */ Add reference to Tape Archive…): https://wiki.archiveteam.org/?diff=56612&oldid=56610
18:40:09grill quits [Ping timeout: 260 seconds]
18:45:44grill (grill) joins
18:49:13<@OrIdow6>pokechu22: On itch,io, how are you doing on identification? IDK what their rates look like but a fallback approach if we can't reliably identify which ones are NSFW might be to cross reference with search results
18:49:30<@OrIdow6>Famous last words but I think I'm nearly done with the script
18:49:56<pokechu22>OrIdow6: the data.json files, which I downloaded with archivebot at con=3, d=100 (I know that at con=6, d=100 you get 429s, so there is a limit)
18:50:40<pokechu22>https://transfer.archivete.am/R5zMT/itch.io_find_nsfw_games.py is what I used
18:50:41<eggdrop>inline (for browser viewing): https://transfer.archivete.am/inline/R5zMT/itch.io_find_nsfw_games.py
18:51:35<pokechu22>data.json should also be usable to identify games tagged html5 (which might not exactly match games that have html5 embeds), and free/name your price games (that should be exact since data.json has price information)
18:52:54midou quits [Ping timeout: 240 seconds]
19:01:57midou joins
19:04:32awauwa quits [Quit: awauwa]
19:20:33szczot3k quits [Remote host closed the connection]
19:20:42szczot3k (szczot3k) joins
19:27:08lemuria quits [Read error: Connection reset by peer]
19:27:54lemuria (lemuria) joins
19:31:48etnguyen03 quits [Client Quit]
19:32:09etnguyen03 (etnguyen03) joins
19:36:34grill quits [Ping timeout: 240 seconds]
19:41:59etnguyen03 quits [Client Quit]
19:51:58dabs quits [Client Quit]
20:19:19dabs joins
20:42:36Island joins
20:46:23mls (mls) joins
21:57:27Webuser090327 quits [Quit: Ooops, wrong browser tab.]
22:00:38etnguyen03 (etnguyen03) joins
22:09:51pokechu22 quits [Quit: Network maintenance]
22:14:49cascode quits [Ping timeout: 260 seconds]
22:15:01cascode joins
22:35:49cascode quits [Ping timeout: 260 seconds]
22:35:55simon816 quits [Remote host closed the connection]
22:36:01cascode joins
22:40:14cascode quits [Ping timeout: 240 seconds]
22:40:26cascode joins
22:40:51simon816 (simon816) joins
22:41:49pokechu22 (pokechu22) joins
22:44:01Guest58 joins
22:47:19Webuser172778 joins
22:48:03Webuser172778 quits [Client Quit]
22:49:14cascode quits [Ping timeout: 260 seconds]
22:53:54pokechu22 quits [Ping timeout: 240 seconds]
22:56:16pokechu22 (pokechu22) joins
22:59:02Dada quits [Remote host closed the connection]
23:12:31hyenatown joins
23:23:13cascode joins
23:29:56Hackerpcs quits [Quit: Hackerpcs]
23:35:27cascode quits [Read error: Connection reset by peer]
23:35:39cascode joins
23:36:39Hackerpcs (Hackerpcs) joins
23:42:11etnguyen03 quits [Client Quit]