00:00:17<flashfire42|m>Would it eventually grab all them or not tho?
00:02:20<@JAA>Finland also redirects to .de, wut?
00:02:35<@JAA>Maybe it detects Finnish Hetzner IPs as German.
00:02:43<qwertyasdfuiopghjkl>In Estonia, got redirected to https://www.southparkstudios.com/forum/index.php
00:02:49<@JAA>UK: https://www.southparkstudios.co.uk/forum/index.php
00:02:59<pokechu22>US gives https://southpark.cc.com/forum/index.php as-is
00:03:03<@JAA>NL: https://www.southparkstudios.com/forum/index.php
00:03:19<nicolas17>who thought this was a good idea /o\
00:03:58<@JAA>NZ: https://www.southparkstudios.com/forum/index.php
00:04:15<nicolas17>AR: https://www.southpark.lat/forum/index.php
00:04:38<nicolas17>yet the forum contents seem generic/international?
00:05:36<@JAA>Yeah, same content everywhere it seems.
00:05:37<nicolas17>DigitalOcean NYC: no redirect
00:06:05<Doranwen>Maggie's hand-saving the fic but wanted to pass it on. Thanks for looking at it!
00:06:09<@JAA>Also, given the most recent posts, they seem to have given up completely on fighting spam.
00:10:03<nicolas17>JAA: https://twitter.com/ryanqnorth/status/1433861047404961825
00:10:03<eggdrop>nitter: https://nitter.net/ryanqnorth/status/1433861047404961825
00:12:18<@JAA>Yup
00:16:06DLoader_ (DLoader) joins
00:17:09DLoader quits [Ping timeout: 272 seconds]
00:17:16DLoader_ is now known as DLoader
00:33:10<fireonlive>DigitalDragons: ahh ok, so no action needed :)
00:34:46<fireonlive>oh wow, the southpark forum is dying :o
00:34:54<fireonlive>>Join Our Discord
00:34:57<fireonlive>fucking kill me
00:35:07<fireonlive>canada: https://www.southparkstudios.com/forum/index.php
00:35:18<fireonlive>oh JAA covered that, ignore me
00:40:16<Ryz>Is it possible to save it through AB? oo;
00:40:42<@JAA>Under one of the other domains, yes. Not under southpark.cc.com because we don't currently have a pipeline in the US.
00:40:59<@JAA>Well, actually, no, because AB is much too slow to archive it in time, but you get the idea.
00:45:24<@JAA>I've started qwarc from a machine in the US, and it should take around 10 hours.
00:46:05<@JAA>Grabbing only the topic pages, as usual.
00:48:15<fireonlive>thanks JAA :)
01:00:29<@JAA>Maybe longer, site is still getting slower. :-|
01:00:42<fireonlive>:(
01:01:20bocci quits [Ping timeout: 240 seconds]
01:11:59ell quits [Quit: Ping timeout (120 seconds)]
01:12:44ell (ell) joins
01:36:43<@JAA>I guess maybe I should've gone with sequential topic IDs on this one rather than random order since most of the recent topics are probably spam.
01:36:52<@JAA>Too late to fix that now though.
02:08:24<Ryz>:C
02:09:53Kitty quits [Ping timeout: 272 seconds]
02:16:58Naruyoko joins
02:46:54Doranwen quits [Remote host closed the connection]
02:49:34test joins
02:50:29<test>The Hobbes OS/2 archive is going down forever in April. https://hobbes.nmsu.edu/
02:51:00test quits [Remote host closed the connection]
03:14:47<tech234a>LinkTree acquired Koji, which is shutting down on the 31st https://www.prnewswire.com/news-releases/linktree-acquires-koji-302015100.html
03:17:16Laura-CFIA joins
03:17:39parfait quits [Ping timeout: 272 seconds]
03:17:55<Laura-CFIA>Hello! I don't have permissions in the archivebot channel so am dropping in here to see if I can get guidance/assistance :)
03:20:40<Laura-CFIA>I'm aiming to archive the site Room Escape Artist (http://roomescapeartist.com), it has about 5,000 articles/pages. It's part of a larger effort I'm organizing to archive pages and materials related to the genre of immersive art. In this case, REA is the sole documentation for a lot of these experiences, many of which have since disappeared
03:21:57<h2ibot>Tech234a edited Deathwatch (+177, /* 2024 */ Koji): https://wiki.archiveteam.org/?diff=51479&oldid=51478
03:23:20tzt quits [Ping timeout: 240 seconds]
03:23:39<Laura-CFIA>I've been in contact with the site owner and they're willing to add code to the site if that's necessary for archiving purposes
03:26:43<nicolas17>that looks like a straightforward wordpress blog, no weird javascript stuff
03:27:40<nicolas17>JAA: we can probably throw the homepage into archivebot and let it crawl
03:32:22BearFortress joins
03:39:16<@JAA>Yep, started.
03:52:03Doranwen (Doranwen) joins
03:57:42ctag (ctag) joins
04:15:41<Laura-CFIA>Rad, thank you!
04:15:41tzt (tzt) joins
04:22:42<nicolas17>Laura-CFIA: a friend *makes* escape rooms but I don't feel like I could write a review with this quality
04:25:19<Laura-CFIA>nicolas17 Yeah, haha! They're the best around, been doing it since escape rooms started around 2014 (so 10 years now, kind of amazing)
04:27:42<Laura-CFIA>I have a general question, also... there are several other sites I'd love to add to the archive eventually, is it easiest to just come in here and make the request? I'm semi-comfortable with IRC commands but I don't want to mess anything up
04:28:20<nicolas17>yeah
04:28:34<nicolas17>for some specific websites we have specific channels and specialized tooling
04:29:06<nicolas17>anyone can go to #imgone and run "!a https://i.imgur.com/0NjLWyR.jpg" to archive something from imgur
04:29:09<Laura-CFIA>Great, thanks! Is there any kind of guideline on how often a site should be crawled? For example with REA and some of these other ones, they're posting multiple articles a day
04:29:13<Laura-CFIA>Ooh, interesting!
04:29:53<nicolas17>and it won't just get that one URL, it will extract the image ID 0njLWyR and get the webpage, image, and some other stuff
04:31:02<nicolas17>#archivebot for generic archival is restricted, only users with +o or +v permissions can add stuff, but just ask and someone can add it for you or tell you why not
04:31:56<nicolas17>and if you're going to stick around you should probably get a real IRC client instead of using the webchat ;)
04:38:15<Laura-CFIA>Hahah, I can do that :) Thank you!
04:50:16qwertyasdfuiopghjkl quits [Ping timeout: 265 seconds]
04:51:11<@OrIdow6^2>Though !tell bot kinda makes webchat more useful than it used to be
04:52:57<project10>eggdrop++
04:52:57<eggdrop>[karma] 'eggdrop' now has 6 karma!
04:55:59<fireonlive>:D
04:58:36ell quits [Client Quit]
04:58:44ell (ell) joins
05:03:14ell quits [Client Quit]
05:03:23ell (ell) joins
05:35:19qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
05:44:21DogsRNice quits [Read error: Connection reset by peer]
06:08:01fuzzy8021 quits [Ping timeout: 272 seconds]
06:09:02fuzzy8021 joins
06:09:02fuzzy8021 quits [Changing host]
06:09:02fuzzy8021 (fuzzy8021) joins
06:13:51Island quits [Read error: Connection reset by peer]
06:37:42shreyasminocha quits [Read error: Connection reset by peer]
06:37:42evan quits [Read error: Connection reset by peer]
06:37:50c3manu quits [Ping timeout: 240 seconds]
06:37:50thehedgeh0g quits [Ping timeout: 240 seconds]
06:39:41systwi quits [Ping timeout: 272 seconds]
06:49:36systwi (systwi) joins
06:54:06Laura-CFIA quits [Remote host closed the connection]
07:09:14f00dgroupie joins
07:10:13f00dgroupie leaves
07:27:39lennier2 joins
07:30:59lennier2_ quits [Ping timeout: 272 seconds]
07:31:10lennier2_ joins
07:36:03lennier2 quits [Ping timeout: 272 seconds]
07:50:31qwertyasdfuiopghjkl quits [Client Quit]
07:55:03thehedgeh0g (mrHedgehog0) joins
07:55:28c3manu (c3manu) joins
07:55:55shreyasminocha (shreyasminocha) joins
07:56:03lennier2_ quits [Read error: Connection reset by peer]
07:56:05fireonlive is now known as you
07:56:10you is now known as fireonlive
07:56:16lennier2_ joins
07:56:35evan joins
08:04:09evan quits [Read error: Connection reset by peer]
08:04:10shreyasminocha quits [Read error: Connection reset by peer]
08:04:10lennier2 joins
08:04:11thehedgeh0g quits [Read error: Connection reset by peer]
08:04:13c3manu quits [Read error: Connection reset by peer]
08:05:16thehedgeh0g (mrHedgehog0) joins
08:06:07c3manu (c3manu) joins
08:06:36shreyasminocha (shreyasminocha) joins
08:06:50lennier2_ quits [Ping timeout: 240 seconds]
08:07:16evan joins
08:14:56Arcorann (Arcorann) joins
08:29:14nexusxe quits [Read error: Connection reset by peer]
08:32:20thehedgeh0g quits [Ping timeout: 240 seconds]
08:33:20shreyasminocha quits [Ping timeout: 240 seconds]
08:33:20c3manu quits [Ping timeout: 240 seconds]
08:33:50evan quits [Ping timeout: 240 seconds]
08:42:50thehedgeh0g (mrHedgehog0) joins
08:42:50c3manu (c3manu) joins
08:42:59shreyasminocha (shreyasminocha) joins
08:43:36shreyasminocha quits [Remote host closed the connection]
08:43:36c3manu quits [Remote host closed the connection]
08:43:36thehedgeh0g quits [Remote host closed the connection]
08:43:40evan joins
08:43:42c3manu (c3manu) joins
08:43:43thehedgeh0g (mrHedgehog0) joins
08:43:43shreyasminocha (shreyasminocha) joins
08:47:50evan quits [Ping timeout: 240 seconds]
08:48:09c3manu quits [Read error: Connection reset by peer]
08:48:20shreyasminocha quits [Ping timeout: 240 seconds]
08:48:20thehedgeh0g quits [Ping timeout: 240 seconds]
08:50:20evan joins
08:53:30evan quits [Read error: Connection reset by peer]
08:54:57thehedgeh0g (mrHedgehog0) joins
08:54:59c3manu (c3manu) joins
08:55:10shreyasminocha (shreyasminocha) joins
08:55:17evan joins
09:00:40evan quits [Read error: Connection reset by peer]
09:00:40c3manu quits [Read error: Connection reset by peer]
09:00:40shreyasminocha quits [Read error: Connection reset by peer]
09:01:50thehedgeh0g quits [Ping timeout: 240 seconds]
09:50:47<@arkiver>nicolas17: just saw https://hackint.logs.kiska.pw/archiveteam-bs/20240103#c399918 - always feel free to get this data and upload it
09:50:52<@arkiver>especially in these interesting cases
09:52:11Ruthalas59 quits [Client Quit]
09:52:33Ruthalas59 (Ruthalas) joins
10:00:01Bleo18260 quits [Client Quit]
10:01:17Bleo18260 joins
10:06:25c3manu (c3manu) joins
10:06:42thehedgeh0g (mrHedgehog0) joins
10:06:59evan joins
10:08:08shreyasminocha (shreyasminocha) joins
10:11:53shreyasminocha quits [Read error: Connection reset by peer]
10:11:54c3manu quits [Read error: Connection reset by peer]
10:11:54thehedgeh0g quits [Read error: Connection reset by peer]
10:11:54evan quits [Read error: Connection reset by peer]
10:21:53bocci (bocci) joins
10:34:20bocci quits [Remote host closed the connection]
10:39:14bocci (bocci) joins
11:32:46eroc19905 (eroc1990) joins
11:35:27eroc1990 quits [Ping timeout: 272 seconds]
12:03:24Wohlstand (Wohlstand) joins
12:21:41HotSwap quits [Ping timeout: 272 seconds]
12:39:13decky_e quits [Read error: Connection reset by peer]
12:45:45Arcorann quits [Ping timeout: 272 seconds]
13:30:38HotSwap joins
13:54:16RealPerson leaves
14:33:20eroc19905 is now known as eroc1990
14:42:42<@JAA>My qwarc grab of southpark.cc.com finished some hours ago and seems to have successfully grabbed almost everything. There are a couple ancient broken topics that return an error page, but otherwise, I didn't see any significant problems.
14:49:16<@JAA>182775 topics, 218971 topic pages retrieved according to my log. That's about 10k short of the counter on the homepage, but that's not unexpected. There were some login-required topics, though I haven't looked into whether there are areas of the forums accessible by anyone with an account.
14:51:27<@JAA>I'm also running an update thingy that will continue to grab new posts every few minutes until the site goes down. Although there's little of value there; it's all spam.
15:03:21<@JAA>797732 posts in those topic pages vs ~870k per the homepage.
15:23:18qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
15:26:48katocala quits [Remote host closed the connection]
15:56:50itachi1706 quits [Ping timeout: 240 seconds]
16:01:03itachi1706 (itachi1706) joins
16:21:39<fireonlive>^_^
16:22:04<fireonlive>> Google is shutting down websites built with Google Business Profiles in March 2024. (via #archiveteam) sheesh lol
16:22:47<h2ibot>Nulldata edited Deathwatch (+274, /* 2024 */ Added Google Business Profile Websites): https://wiki.archiveteam.org/?diff=51480&oldid=51479
16:23:15<fireonlive>i’ve seen business.site in use but not negocio.site before, must be a regional thing
16:23:55<nulldata>Death eventually comes for ~~all of us~~ every Google property.
16:25:47<fireonlive>true!
16:28:33qwertyasdfuiopghjkl quits [Remote host closed the connection]
16:32:50qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
16:45:01bocci quits [Remote host closed the connection]
16:45:15bocci (bocci) joins
16:57:02riku quits [Quit: WeeChat 4.1.2]
16:57:56<h2ibot>YetAnotherArchiver edited The WARC Ecosystem (+751, Create a wikitable for deprecated tools): https://wiki.archiveteam.org/?diff=51481&oldid=51454
16:57:57<h2ibot>Ufarwisan edited Discord (+131, update): https://wiki.archiveteam.org/?diff=51482&oldid=50931
16:57:58<h2ibot>Ufarwisan edited Pastebin (-346, the wayback machine has begun to ignore the…): https://wiki.archiveteam.org/?diff=51483&oldid=51460
16:57:59<h2ibot>Ufarwisan edited Matrix (+92, /* Archival tools */): https://wiki.archiveteam.org/?diff=51484&oldid=46312
16:58:00<h2ibot>RealPerson edited List of website hosts (+42, added https://www.000webhost.com/): https://wiki.archiveteam.org/?diff=51485&oldid=51453
17:01:54RealPerson joins
17:13:41<fireonlive>ahh... 000webhost....
17:18:11<nicolas17>arkiver: that iOS beta has been archived via archivebot
17:18:24<nicolas17>it took really long
17:21:17<nicolas17>I know someone who has an archive of ~all iOS builds (including some that Apple has since deleted), it's like 50TB...
17:23:51katocala joins
17:27:12Naruyoko5 joins
17:28:51Naruyoko quits [Ping timeout: 272 seconds]
17:31:51<nicolas17>arkiver: I'd like some advice on the samsung open source thing, but we seem to have non-overlapping activity times on IRC :P
17:33:38Naruyoko joins
17:35:20Naruyoko5 quits [Ping timeout: 240 seconds]
17:43:20bocci quits [Ping timeout: 240 seconds]
17:54:41c3manu (c3manu) joins
18:07:48<nicolas17>(or your pings are still broken)
18:28:28shreyasminocha (shreyasminocha) joins
18:28:56evan joins
18:29:15c3manu_ (c3manu) joins
18:29:18thehedgeh0g (mrHedgehog0) joins
18:57:52Gooshka (Gooshka) joins
18:58:35<Gooshka>https://www.icc-cpi.int/streaming-all-displays - streams of ICC, I guess these videos can be saved as radio recordings are saved by the IA
19:07:28Gooshka quits [Remote host closed the connection]
19:18:00<@JAA>The spam on the South Park forums seems to have started on 2023-06-14 or so. Initially only a few topics daily.
19:18:22<@JAA>I'm beginning to get a shutdown message randomly.
19:19:12riku (riku) joins
19:22:16Dango360_ joins
19:24:59<@JAA>Now it's solidly the shutdown message.
19:25:50<@JAA>There were about 96k topic IDs before the spam began, and there were about 265k topic IDs just before the shutdown.
19:26:01Dango360 quits [Ping timeout: 272 seconds]
19:26:17<@JAA>So just over a third of all topics are not spam...
19:29:00<nicolas17>finding spam on the internet is like finding hay in a haystack
19:31:03<@JAA>Usually, it gets deleted though. They clearly didn't give a shit for half a year, then decided to shut the forums down instead.
19:31:42<nicolas17>bet they laid off the moderators
19:33:08riku quits [Client Quit]
19:35:35<fireonlive>>Howdy Ho, South Park fans! The South Park Forums might be closed, but fear not, our bond’s as solid as Cartman’s love for Cheesy Poofs! Join us (@SouthPark) on our social channels for news, updates and more.
19:35:38<fireonlive>lol
19:35:59<nicolas17>hot take, moderating a forum is easier than moderating a discord
19:36:07<fireonlive>i like how the <title> of that page is "Social Media Layout"
19:36:43c3manu quits [Client Quit]
19:36:43c3manu_ is now known as c3manu
19:37:02<fireonlive>i'd believe that, a little less 'real-time' perhaps?
19:37:55<fireonlive>https://images.paramount.tech/path/mgid:file:gsp:entertainment-assets:/sps/shared/forum/BoysWaving-800px.png < interesting url for the bye image
19:38:03<fireonlive>mgid, etc
19:40:41<fireonlive>(anyone know what that is?)
19:43:00<@JAA>Interesting, avatars still work but there's no geo-IP redirect on those URLs.
19:43:31<fireonlive>oh huh
19:46:34_Dango360 joins
19:49:38<@JAA>Oh, there were attachments.
19:50:43Dango360_ quits [Ping timeout: 272 seconds]
19:50:53<@JAA>Those pages still work, but the attachments seem to be gone. Maybe that's a relic from the ancient times.
19:51:19superkuh joins
19:51:32<@JAA>> <p>The selected attachment does not exist anymore.</p>
19:51:44<@JAA>E.g. https://southpark.cc.com/forum/download/file.php?id=1358 (highest ID)
19:58:20hitgrr8 quits [Ping timeout: 240 seconds]
20:05:29<@JAA>A lot of the avatars are 404s, actually. Either they're deleting them right now, or they were already broken, can't tell.
20:05:36<@JAA>I'm grabbing whatever's left though.
20:14:46Megame (Megame) joins
20:17:19cm quits [Ping timeout: 272 seconds]
20:19:28<fireonlive>:)
20:19:56cm joins
20:23:57riku (riku) joins
20:33:56Island joins
20:34:43<thuban>nice work!
20:37:14<fireonlive>JAA++
20:37:15<eggdrop>[karma] 'JAA' now has 11 karma!
20:37:32<fireonlive>Paramount--
20:37:32<eggdrop>[karma] 'Paramount' now has -1 karma!
20:39:08<@JAA>:-)
20:45:27<@JAA>5.00G/5.00G [01:22<00:00, 65.2MiB/s]
20:45:32<@JAA>Nice upload speed :-)
20:45:52DogsRNice joins
20:47:32DogsRNice_ joins
20:49:16<fireonlive>=]
20:51:20DogsRNice quits [Ping timeout: 240 seconds]
21:06:52BlueMaxima joins
21:16:28<@JAA>Turns out that those attachments I saw were only introduced in 2022: https://web.archive.org/web/20240106091745/https://southpark.cc.com/forum/viewtopic.php?f=2&t=94997
21:16:52<fireonlive>ahh
21:18:01<@JAA>There are three attachments in the WBM, all captured about a year ago: https://web.archive.org/web/*/https://southpark.cc.com/forum/download/file.php*
21:19:07<@JAA>So it broke sometime in the past 11 months or so, I guess.
21:19:23<@JAA>Or I was just too slow today.
21:26:36jacksonchen666 (jacksonchen666) joins
21:28:08Wohlstand quits [Remote host closed the connection]
21:37:01Wohlstand (Wohlstand) joins
21:51:42jacksonchen666 quits [Ping timeout: 255 seconds]
22:07:37<nulldata>fireonlive - RE the URL, it's probably from the DAM Paramount is using - likely https://www.opentext.com/products/media-management
22:08:18<fireonlive>ohh interesting
22:09:03<fireonlive>custom URI schemes for everything is neat :)
22:13:36qwertyasdfuiopghjkl quits [Remote host closed the connection]
22:14:31qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
22:14:31qwertyasdfuiopghjkl quits [Excess Flood]
22:14:48qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
22:18:26qwertyasdfuiopghjkl quits [Excess Flood]
22:18:29<fireonlive>found this weird non-redirecting subdomain but seems the same story for files: https://forums.southpark.cc.com/forum/download/file.php?id=34
22:18:41qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
22:19:05<fireonlive>the api.php actually exposes a 'direct from mediawiki' error instead of 'covering it up' with a generic page https://forums.southpark.cc.com/w/api.php
22:19:18<fireonlive>(also, seemingly no geo-redirect)
22:20:36<fireonlive>see also: https://forums.southpark.cc.com/wiki/Special:RecentChanges vs https://southpark.cc.com/wiki/Special:RecentChanges
22:31:56<@JAA>Yeah, I saw that subdomain earlier (wiki page creation with all the various forum URLs soon). The avatars are also served from that domain.
22:32:18<@JAA>Interesting that it serves the wiki, too.
22:33:02<fireonlive>ahh
22:33:23<fireonlive>indeed hm
22:34:44qwertyasdfuiopghjkl quits [Remote host closed the connection]
22:36:04qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
22:36:17BlueMaxima_ joins
22:37:50qwertyasdfuiopghjkl quits [Excess Flood]
22:38:45<DogsRNice_>https://twitter.com/JoeyCheerio/status/1745143832881098845
22:38:45<eggdrop>nitter: https://nitter.net/JoeyCheerio/status/1745143832881098845
22:38:47<DogsRNice_>https://twitter.com/JoeyCheerio/status/1745150271230038228
22:39:49BlueMaxima quits [Ping timeout: 272 seconds]
22:40:08<fireonlive>sheesh, what's with the DMCA takedowns on REing lately lol
22:40:14qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
22:40:36<DogsRNice_>especally with valve, they dont do that often
22:41:33<DogsRNice_>did anyone archive the portal 64 rom patcher?
22:44:05pedatic-darwin joins
22:44:36<pedatic-darwin>greetings
22:44:47<pedatic-darwin>i was forwarded here by https://findyoutubevideo.thetechrobo.ca/
22:45:08<pedatic-darwin>how do i go about requesting a deleted youtube video
22:45:16<TheTechRobo>you are probably looking for #youtubearchive, not #archiveteam-bs
22:45:18<TheTechRobo>/join #youtubearchive
22:45:24<pedatic-darwin>thank you, my mistake
22:46:20hackbug quits [Remote host closed the connection]
22:50:53hackbug (hackbug) joins
22:59:49pedantic-darwin joins
23:01:06pedatic-darwin quits [Remote host closed the connection]
23:12:07<h2ibot>JustAnotherArchivist created South Park Forums (+2096, Created page with "{{Infobox project | URL =…): https://wiki.archiveteam.org/?title=South%20Park%20Forums
23:13:07<h2ibot>JustAnotherArchivist edited Deathwatch (+36, /* 2024 */ Add South Park Forums): https://wiki.archiveteam.org/?diff=51487&oldid=51480
23:31:02robin-rpr joins
23:34:33parfait (kdqep) joins
23:34:33ctag quits [Read error: Connection reset by peer]
23:35:26ctag (ctag) joins
23:42:43parfait quits [Client Quit]
23:55:31Letur quits [Quit: Ping timeout (120 seconds)]
23:56:53Letur joins