00:00:56<nicolas17>fireonlive: wait what did you archivebot? the release notes page?
00:05:42<fireonlive>nicolas17: te
00:05:43<fireonlive>te
00:05:44<fireonlive>ye
00:06:00<nicolas17>probably won't work, it's a JS-infested SPA :P
00:06:06<nicolas17>maybe I should SPN it
00:06:58<h2ibot>Switchnode edited Template:CTA URL lists (-129, move category link to hat position; tighten up…): https://wiki.archiveteam.org/?diff=51461&oldid=51240
00:13:33<fireonlive>rip
00:14:16<fireonlive>much nicer, re: CTA URL
00:23:01lflare quits [Killed (ing.hackint.org (Nickname regained by services))]
00:23:03lflare (lflare) joins
00:23:59lunik173 quits [Remote host closed the connection]
00:24:36lunik173 joins
00:43:05<h2ibot>Switchnode edited URLs (-74, replace CTA... carefully!): https://wiki.archiveteam.org/?diff=51462&oldid=51456
00:45:05bocci_ quits [Ping timeout: 272 seconds]
00:54:04qwertyasdfuiopghjkl quits [Client Quit]
00:54:17parfait (kdqep) joins
00:55:08<h2ibot>Switchnode edited ARGENTeaM (+0, private donkey kong?): https://wiki.archiveteam.org/?diff=51463&oldid=51452
00:55:45qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
00:55:57<tomodachi94>@jacksonchen666:hackint.org: You beat me to the 0w0.is and gender.systems shutdown lol
00:58:50parfait quits [Ping timeout: 240 seconds]
00:59:21<fireonlive>pls add to deathwatch
01:13:11<h2ibot>Switchnode edited Deathwatch (+339, /* 2024 */ add 0w0.is and gender.systems): https://wiki.archiveteam.org/?diff=51464&oldid=51457
01:17:18<tomodachi94>Damnit I just edited that too :(
01:17:55<tomodachi94>Is there a reason for an edit moderation queue on the wiki?
01:18:09<thuban>tomodachi94: just drive-by spammers
01:19:12<thuban>what do we even do with mastodon these days? still totally borked in archivebot, right?
01:19:32DogsRNice joins
01:32:38<fireonlive>yeah it’s fucked in AB since v4.?
01:32:43<fireonlive>no known
01:32:47<fireonlive>procedure at the moment
01:39:17<tomodachi94>That's unfortunate
01:41:04<fireonlive>yeah :/
01:41:12<fireonlive>thank the devs who removed the no js fallback
01:44:37monoxane (monoxane) joins
01:46:55monoxane7 (monoxane) joins
01:48:50monoxane quits [Ping timeout: 240 seconds]
01:48:50monoxane7 is now known as monoxane
01:51:36Doranwen (Doranwen) joins
01:51:37<eggdrop>[tell] Doranwen: [2024-01-03T19:28:09Z] <fireonlive> do you have a wiki account?
01:52:30<Doranwen>fireonlive: No, I haven't gotten one yet. I've hardly edited wikis so it takes eternally long looking up the syntax because I don't know it yet. Which means I tend to avoid doing it, lol.
01:52:47<fireonlive>ah :)
01:52:55<fireonlive>I know that feeling
01:53:28<fireonlive>we can always supply a little fixes after tho
01:54:13<Doranwen>Yeah, I need to just get around to it… eventually… there always seems to be something more important, lol.
01:54:20tbc1887 quits [Ping timeout: 240 seconds]
01:55:24<fireonlive>ye :$
01:55:26<fireonlive>:)
02:07:38tbc1887 (tbc1887) joins
02:13:58Ruthalas59 quits [Client Quit]
02:44:19Mateon2 joins
02:45:20Mateon1 quits [Ping timeout: 240 seconds]
02:45:20Mateon2 is now known as Mateon1
03:16:37parfait (kdqep) joins
03:50:50parfait quits [Client Quit]
04:13:24Ruthalas59 (Ruthalas) joins
04:21:42BlueMaxima_ joins
04:23:50BlueMaxima quits [Ping timeout: 240 seconds]
04:30:33treora quits [Ping timeout: 272 seconds]
04:50:57nic90705 (nic) joins
04:51:27nic9070 quits [Ping timeout: 272 seconds]
04:51:27nic90705 is now known as nic9070
06:12:28Island quits [Read error: Connection reset by peer]
06:20:43AlsoHP_Archivist joins
06:21:03c3manu (c3manu) joins
06:23:50HP_Archivist quits [Ping timeout: 240 seconds]
06:25:37<mgrandi>Pedrosso: it seems that my steam workshop downloader works fine with portal 2
06:44:39treora joins
07:14:10c3manu quits [Remote host closed the connection]
07:42:21Arcorann (Arcorann) joins
08:01:07decky_e quits [Read error: Connection reset by peer]
08:01:31decky_e joins
08:20:30<mgrandi>i have a few bugs i need to work out, such as deleting files as they get downloaded so you aren't duplicating space within the "steamcmd" folder and cleaing up the code but it seems to work
08:29:55DogsRNice quits [Read error: Connection reset by peer]
08:57:37qwertyasdfuiopghjkl quits [Ping timeout: 259 seconds]
09:00:27Chris5010 (Chris5010) joins
09:07:13<mgrandi>example log of it running for 1 page of the workshop (30 items): https://gist.github.com/mgrandi/a1f7dfeae765890e91b987debadf3d09
10:00:03Bleo18260 quits [Client Quit]
10:01:24Bleo18260 joins
10:09:13<Pedrosso>Well, that's great
10:42:25sec^nd quits [Remote host closed the connection]
10:42:47sec^nd (second) joins
11:02:00igloo22225 quits [Quit: The Lounge - https://thelounge.chat]
11:02:26igloo22225 (igloo22225) joins
11:07:38qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
11:25:49bocci_ joins
11:49:17<h2ibot>OrIdow6 edited Google Drive (+376, Pubhtml pages): https://wiki.archiveteam.org/?diff=51468&oldid=51399
11:50:05Earendil7_ quits [Ping timeout: 272 seconds]
11:50:43Earendil7 (Earendil7) joins
11:55:50Mist8kenGAS quits [Read error: Connection reset by peer]
11:57:19<h2ibot>OrIdow6 edited Google Drive (+153, /* Notes */ /pub): https://wiki.archiveteam.org/?diff=51469&oldid=51468
12:01:03shreyasminocha quits [Remote host closed the connection]
12:01:03evan quits [Read error: Connection reset by peer]
12:01:03thehedgeh0g quits [Remote host closed the connection]
12:01:07evan joins
12:01:11thehedgeh0g (mrHedgehog0) joins
12:01:11shreyasminocha (shreyasminocha) joins
12:03:25lflare quits [Client Quit]
12:11:36lflare (lflare) joins
12:21:26qwertyasdfuiopghjkl quits [Remote host closed the connection]
12:23:01qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
12:34:56lflare is now known as RJHacker14101
12:34:58lflare (lflare) joins
12:35:03RJHacker14101 quits [Ping timeout: 272 seconds]
12:36:39BlueMaxima_ quits [Read error: Connection reset by peer]
12:42:39lflare quits [Ping timeout: 272 seconds]
12:51:06lflare (lflare) joins
13:01:01Arcorann quits [Ping timeout: 272 seconds]
13:35:47<nulldata>https://twitter.com/TafferKing451/status/1742646316927459685
13:35:47<eggdrop>nitter: https://nitter.net/TafferKing451/status/1742646316927459685
13:36:40<nulldata>More layoffs at 3D Realms and Slipgate. Probably wouldn't be a bad idea to throw them in AB.
13:37:05<nulldata>https://3drealms.com/
13:37:22<nulldata>https://www.slipgate-ironworks.com/
13:38:31<nulldata>Also https://twitter.com/SlipgateIron https://twitter.com/3DRealms
13:38:31<eggdrop>nitter: https://nitter.net/SlipgateIron https://nitter.net/3DRealms
13:57:59<Barto>nulldata: i took care of the websites
14:02:30<nulldata>Thanks!
14:12:44<Exorcism>!tell Megame I was able to obtain a list of subdomains for https://cgsociety.org/ (If you want to put them in the archivebot): https://chibi.mint.lgbt/s/oBrR9I7jVS2Y
14:12:44<eggdrop>[tell] ok, I'll tell Megame when they join next
15:02:00<ctag>If someone with AB would grab http://www.vbas.org please, and let me know.
15:02:19<ctag>I guess setting that up broke the https redirect, so I'm hoping to put it back to normal soonish
15:04:41<Barto>ctag: let me test this :-)
15:05:58<Barto>is the ftp still working?
15:08:53<ctag>Uhhh, probably not
15:09:03<Barto>dayum
15:09:19<ctag>I do have a backup of it somewhere around here though
15:09:41<ctag>We made a big push to migrate to google drive this past year, to make files more accessible to regular membership
15:09:58<ctag>So most of it should be over there too
15:11:13<ctag>That FTP server is still running though. It's a 32-bit SuSe Linux machine that we got from government auction around 200(2?)
15:11:25<ctag>4x500GB Raid
15:11:37<Barto>lol, the trusty old beast
15:11:38<ctag>Was hot stuff back in the day, I imagine.
15:11:58<ctag>But it's never needed any maintenance. Has just ran under a desk for 20 years straight
15:13:17<Barto>yeah, cant reach it from here
15:14:24<ctag>Hmm. Try ftp.vbas.org?
15:14:32<ctag>I don't have an ftp client at this computer
15:15:13redrock joins
15:15:28redrock quits [Remote host closed the connection]
15:19:46<ctag>Nope, there's no ftp daemon running anymore, I just checked. We switched to ssh-rsync and filebrowser apparently.
15:20:47<ctag>The blog is broken too, Hrm
15:28:22<ctag>:D
15:28:27<ctag>Blog should be fixed-ish now
15:28:39<ctag>Looks like a lot of image resources are missing though
15:29:12<ctag>I wish we had a copy of the website pre-2011 :-/ Oh well
15:34:59<ctag>! There's versions in wayback machine back to 1999!
15:35:21<ctag>I'm gonna go tell our society historian haha. How have I not checked that before
15:36:20treora quits [Ping timeout: 240 seconds]
15:37:08<Exorcism>Barto: can you archive this website with AB ? https://0w0.is/
15:43:46treora joins
15:45:33<ctag>Hrm. The version that I'm trying to save seem well preserved on wayback machine. I'm not sure if it's a good idea to save another copy now.
15:45:52<ctag>My original goal was to keep the old site available as an archive on-site for our organization
15:46:15<ctag>Is there a way to retrieve warcs from wayback machine?
15:55:07<Barto>i let others do Mastodon Exorcism
15:59:22<Exorcism>👍️
16:00:50bocci_ quits [Ping timeout: 240 seconds]
16:11:18<ctag>Barto, I think I'm going to yank that url redirect. I'm less sure if it's a good idea to archive it on IA like this.
16:14:01<joepie91|m><Exorcism> "Barto: can you archive this..." <- note, mastodon content should only be archived with consent
16:14:18<joepie91|m>or well, fedi content* I suppose
16:29:14bocci (bocci) joins
16:39:17<nulldata>Can someone throw this into AB? A well known GTA modder is leaving the scene. https://zolika1351.pages.dev/
16:42:42<Barto>joepie91|m: i dont have the history on this, so i cant make any comment on why we do it this way
16:44:17<nulldata>Thanks Barto!
16:44:35<Barto>nulldata: not sure when i'll do twitter, but it's in my account list
16:45:51treora quits [Ping timeout: 272 seconds]
16:46:43<nulldata>He has a Discord too, but invite links are dead - I am still in it though. If I have get a moment later I'll look into archiving it.
16:47:40<Barto>#discard :-)
16:48:45treora joins
16:49:00<@arkiver>Barto: what do you mean "this way"?
16:57:31<Barto>arkiver: the fact that we dont usually throw mastodon instances in AB.
17:01:03bocci quits [Ping timeout: 272 seconds]
17:01:21<fireonlive>can add the twitters to the queue at https://pad.notkiska.pw/p/archivebot-twitter too
17:01:37bocci (bocci) joins
17:11:36<Barto>good idea. will need to do a bit of cleanup and check capitalization of handle firsts
17:11:39<Barto>first*
17:12:03<fireonlive>:)
17:14:16aprego (apregoa) joins
17:17:02<aprego>after a website is excluded from the wayback machine, is it possible to download snapshots of the site?
17:20:16<aprego>is it true you can still find the WARCs in collections?
17:26:11bocci quits [Client Quit]
17:31:10<Barto>fireonlive: added, i think you can add case ok to all of them
17:32:42<Barto>it's a mix of space stuffs, gabonese handles, owasp drama, hacked company handles, swiss catholic groups, and some other proactive shit.
17:58:55<fireonlive>thanks :)
18:01:16bocci (bocci) joins
18:14:15<TheTechRobo>aprego: Depends on how the site was archived
18:14:38<aprego>TheTechRobo: SavePageNow mostly
18:19:52mr_sarge quits [Read error: Connection reset by peer]
18:22:12<nicolas17>aprego: I think savepagenow WARCs are *always* inaccessible in order to support the *possibility* that they may get excluded from wayback machine in the future
18:24:01raxxy-137409 quits [Ping timeout: 272 seconds]
18:24:08raxxy-137409 joins
18:24:16mr_sarge (sarge) joins
18:27:06<aprego>i guess it's over
18:33:15bocci quits [Client Quit]
18:52:53DogsRNice joins
18:55:43<SketchCow>Testflight Crashland Project was taken down off Internet Archive and Wayback
18:56:58<Barto>damn
19:16:23c3manu (c3manu) joins
19:21:33<Dango360>"Where do all the saved files go? Files are ultimately uploaded to Internet Archive on the archiveteam collection." we kept the warcs, right?
19:25:27<nicolas17>Dango360: you mean for testflight?
19:25:38<nicolas17>it's possible the warcs are still on IA's servers but they are not publicly accessible
19:25:54<audrooku|m>it is my understanding that that is IA's policy yes
19:26:03<fireonlive>oh.
19:26:19<fireonlive>sad news :(
19:26:37<fireonlive>i was just coming here to relay that the discord said to "Under no circumstance, for any reason, upload another copy of the data to the Internet Archive/archive.org."
19:26:44<fireonlive>and was confused as to why, but now I know
19:27:14<fireonlive>s/said/pinged @here/
19:28:54sum1 joins
19:32:39<audrooku|m>which discord?
19:32:58<fireonlive>"TestFlight"
19:33:01<fireonlive>one sec
19:33:11<nicolas17>"kids talking about the testflight leak" discord
19:33:12<fireonlive>i'll PM you to keep it out of here
19:33:14<fireonlive>yeah
19:33:57<fireonlive>this whole testflight 'leak' thing is a big can of shitfuckery that was presented in bad faith from the discovering party and then clickbait media being clickbait media ran foaming at the mouth with it
19:34:01<fireonlive>from what i understand anyways
19:34:07<fireonlive>(it was never a leak)
19:34:40<fireonlive>at least some tried to rebrand it later to something else, but i don't think it stuck
19:45:20treora quits [Ping timeout: 240 seconds]
19:46:44sum1 quits [Remote host closed the connection]
19:56:06treora joins
20:43:07<@JAA>Exorcism's gender.systems subdomain list at https://chibi.mint.lgbt/s/PddyAUliT1qk (from #archiveteam earlier) in plain text rather than using the world's worst pastebin: matrix.gender.systems chat.im.gender.systems matrix.im.gender.systems im.gender.systems read.gender.systems
20:44:36<fireonlive>"<noscript><strong>We're sorry but chibisafe doesn't work properly without JavaScript enabled. Please enable it to continue.</strong></noscript>"
20:44:37<fireonlive>ah
20:45:58<@JAA>Ditto for https://chibi.mint.lgbt/s/oBrR9I7jVS2Y from here earlier: https://transfer.archivete.am/inline/10tKkw/cgsociety.org-subdomains
20:46:49<@JAA>Imagine requiring JS to literally just render a few lines of plain text...
20:47:52<fireonlive>the future is here :D
20:58:46<Exorcism>fireonlive, JAA: there is also the RAW version 😭 : https://chibi.mint.lgbt/api/snippet/oBrR9I7jVS2Y/raw
21:16:55Mateon1 quits [Remote host closed the connection]
21:26:19Mateon1 joins
21:33:43BlueMaxima joins
21:35:32c3manu quits [Remote host closed the connection]
21:39:22<joepie91|m>Barto: I don't know the exact history of how things went with mastodon stuff *in archiveteam* but as a general rule, fedi instances tend to be deliberately ephemeral and not friendly towards scraping
21:39:42<joepie91|m>and heavily focused around consent
21:41:11<joepie91|m>fedi instances tend to be closer to someone's living room than to a public square
21:43:12<@OrIdow6>I wasn't there for the incident that prompted that rule but I do think that enough time has passed that it can be written up about on the wiki
21:51:34Gooshka (Gooshka) joins
21:52:20<Gooshka>List of Geonames sources, may be useful for saving government content around the world https://www.geonames.org/datasources/
22:02:07Gooshka quits [Ping timeout: 265 seconds]
22:05:24Island joins
22:24:05<nicolas17>given a .warc, do I have any chance of compressing it back to the original .warc.gz? like gzipping each record in the same way wget-at does?
22:24:54<@JAA>I'm not aware of tooling for that. Bit-identical output would likely be virtually impossible.
22:25:19<fireonlive>https://static.space/ < interesting/strange
22:25:29<fireonlive>via https://static.space/sha2-256:d5b215dd588bda164aca31a2eb08aab56ac64bc03cef24e3b67e35654316a446
22:25:44<@JAA>It's one of those things I intend to support in my WIP tooling though.
22:28:21<h2ibot>Switchnode edited Template:CTA URL lists (+24, additional fixes): https://wiki.archiveteam.org/?diff=51470&oldid=51461
22:42:53<Barto>joepie91|m: that is kinda why i said i'd let others do it, hoping they have a better view on the mastodon event. Same with the process to take if we want to crawl it, others may have a better experience than me.
23:06:24jacksonchen666 quits [Ping timeout: 255 seconds]
23:08:26jacksonchen666 (jacksonchen666) joins
23:17:19aprego quits [Remote host closed the connection]
23:30:27Arcorann (Arcorann) joins
23:34:21celestial quits [Ping timeout: 272 seconds]
23:35:41celestial joins
23:43:48Lord_Nightmare quits [Quit: ZNC - http://znc.in]
23:49:20Lord_Nightmare (Lord_Nightmare) joins