00:00:56 | <nicolas17> | fireonlive: wait what did you archivebot? the release notes page? |
00:05:42 | <fireonlive> | nicolas17: te |
00:05:43 | <fireonlive> | te |
00:05:44 | <fireonlive> | ye |
00:06:00 | <nicolas17> | probably won't work, it's a JS-infested SPA :P |
00:06:06 | <nicolas17> | maybe I should SPN it |
00:06:58 | <h2ibot> | Switchnode edited Template:CTA URL lists (-129, move category link to hat position; tighten up…): https://wiki.archiveteam.org/?diff=51461&oldid=51240 |
00:13:33 | <fireonlive> | rip |
00:14:16 | <fireonlive> | much nicer, re: CTA URL |
00:23:01 | | lflare is now authenticated as * |
00:23:01 | | lflare quits [Killed (ing.hackint.org (Nickname regained by services))] |
00:23:03 | | lflare (lflare) joins |
00:23:59 | | lunik173 quits [Remote host closed the connection] |
00:24:36 | | lunik173 joins |
00:43:05 | <h2ibot> | Switchnode edited URLs (-74, replace CTA... carefully!): https://wiki.archiveteam.org/?diff=51462&oldid=51456 |
00:45:05 | | bocci_ quits [Ping timeout: 272 seconds] |
00:54:04 | | qwertyasdfuiopghjkl quits [Client Quit] |
00:54:17 | | parfait (kdqep) joins |
00:55:08 | <h2ibot> | Switchnode edited ARGENTeaM (+0, private donkey kong?): https://wiki.archiveteam.org/?diff=51463&oldid=51452 |
00:55:45 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
00:55:57 | <tomodachi94> | @jacksonchen666:hackint.org: You beat me to the 0w0.is and gender.systems shutdown lol |
00:58:50 | | parfait quits [Ping timeout: 240 seconds] |
00:59:21 | <fireonlive> | pls add to deathwatch |
01:13:11 | <h2ibot> | Switchnode edited Deathwatch (+339, /* 2024 */ add 0w0.is and gender.systems): https://wiki.archiveteam.org/?diff=51464&oldid=51457 |
01:17:18 | <tomodachi94> | Damnit I just edited that too :( |
01:17:55 | <tomodachi94> | Is there a reason for an edit moderation queue on the wiki? |
01:18:09 | <thuban> | tomodachi94: just drive-by spammers |
01:19:12 | <thuban> | what do we even do with mastodon these days? still totally borked in archivebot, right? |
01:19:32 | | DogsRNice joins |
01:32:38 | <fireonlive> | yeah it’s fucked in AB since v4.? |
01:32:43 | <fireonlive> | no known |
01:32:47 | <fireonlive> | procedure at the moment |
01:39:17 | <tomodachi94> | That's unfortunate |
01:41:04 | <fireonlive> | yeah :/ |
01:41:12 | <fireonlive> | thank the devs who removed the no js fallback |
01:44:37 | | monoxane (monoxane) joins |
01:46:55 | | monoxane7 (monoxane) joins |
01:48:50 | | monoxane quits [Ping timeout: 240 seconds] |
01:48:50 | | monoxane7 is now known as monoxane |
01:51:36 | | Doranwen (Doranwen) joins |
01:51:37 | <eggdrop> | [tell] Doranwen: [2024-01-03T19:28:09Z] <fireonlive> do you have a wiki account? |
01:52:30 | <Doranwen> | fireonlive: No, I haven't gotten one yet. I've hardly edited wikis so it takes eternally long looking up the syntax because I don't know it yet. Which means I tend to avoid doing it, lol. |
01:52:47 | <fireonlive> | ah :) |
01:52:55 | <fireonlive> | I know that feeling |
01:53:28 | <fireonlive> | we can always supply a little fixes after tho |
01:54:13 | <Doranwen> | Yeah, I need to just get around to it… eventually… there always seems to be something more important, lol. |
01:54:20 | | tbc1887 quits [Ping timeout: 240 seconds] |
01:55:24 | <fireonlive> | ye :$ |
01:55:26 | <fireonlive> | :) |
02:07:38 | | tbc1887 (tbc1887) joins |
02:13:58 | | Ruthalas59 quits [Client Quit] |
02:44:19 | | Mateon2 joins |
02:45:20 | | Mateon1 quits [Ping timeout: 240 seconds] |
02:45:20 | | Mateon2 is now known as Mateon1 |
03:16:37 | | parfait (kdqep) joins |
03:50:50 | | parfait quits [Client Quit] |
04:13:24 | | Ruthalas59 (Ruthalas) joins |
04:21:42 | | BlueMaxima_ joins |
04:23:50 | | BlueMaxima quits [Ping timeout: 240 seconds] |
04:30:33 | | treora quits [Ping timeout: 272 seconds] |
04:50:57 | | nic90705 (nic) joins |
04:51:27 | | nic9070 quits [Ping timeout: 272 seconds] |
04:51:27 | | nic90705 is now known as nic9070 |
06:12:28 | | Island quits [Read error: Connection reset by peer] |
06:20:43 | | AlsoHP_Archivist joins |
06:21:03 | | c3manu (c3manu) joins |
06:23:50 | | HP_Archivist quits [Ping timeout: 240 seconds] |
06:25:37 | <mgrandi> | Pedrosso: it seems that my steam workshop downloader works fine with portal 2 |
06:44:39 | | treora joins |
07:14:10 | | c3manu quits [Remote host closed the connection] |
07:42:21 | | Arcorann (Arcorann) joins |
08:01:07 | | decky_e quits [Read error: Connection reset by peer] |
08:01:31 | | decky_e joins |
08:20:30 | <mgrandi> | i have a few bugs i need to work out, such as deleting files as they get downloaded so you aren't duplicating space within the "steamcmd" folder and cleaing up the code but it seems to work |
08:29:55 | | DogsRNice quits [Read error: Connection reset by peer] |
08:57:37 | | qwertyasdfuiopghjkl quits [Ping timeout: 259 seconds] |
09:00:27 | | Chris5010 (Chris5010) joins |
09:07:13 | <mgrandi> | example log of it running for 1 page of the workshop (30 items): https://gist.github.com/mgrandi/a1f7dfeae765890e91b987debadf3d09 |
10:00:03 | | Bleo18260 quits [Client Quit] |
10:01:24 | | Bleo18260 joins |
10:09:13 | <Pedrosso> | Well, that's great |
10:42:25 | | sec^nd quits [Remote host closed the connection] |
10:42:47 | | sec^nd (second) joins |
11:02:00 | | igloo22225 quits [Quit: The Lounge - https://thelounge.chat] |
11:02:26 | | igloo22225 (igloo22225) joins |
11:07:38 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
11:25:49 | | bocci_ joins |
11:49:17 | <h2ibot> | OrIdow6 edited Google Drive (+376, Pubhtml pages): https://wiki.archiveteam.org/?diff=51468&oldid=51399 |
11:50:05 | | Earendil7_ quits [Ping timeout: 272 seconds] |
11:50:43 | | Earendil7 (Earendil7) joins |
11:55:50 | | Mist8kenGAS quits [Read error: Connection reset by peer] |
11:57:19 | <h2ibot> | OrIdow6 edited Google Drive (+153, /* Notes */ /pub): https://wiki.archiveteam.org/?diff=51469&oldid=51468 |
12:01:03 | | shreyasminocha quits [Remote host closed the connection] |
12:01:03 | | evan quits [Read error: Connection reset by peer] |
12:01:03 | | thehedgeh0g quits [Remote host closed the connection] |
12:01:07 | | evan joins |
12:01:11 | | thehedgeh0g (mrHedgehog0) joins |
12:01:11 | | shreyasminocha (shreyasminocha) joins |
12:03:25 | | lflare quits [Client Quit] |
12:11:36 | | lflare (lflare) joins |
12:21:26 | | qwertyasdfuiopghjkl quits [Remote host closed the connection] |
12:23:01 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
12:34:56 | | lflare is now authenticated as * |
12:34:56 | | lflare is now known as RJHacker14101 |
12:34:58 | | lflare (lflare) joins |
12:35:03 | | RJHacker14101 quits [Ping timeout: 272 seconds] |
12:36:39 | | BlueMaxima_ quits [Read error: Connection reset by peer] |
12:42:39 | | lflare quits [Ping timeout: 272 seconds] |
12:51:06 | | lflare (lflare) joins |
13:01:01 | | Arcorann quits [Ping timeout: 272 seconds] |
13:35:47 | <nulldata> | https://twitter.com/TafferKing451/status/1742646316927459685 |
13:35:47 | <eggdrop> | nitter: https://nitter.net/TafferKing451/status/1742646316927459685 |
13:36:40 | <nulldata> | More layoffs at 3D Realms and Slipgate. Probably wouldn't be a bad idea to throw them in AB. |
13:37:05 | <nulldata> | https://3drealms.com/ |
13:37:22 | <nulldata> | https://www.slipgate-ironworks.com/ |
13:38:31 | <nulldata> | Also https://twitter.com/SlipgateIron https://twitter.com/3DRealms |
13:38:31 | <eggdrop> | nitter: https://nitter.net/SlipgateIron https://nitter.net/3DRealms |
13:57:59 | <Barto> | nulldata: i took care of the websites |
14:02:30 | <nulldata> | Thanks! |
14:12:44 | <Exorcism> | !tell Megame I was able to obtain a list of subdomains for https://cgsociety.org/ (If you want to put them in the archivebot): https://chibi.mint.lgbt/s/oBrR9I7jVS2Y |
14:12:44 | <eggdrop> | [tell] ok, I'll tell Megame when they join next |
15:02:00 | <ctag> | If someone with AB would grab http://www.vbas.org please, and let me know. |
15:02:19 | <ctag> | I guess setting that up broke the https redirect, so I'm hoping to put it back to normal soonish |
15:04:41 | <Barto> | ctag: let me test this :-) |
15:05:58 | <Barto> | is the ftp still working? |
15:08:53 | <ctag> | Uhhh, probably not |
15:09:03 | <Barto> | dayum |
15:09:19 | <ctag> | I do have a backup of it somewhere around here though |
15:09:41 | <ctag> | We made a big push to migrate to google drive this past year, to make files more accessible to regular membership |
15:09:58 | <ctag> | So most of it should be over there too |
15:11:13 | <ctag> | That FTP server is still running though. It's a 32-bit SuSe Linux machine that we got from government auction around 200(2?) |
15:11:25 | <ctag> | 4x500GB Raid |
15:11:37 | <Barto> | lol, the trusty old beast |
15:11:38 | <ctag> | Was hot stuff back in the day, I imagine. |
15:11:58 | <ctag> | But it's never needed any maintenance. Has just ran under a desk for 20 years straight |
15:13:17 | <Barto> | yeah, cant reach it from here |
15:14:24 | <ctag> | Hmm. Try ftp.vbas.org? |
15:14:32 | <ctag> | I don't have an ftp client at this computer |
15:15:13 | | redrock joins |
15:15:28 | | redrock quits [Remote host closed the connection] |
15:19:46 | <ctag> | Nope, there's no ftp daemon running anymore, I just checked. We switched to ssh-rsync and filebrowser apparently. |
15:20:47 | <ctag> | The blog is broken too, Hrm |
15:28:22 | <ctag> | :D |
15:28:27 | <ctag> | Blog should be fixed-ish now |
15:28:39 | <ctag> | Looks like a lot of image resources are missing though |
15:29:12 | <ctag> | I wish we had a copy of the website pre-2011 :-/ Oh well |
15:34:59 | <ctag> | ! There's versions in wayback machine back to 1999! |
15:35:21 | <ctag> | I'm gonna go tell our society historian haha. How have I not checked that before |
15:36:20 | | treora quits [Ping timeout: 240 seconds] |
15:37:08 | <Exorcism> | Barto: can you archive this website with AB ? https://0w0.is/ |
15:43:46 | | treora joins |
15:45:33 | <ctag> | Hrm. The version that I'm trying to save seem well preserved on wayback machine. I'm not sure if it's a good idea to save another copy now. |
15:45:52 | <ctag> | My original goal was to keep the old site available as an archive on-site for our organization |
15:46:15 | <ctag> | Is there a way to retrieve warcs from wayback machine? |
15:55:07 | <Barto> | i let others do Mastodon Exorcism |
15:59:22 | <Exorcism> | 👍️ |
16:00:50 | | bocci_ quits [Ping timeout: 240 seconds] |
16:11:18 | <ctag> | Barto, I think I'm going to yank that url redirect. I'm less sure if it's a good idea to archive it on IA like this. |
16:14:01 | <joepie91|m> | <Exorcism> "Barto: can you archive this..." <- note, mastodon content should only be archived with consent |
16:14:18 | <joepie91|m> | or well, fedi content* I suppose |
16:29:14 | | bocci (bocci) joins |
16:39:17 | <nulldata> | Can someone throw this into AB? A well known GTA modder is leaving the scene. https://zolika1351.pages.dev/ |
16:42:42 | <Barto> | joepie91|m: i dont have the history on this, so i cant make any comment on why we do it this way |
16:44:17 | <nulldata> | Thanks Barto! |
16:44:35 | <Barto> | nulldata: not sure when i'll do twitter, but it's in my account list |
16:45:51 | | treora quits [Ping timeout: 272 seconds] |
16:46:43 | <nulldata> | He has a Discord too, but invite links are dead - I am still in it though. If I have get a moment later I'll look into archiving it. |
16:47:40 | <Barto> | #discard :-) |
16:48:45 | | treora joins |
16:49:00 | <@arkiver> | Barto: what do you mean "this way"? |
16:57:31 | <Barto> | arkiver: the fact that we dont usually throw mastodon instances in AB. |
17:01:03 | | bocci quits [Ping timeout: 272 seconds] |
17:01:21 | <fireonlive> | can add the twitters to the queue at https://pad.notkiska.pw/p/archivebot-twitter too |
17:01:37 | | bocci (bocci) joins |
17:11:36 | <Barto> | good idea. will need to do a bit of cleanup and check capitalization of handle firsts |
17:11:39 | <Barto> | first* |
17:12:03 | <fireonlive> | :) |
17:14:16 | | aprego (apregoa) joins |
17:17:02 | <aprego> | after a website is excluded from the wayback machine, is it possible to download snapshots of the site? |
17:20:16 | <aprego> | is it true you can still find the WARCs in collections? |
17:26:11 | | bocci quits [Client Quit] |
17:31:10 | <Barto> | fireonlive: added, i think you can add case ok to all of them |
17:32:42 | <Barto> | it's a mix of space stuffs, gabonese handles, owasp drama, hacked company handles, swiss catholic groups, and some other proactive shit. |
17:58:55 | <fireonlive> | thanks :) |
18:01:16 | | bocci (bocci) joins |
18:14:15 | <TheTechRobo> | aprego: Depends on how the site was archived |
18:14:38 | <aprego> | TheTechRobo: SavePageNow mostly |
18:19:52 | | mr_sarge quits [Read error: Connection reset by peer] |
18:22:12 | <nicolas17> | aprego: I think savepagenow WARCs are *always* inaccessible in order to support the *possibility* that they may get excluded from wayback machine in the future |
18:24:01 | | raxxy-137409 quits [Ping timeout: 272 seconds] |
18:24:08 | | raxxy-137409 joins |
18:24:16 | | mr_sarge (sarge) joins |
18:27:06 | <aprego> | i guess it's over |
18:33:15 | | bocci quits [Client Quit] |
18:52:53 | | DogsRNice joins |
18:55:43 | <SketchCow> | Testflight Crashland Project was taken down off Internet Archive and Wayback |
18:56:58 | <Barto> | damn |
19:16:23 | | c3manu (c3manu) joins |
19:21:33 | <Dango360> | "Where do all the saved files go? Files are ultimately uploaded to Internet Archive on the archiveteam collection." we kept the warcs, right? |
19:25:27 | <nicolas17> | Dango360: you mean for testflight? |
19:25:38 | <nicolas17> | it's possible the warcs are still on IA's servers but they are not publicly accessible |
19:25:54 | <audrooku|m> | it is my understanding that that is IA's policy yes |
19:26:03 | <fireonlive> | oh. |
19:26:19 | <fireonlive> | sad news :( |
19:26:37 | <fireonlive> | i was just coming here to relay that the discord said to "Under no circumstance, for any reason, upload another copy of the data to the Internet Archive/archive.org." |
19:26:44 | <fireonlive> | and was confused as to why, but now I know |
19:27:14 | <fireonlive> | s/said/pinged @here/ |
19:28:54 | | sum1 joins |
19:32:39 | <audrooku|m> | which discord? |
19:32:58 | <fireonlive> | "TestFlight" |
19:33:01 | <fireonlive> | one sec |
19:33:11 | <nicolas17> | "kids talking about the testflight leak" discord |
19:33:12 | <fireonlive> | i'll PM you to keep it out of here |
19:33:14 | <fireonlive> | yeah |
19:33:57 | <fireonlive> | this whole testflight 'leak' thing is a big can of shitfuckery that was presented in bad faith from the discovering party and then clickbait media being clickbait media ran foaming at the mouth with it |
19:34:01 | <fireonlive> | from what i understand anyways |
19:34:07 | <fireonlive> | (it was never a leak) |
19:34:40 | <fireonlive> | at least some tried to rebrand it later to something else, but i don't think it stuck |
19:45:20 | | treora quits [Ping timeout: 240 seconds] |
19:46:44 | | sum1 quits [Remote host closed the connection] |
19:56:06 | | treora joins |
20:43:07 | <@JAA> | Exorcism's gender.systems subdomain list at https://chibi.mint.lgbt/s/PddyAUliT1qk (from #archiveteam earlier) in plain text rather than using the world's worst pastebin: matrix.gender.systems chat.im.gender.systems matrix.im.gender.systems im.gender.systems read.gender.systems |
20:44:36 | <fireonlive> | "<noscript><strong>We're sorry but chibisafe doesn't work properly without JavaScript enabled. Please enable it to continue.</strong></noscript>" |
20:44:37 | <fireonlive> | ah |
20:45:58 | <@JAA> | Ditto for https://chibi.mint.lgbt/s/oBrR9I7jVS2Y from here earlier: https://transfer.archivete.am/inline/10tKkw/cgsociety.org-subdomains |
20:46:49 | <@JAA> | Imagine requiring JS to literally just render a few lines of plain text... |
20:47:52 | <fireonlive> | the future is here :D |
20:58:46 | <Exorcism> | fireonlive, JAA: there is also the RAW version 😭 : https://chibi.mint.lgbt/api/snippet/oBrR9I7jVS2Y/raw |
21:16:55 | | Mateon1 quits [Remote host closed the connection] |
21:26:19 | | Mateon1 joins |
21:33:43 | | BlueMaxima joins |
21:35:32 | | c3manu quits [Remote host closed the connection] |
21:39:22 | <joepie91|m> | Barto: I don't know the exact history of how things went with mastodon stuff *in archiveteam* but as a general rule, fedi instances tend to be deliberately ephemeral and not friendly towards scraping |
21:39:42 | <joepie91|m> | and heavily focused around consent |
21:41:11 | <joepie91|m> | fedi instances tend to be closer to someone's living room than to a public square |
21:43:12 | <@OrIdow6> | I wasn't there for the incident that prompted that rule but I do think that enough time has passed that it can be written up about on the wiki |
21:51:34 | | Gooshka (Gooshka) joins |
21:52:20 | <Gooshka> | List of Geonames sources, may be useful for saving government content around the world https://www.geonames.org/datasources/ |
22:02:07 | | Gooshka quits [Ping timeout: 265 seconds] |
22:05:24 | | Island joins |
22:24:05 | <nicolas17> | given a .warc, do I have any chance of compressing it back to the original .warc.gz? like gzipping each record in the same way wget-at does? |
22:24:54 | <@JAA> | I'm not aware of tooling for that. Bit-identical output would likely be virtually impossible. |
22:25:19 | <fireonlive> | https://static.space/ < interesting/strange |
22:25:29 | <fireonlive> | via https://static.space/sha2-256:d5b215dd588bda164aca31a2eb08aab56ac64bc03cef24e3b67e35654316a446 |
22:25:44 | <@JAA> | It's one of those things I intend to support in my WIP tooling though. |
22:28:21 | <h2ibot> | Switchnode edited Template:CTA URL lists (+24, additional fixes): https://wiki.archiveteam.org/?diff=51470&oldid=51461 |
22:42:53 | <Barto> | joepie91|m: that is kinda why i said i'd let others do it, hoping they have a better view on the mastodon event. Same with the process to take if we want to crawl it, others may have a better experience than me. |
23:06:24 | | jacksonchen666 quits [Ping timeout: 255 seconds] |
23:08:26 | | jacksonchen666 (jacksonchen666) joins |
23:17:19 | | aprego quits [Remote host closed the connection] |
23:30:27 | | Arcorann (Arcorann) joins |
23:34:21 | | celestial quits [Ping timeout: 272 seconds] |
23:35:41 | | celestial joins |
23:43:48 | | Lord_Nightmare quits [Quit: ZNC - http://znc.in] |
23:49:20 | | Lord_Nightmare (Lord_Nightmare) joins |