00:07:52 | <ctag> | What is argenteam? |
00:08:23 | <ctag> | I torrent stuff that's in the clear, but that thing mentions subtitles, which gives me pause. |
00:10:03 | <nicolas17> | ctag: argenteam was a fansub website |
00:10:33 | <ctag> | I don't know what fansub means, one sec |
00:10:52 | <nicolas17> | they downloaded movies and TV show episodes in their original language and made their own translated subtitles |
00:11:12 | <FireFly> | ctag: basically community-made subtitles for media, so the subtitles should be just fine afaik |
00:11:31 | <ctag> | Hmm |
00:11:52 | <ctag> | OK, I'll toss it in the grinder, thank you both for the explanations. |
00:12:25 | <nicolas17> | the website had magnet and e2dk links to the videos, and downloading/seeding those is plain old piracy, but the subtitles were made by the community |
00:12:59 | <nicolas17> | and now it was shut down |
00:13:23 | <nicolas17> | and they made a single 1.8GB torrent with *all* the subtitles they had |
00:13:53 | <ctag> | Ah |
00:14:42 | <ctag> | It's got plenty of seeders, I'm guessing archival involves more than just keeping the torrent active? |
00:15:56 | <nicolas17> | yeah the shutdown was today so there's probably a shitload of people downloading it right now |
00:16:21 | <nicolas17> | ctag: I already archivebot'd the website and subtitles and forum last month, I only posted it here now to get the shutdown notice archived |
00:16:57 | <ctag> | Ah, OK thanks |
00:17:23 | <ctag> | Does IA host piracy agacent material? |
00:17:30 | <ctag> | Or is this to get it saved but blackholed |
00:36:24 | <h2ibot> | Pokechu22 edited List of website hosts (+32, /* B */ bplaced also uses square7.ch): https://wiki.archiveteam.org/?diff=51453&oldid=47760 |
00:40:32 | | driib quits [Quit: The Lounge - https://thelounge.chat] |
00:45:02 | | icedice (icedice) joins |
00:45:32 | | icedice quits [Remote host closed the connection] |
01:00:49 | | driib (driib) joins |
01:23:23 | | icedice (icedice) joins |
01:32:13 | | TastyWiener954 quits [Quit: So long, farewell, auf wiedersehen, good night] |
01:34:14 | | TastyWiener954 (TastyWiener95) joins |
01:44:08 | | nicolas17 quits [Read error: Connection reset by peer] |
02:08:55 | | nicolas17 joins |
02:45:15 | <manu|m> | learned about a mastodon instance that will likely shut down soon, no idea what to do about it.. https://woof.group/@aphyr/111683303140271139 |
02:48:13 | | c3manu (c3manu) joins |
02:52:24 | | c3manu quits [Client Quit] |
03:12:03 | | nic907 quits [Quit: The Lounge - https://thelounge.chat] |
03:12:43 | | atphoenix_ quits [Remote host closed the connection] |
03:12:54 | | nic9070 (nic) joins |
03:13:27 | | atphoenix_ (atphoenix) joins |
03:20:09 | | icedice quits [Client Quit] |
03:37:18 | | HP_Archivist (HP_Archivist) joins |
04:31:54 | | tbc1887 (tbc1887) joins |
04:37:09 | | qwertyasdfuiopghjkl quits [Remote host closed the connection] |
04:41:48 | | Craigle quits [Quit: The Lounge - https://thelounge.chat] |
04:42:18 | | Craigle (Craigle) joins |
04:53:13 | | tbc1887 quits [Client Quit] |
05:00:50 | | tbc1887 (tbc1887) joins |
05:02:50 | <pabs> | manu|m: can you ask them if the instance should be archived? and add it to https://wiki.archiveteam.org/index.php/Mastodon |
05:03:21 | <pabs> | not sure if AT does do fediverse archiving, there was a bit if a backlash before IIRC |
05:11:04 | <manu|m> | well, it was offline because the instance admin isn't reachable. there's probably not going to be a consensus among its userbase either.. |
05:12:32 | <pabs> | for the logs, the instance is https://bear.community/ |
05:13:07 | | nicolas17 covers fireonlive's eyes |
05:14:01 | <fireonlive> | :O |
05:14:21 | <fireonlive> | :D |
05:22:50 | <fireonlive> | archivebot can't mastodon vlatest anymore, so we'd have to something else |
05:46:22 | | sec^nd quits [Remote host closed the connection] |
05:46:44 | | sec^nd (second) joins |
06:15:26 | | Island quits [Read error: Connection reset by peer] |
06:20:04 | | line joins |
06:20:22 | | Arcorann (Arcorann) joins |
06:41:50 | | monoxane quits [Ping timeout: 240 seconds] |
06:41:57 | | datechnoman quits [Quit: The Lounge - https://thelounge.chat] |
06:42:59 | | datechnoman (datechnoman) joins |
06:46:56 | | Ruthalas59 quits [Read error: Connection reset by peer] |
06:47:09 | | Ruthalas59 (Ruthalas) joins |
06:58:39 | | BlueMaxima quits [Read error: Connection reset by peer] |
07:09:27 | | DogsRNice quits [Read error: Connection reset by peer] |
07:52:42 | | nulldata quits [Quit: Ping timeout (120 seconds)] |
07:53:10 | | nulldata (nulldata) joins |
08:38:17 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
08:58:20 | | Church quits [Ping timeout: 240 seconds] |
09:12:40 | | sepro quits [Quit: Bye!] |
10:00:04 | | Bleo18260 quits [Client Quit] |
10:01:26 | | Bleo18260 joins |
10:23:47 | | hitgrr8 joins |
10:48:20 | | Matthww119 quits [Ping timeout: 240 seconds] |
10:57:33 | | Matthww119 joins |
10:57:44 | | TastyWiener954 quits [Client Quit] |
10:59:50 | | TastyWiener954 (TastyWiener95) joins |
11:28:58 | | Earendil7_ (Earendil7) joins |
11:31:19 | | Earendil7 quits [Ping timeout: 272 seconds] |
11:49:37 | <h2ibot> | YetAnotherArchiver edited The WARC Ecosystem (+592, /* Tools */ Add more tools): https://wiki.archiveteam.org/?diff=51454&oldid=51015 |
11:49:38 | <h2ibot> | Igloos edited Deathwatch (+240, Added arcalive/b/genshin): https://wiki.archiveteam.org/?diff=51455&oldid=51449 |
12:01:30 | | kiryu (kiryu) joins |
12:18:58 | | mgrytbak quits [Quit: Ping timeout (120 seconds)] |
12:19:07 | | mgrytbak joins |
12:27:18 | | mgrytbak quits [Client Quit] |
12:27:29 | | mgrytbak joins |
12:30:13 | | Arcorann quits [Ping timeout: 272 seconds] |
12:36:25 | | brandan joins |
12:37:29 | <brandan> | hey i'd like to archive an entire school's website for wiki-en reasons, i already tried to use archivebot but then it dawned on me that i need a second hand to authorize the usage of archivebot in the first place |
12:37:35 | <brandan> | sorry if i misspell anything im on a newer keyboard |
12:44:35 | <Barto> | on it |
13:15:35 | | eroc19905 quits [Quit: The Lounge - https://thelounge.chat] |
13:16:05 | | eroc1990 (eroc1990) joins |
13:22:08 | | katia quits [Remote host closed the connection] |
13:23:03 | | katia (katia) joins |
13:55:23 | | iminonet joins |
14:24:06 | | litech joins |
14:25:41 | <litech> | Hello everyone, has anyone archived the comments to this deleted livejournal post? https://web.archive.org/web/20191114140929/https://varlamov.ru/3267082.html |
14:25:54 | | litech quits [Remote host closed the connection] |
14:26:09 | | litech joins |
14:27:00 | <litech> | Hello everyone, has anyone archived the comments to this LiveJournal post? https://web.archive.org/web/20191114140929/https://varlamov.ru/3267082.html Thank you for looking into this. |
14:38:09 | | bocci (bocci) joins |
14:46:01 | | qwertyasdfuiopghjkl quits [Remote host closed the connection] |
14:46:01 | | litech quits [Remote host closed the connection] |
14:46:36 | | litech joins |
14:51:23 | | brandan quits [Remote host closed the connection] |
14:52:30 | | iminonet35 joins |
14:53:42 | | BearFortress quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.] |
14:56:07 | | iminonet quits [Ping timeout: 265 seconds] |
15:07:50 | | tertu quits [Quit: so long...] |
15:09:14 | | litech quits [Remote host closed the connection] |
15:10:23 | | tertu (tertu) joins |
15:14:39 | | VerifiedJ quits [Quit: The Lounge - https://thelounge.chat] |
15:15:13 | | VerifiedJ (VerifiedJ) joins |
15:25:37 | | jacksonchen666 (jacksonchen666) joins |
15:39:11 | | kiryu quits [Remote host closed the connection] |
15:48:02 | | brandan joins |
15:48:17 | | brandan quits [Remote host closed the connection] |
15:52:20 | | bocci quits [Ping timeout: 240 seconds] |
16:03:05 | | iminonet35 quits [Remote host closed the connection] |
16:17:22 | | Atkin joins |
16:18:32 | | Atkin quits [Remote host closed the connection] |
16:29:36 | | BearFortress joins |
16:34:41 | | Matthww119 quits [Ping timeout: 272 seconds] |
16:40:08 | | Matthww119 joins |
16:47:00 | | aninternettroll quits [Remote host closed the connection] |
17:03:44 | | usr joins |
17:07:37 | | usr quits [Remote host closed the connection] |
17:10:38 | | aninternettroll (aninternettroll) joins |
17:19:54 | | sec^nd quits [Ping timeout: 255 seconds] |
17:23:53 | | sec^nd (second) joins |
17:24:26 | | Megame (Megame) joins |
17:26:25 | | iminonet joins |
17:34:25 | | iminonet quits [Remote host closed the connection] |
17:42:01 | | benjins2_ quits [Read error: Connection reset by peer] |
17:44:55 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
17:45:38 | | benjins2 joins |
17:52:18 | | jacksonchen666 quits [Ping timeout: 255 seconds] |
17:54:29 | | sec^nd quits [Remote host closed the connection] |
17:54:29 | <that_lurker> | Anyone know any good way to archive a podcast to IA? |
17:54:43 | <that_lurker> | s/way/tool |
17:55:37 | | sec^nd (second) joins |
18:03:41 | <nulldata> | that_lurker - You can grab the RSS feed of a podcast if it's on iTunes using the details here https://superuser.com/a/782413 and then throwing that into something like https://github.com/lightpohl/podcast-dl or https://codeberg.org/janw/podcast-archiver |
18:04:11 | <nulldata> | or if it's on SoundCloud or YouTube you could use yt-dlp to grab the entire user/channel |
18:05:05 | | jacksonchen666 (jacksonchen666) joins |
18:07:07 | <that_lurker> | ahh podcast-archiver was the one I was looking for. Thanks |
18:08:51 | <nulldata> | Though as for what switches to use and the kosher way to package it for IA I'm not sure. Probably could use some guidance from someone on formatting, tags, etc. The question has come up a few times I've thought about making a Wiki article. |
18:43:30 | | DogsRNice joins |
18:54:13 | | bocci (bocci) joins |
19:03:55 | | icedice (icedice) joins |
19:07:21 | | kaz (Kaz) joins |
19:07:21 | | @ChanServ sets mode: +o kaz |
20:10:20 | | Barto quits [Ping timeout: 240 seconds] |
20:13:16 | | Barto (Barto) joins |
20:20:50 | | BlueMaxima joins |
20:31:01 | | Island joins |
20:32:27 | <h2ibot> | FireonLive edited URLs (-19, remove CTA for now): https://wiki.archiveteam.org/?diff=51456&oldid=51406 |
20:53:54 | | katia quits [Remote host closed the connection] |
20:54:11 | | katia (katia) joins |
20:55:55 | | katia quits [Remote host closed the connection] |
20:56:38 | | katia (katia) joins |
20:57:27 | | katia quits [Remote host closed the connection] |
20:58:26 | | katia (katia) joins |
21:18:46 | | c3manu (c3manu) joins |
21:40:34 | | BlueMaxima quits [Read error: Connection reset by peer] |
21:45:20 | | icedice quits [Ping timeout: 240 seconds] |
22:11:29 | | katia quits [Client Quit] |
22:12:11 | | katia_ (katia) joins |
22:14:09 | | Megame quits [Ping timeout: 272 seconds] |
22:22:00 | <Pedrosso> | JAA: Is the steam workshop downloads grab planned to be done at some point? |
22:23:09 | | BlueMaxima joins |
22:33:35 | | ScenarioPlanet8 joins |
22:35:39 | | c3manu quits [Remote host closed the connection] |
22:37:18 | | ScenarioPlanet quits [Client Quit] |
22:37:19 | | ScenarioPlanet8 is now known as ScenarioPlanet |
22:42:10 | | ScenarioPlanet is now authenticated as ScenarioPlanet |
22:42:10 | | ScenarioPlanet quits [Changing host] |
22:42:10 | | ScenarioPlanet (ScenarioPlanet) joins |
22:50:50 | | bocci quits [Ping timeout: 240 seconds] |
23:00:35 | <mgrandi> | @Pedrosso: steam workshop grab? Like the websites or files? |
23:00:40 | | Island quits [Read error: Connection reset by peer] |
23:01:52 | | qwertyasdfuiopghjkl quits [Ping timeout: 265 seconds] |
23:04:07 | <Pedrosso> | mgrandi: In this instance I mean files but getting the comments may also be good. https://hackint.logs.kiska.pw/archiveteam-bs/20231219#c396229 |
23:04:21 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
23:06:18 | | Island joins |
23:10:43 | <mgrandi> | I actually have code to get files |
23:10:57 | <mgrandi> | I downloaded the entirety of CSGO's maps before cs2 came out |
23:12:48 | <mgrandi> | I admittedly was lazy and just wrapped SteamCmd since I don't know of a way that you can get another games workshop items otherwise since it requires a license and other stuff |
23:18:31 | <@JAA> | Pedrosso: No specific plans, but it's one of those 'I'd like to do this someday' things. If someone beats me to it, all the better. |
23:19:40 | <Pedrosso> | I didn't exactly understand how you got the download links from the api |
23:20:20 | | Kitty quits [Ping timeout: 240 seconds] |
23:20:30 | <Pedrosso> | mgrandi: Did you ever find a way to get all the ids? My way has been very clunky of simply asking steam to search through all the dates and iterating through its search pages. |
23:21:17 | <mgrandi> | Good old fashion iterating the steam workshop pages |
23:21:30 | <Pedrosso> | Ah yes, iteration. how many 404s do you get? |
23:21:36 | <mgrandi> | I didn't get any really |
23:21:39 | <Pedrosso> | wow |
23:21:53 | <mgrandi> | Just on the workshop gallery pages , not each individual ones |
23:22:18 | <mgrandi> | https://steamcommunity.com/workshop/browse/?appid=730 like that |
23:22:27 | <Pedrosso> | ohh |
23:23:43 | <Pedrosso> | Would you be able to download and upload the p2 workshop items to IA? I have a list of portal 2 workshop item ids which is up-to-date up til the upload date here https://archive.org/details/portal2_workshopIDs_20231212 |
23:27:00 | <mgrandi> | I can get that started , it requires windows so I can't easily do it on my server |
23:27:46 | <Pedrosso> | Awesome. How will you upload it? Like, in item fragments? |
23:28:04 | <mgrandi> | But right now it's storing everything as rows in a database since it was easiest to get working fast, 7z compressed since there wasn't really a good reason to use warc since I'm not the one downloading it (SteamCmd is) |
23:28:33 | <mgrandi> | It can be changed to do whatever, or I can upload the code and you can run it as well |
23:34:14 | <Pedrosso> | I am concerned about getting banned & rate limiting. Regardless, I would indeed like to see the code |
23:34:33 | <mgrandi> | You don't seem to get banned |
23:34:47 | <mgrandi> | I did run into a few rate limits but it seems like it correctly handles it |
23:35:22 | <Pedrosso> | that's good |
23:36:30 | <mgrandi> | Since it's a command line program and they don't have good exit codes I have to parse the stdout which is fun |
23:39:14 | <mgrandi> | Let me look at the code tonight and clean it up and publish it since I've been meaning to do that |
23:47:35 | <mgrandi> | It also might need some adjustments since steam workshop is basically hashtag yolo , even within the Counter strike global offensive workshop, I found several different formats of "maps" |
23:52:32 | <Pedrosso> | Oh wait, I do have a download script, I had just believed that you all had had better ways of doing it. |
23:52:37 | | sepro (sepro) joins |
23:52:41 | <Pedrosso> | Only for the portal maps though |
23:58:51 | <mgrandi> | I mean the python code just automates it |
23:59:21 | <mgrandi> | But it also compressed it, creates a database of the files, is resumable , handles errors or if we get rate limited, dtc |
23:59:34 | <Pedrosso> | That's better than my code at least |
23:59:45 | <Pedrosso> | Mine just stuffs the files in a .tar file and is done with it |