00:00:38 | | yarrow2 quits [Client Quit] |
00:04:23 | | yarrow quits [Read error: Connection reset by peer] |
00:05:49 | | yarrow (yarrow) joins |
00:30:37 | | nepeat quits [Ping timeout: 272 seconds] |
00:32:26 | | nepeat (nepeat) joins |
01:04:23 | | grid joins |
01:25:53 | | loug4 quits [Client Quit] |
01:34:21 | | Wohlstand quits [Client Quit] |
02:25:47 | | etnguyen03 quits [Client Quit] |
02:31:46 | | yarrow quits [Read error: Connection reset by peer] |
02:33:42 | | yarrow (yarrow) joins |
02:34:55 | | Notrealname1234 (Notrealname1234) joins |
02:35:22 | | Notrealname1234 quits [Client Quit] |
02:36:39 | | lflare quits [Ping timeout: 272 seconds] |
03:19:59 | <yarrow> | Anyone working on archiving videos from Vimeo? |
03:24:17 | | grid quits [Client Quit] |
03:54:27 | <pabs> | looks like AT isn't https://wiki.archiveteam.org/index.php/Vimeo |
04:02:07 | <@OrIdow6> | Is it at risk? |
04:05:33 | | ilnrja quits [Remote host closed the connection] |
04:05:55 | | ilnrja (ilnrja) joins |
04:07:09 | | Megame quits [Client Quit] |
04:07:18 | | SkilledAlpaca8 joins |
04:09:45 | | SkilledAlpaca quits [Ping timeout: 272 seconds] |
04:09:45 | | SkilledAlpaca8 is now known as SkilledAlpaca |
04:28:48 | <h2ibot> | Exorcism edited Bugzilla (+0, /* Status */): https://wiki.archiveteam.org/?diff=53014&oldid=53012 |
04:29:46 | | wickedplayer494 quits [Ping timeout: 255 seconds] |
04:30:37 | | wickedplayer494 joins |
04:30:49 | | wickedplayer494 is now authenticated as wickedplayer494 |
04:33:42 | | wickedplayer494 quits [Remote host closed the connection] |
04:33:49 | <h2ibot> | Exorcism edited Bugzilla (+0, /* Status */): https://wiki.archiveteam.org/?diff=53015&oldid=53014 |
04:34:05 | | wickedplayer494 joins |
04:37:50 | <h2ibot> | Exorcism edited Bugzilla (+0, /* Status */): https://wiki.archiveteam.org/?diff=53016&oldid=53015 |
04:39:39 | | wickedplayer494 is now authenticated as wickedplayer494 |
04:39:50 | <h2ibot> | Exorcism edited Bugzilla (+0, /* Status */): https://wiki.archiveteam.org/?diff=53017&oldid=53016 |
04:40:50 | <h2ibot> | Exorcism edited Bugzilla (+0, /* Status */): https://wiki.archiveteam.org/?diff=53018&oldid=53017 |
04:44:37 | <yarrow> | This channel may be at risk: https://vimeo.com/user1646158 |
04:45:13 | | qw3rty__ quits [Ping timeout: 272 seconds] |
04:51:04 | | qw3rty__ joins |
05:04:55 | | shgaqnyrjp_ (shgaqnyrjp) joins |
05:05:09 | | shgaqnyrjp quits [Remote host closed the connection] |
05:14:24 | | Guest54 quits [Client Quit] |
05:26:41 | <yarrow> | Gonna attempt to back this channel up myself, but I don't know what I'm doing |
05:36:53 | <tek_dmn> | Step 1, youtube-dl, step 2, amazon S3, step 3, $200k S3 bill? |
05:38:03 | <that_lurker> | yarrow: Do you mean to the internet archive? |
05:38:49 | <yarrow> | I'm just trying to download one Vimeo channel with 42 very low-resolution videos from 13+ years ago |
05:39:02 | <yarrow> | that_lurker: yes :) |
05:39:52 | <that_lurker> | hmm tubeup is the way to go, but IA might not like it :-) |
05:40:23 | <that_lurker> | https://github.com/bibanon/tubeup |
05:42:40 | | wickedplayer494 quits [Ping timeout: 255 seconds] |
05:43:12 | | wickedplayer494 joins |
05:44:07 | | Island quits [Read error: Connection reset by peer] |
05:54:50 | <yarrow> | every tool for this is so hard to use for a typical end-user |
05:56:04 | | DogsRNice quits [Read error: Connection reset by peer] |
05:57:10 | <that_lurker> | yarrow: I can grab that once I get home. Though archivebot could maybe grab that too if I remember correctly. |
05:57:22 | <pabs> | yt-dlp is typically better than youtube-dl :) |
05:59:24 | <yarrow> | the documentation for yt-dlp is making my scalp itch |
06:00:18 | <that_lurker> | thats why tubeup is easy. Few steps and you download and upload to IA |
06:00:33 | <yarrow> | I'm not running Linux though |
06:00:53 | <yarrow> | I'm cool and play video games |
06:01:08 | <that_lurker> | wsl for the win :-P |
06:01:33 | <lemuria> | And I often find myself visiting ancient websites from what, 2005? |
06:02:13 | <lemuria> | Still going through my records and looking at the investigations that I did to determine if those ancient website owners were still alive |
06:15:24 | | lemuria is now known as lemuria_ |
06:15:35 | | lemuria_ is now known as lemuria__ |
06:15:41 | | lemuria__ is now known as lemuria |
06:17:34 | <yarrow> | archivebot is taking a run at the vimeo channel :) |
06:25:27 | | shgaqnyrjp_ is now known as shgaqnyrjp |
06:30:33 | <yarrow> | that_lurker: good tip, trying that now :) |
06:36:18 | <yarrow> | either I'm doing something wrong or Tubeup doesn't work with Vimeo account URLs |
06:39:25 | <yarrow> | also encountered an error trying a single video URL |
06:46:37 | | BlueMaxima quits [Read error: Connection reset by peer] |
06:59:42 | <lemuria> | wait how does the archivebot download vimeo exactly |
07:02:46 | <yarrow> | ¯\_(ツ)_/¯ |
07:05:19 | <yarrow> | I've got tubeup to work, but I have to paste each Vimeo video URL in one by one, rather than just pasting the channel URL |
07:06:03 | | wickedplayer494 is now authenticated as wickedplayer494 |
07:09:37 | | Unholy236192464537713 quits [Ping timeout: 272 seconds] |
07:19:53 | <that_lurker> | well if AB can get them, then there is no need to use tubeup. Though AB will make them visible on in the wayback machine. |
07:42:14 | | lflare (lflare) joins |
08:35:31 | <yarrow> | Well, it's done: https://archive.org/details/@fhivimeobackup |
08:35:52 | <yarrow> | I'm never going to do that again :P |
08:42:33 | <h2ibot> | Exorcism edited Bugzilla (+0, /* Status */): https://wiki.archiveteam.org/?diff=53019&oldid=53018 |
08:49:34 | <h2ibot> | Exorcism edited Bugzilla (+0, /* Status */): https://wiki.archiveteam.org/?diff=53020&oldid=53019 |
08:51:13 | | qw3rty__ quits [Ping timeout: 255 seconds] |
08:57:36 | <h2ibot> | Exorcism edited Bugzilla (+0, /* Status */): https://wiki.archiveteam.org/?diff=53021&oldid=53020 |
08:58:11 | | qw3rty__ joins |
09:00:02 | | Bleo1826007227196 quits [Client Quit] |
09:00:14 | | loug4 joins |
09:01:26 | | Bleo1826007227196 joins |
09:01:36 | <h2ibot> | Exorcism edited Bugzilla (+0, /* Status */): https://wiki.archiveteam.org/?diff=53022&oldid=53021 |
09:02:34 | | kiryu quits [Remote host closed the connection] |
09:03:37 | <h2ibot> | Exorcism edited Bugzilla (+0, /* Status */): https://wiki.archiveteam.org/?diff=53023&oldid=53022 |
09:05:37 | <h2ibot> | Exorcism edited Bugzilla (+0, /* Status */): https://wiki.archiveteam.org/?diff=53024&oldid=53023 |
09:24:41 | <h2ibot> | Exorcism edited Bugzilla (+0, /* Status */): https://wiki.archiveteam.org/?diff=53025&oldid=53024 |
09:27:41 | <h2ibot> | Exorcism edited Bugzilla (+0, /* Status */): https://wiki.archiveteam.org/?diff=53026&oldid=53025 |
09:28:44 | | yarrow quits [Read error: Connection reset by peer] |
09:36:50 | | yarrow (yarrow) joins |
09:59:36 | <c3manu> | yarrow: why didn't you just use the list i !ao'd earlier? ^^ |
10:00:17 | <c3manu> | could have looped over it: https://transfer.archivete.am/RJ4ND/2024-07-22_future-of-humanity-institute_vimeo-pages.txt |
10:00:17 | <eggdrop> | inline (for browser viewing): https://transfer.archivete.am/inline/RJ4ND/2024-07-22_future-of-humanity-institute_vimeo-pages.txt |
10:00:32 | <c3manu> | minus the first line ofc |
10:00:39 | <yarrow_irccloud> | Can you just paste a bunch of links at once into tubeup? |
10:03:02 | <c3manu> | i've never used tubeup, but that would have been an easy for loop |
10:03:32 | <yarrow_irccloud> | I don’t know what that means |
10:04:54 | <c3manu> | in a shell you could have used a loop to iterate the items of the list, and call tubeup for each individual one |
10:05:39 | <c3manu> | like in bash for example, sth along the lines of: while read line; do tubeup "$line" ; done < list.txt |
10:05:52 | <c3manu> | (don't quote me on the syntax) |
10:08:32 | <c3manu> | i'm sure there's equivalents for powershell etc. |
10:08:49 | <c3manu> | but anyways, you've uploaded it already :) |
10:09:00 | <yarrow_irccloud> | It’s okay, I deserve to suffer for my sins |
10:09:18 | <yarrow_irccloud> | God intended for me to be slowly copy/pasting links over and over :P |
10:12:50 | <c3manu> | :D |
10:31:52 | <h2ibot> | Exorcism edited Bugzilla (+0, /* Status */): https://wiki.archiveteam.org/?diff=53027&oldid=53026 |
11:00:03 | | Bleo1826007227196 quits [Client Quit] |
11:01:20 | | Bleo1826007227196 joins |
11:09:05 | <tzt> | https://baynetlibs.org/ - "The BayNetLibs.org website will closed down effective July 31, 2024. |
11:24:23 | <yarrow_irccloud> | tzt: thanks :) I copy/pasted your message to #archivebot |
11:37:56 | | SkilledAlpaca quits [Client Quit] |
11:38:52 | <yarrow_irccloud> | c3manu: genuinely thank you for the advice. I tend to do things the hard way because I don’t know how to automate anything. If you know how I can automate archiving podcasts (without spending 2 years learning how to code), please let me know. |
11:40:02 | | SkilledAlpaca joins |
12:45:10 | | Guest54 joins |
13:01:12 | | jumbo joins |
13:02:28 | | jumbo quits [Client Quit] |
13:26:32 | | icedice quits [Client Quit] |
13:57:29 | | katocala quits [Ping timeout: 272 seconds] |
13:57:46 | | katocala joins |
13:57:46 | | katocala is now authenticated as katocala |
14:07:37 | | katocala quits [Ping timeout: 272 seconds] |
14:08:31 | | katocala joins |
14:08:31 | | katocala is now authenticated as katocala |
14:21:39 | | PredatorIWD joins |
14:30:40 | <h2ibot> | Exorcism edited Bugzilla (+0, /* Status */): https://wiki.archiveteam.org/?diff=53028&oldid=53027 |
14:30:42 | <that_lurker> | yarrow_irccloud: Maybe this is something you are looking for? https://github.com/janw/podcast-archiver Also https://github.com/janw/tapedrive is a thing, but it seems to be somewhat stalled. |
14:31:18 | <that_lurker> | there is also https://github.com/lightpohl/podcast-dl |
14:32:15 | <that_lurker> | all those should work with the rss feeds, so you should not need to manually get the individual podcast urls |
15:09:47 | <h2ibot> | Exorcism edited Bugzilla (+80, /* Status */): https://wiki.archiveteam.org/?diff=53029&oldid=53028 |
15:19:49 | | Doranwen quits [Ping timeout: 272 seconds] |
15:25:21 | | Doranwen (Doranwen) joins |
15:35:01 | | qw3rty__ quits [Ping timeout: 272 seconds] |
15:36:13 | | Doranwen quits [Ping timeout: 255 seconds] |
15:36:17 | | that_lurker quits [Ping timeout: 272 seconds] |
15:38:25 | | Doranwen (Doranwen) joins |
15:41:21 | | qw3rty__ joins |
15:49:31 | <c3manu> | yarrow_irccloud: glad to help, i don't judge doing it the hard/annoying way either :) what platform are you on? |
16:04:06 | | PredatorIWD quits [Read error: Connection reset by peer] |
16:07:39 | | PredatorIWD joins |
16:25:00 | <h2ibot> | IDKhowToEdit edited Discourse/uncategorized (+60, added https://www.isharkfly.com/): https://wiki.archiveteam.org/?diff=53030&oldid=52749 |
16:30:46 | | BPCZ quits [Quit: eh???] |
16:32:06 | | Bleo1826007227196 quits [Client Quit] |
16:32:26 | | Bleo1826007227196 joins |
16:33:01 | | BPCZ (BPCZ) joins |
16:41:28 | | briansxml joins |
16:42:20 | | briansxml quits [Client Quit] |
17:29:13 | | Island joins |
17:34:50 | | JaffaCakes118 (JaffaCakes118) joins |
17:39:11 | | Juesto (Juest) joins |
17:41:41 | | Juest quits [Ping timeout: 272 seconds] |
17:41:41 | | Juesto is now known as Juest |
17:58:35 | <yarrow_irccloud> | c3manu: using Windows 11, but I ran Tubeup in an Ubuntu virtual machine as part of Windows Subsystem for Linux |
18:04:18 | <yarrow_irccloud> | that_lurker: I use this app because it has a GUI https://github.com/cnovel/PodcastBulkDownloader |
18:10:46 | <yarrow_irccloud> | I’ve done about 230 podcasts so far (started in June). It’s very time-consuming and I’m very aware of what a tiny fraction of podcasts I’m getting. |
18:16:18 | <yarrow_irccloud> | It’s a highly neglected area of digital archiving. For example, Barack Obama’s podcast from 2021 was not archived until I uploaded it about a week ago. |
18:17:29 | <yarrow_irccloud> | Even if I wanted to archive just, say, every podcast that has been nominated for a major podcast award, that would probably be too much work for me to take on, using my current method. |
18:30:25 | <yarrow_irccloud> | A podcast published by one of the political parties in Quebec, which holds about 10% of the seats in Quebec’s parliament, was never archived and is now fully lost media. |
18:32:29 | <yarrow_irccloud> | Page is still on Spotify but the episodes can’t be played or downloaded anywhere: https://open.spotify.com/show/0o6HMksGWjlR1xuWlVoLll?si=tWFk1LUrQlS9VN0h4H2Mmg |
18:46:43 | | DogsRNice joins |
19:43:14 | | wickedplayer494 quits [Read error: Connection reset by peer] |
19:45:14 | | wickedplayer494 joins |
19:45:46 | | wickedplayer494 is now authenticated as wickedplayer494 |
20:39:11 | | Wohlstand (Wohlstand) joins |
21:30:58 | | midou_ joins |
21:32:13 | | midou quits [Ping timeout: 272 seconds] |
21:32:37 | | midou_ is now known as midou |
21:46:05 | <JaffaCakes118> | Not sure if anyones archived the site recently but Kaspersky is soon being banned from selling its products in US, just wondering if anyone has archived the USA site before it gets taken down - https://usa.kaspersky.com/ |
21:50:01 | <Barto> | i did save the main kaspersky website, not the usa subdomain |
21:50:17 | <Barto> | i'll queue that one |
21:50:25 | <JaffaCakes118> | Thanks Barto |
21:50:41 | <Barto> | pipelines are a bit full, gotta wait a bit first |
21:50:46 | <JaffaCakes118> | No problem |
21:53:00 | | etnguyen03 (etnguyen03) joins |
21:54:34 | <JaffaCakes118> | Barto there's also https://forum.kaspersky.com/ not sure if that should be archived again before the ban happens |
21:56:09 | <JaffaCakes118> | Also unrelated to the ban but I noticed https://opentip.kaspersky.com/ has not been archived by archivebot before, there's analysis links that are archived from crawlers etc but not the main site itself and its embedded links |
21:56:50 | <JaffaCakes118> | But might be good to archive that considering the ban is going to be pretty major, not sure what sort of stuff they will end up removing |
21:58:45 | | abirkill quits [Quit: Let us prepare to grapple with the ineffable itself, and see if we may not eff it after all.] |
21:59:28 | <katia> | opentip.kaspersky.com seems to get data via POST requests |
22:00:06 | <JaffaCakes118> | It wasn't so much about archiving the analyses, just the main opentip page, documentation etc |
22:00:26 | <JaffaCakes118> | by the looks of it the documentation for opentip doesn't have much coverage |
22:00:33 | <JaffaCakes118> | and a lot of the archives are old |
22:01:17 | <JaffaCakes118> | https://opentip.kaspersky.com/Help/Doc_data/About.htm |
22:07:05 | | loug4 quits [Client Quit] |
22:19:00 | | lennier2 joins |
22:19:28 | <Barto> | i just didnt iterate over all subdomains, was tired that day, i'll maybe queue them tomorrow |
22:19:53 | <Barto> | at least those you mentioned, if anybody goes faster than me, good thing, i'll just head to bed for now |
22:21:40 | | lennier2_ quits [Ping timeout: 255 seconds] |
22:37:02 | | BlueMaxima joins |
22:37:09 | | pseudorizer quits [Client Quit] |
22:38:08 | | pseudorizer (pseudorizer) joins |
22:40:07 | | midou quits [Ping timeout: 255 seconds] |
22:51:34 | | BearFortress_ joins |
22:54:58 | | BearFortress quits [Ping timeout: 255 seconds] |
23:08:55 | | Radzig quits [Ping timeout: 255 seconds] |
23:24:10 | | Radzig joins |