00:01:07 | <thuban> | ^ did the discord scrape _only_ get links to those sm? or are those links in addition to a main group site? |
00:02:22 | <icedice> | Those + main sites |
00:02:33 | <icedice> | https://transfer.archivete.am/PsqdM/urls-2024-04-14-discord-urls-for-scanlation-group-sites.txt |
00:02:33 | <eggdrop> | inline (for browser viewing): https://transfer.archivete.am/inline/PsqdM/urls-2024-04-14-discord-urls-for-scanlation-group-sites.txt |
00:03:27 | <thuban> | ok, cool |
00:04:10 | <icedice> | Discord links should be skipped because of privacy concerns, MangaDex and Mangaupdates links can be skipped since a complete archivation of those sites would make more sense and in the case of MangaDex is probably too big of a project |
00:06:24 | <thuban> | as for e-hentai, i'm reluctant to crawl an entire website for urls that _may_ have been posted in the comments of _some_ of the items, particularly as they're not true links and would have to be extracted from the warcs and not just the wpull log |
00:06:36 | | etnguyen03 (etnguyen03) joins |
00:13:17 | <icedice> | I see |
00:23:10 | <icedice> | The scope could be narrowed by only crawling the manga listings which seem to start with /g/ in the URL and blocking the domains that serve images |
00:23:41 | <icedice> | Still a lot of work, probably |
00:27:36 | | etnguyen03 quits [Client Quit] |
00:38:28 | | etnguyen03 (etnguyen03) joins |
00:41:23 | <fireonlive> | if only it was e-bara :( |
00:59:47 | | Doranwen quits [Ping timeout: 272 seconds] |
01:01:44 | | etnguyen03 quits [Client Quit] |
01:02:55 | | Doranwen (Doranwen) joins |
01:23:34 | | etnguyen03 (etnguyen03) joins |
01:26:37 | | Doranwen quits [Ping timeout: 255 seconds] |
01:57:11 | | etnguyen03 quits [Client Quit] |
02:17:03 | | hackbug quits [Ping timeout: 272 seconds] |
02:17:54 | | etnguyen03 (etnguyen03) joins |
02:46:37 | | Doranwen (Doranwen) joins |
03:13:58 | | hackbug (hackbug) joins |
04:08:02 | | pixel leaves |
04:20:57 | | etnguyen03 quits [Client Quit] |
04:25:38 | | etnguyen03 (etnguyen03) joins |
04:49:36 | | etnguyen03 quits [Remote host closed the connection] |
05:00:49 | | BlueMaxima quits [Read error: Connection reset by peer] |
05:04:51 | | Island quits [Read error: Connection reset by peer] |
05:09:55 | | Larsenv quits [Client Quit] |
05:29:09 | | Larsenv (Larsenv) joins |
05:41:55 | | grid joins |
07:05:03 | | Unholy2361 quits [Remote host closed the connection] |
07:06:10 | | Unholy2361 (Unholy2361) joins |
07:15:59 | | anarcat quits [Ping timeout: 272 seconds] |
07:23:29 | | pixel (pixel) joins |
07:32:23 | | anarcat (anarcat) joins |
07:51:48 | | grid quits [Client Quit] |
09:00:01 | | Bleo182600 quits [Client Quit] |
09:00:37 | | Arcorann (Arcorann) joins |
09:01:16 | | Bleo182600 joins |
11:02:48 | | pixel leaves |
11:02:48 | | pixel (pixel) joins |
11:08:09 | | apache2 quits [Remote host closed the connection] |
11:08:32 | | apache2 joins |
12:07:25 | <thuban> | nulldata: did you end up running any of the sbnation podcasts through archivebot? i am going to start working on them now |
12:38:09 | <nulldata> | I haven't |
13:07:00 | | SootBector quits [Ping timeout: 255 seconds] |
13:07:22 | | SootBector (SootBector) joins |
13:27:11 | | etnguyen03 (etnguyen03) joins |
13:38:57 | | kiryu quits [Remote host closed the connection] |
13:40:15 | | kiryu joins |
13:40:15 | | kiryu is now authenticated as kiryu |
13:40:15 | | kiryu quits [Changing host] |
13:40:15 | | kiryu (kiryu) joins |
13:46:07 | | Arcorann quits [Ping timeout: 272 seconds] |
14:04:36 | <thuban> | ok, ty |
14:08:43 | | fangfufu quits [Quit: ZNC 1.8.2+deb3.1 - https://znc.in] |
14:12:15 | | fangfufu joins |
14:12:33 | | fangfufu is now authenticated as fangfufu |
14:25:34 | | Wohlstand quits [Ping timeout: 255 seconds] |
14:39:19 | | etnguyen03 quits [Client Quit] |
15:07:22 | | Mannie joins |
15:08:15 | <Mannie> | I was just browsing the archiveteam site and find that https://www.rmiembassyus.org/ is on the list of not yet archived. clicked though and it is down!! |
15:24:28 | <Vokun> | I just woke up nice and early for my opening shift, after finishing a closing shift, and my eyes aren't working well rn this early. I thought I was having a stroke reading that url, thinking it said madeinabyss |
15:25:07 | <kiska> | I suppose it is now inaabyss :D |
15:44:46 | | Ruthalas59 quits [Ping timeout: 255 seconds] |
15:45:39 | | Ruthalas59 (Ruthalas) joins |
15:45:55 | | knecht4 quits [Client Quit] |
15:48:13 | | knecht4 joins |
16:02:00 | | etnguyen03 (etnguyen03) joins |
16:02:43 | | f_ (funderscore) joins |
16:11:55 | | Notrealname1234 (Notrealname1234) joins |
16:16:48 | | Notrealname1234 quits [Client Quit] |
16:17:03 | | Notrealname1234 (Notrealname1234) joins |
16:27:41 | | Notrealname1234 quits [Client Quit] |
16:32:27 | | Notrealname1234 (Notrealname1234) joins |
16:35:46 | | Notrealname1234 quits [Client Quit] |
16:42:42 | | knecht4 quits [Client Quit] |
16:42:57 | <eroc19905> | For those that may not have seen yet, https://roosterteeth.com will be shutting down May 15, 2024. |
16:43:07 | <eroc19905> | see https://roosterteeth.com/g/post/ebc5b2cd-bd04-4935-ae36-7bb5056e043f |
16:43:16 | | eroc19905 is now known as eroc1990 |
16:43:52 | | knecht4 joins |
17:04:50 | <balrog> | Is there a way to archive WikiWikiWeb content? http://www.dairiki.org/HammondWiki |
17:04:59 | <balrog> | I guess AB, but I don't want to mess anything up |
17:05:17 | <balrog> | (this is the old, first wiki platform) |
17:14:29 | | eightthree quits [Ping timeout: 272 seconds] |
17:15:13 | | eightthree joins |
17:46:27 | | f_ quits [Ping timeout: 255 seconds] |
17:48:56 | | Mannie quits [Client Quit] |
17:54:00 | | JaffaCakes118 quits [Remote host closed the connection] |
17:54:24 | | JaffaCakes118 (JaffaCakes118) joins |
18:18:05 | | f_ (funderscore) joins |
18:26:34 | | DogsRNice joins |
18:30:32 | | DogsRNice_ joins |
18:31:42 | | JaffaCakes118 quits [Remote host closed the connection] |
18:33:49 | | Larsenv quits [Client Quit] |
18:33:58 | | DogsRNice quits [Ping timeout: 255 seconds] |
18:35:03 | | f_ quits [Ping timeout: 255 seconds] |
19:14:14 | | etnguyen03 quits [Client Quit] |
19:41:38 | | Island joins |
20:32:04 | | JaffaCakes118 (JaffaCakes118) joins |
21:00:31 | | ThetaDev quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.] |
21:00:39 | | ThetaDev joins |
21:22:00 | | Larsenv (Larsenv) joins |
21:32:15 | | midou quits [Ping timeout: 272 seconds] |
21:35:18 | | midou joins |
21:37:02 | | whoom joins |
21:38:16 | <whoom> | Sorry if this is off-topic, but could someone help me find potential archives of something? |
21:38:32 | <whoom> | I'm looking for archives of an old Invisionfree board |
21:38:33 | <whoom> | There are only a few captures directly accessible on the wayback machine |
21:38:41 | <whoom> | but I'm wondering if I could possibly find more elsewhere |
21:39:05 | <whoom> | here’s a link to an IA snapshot: https://web.archive.org/web/20091204051950/http://z6.invisionfree.com/Ponyville/index.php |
21:39:14 | | whoom quits [Client Quit] |
21:47:10 | <@JAA> | !8ball Web chat? |
21:47:10 | <eggdrop> | 🎱: JAA, you may rely on it |
22:03:48 | | qwertyasdfuiopghjkl quits [Client Quit] |
22:04:36 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
22:13:05 | | etnguyen03 (etnguyen03) joins |
22:16:54 | | BlueMaxima joins |
22:20:04 | <icedice> | thuban: Have you started archiving the Great Discord Links Hub (aka Scan Group Directory) links? |
22:30:23 | | BornOn420 quits [Client Quit] |
22:31:10 | | BornOn420 (BornOn420) joins |
22:41:21 | | parfait (kdqep) joins |
22:55:16 | | etnguyen03 quits [Client Quit] |
22:55:42 | | qwertyasdfuiopghjkl quits [Client Quit] |
22:56:28 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
23:07:34 | | midou quits [Ping timeout: 255 seconds] |
23:10:28 | | Guest quits [Quit: Connection closed] |
23:10:28 | | qwertyasdfuiopghjkl quits [Client Quit] |
23:11:08 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
23:46:17 | | etnguyen03 (etnguyen03) joins |