00:01:07<thuban>^ did the discord scrape _only_ get links to those sm? or are those links in addition to a main group site?
00:02:22<icedice>Those + main sites
00:02:33<icedice>https://transfer.archivete.am/PsqdM/urls-2024-04-14-discord-urls-for-scanlation-group-sites.txt
00:02:33<eggdrop>inline (for browser viewing): https://transfer.archivete.am/inline/PsqdM/urls-2024-04-14-discord-urls-for-scanlation-group-sites.txt
00:03:27<thuban>ok, cool
00:04:10<icedice>Discord links should be skipped because of privacy concerns, MangaDex and Mangaupdates links can be skipped since a complete archivation of those sites would make more sense and in the case of MangaDex is probably too big of a project
00:06:24<thuban>as for e-hentai, i'm reluctant to crawl an entire website for urls that _may_ have been posted in the comments of _some_ of the items, particularly as they're not true links and would have to be extracted from the warcs and not just the wpull log
00:06:36etnguyen03 (etnguyen03) joins
00:13:17<icedice>I see
00:23:10<icedice>The scope could be narrowed by only crawling the manga listings which seem to start with /g/ in the URL and blocking the domains that serve images
00:23:41<icedice>Still a lot of work, probably
00:27:36etnguyen03 quits [Client Quit]
00:38:28etnguyen03 (etnguyen03) joins
00:41:23<fireonlive>if only it was e-bara :(
00:59:47Doranwen quits [Ping timeout: 272 seconds]
01:01:44etnguyen03 quits [Client Quit]
01:02:55Doranwen (Doranwen) joins
01:23:34etnguyen03 (etnguyen03) joins
01:26:37Doranwen quits [Ping timeout: 255 seconds]
01:57:11etnguyen03 quits [Client Quit]
02:17:03hackbug quits [Ping timeout: 272 seconds]
02:17:54etnguyen03 (etnguyen03) joins
02:46:37Doranwen (Doranwen) joins
03:13:58hackbug (hackbug) joins
04:08:02pixel leaves
04:20:57etnguyen03 quits [Client Quit]
04:25:38etnguyen03 (etnguyen03) joins
04:49:36etnguyen03 quits [Remote host closed the connection]
05:00:49BlueMaxima quits [Read error: Connection reset by peer]
05:04:51Island quits [Read error: Connection reset by peer]
05:09:55Larsenv quits [Client Quit]
05:29:09Larsenv (Larsenv) joins
05:41:55grid joins
07:05:03Unholy2361 quits [Remote host closed the connection]
07:06:10Unholy2361 (Unholy2361) joins
07:15:59anarcat quits [Ping timeout: 272 seconds]
07:23:29pixel (pixel) joins
07:32:23anarcat (anarcat) joins
07:51:48grid quits [Client Quit]
09:00:01Bleo182600 quits [Client Quit]
09:00:37Arcorann (Arcorann) joins
09:01:16Bleo182600 joins
11:02:48pixel leaves
11:02:48pixel (pixel) joins
11:08:09apache2 quits [Remote host closed the connection]
11:08:32apache2 joins
12:07:25<thuban>nulldata: did you end up running any of the sbnation podcasts through archivebot? i am going to start working on them now
12:38:09<nulldata>I haven't
13:07:00SootBector quits [Ping timeout: 255 seconds]
13:07:22SootBector (SootBector) joins
13:27:11etnguyen03 (etnguyen03) joins
13:38:57kiryu quits [Remote host closed the connection]
13:40:15kiryu joins
13:40:15kiryu quits [Changing host]
13:40:15kiryu (kiryu) joins
13:46:07Arcorann quits [Ping timeout: 272 seconds]
14:04:36<thuban>ok, ty
14:08:43fangfufu quits [Quit: ZNC 1.8.2+deb3.1 - https://znc.in]
14:12:15fangfufu joins
14:25:34Wohlstand quits [Ping timeout: 255 seconds]
14:39:19etnguyen03 quits [Client Quit]
15:07:22Mannie joins
15:08:15<Mannie>I was just browsing the archiveteam site and find that https://www.rmiembassyus.org/ is on the list of not yet archived. clicked though and it is down!!
15:24:28<Vokun>I just woke up nice and early for my opening shift, after finishing a closing shift, and my eyes aren't working well rn this early. I thought I was having a stroke reading that url, thinking it said madeinabyss
15:25:07<kiska>I suppose it is now inaabyss :D
15:44:46Ruthalas59 quits [Ping timeout: 255 seconds]
15:45:39Ruthalas59 (Ruthalas) joins
15:45:55knecht4 quits [Client Quit]
15:48:13knecht4 joins
16:02:00etnguyen03 (etnguyen03) joins
16:02:43f_ (funderscore) joins
16:11:55Notrealname1234 (Notrealname1234) joins
16:16:48Notrealname1234 quits [Client Quit]
16:17:03Notrealname1234 (Notrealname1234) joins
16:27:41Notrealname1234 quits [Client Quit]
16:32:27Notrealname1234 (Notrealname1234) joins
16:35:46Notrealname1234 quits [Client Quit]
16:42:42knecht4 quits [Client Quit]
16:42:57<eroc19905>For those that may not have seen yet, https://roosterteeth.com will be shutting down May 15, 2024.
16:43:07<eroc19905>see https://roosterteeth.com/g/post/ebc5b2cd-bd04-4935-ae36-7bb5056e043f
16:43:16eroc19905 is now known as eroc1990
16:43:52knecht4 joins
17:04:50<balrog>Is there a way to archive WikiWikiWeb content? http://www.dairiki.org/HammondWiki
17:04:59<balrog>I guess AB, but I don't want to mess anything up
17:05:17<balrog>(this is the old, first wiki platform)
17:14:29eightthree quits [Ping timeout: 272 seconds]
17:15:13eightthree joins
17:46:27f_ quits [Ping timeout: 255 seconds]
17:48:56Mannie quits [Client Quit]
17:54:00JaffaCakes118 quits [Remote host closed the connection]
17:54:24JaffaCakes118 (JaffaCakes118) joins
18:18:05f_ (funderscore) joins
18:26:34DogsRNice joins
18:30:32DogsRNice_ joins
18:31:42JaffaCakes118 quits [Remote host closed the connection]
18:33:49Larsenv quits [Client Quit]
18:33:58DogsRNice quits [Ping timeout: 255 seconds]
18:35:03f_ quits [Ping timeout: 255 seconds]
19:14:14etnguyen03 quits [Client Quit]
19:41:38Island joins
20:32:04JaffaCakes118 (JaffaCakes118) joins
21:00:31ThetaDev quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]
21:00:39ThetaDev joins
21:22:00Larsenv (Larsenv) joins
21:32:15midou quits [Ping timeout: 272 seconds]
21:35:18midou joins
21:37:02whoom joins
21:38:16<whoom>Sorry if this is off-topic, but could someone help me find potential archives of something?
21:38:32<whoom>I'm looking for archives of an old Invisionfree board
21:38:33<whoom>There are only a few captures directly accessible on the wayback machine
21:38:41<whoom>but I'm wondering if I could possibly find more elsewhere
21:39:05<whoom>here’s a link to an IA snapshot: https://web.archive.org/web/20091204051950/http://z6.invisionfree.com/Ponyville/index.php
21:39:14whoom quits [Client Quit]
21:47:10<@JAA>!8ball Web chat?
21:47:10<eggdrop>🎱: JAA, you may rely on it
22:03:48qwertyasdfuiopghjkl quits [Client Quit]
22:04:36qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
22:13:05etnguyen03 (etnguyen03) joins
22:16:54BlueMaxima joins
22:20:04<icedice>thuban: Have you started archiving the Great Discord Links Hub (aka Scan Group Directory) links?
22:30:23BornOn420 quits [Client Quit]
22:31:10BornOn420 (BornOn420) joins
22:41:21parfait (kdqep) joins
22:55:16etnguyen03 quits [Client Quit]
22:55:42qwertyasdfuiopghjkl quits [Client Quit]
22:56:28qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
23:07:34midou quits [Ping timeout: 255 seconds]
23:10:28Guest quits [Quit: Connection closed]
23:10:28qwertyasdfuiopghjkl quits [Client Quit]
23:11:08qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
23:46:17etnguyen03 (etnguyen03) joins