00:11:01Wohlstand quits [Ping timeout: 255 seconds]
00:22:56qw3rty joins
00:23:29qw3rty_ quits [Ping timeout: 272 seconds]
00:26:11ned joins
00:26:30ned quits [Client Quit]
00:27:01etnguyen03 quits [Client Quit]
00:39:35etnguyen03 (etnguyen03) joins
00:44:00Jackster joins
00:51:21<Jackster>Anyone got grab-site to successfully login to vbulletin forum? The cookies dont work for me.
00:55:39eroc19905 (eroc1990) joins
00:57:41eroc1990 quits [Ping timeout: 272 seconds]
01:06:00Arcorann (Arcorann) joins
01:29:08<pabs>so, uh, Iran...
01:37:26<anarcat>what about it
01:38:34<nicolas17>anarcat: iran vs israel attacks going on
01:39:41<anarcat>yes, well
01:39:55<anarcat>anything needs archiving?
01:43:29<kpcyrd>"where do you get your news from?" - "some irc channel for library enthusiasts"
01:50:56etnguyen03 quits [Client Quit]
01:53:29<pabs>archive both sides in case of escalation?
01:55:01tbc1887 quits [Quit: The Lounge - https://thelounge.chat]
02:00:12tbc1887 (tbc1887) joins
02:30:51Jackster quits [Client Quit]
02:42:14<thuban>!remindme 3h scrape bbc media guides for urls-sources
02:42:15<eggdrop>[remind] ok, i'll remind you at 2024-04-14T05:42:14Z
02:50:44<fireonlive>kpcyrd: ikr? i find out about so many things here lol
02:53:56etnguyen03 (etnguyen03) joins
03:05:39Wohlstand (Wohlstand) joins
03:41:13etnguyen03 quits [Client Quit]
03:58:48etnguyen03 (etnguyen03) joins
04:01:02HP_Archivist quits [Quit: Leaving]
04:26:02DogsRNice quits [Read error: Connection reset by peer]
04:36:11Doranwen quits [Ping timeout: 272 seconds]
04:37:27Doranwen (Doranwen) joins
04:39:22etnguyen03 quits [Client Quit]
05:02:25etnguyen03 (etnguyen03) joins
05:19:07etnguyen03 quits [Remote host closed the connection]
05:42:15<eggdrop>[remind] thuban: scrape bbc media guides for urls-sources
06:09:25DogsRNice joins
06:12:11Island quits [Read error: Connection reset by peer]
06:16:07DogsRNice quits [Read error: Connection reset by peer]
06:20:55Wohlstand quits [Ping timeout: 255 seconds]
07:05:02Unholy2361 quits [Remote host closed the connection]
07:06:09Unholy2361 (Unholy2361) joins
07:15:50BlueMaxima quits [Read error: Connection reset by peer]
07:56:36pabs quits [Remote host closed the connection]
07:57:22pabs (pabs) joins
09:00:01Bleo182600 quits [Client Quit]
09:01:20Bleo182600 joins
10:02:00igloo22225 quits [Quit: The Lounge - https://thelounge.chat]
10:02:25igloo22225 (igloo22225) joins
10:31:24jacksonchen666 (jacksonchen666) joins
10:53:23kiryu quits [Remote host closed the connection]
10:54:50kiryu joins
10:54:50kiryu quits [Changing host]
10:54:50kiryu (kiryu) joins
11:01:55MrMcNuggets (MrMcNuggets) joins
11:02:59MrMcNuggets quits [Client Quit]
11:42:15f_ (funderscore) joins
11:47:12f_ quits [Remote host closed the connection]
11:49:11f_ (funderscore) joins
11:54:33f_ quits [Ping timeout: 255 seconds]
12:47:26HP_Archivist (HP_Archivist) joins
13:07:59icedice (icedice) joins
13:10:49<icedice>thuban: I remembered another scanlation group link directory, a Discord server called Great Discord Links Hub (previously known as Scan Group Directory): https://discord.gg/xAsyVb52a9
13:11:45<icedice>With Mangaupdates, MangaDex, Vatoto, and Great Discord Links Hub we should have pretty good coverage of scanlation group sites
13:12:04<icedice>I'll see if someone in #discard can scrape the links
13:40:34Arcorann quits [Ping timeout: 255 seconds]
14:14:47etnguyen03 (etnguyen03) joins
14:38:50IDK quits [Quit: Connection closed for inactivity]
14:39:51BornOn420 (BornOn420) joins
14:44:22etnguyen03 quits [Client Quit]
14:48:07etnguyen03 (etnguyen03) joins
14:48:30IDK (IDK) joins
14:57:07Jackster joins
15:00:31<tapos>You should scrape https://e-hentai.org/ for scanlation sites as well, scanlators sometimes post their site in the comments section
15:01:07<tapos>If it's too much work to scrape for links, then you could scape it via Bing
15:01:14<tapos>Not as good, but better than nothing
15:03:39<tapos>Also, I think there'll probably be stuff on there that isn't covered by the link lists you've already scraped
15:16:26etnguyen03 quits [Client Quit]
15:18:41etnguyen03 (etnguyen03) joins
15:30:15<Jackster>Nuked offline more like
15:33:30etnguyen03 quits [Client Quit]
15:39:48etnguyen03 (etnguyen03) joins
15:40:45<tapos>Nevermind the Bing scape, it seems like Bing doesn't index E-Hentai
15:40:58<tapos>I'm guessing FAKKU got them to censor out the whole domain
15:41:07<tapos>That publisher tends to go nuclear
15:52:41Notrealname1234 (Notrealname1234) joins
15:54:30albertlarsan68 (AlbertLarsan68) joins
15:55:58Notrealname1234 quits [Client Quit]
16:23:32Larsenv quits [Client Quit]
16:31:12etnguyen03 quits [Client Quit]
16:50:08f_ (funderscore) joins
16:50:50Larsenv (Larsenv) joins
17:01:32Island joins
17:05:48etnguyen03 (etnguyen03) joins
17:23:45f_ quits [Client Quit]
17:36:33Notrealname1234 (Notrealname1234) joins
17:38:32Notrealname1234 quits [Client Quit]
17:48:57etnguyen03 quits [Client Quit]
18:09:59qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
18:18:40tzt quits [Ping timeout: 255 seconds]
18:41:09jacksonchen666 quits [Client Quit]
19:14:49JaffaCakes118 (JaffaCakes118) joins
19:15:13<h2ibot>Manu edited Deathwatch (+294, add gimpscripts.net): https://wiki.archiveteam.org/?diff=52046&oldid=52024
19:16:45etnguyen03 (etnguyen03) joins
19:27:17midou quits [Ping timeout: 272 seconds]
19:43:19<h2ibot>Manu edited Deathwatch (+38, add gimpscripts.net job reference): https://wiki.archiveteam.org/?diff=52047&oldid=52046
19:48:22midou joins
19:59:54etnguyen03 quits [Client Quit]
20:03:33Naruyoko5 quits [Quit: Leaving]
20:20:45HP_Archivist quits [Client Quit]
20:31:56tzt (tzt) joins
21:05:41Wohlstand (Wohlstand) joins
21:09:57etnguyen03 (etnguyen03) joins
21:23:27Barto (Barto) joins
21:24:23<Barto>woop woop irc is back after this bigass btrfs volume failure, still a lot of files to recover, but that that part was saved ;-)
21:59:06@AlsoJAA quits [Quit: So long, and thanks for all the fish!]
22:02:45Arcorann (Arcorann) joins
22:09:58Arcorann quits [Ping timeout: 255 seconds]
22:13:31etnguyen03 quits [Client Quit]
22:36:42JTL quits [Quit: .]
22:36:58JTL (JTL) joins
22:39:45BlueMaxima joins
22:53:30<icedice>thuban: Vokun is taking care of scraping that Discord server for links
23:21:17parfait_ quits [Quit: Leaving]
23:21:47pseudorizer quits [Quit: ZNC 1.9.0 - https://znc.in]
23:24:08pseudorizer (pseudorizer) joins
23:41:46<icedice>Should we expand the scope of the scanlation group archivation project to included social media (other than Discord)?
23:41:53<icedice>Seems like a good idea
23:43:26<icedice>Their Discord servers shold probably be left alone since it's sort of an invasion of privacy to archive that and index it publicly
23:45:23<thuban>icedice: depends which social media; we don't have a good way of handling most of the major sites (facebook, twitter, instagram) right now.
23:47:16<thuban>fwiw, for the mangaupdates and vatoto scrapes i grabbed all links listed, and dumped relevant urls into appropriate projects (including telegram)
23:49:34<icedice>Ok, nice
23:50:00<icedice>Not sure what sites MangaDex lets you list, probably Twitter and Facebook, at least
23:51:01<icedice>Vokun got Tumblr, Facebook, Twitter, and Instagram links from the Discord scrape
23:51:21<icedice>The Tumblr ones are important since some groups use that for their websites
23:51:47<icedice>The rest we dump into the relevant projects, I guess?
23:55:56<thuban>mangadex's group schema only includes website, irc, discord, email, twitter, and mangaupdates links, so nothing useful for us there. (there are a bunch of groups with other social media, like telegram or vkontakte, but they get listed as 'website' so i've already covered them)
23:58:01<icedice>Ok
23:58:50<icedice>That thing said earlier about scraping E-Hentai for links might be a good idea
23:59:09<icedice>If any Google-hosted sites are going to get yeeted, it's those
23:59:48Jackster quits [Client Quit]