| 00:11:01 | | Wohlstand quits [Ping timeout: 255 seconds] |
| 00:22:56 | | qw3rty joins |
| 00:23:29 | | qw3rty_ quits [Ping timeout: 272 seconds] |
| 00:26:11 | | ned joins |
| 00:26:30 | | ned quits [Client Quit] |
| 00:27:01 | | etnguyen03 quits [Client Quit] |
| 00:39:35 | | etnguyen03 (etnguyen03) joins |
| 00:44:00 | | Jackster joins |
| 00:51:21 | <Jackster> | Anyone got grab-site to successfully login to vbulletin forum? The cookies dont work for me. |
| 00:55:39 | | eroc19905 (eroc1990) joins |
| 00:57:41 | | eroc1990 quits [Ping timeout: 272 seconds] |
| 01:06:00 | | Arcorann (Arcorann) joins |
| 01:29:08 | <pabs> | so, uh, Iran... |
| 01:37:26 | <anarcat> | what about it |
| 01:38:34 | <nicolas17> | anarcat: iran vs israel attacks going on |
| 01:39:41 | <anarcat> | yes, well |
| 01:39:55 | <anarcat> | anything needs archiving? |
| 01:43:29 | <kpcyrd> | "where do you get your news from?" - "some irc channel for library enthusiasts" |
| 01:50:56 | | etnguyen03 quits [Client Quit] |
| 01:53:29 | <pabs> | archive both sides in case of escalation? |
| 01:55:01 | | tbc1887 quits [Quit: The Lounge - https://thelounge.chat] |
| 02:00:12 | | tbc1887 (tbc1887) joins |
| 02:30:51 | | Jackster quits [Client Quit] |
| 02:42:14 | <thuban> | !remindme 3h scrape bbc media guides for urls-sources |
| 02:42:15 | <eggdrop> | [remind] ok, i'll remind you at 2024-04-14T05:42:14Z |
| 02:50:44 | <fireonlive> | kpcyrd: ikr? i find out about so many things here lol |
| 02:53:56 | | etnguyen03 (etnguyen03) joins |
| 03:05:39 | | Wohlstand (Wohlstand) joins |
| 03:41:13 | | etnguyen03 quits [Client Quit] |
| 03:58:48 | | etnguyen03 (etnguyen03) joins |
| 04:01:02 | | HP_Archivist quits [Quit: Leaving] |
| 04:26:02 | | DogsRNice quits [Read error: Connection reset by peer] |
| 04:36:11 | | Doranwen quits [Ping timeout: 272 seconds] |
| 04:37:27 | | Doranwen (Doranwen) joins |
| 04:39:22 | | etnguyen03 quits [Client Quit] |
| 05:02:25 | | etnguyen03 (etnguyen03) joins |
| 05:19:07 | | etnguyen03 quits [Remote host closed the connection] |
| 05:42:15 | <eggdrop> | [remind] thuban: scrape bbc media guides for urls-sources |
| 06:09:25 | | DogsRNice joins |
| 06:12:11 | | Island quits [Read error: Connection reset by peer] |
| 06:16:07 | | DogsRNice quits [Read error: Connection reset by peer] |
| 06:20:55 | | Wohlstand quits [Ping timeout: 255 seconds] |
| 07:05:02 | | Unholy2361 quits [Remote host closed the connection] |
| 07:06:09 | | Unholy2361 (Unholy2361) joins |
| 07:15:50 | | BlueMaxima quits [Read error: Connection reset by peer] |
| 07:56:36 | | pabs quits [Remote host closed the connection] |
| 07:57:22 | | pabs (pabs) joins |
| 09:00:01 | | Bleo182600 quits [Client Quit] |
| 09:01:20 | | Bleo182600 joins |
| 10:02:00 | | igloo22225 quits [Quit: The Lounge - https://thelounge.chat] |
| 10:02:25 | | igloo22225 (igloo22225) joins |
| 10:31:24 | | jacksonchen666 (jacksonchen666) joins |
| 10:53:23 | | kiryu quits [Remote host closed the connection] |
| 10:54:50 | | kiryu joins |
| 10:54:50 | | kiryu is now authenticated as kiryu |
| 10:54:50 | | kiryu quits [Changing host] |
| 10:54:50 | | kiryu (kiryu) joins |
| 11:01:55 | | MrMcNuggets (MrMcNuggets) joins |
| 11:02:59 | | MrMcNuggets quits [Client Quit] |
| 11:42:15 | | f_ (funderscore) joins |
| 11:47:12 | | f_ quits [Remote host closed the connection] |
| 11:49:11 | | f_ (funderscore) joins |
| 11:54:33 | | f_ quits [Ping timeout: 255 seconds] |
| 12:47:26 | | HP_Archivist (HP_Archivist) joins |
| 13:07:59 | | icedice (icedice) joins |
| 13:10:49 | <icedice> | thuban: I remembered another scanlation group link directory, a Discord server called Great Discord Links Hub (previously known as Scan Group Directory): https://discord.gg/xAsyVb52a9 |
| 13:11:45 | <icedice> | With Mangaupdates, MangaDex, Vatoto, and Great Discord Links Hub we should have pretty good coverage of scanlation group sites |
| 13:12:04 | <icedice> | I'll see if someone in #discard can scrape the links |
| 13:40:34 | | Arcorann quits [Ping timeout: 255 seconds] |
| 14:14:47 | | etnguyen03 (etnguyen03) joins |
| 14:38:50 | | IDK quits [Quit: Connection closed for inactivity] |
| 14:39:51 | | BornOn420 (BornOn420) joins |
| 14:44:22 | | etnguyen03 quits [Client Quit] |
| 14:48:07 | | etnguyen03 (etnguyen03) joins |
| 14:48:30 | | IDK (IDK) joins |
| 14:57:07 | | Jackster joins |
| 15:00:31 | <tapos> | You should scrape https://e-hentai.org/ for scanlation sites as well, scanlators sometimes post their site in the comments section |
| 15:01:07 | <tapos> | If it's too much work to scrape for links, then you could scape it via Bing |
| 15:01:14 | <tapos> | Not as good, but better than nothing |
| 15:03:39 | <tapos> | Also, I think there'll probably be stuff on there that isn't covered by the link lists you've already scraped |
| 15:16:26 | | etnguyen03 quits [Client Quit] |
| 15:18:41 | | etnguyen03 (etnguyen03) joins |
| 15:30:15 | <Jackster> | Nuked offline more like |
| 15:33:30 | | etnguyen03 quits [Client Quit] |
| 15:39:48 | | etnguyen03 (etnguyen03) joins |
| 15:40:45 | <tapos> | Nevermind the Bing scape, it seems like Bing doesn't index E-Hentai |
| 15:40:58 | <tapos> | I'm guessing FAKKU got them to censor out the whole domain |
| 15:41:07 | <tapos> | That publisher tends to go nuclear |
| 15:52:41 | | Notrealname1234 (Notrealname1234) joins |
| 15:54:30 | | albertlarsan68 (AlbertLarsan68) joins |
| 15:55:58 | | Notrealname1234 quits [Client Quit] |
| 16:23:32 | | Larsenv quits [Client Quit] |
| 16:31:12 | | etnguyen03 quits [Client Quit] |
| 16:50:08 | | f_ (funderscore) joins |
| 16:50:50 | | Larsenv (Larsenv) joins |
| 17:01:32 | | Island joins |
| 17:05:48 | | etnguyen03 (etnguyen03) joins |
| 17:23:45 | | f_ quits [Client Quit] |
| 17:36:33 | | Notrealname1234 (Notrealname1234) joins |
| 17:38:32 | | Notrealname1234 quits [Client Quit] |
| 17:48:57 | | etnguyen03 quits [Client Quit] |
| 18:09:59 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
| 18:18:40 | | tzt quits [Ping timeout: 255 seconds] |
| 18:41:09 | | jacksonchen666 quits [Client Quit] |
| 19:14:49 | | JaffaCakes118 (JaffaCakes118) joins |
| 19:15:13 | <h2ibot> | Manu edited Deathwatch (+294, add gimpscripts.net): https://wiki.archiveteam.org/?diff=52046&oldid=52024 |
| 19:16:45 | | etnguyen03 (etnguyen03) joins |
| 19:27:17 | | midou quits [Ping timeout: 272 seconds] |
| 19:43:19 | <h2ibot> | Manu edited Deathwatch (+38, add gimpscripts.net job reference): https://wiki.archiveteam.org/?diff=52047&oldid=52046 |
| 19:48:22 | | midou joins |
| 19:59:54 | | etnguyen03 quits [Client Quit] |
| 20:03:33 | | Naruyoko5 quits [Quit: Leaving] |
| 20:20:45 | | HP_Archivist quits [Client Quit] |
| 20:31:56 | | tzt (tzt) joins |
| 21:05:41 | | Wohlstand (Wohlstand) joins |
| 21:09:57 | | etnguyen03 (etnguyen03) joins |
| 21:23:27 | | Barto (Barto) joins |
| 21:24:23 | <Barto> | woop woop irc is back after this bigass btrfs volume failure, still a lot of files to recover, but that that part was saved ;-) |
| 21:59:06 | | @AlsoJAA quits [Quit: So long, and thanks for all the fish!] |
| 22:02:45 | | Arcorann (Arcorann) joins |
| 22:09:58 | | Arcorann quits [Ping timeout: 255 seconds] |
| 22:13:31 | | etnguyen03 quits [Client Quit] |
| 22:36:42 | | JTL quits [Quit: .] |
| 22:36:58 | | JTL (JTL) joins |
| 22:39:45 | | BlueMaxima joins |
| 22:53:30 | <icedice> | thuban: Vokun is taking care of scraping that Discord server for links |
| 23:21:17 | | parfait_ quits [Quit: Leaving] |
| 23:21:47 | | pseudorizer quits [Quit: ZNC 1.9.0 - https://znc.in] |
| 23:24:08 | | pseudorizer (pseudorizer) joins |
| 23:41:46 | <icedice> | Should we expand the scope of the scanlation group archivation project to included social media (other than Discord)? |
| 23:41:53 | <icedice> | Seems like a good idea |
| 23:43:26 | <icedice> | Their Discord servers shold probably be left alone since it's sort of an invasion of privacy to archive that and index it publicly |
| 23:45:23 | <thuban> | icedice: depends which social media; we don't have a good way of handling most of the major sites (facebook, twitter, instagram) right now. |
| 23:47:16 | <thuban> | fwiw, for the mangaupdates and vatoto scrapes i grabbed all links listed, and dumped relevant urls into appropriate projects (including telegram) |
| 23:49:34 | <icedice> | Ok, nice |
| 23:50:00 | <icedice> | Not sure what sites MangaDex lets you list, probably Twitter and Facebook, at least |
| 23:51:01 | <icedice> | Vokun got Tumblr, Facebook, Twitter, and Instagram links from the Discord scrape |
| 23:51:21 | <icedice> | The Tumblr ones are important since some groups use that for their websites |
| 23:51:47 | <icedice> | The rest we dump into the relevant projects, I guess? |
| 23:55:56 | <thuban> | mangadex's group schema only includes website, irc, discord, email, twitter, and mangaupdates links, so nothing useful for us there. (there are a bunch of groups with other social media, like telegram or vkontakte, but they get listed as 'website' so i've already covered them) |
| 23:58:01 | <icedice> | Ok |
| 23:58:50 | <icedice> | That thing said earlier about scraping E-Hentai for links might be a good idea |
| 23:59:09 | <icedice> | If any Google-hosted sites are going to get yeeted, it's those |
| 23:59:48 | | Jackster quits [Client Quit] |