00:11:01 | | Wohlstand quits [Ping timeout: 255 seconds] |
00:22:56 | | qw3rty joins |
00:23:29 | | qw3rty_ quits [Ping timeout: 272 seconds] |
00:26:11 | | ned joins |
00:26:30 | | ned quits [Client Quit] |
00:27:01 | | etnguyen03 quits [Client Quit] |
00:39:35 | | etnguyen03 (etnguyen03) joins |
00:44:00 | | Jackster joins |
00:51:21 | <Jackster> | Anyone got grab-site to successfully login to vbulletin forum? The cookies dont work for me. |
00:55:39 | | eroc19905 (eroc1990) joins |
00:57:41 | | eroc1990 quits [Ping timeout: 272 seconds] |
01:06:00 | | Arcorann (Arcorann) joins |
01:29:08 | <pabs> | so, uh, Iran... |
01:37:26 | <anarcat> | what about it |
01:38:34 | <nicolas17> | anarcat: iran vs israel attacks going on |
01:39:41 | <anarcat> | yes, well |
01:39:55 | <anarcat> | anything needs archiving? |
01:43:29 | <kpcyrd> | "where do you get your news from?" - "some irc channel for library enthusiasts" |
01:50:56 | | etnguyen03 quits [Client Quit] |
01:53:29 | <pabs> | archive both sides in case of escalation? |
01:55:01 | | tbc1887 quits [Quit: The Lounge - https://thelounge.chat] |
02:00:12 | | tbc1887 (tbc1887) joins |
02:30:51 | | Jackster quits [Client Quit] |
02:42:14 | <thuban> | !remindme 3h scrape bbc media guides for urls-sources |
02:42:15 | <eggdrop> | [remind] ok, i'll remind you at 2024-04-14T05:42:14Z |
02:50:44 | <fireonlive> | kpcyrd: ikr? i find out about so many things here lol |
02:53:56 | | etnguyen03 (etnguyen03) joins |
03:05:39 | | Wohlstand (Wohlstand) joins |
03:41:13 | | etnguyen03 quits [Client Quit] |
03:58:48 | | etnguyen03 (etnguyen03) joins |
04:01:02 | | HP_Archivist quits [Quit: Leaving] |
04:26:02 | | DogsRNice quits [Read error: Connection reset by peer] |
04:36:11 | | Doranwen quits [Ping timeout: 272 seconds] |
04:37:27 | | Doranwen (Doranwen) joins |
04:39:22 | | etnguyen03 quits [Client Quit] |
05:02:25 | | etnguyen03 (etnguyen03) joins |
05:19:07 | | etnguyen03 quits [Remote host closed the connection] |
05:42:15 | <eggdrop> | [remind] thuban: scrape bbc media guides for urls-sources |
06:09:25 | | DogsRNice joins |
06:12:11 | | Island quits [Read error: Connection reset by peer] |
06:16:07 | | DogsRNice quits [Read error: Connection reset by peer] |
06:20:55 | | Wohlstand quits [Ping timeout: 255 seconds] |
07:05:02 | | Unholy2361 quits [Remote host closed the connection] |
07:06:09 | | Unholy2361 (Unholy2361) joins |
07:15:50 | | BlueMaxima quits [Read error: Connection reset by peer] |
07:56:36 | | pabs quits [Remote host closed the connection] |
07:57:22 | | pabs (pabs) joins |
09:00:01 | | Bleo182600 quits [Client Quit] |
09:01:20 | | Bleo182600 joins |
10:02:00 | | igloo22225 quits [Quit: The Lounge - https://thelounge.chat] |
10:02:25 | | igloo22225 (igloo22225) joins |
10:31:24 | | jacksonchen666 (jacksonchen666) joins |
10:53:23 | | kiryu quits [Remote host closed the connection] |
10:54:50 | | kiryu joins |
10:54:50 | | kiryu is now authenticated as kiryu |
10:54:50 | | kiryu quits [Changing host] |
10:54:50 | | kiryu (kiryu) joins |
11:01:55 | | MrMcNuggets (MrMcNuggets) joins |
11:02:59 | | MrMcNuggets quits [Client Quit] |
11:42:15 | | f_ (funderscore) joins |
11:47:12 | | f_ quits [Remote host closed the connection] |
11:49:11 | | f_ (funderscore) joins |
11:54:33 | | f_ quits [Ping timeout: 255 seconds] |
12:47:26 | | HP_Archivist (HP_Archivist) joins |
13:07:59 | | icedice (icedice) joins |
13:10:49 | <icedice> | thuban: I remembered another scanlation group link directory, a Discord server called Great Discord Links Hub (previously known as Scan Group Directory): https://discord.gg/xAsyVb52a9 |
13:11:45 | <icedice> | With Mangaupdates, MangaDex, Vatoto, and Great Discord Links Hub we should have pretty good coverage of scanlation group sites |
13:12:04 | <icedice> | I'll see if someone in #discard can scrape the links |
13:40:34 | | Arcorann quits [Ping timeout: 255 seconds] |
14:14:47 | | etnguyen03 (etnguyen03) joins |
14:38:50 | | IDK quits [Quit: Connection closed for inactivity] |
14:39:51 | | BornOn420 (BornOn420) joins |
14:44:22 | | etnguyen03 quits [Client Quit] |
14:48:07 | | etnguyen03 (etnguyen03) joins |
14:48:30 | | IDK (IDK) joins |
14:57:07 | | Jackster joins |
15:00:31 | <tapos> | You should scrape https://e-hentai.org/ for scanlation sites as well, scanlators sometimes post their site in the comments section |
15:01:07 | <tapos> | If it's too much work to scrape for links, then you could scape it via Bing |
15:01:14 | <tapos> | Not as good, but better than nothing |
15:03:39 | <tapos> | Also, I think there'll probably be stuff on there that isn't covered by the link lists you've already scraped |
15:16:26 | | etnguyen03 quits [Client Quit] |
15:18:41 | | etnguyen03 (etnguyen03) joins |
15:30:15 | <Jackster> | Nuked offline more like |
15:33:30 | | etnguyen03 quits [Client Quit] |
15:39:48 | | etnguyen03 (etnguyen03) joins |
15:40:45 | <tapos> | Nevermind the Bing scape, it seems like Bing doesn't index E-Hentai |
15:40:58 | <tapos> | I'm guessing FAKKU got them to censor out the whole domain |
15:41:07 | <tapos> | That publisher tends to go nuclear |
15:52:41 | | Notrealname1234 (Notrealname1234) joins |
15:54:30 | | albertlarsan68 (AlbertLarsan68) joins |
15:55:58 | | Notrealname1234 quits [Client Quit] |
16:23:32 | | Larsenv quits [Client Quit] |
16:31:12 | | etnguyen03 quits [Client Quit] |
16:50:08 | | f_ (funderscore) joins |
16:50:50 | | Larsenv (Larsenv) joins |
17:01:32 | | Island joins |
17:05:48 | | etnguyen03 (etnguyen03) joins |
17:23:45 | | f_ quits [Client Quit] |
17:36:33 | | Notrealname1234 (Notrealname1234) joins |
17:38:32 | | Notrealname1234 quits [Client Quit] |
17:48:57 | | etnguyen03 quits [Client Quit] |
18:09:59 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
18:18:40 | | tzt quits [Ping timeout: 255 seconds] |
18:41:09 | | jacksonchen666 quits [Client Quit] |
19:14:49 | | JaffaCakes118 (JaffaCakes118) joins |
19:15:13 | <h2ibot> | Manu edited Deathwatch (+294, add gimpscripts.net): https://wiki.archiveteam.org/?diff=52046&oldid=52024 |
19:16:45 | | etnguyen03 (etnguyen03) joins |
19:27:17 | | midou quits [Ping timeout: 272 seconds] |
19:43:19 | <h2ibot> | Manu edited Deathwatch (+38, add gimpscripts.net job reference): https://wiki.archiveteam.org/?diff=52047&oldid=52046 |
19:48:22 | | midou joins |
19:59:54 | | etnguyen03 quits [Client Quit] |
20:03:33 | | Naruyoko5 quits [Quit: Leaving] |
20:20:45 | | HP_Archivist quits [Client Quit] |
20:31:56 | | tzt (tzt) joins |
21:05:41 | | Wohlstand (Wohlstand) joins |
21:09:57 | | etnguyen03 (etnguyen03) joins |
21:23:27 | | Barto (Barto) joins |
21:24:23 | <Barto> | woop woop irc is back after this bigass btrfs volume failure, still a lot of files to recover, but that that part was saved ;-) |
21:59:06 | | @AlsoJAA quits [Quit: So long, and thanks for all the fish!] |
22:02:45 | | Arcorann (Arcorann) joins |
22:09:58 | | Arcorann quits [Ping timeout: 255 seconds] |
22:13:31 | | etnguyen03 quits [Client Quit] |
22:36:42 | | JTL quits [Quit: .] |
22:36:58 | | JTL (JTL) joins |
22:39:45 | | BlueMaxima joins |
22:53:30 | <icedice> | thuban: Vokun is taking care of scraping that Discord server for links |
23:21:17 | | parfait_ quits [Quit: Leaving] |
23:21:47 | | pseudorizer quits [Quit: ZNC 1.9.0 - https://znc.in] |
23:24:08 | | pseudorizer (pseudorizer) joins |
23:41:46 | <icedice> | Should we expand the scope of the scanlation group archivation project to included social media (other than Discord)? |
23:41:53 | <icedice> | Seems like a good idea |
23:43:26 | <icedice> | Their Discord servers shold probably be left alone since it's sort of an invasion of privacy to archive that and index it publicly |
23:45:23 | <thuban> | icedice: depends which social media; we don't have a good way of handling most of the major sites (facebook, twitter, instagram) right now. |
23:47:16 | <thuban> | fwiw, for the mangaupdates and vatoto scrapes i grabbed all links listed, and dumped relevant urls into appropriate projects (including telegram) |
23:49:34 | <icedice> | Ok, nice |
23:50:00 | <icedice> | Not sure what sites MangaDex lets you list, probably Twitter and Facebook, at least |
23:51:01 | <icedice> | Vokun got Tumblr, Facebook, Twitter, and Instagram links from the Discord scrape |
23:51:21 | <icedice> | The Tumblr ones are important since some groups use that for their websites |
23:51:47 | <icedice> | The rest we dump into the relevant projects, I guess? |
23:55:56 | <thuban> | mangadex's group schema only includes website, irc, discord, email, twitter, and mangaupdates links, so nothing useful for us there. (there are a bunch of groups with other social media, like telegram or vkontakte, but they get listed as 'website' so i've already covered them) |
23:58:01 | <icedice> | Ok |
23:58:50 | <icedice> | That thing said earlier about scraping E-Hentai for links might be a good idea |
23:59:09 | <icedice> | If any Google-hosted sites are going to get yeeted, it's those |
23:59:48 | | Jackster quits [Client Quit] |