00:12:41 | | bass joins |
00:13:57 | | bass quits [Client Quit] |
00:18:37 | | etnguyen03 quits [Client Quit] |
00:20:30 | | etnguyen03 (etnguyen03) joins |
00:36:14 | | etnguyen03 quits [Client Quit] |
00:40:46 | | tzt (tzt) joins |
00:42:01 | | etnguyen03 (etnguyen03) joins |
00:52:46 | | etnguyen03 quits [Client Quit] |
01:00:05 | | Hackerpcs quits [Quit: Hackerpcs] |
01:02:01 | | Hackerpcs (Hackerpcs) joins |
01:06:12 | | etnguyen03 (etnguyen03) joins |
01:48:55 | | michaelblob_ (michaelblob) joins |
01:50:08 | | ds joins |
01:50:37 | | ds quits [Client Quit] |
01:52:16 | | michaelblob quits [Ping timeout: 255 seconds] |
02:31:05 | | michaelblob (michaelblob) joins |
02:35:01 | | michaelblob_ quits [Ping timeout: 255 seconds] |
03:16:32 | | HP_Archivist (HP_Archivist) joins |
03:39:12 | <nicolas17> | #archiveteam topic may need updating (pointless to still mention taringa, dunno about the other two) |
03:48:30 | | nulldata8 (nulldata) joins |
03:50:33 | | nulldata quits [Ping timeout: 272 seconds] |
03:50:33 | | nulldata8 is now known as nulldata |
03:57:36 | | deadorbit joins |
04:07:43 | | nicolas17 quits [Ping timeout: 255 seconds] |
04:23:28 | | midou quits [Ping timeout: 255 seconds] |
04:24:32 | | midou joins |
04:43:45 | | pabs quits [Ping timeout: 272 seconds] |
04:49:28 | | pabs (pabs) joins |
04:59:02 | | Island quits [Read error: Connection reset by peer] |
04:59:38 | | etnguyen03 quits [Client Quit] |
05:02:46 | | etnguyen03 (etnguyen03) joins |
05:15:29 | | etnguyen03 quits [Remote host closed the connection] |
05:18:43 | | deadorbit quits [Client Quit] |
05:46:39 | | @arkiver is back from a few days of lower availability |
05:50:17 | | fireonlive waves to arkiver |
05:50:21 | <fireonlive> | welcome back! |
05:50:25 | <@arkiver> | thanks :) |
05:50:28 | <fireonlive> | :) |
06:05:15 | | JaffaCakes118_2 quits [Remote host closed the connection] |
06:07:00 | | f_ (funderscore) joins |
06:10:13 | | JaffaCakes118 (JaffaCakes118) joins |
06:11:54 | | DogsRNice quits [Read error: Connection reset by peer] |
06:12:14 | | f_ quits [Remote host closed the connection] |
06:32:34 | | f_ (funderscore) joins |
07:00:33 | <h2ibot> | JAABot edited CurrentWarriorProject (-38): https://wiki.archiveteam.org/?diff=52051&oldid=52007 |
07:05:03 | | Unholy2361 quits [Remote host closed the connection] |
07:06:11 | | Unholy23619 (Unholy2361) joins |
07:07:41 | | BlueMaxima quits [Read error: Connection reset by peer] |
07:17:05 | | Guest quits [Client Quit] |
07:17:06 | | qwertyasdfuiopghjkl quits [Client Quit] |
07:17:22 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
07:22:11 | | f_ quits [Client Quit] |
07:22:31 | | f_ (funderscore) joins |
07:28:48 | | Naruyoko joins |
07:39:03 | | f_ quits [Client Quit] |
07:52:16 | | beastbg8_ quits [Ping timeout: 255 seconds] |
08:20:08 | | blue_0000ff quits [Read error: Connection reset by peer] |
08:20:51 | | blue_0000ff joins |
08:56:37 | | Arcorann (Arcorann) joins |
09:00:02 | | Bleo182600 quits [Client Quit] |
09:01:21 | | Bleo182600 joins |
09:50:09 | | pseudorizer quits [Quit: ZNC 1.9.0 - https://znc.in] |
09:51:05 | | pseudorizer (pseudorizer) joins |
09:56:22 | | f_ (funderscore) joins |
11:06:55 | | jacksonchen666 (jacksonchen666) joins |
11:22:36 | | f_ quits [Ping timeout: 255 seconds] |
11:48:41 | <c3manu> | fyi: a german journalist apparently has a case against him for linking to the linkunten-indymedia archive. that's after offices have been raided and electronic devices been confiscated. only got a german language link for now: https://rote-hilfe.de/meldungen/unbequeme-berichterstattung-prozess-gegen-linken-journalisten |
11:49:38 | <c3manu> | that's an english one from last year: https://cpj.org/2023/01/german-police-search-office-of-independent-broadcaster-and-2-journalists-homes-seize-equipment-and-documents/ |
12:12:06 | | jacksonchen666 quits [Ping timeout: 255 seconds] |
12:14:00 | | jacksonchen666 (jacksonchen666) joins |
12:52:56 | <murb> | how is (2) https://www.gesetze-im-internet.de/stgb/__85.html actually interperted by the courts? |
13:02:52 | | etnguyen03 (etnguyen03) joins |
13:05:01 | <c3manu> | murb: not sure, the case hasn't been decided yet. it seems to be based §129 though. and it's really fishy in this whole matter. for making the website illegal, they declared linksunten to be a german "Verein", which is not at all what it was |
13:05:14 | <c3manu> | assuming you can read german: https://www.tagesschau.de/inland/indymedia-verbot-101.html |
13:05:42 | <c3manu> | i don't remember what the verdict on that was though, i would have to read up on that |
13:15:38 | <h2ibot> | Manu edited Mailman/2 (+147, http://jul.es/pipermail): https://wiki.archiveteam.org/?diff=52052&oldid=52050 |
13:18:26 | | eroc1990 quits [Quit: The Lounge - https://thelounge.chat] |
13:18:58 | | eroc1990 (eroc1990) joins |
13:22:44 | | Naruyoko5 joins |
13:23:39 | <h2ibot> | Manu edited Mailman/2 (+0, http://dovecot.org/pipermail/): https://wiki.archiveteam.org/?diff=52053&oldid=52052 |
13:25:16 | | Naruyoko quits [Ping timeout: 255 seconds] |
13:27:11 | | Naruyoko joins |
13:29:19 | | Naruyoko5 quits [Ping timeout: 255 seconds] |
13:33:22 | | Arcorann quits [Ping timeout: 255 seconds] |
13:58:32 | | Exorcism quits [Excess Flood] |
13:58:44 | | Exorcism (exorcism) joins |
14:00:11 | | jacksonchen666 quits [Remote host closed the connection] |
14:00:59 | | jacksonchen666 (jacksonchen666) joins |
14:20:05 | | midou quits [Ping timeout: 272 seconds] |
14:26:29 | | midou joins |
14:29:06 | | deadorbit joins |
14:30:22 | | deadorbit22 joins |
14:34:25 | | deadorbit quits [Ping timeout: 265 seconds] |
14:34:52 | <h2ibot> | Manu edited Mailman/2 (-30, Running https://erlang.org/mailman): https://wiki.archiveteam.org/?diff=52054&oldid=52053 |
14:41:02 | | etnguyen03 quits [Client Quit] |
14:41:38 | | kiryu_ joins |
14:42:16 | | kiryu__ joins |
14:44:28 | | kiryu quits [Ping timeout: 255 seconds] |
14:46:16 | | kiryu_ quits [Ping timeout: 255 seconds] |
15:10:40 | | deadorbit22 quits [Ping timeout: 265 seconds] |
15:34:38 | | Naruyoko5 joins |
15:36:55 | | kiryu_ joins |
15:37:34 | | driib quits [Ping timeout: 255 seconds] |
15:38:01 | | Naruyoko quits [Ping timeout: 255 seconds] |
15:40:10 | | Naruyoko joins |
15:41:01 | | Naruyoko quits [Client Quit] |
15:41:10 | | kiryu__ quits [Ping timeout: 255 seconds] |
15:42:58 | | Naruyoko5 quits [Ping timeout: 255 seconds] |
15:57:08 | | beastbg8 (beastbg8) joins |
15:57:32 | | driib (driib) joins |
16:11:26 | | f_ (funderscore) joins |
16:45:49 | | nicolas17 joins |
16:49:46 | <@JAA> | I'm getting rid of a bunch of old project channels today. You won't notice anything as they've been inaccessible since late 2022 already anyway. They're also marked accordingly on the wiki since then. |
16:53:25 | | fireonlive pours several out |
17:01:27 | | JaffaCakes118 quits [Remote host closed the connection] |
17:01:51 | | JaffaCakes118 (JaffaCakes118) joins |
17:23:30 | | SootBector quits [Ping timeout: 255 seconds] |
17:25:39 | | SootBector (SootBector) joins |
17:26:05 | | deadorbit joins |
17:26:18 | <deadorbit> | has anyone thought of archiving help.openstreetmap.org |
17:27:06 | | DogsRNice joins |
17:40:08 | <@JAA> | Yes, it was fully archived with ArchiveBot last month. |
17:42:11 | <nicolas17> | and coordinated with the openstreetmap admins |
17:52:45 | | f_ quits [Ping timeout: 255 seconds] |
17:58:52 | <deadorbit> | nice |
18:01:29 | | DogsRNice_ joins |
18:05:10 | | DogsRNice quits [Ping timeout: 255 seconds] |
18:06:04 | | DogsRNice_ quits [Ping timeout: 255 seconds] |
18:15:59 | | Island joins |
18:50:21 | | etnguyen03 (etnguyen03) joins |
19:04:07 | | deadorbit quits [Ping timeout: 265 seconds] |
19:15:50 | | etnguyen03 quits [Client Quit] |
19:21:49 | | Guest joins |
19:39:48 | <tapos> | thuban: For some reason I've decided to torture myself by manually getting every Google Site and Blogspot link from E-Hentai |
19:40:03 | | myself screams in anguish |
19:40:17 | <tapos> | Done with Google Sites, I'll do Blogspot as my mental state allows |
19:40:38 | <thuban> | o7 |
19:40:54 | <tapos> | I'd assume most freely hosted hentai scanlation sites are on there |
19:42:42 | <tapos> | I also just got a good idea |
19:43:05 | <tapos> | Kemono has a ton of NSFW Google Sites and Blogspot links |
19:43:07 | <tapos> | https://kemono.su/posts?q=sites.google.com |
19:43:15 | <tapos> | https://kemono.su/posts?q=blogspot.com |
19:43:41 | <tapos> | It's basically a Patreon/etc. archiver |
19:47:02 | <tapos> | I wonder if one of these softwares saves the text that has the links: https://github.com/search?q=Kemono&type=repositories&s=stars&o=desc |
19:47:36 | <tapos> | Still, a bunch of separate txt files is a pain in the ass to deal with |
19:48:07 | <tapos> | So I guess a custom scape would be best |
19:48:14 | <tapos> | The site uses DDoS-Guard though |
19:52:11 | <tapos> | This could maybe be rewritten for Google Sites and Blogspot: https://github.com/SatyamSSJ10/Kemono-youtube-fetch |
20:00:28 | <tapos> | thuban: https://transfer.archivete.am/15bVDx/E-Hentai%20Google%20Sites.txt |
20:00:29 | <eggdrop> | inline (for browser viewing): https://transfer.archivete.am/inline/15bVDx/E-Hentai%20Google%20Sites.txt |
20:00:56 | <tapos> | I skipped the sites that were behind a Google login |
20:01:08 | <tapos> | Which was most of them |
20:02:03 | <tapos> | And for one group I included their other links in there while I was at it |
20:02:19 | <tapos> | Seems to mostly be artists using Google Sites |
20:02:28 | <nyany> | i'm really curious as to why I was highlighted for that |
20:02:34 | <thuban> | i have no idea how we handle google sites, actually--we had a project but i think it was just for the 'classic' sites. no idea whether it would work on current sites |
20:03:05 | <tapos> | So vanilla ArchiveBot wouldn't cut it? |
20:03:42 | <pokechu22> | Archivebot does work with google sites to my understanding |
20:03:48 | <tapos> | Nice |
20:04:13 | <pokechu22> | but you do have to start one archivebot job per site, which makes it not super useful for large quantities of sites that need to be saved quickly |
20:04:13 | <tapos> | thuban do you think you can scrape Kemono for Google Sites and Blogspot links? |
20:04:26 | <thuban> | ^ right, just not sure whether something else would be more apt |
20:04:40 | <tapos> | Ok |
20:04:50 | <thuban> | sorry, i'm rather busy at present |
20:04:58 | <tapos> | Well, it's just 16 Google Sites from E-Hentai |
20:05:19 | <tapos> | So it could probably be fed site by site |
20:05:26 | <tapos> | Ok, no worries |
20:05:39 | <nyany> | Hey, it's inporntant. We'll figure it out :D |
20:06:35 | <tapos> | Yeah |
20:06:48 | <tapos> | I'm not doing Kemono manually though lol |
20:07:25 | <tapos> | 823 posts (17 pages) of Google Sites links |
20:07:52 | <tapos> | 9612 posts (193 pages) of Blogspot links |
20:08:56 | <thuban> | blogspot we can just dump in #frogger, so that's fine |
20:10:54 | <thuban> | google sites we could _maybe_ do through ab with queueh2ibot, but it would make sense to find out whether #nearlylostmygoogles does/can apply first |
20:11:48 | <@JAA> | 16 sites is few enough to just do it manually. |
20:12:09 | <thuban> | yeah, but 823... |
20:13:08 | <@JAA> | Oh, two different sources, right. |
20:28:20 | | JaffaCakes118 quits [Remote host closed the connection] |
20:42:06 | | JaffaCakes118 (JaffaCakes118) joins |
21:07:37 | <tapos> | thuban it's 823 posts, not 823 sites |
21:08:10 | <tapos> | Most likely it's like 30 sites with a few of them being linked to in hundreds of posts each |
21:08:18 | | pedantic-darwin quits [Client Quit] |
21:08:48 | <tapos> | Since some artists put their site link in every post |
21:10:59 | <tapos> | There's no way of seeing which artist made which post without opening the post though |
21:11:16 | <tapos> | Otherwise I could just speedrun through the search pages manually |
21:11:44 | <tapos> | Now if I want to do it manually I'd have to open every single post |
21:12:24 | <tapos> | Even if I could speedrun it the Blogspot ones are too much |
21:17:50 | <thuban> | oic, thought you were using that scraper you linked |
21:18:15 | | pedantic-darwin joins |
21:25:54 | | BlueMaxima joins |
22:46:36 | | etnguyen03 (etnguyen03) joins |
22:49:36 | | Hackerpcs quits [Client Quit] |
22:51:25 | | Hackerpcs (Hackerpcs) joins |
23:17:29 | <fireonlive> | -+rss- Show HN: A self-published art book about Google's first 25 years: This took me 3 years to finish. (It is 100% self-published, not endorsed by Google.)So… I wrote a book. It’s a different book with a unique approach. It’s not a novel or a technical book. It’s a biography, a company’s biography. My hope is that it serves two |
23:17:29 | <fireonlive> | purposes: to inspire founders and to captivate interior designers.It all [...] https://news.ycombinator.com/item?id=40067484 |
23:17:39 | <fireonlive> | i hope this gets preserved somehow.. |
23:31:31 | | fuzzy8021 quits [Read error: Connection reset by peer] |
23:31:39 | | decky joins |
23:32:09 | | myself quits [Quit: Ping timeout (120 seconds)] |
23:32:11 | | eroc1990 quits [Client Quit] |
23:32:28 | | myself (myself) joins |
23:32:35 | | Bleo182600 quits [Client Quit] |
23:32:36 | | eroc1990 (eroc1990) joins |
23:32:53 | | Bleo182600 joins |
23:34:52 | | fuzzy8021 (fuzzy8021) joins |
23:35:01 | | decky_e quits [Ping timeout: 255 seconds] |