| 00:15:40 | | Arcorann_ joins |
| 00:25:15 | | anarcat (anarcat) joins |
| 00:25:19 | <anarcat> | hey |
| 00:25:29 | <anarcat> | is someone on top of github.com/freenode, which is also hijacked |
| 00:25:31 | <anarcat> | ? |
| 00:51:19 | | Frogging101 joins |
| 01:03:14 | | dm4v_ joins |
| 01:04:19 | | dm4v quits [Ping timeout: 258 seconds] |
| 01:04:19 | | dm4v_ is now known as dm4v |
| 01:04:19 | | dm4v is now authenticated as dm4v |
| 01:04:19 | | dm4v quits [Changing host] |
| 01:04:19 | | dm4v (dm4v) joins |
| 01:13:35 | | onetruth joins |
| 01:37:02 | | luke2m joins |
| 01:58:28 | | Iki joins |
| 02:00:40 | | Iki1 quits [Ping timeout: 258 seconds] |
| 02:02:17 | | driib1 (driib) joins |
| 02:04:22 | | driib quits [Ping timeout: 250 seconds] |
| 02:04:22 | | driib1 is now known as driib |
| 02:08:01 | | HP_Archivist quits [Read error: Connection reset by peer] |
| 02:08:08 | | benjins is now authenticated as benjins |
| 02:08:25 | | HP_Archivist (HP_Archivist) joins |
| 02:14:47 | | luke2m quits [Client Quit] |
| 02:29:25 | | Larsenv quits [Quit: ZNC 1.8.2+deb1+focal2 - https://znc.in] |
| 02:30:34 | | Larsenv (Larsenv) joins |
| 02:44:49 | <@JAA> | anarcat: I dumped all the repos onto IA shortly after the takeover. https://archive.org/details/github.com_freenode_bundles_20210521 Doesn't include issues etc., but there was an AB job for it, too, which should grab most of it. #gitgud is unfortunately still broken, so can't get a more thorough grab at the moment. |
| 02:45:06 | | ThreeHM quits [Ping timeout: 250 seconds] |
| 02:47:06 | | ThreeHM (ThreeHeadedMonkey) joins |
| 03:17:41 | | qw3rty_ joins |
| 03:21:33 | | qw3rty__ quits [Ping timeout: 258 seconds] |
| 03:33:18 | | Larsenv quits [Remote host closed the connection] |
| 03:36:46 | | Larsenv (Larsenv) joins |
| 03:57:15 | | DogsRNice quits [Read error: Connection reset by peer] |
| 04:20:31 | | HP_Archivist quits [Read error: Connection reset by peer] |
| 04:20:55 | | HP_Archivist (HP_Archivist) joins |
| 04:21:53 | | lukash7 quits [Client Quit] |
| 04:22:24 | | lukash7 joins |
| 04:29:33 | | BlueMaxima_ joins |
| 04:33:37 | | BlueMaxima quits [Ping timeout: 258 seconds] |
| 05:13:39 | | MaxG quits [Ping timeout: 244 seconds] |
| 05:35:14 | | fuzzy8021 quits [Killed (NickServ (GHOST command used by fuzzy802!~fuzzy8021@173-224-26-244.ptcnet.net))] |
| 05:35:20 | | fuzzy8021 (fuzzy8021) joins |
| 05:52:26 | | MaxG joins |
| 06:24:42 | | fuzzy8021 quits [Killed (NickServ (GHOST command used by fuzzy802!~fuzzy8021@173-224-26-244.ptcnet.net))] |
| 06:24:48 | | fuzzy8021 (fuzzy8021) joins |
| 07:06:28 | | BlueMaxima__ joins |
| 07:10:24 | | BlueMaxima_ quits [Ping timeout: 258 seconds] |
| 07:21:53 | | BlueMaxima_ joins |
| 07:25:44 | | BlueMaxima__ quits [Ping timeout: 258 seconds] |
| 07:58:49 | | fuzzy8021 quits [Killed (NickServ (GHOST command used by fuzzy802!~fuzzy8021@173-224-26-244.ptcnet.net))] |
| 07:58:55 | | fuzzy8021 (fuzzy8021) joins |
| 08:20:33 | | HP_Archivist quits [Ping timeout: 258 seconds] |
| 08:25:57 | | BlueMaxima_ quits [Read error: Connection reset by peer] |
| 08:38:58 | | robbi5 (robbi5) joins |
| 10:01:18 | | Tweebie joins |
| 10:01:49 | <Tweebie> | hi ... is this archive.today? |
| 10:09:05 | <Jake> | Nope. |
| 10:14:21 | | Tweebie quits [Ping timeout: 244 seconds] |
| 10:22:26 | | fuzzy8021 quits [Killed (NickServ (GHOST command used by fuzzy802!~fuzzy8021@173-224-26-244.ptcnet.net))] |
| 10:22:32 | | fuzzy8021 (fuzzy8021) joins |
| 10:42:14 | | noteness quits [Remote host closed the connection] |
| 10:42:30 | | noteness (noteness) joins |
| 10:44:34 | | lennier1 quits [Quit: Going offline, see ya! (www.adiirc.com)] |
| 10:45:49 | | lennier1 (lennier1) joins |
| 11:19:51 | | Tweebie joins |
| 11:20:35 | <Tweebie> | good day |
| 11:21:57 | <Tweebie> | how do i search the archived content based on exact keyword? using "xxx" doesnt seem to work. |
| 11:25:57 | <@EggplantN> | What exactly are you looking for. We are ArchiveTeam.org |
| 11:26:16 | | LeGoupil joins |
| 11:32:22 | | Tweebie quits [Ping timeout: 244 seconds] |
| 11:38:16 | | Tweebie joins |
| 11:39:21 | <Tweebie> | oo sorry. you guys arent affiliated with archive.is / archive.today ? |
| 11:42:49 | <Jake> | nope! |
| 11:43:36 | <Jake> | (but from my knowledge, archive.is doesn't have any full text search. I believe they answer questions here: https://blog.archive.today/ask ) |
| 11:43:40 | | LeGoupil quits [Client Quit] |
| 11:45:48 | | Tweebie quits [Ping timeout: 244 seconds] |
| 11:49:22 | | Tweebie joins |
| 11:50:30 | <Tweebie> | @Jake thanks, appreciate it |
| 11:50:40 | | Tweebie quits [Remote host closed the connection] |
| 12:22:48 | | LeGoupil joins |
| 13:01:37 | <anarcat> | JAA: gotcha |
| 13:01:46 | <anarcat> | JAA: git repos were what i was mostly worried about |
| 13:02:05 | <anarcat> | in general, i'm not too worried about software on github, their stuff is fairly reliable, and it's well replicated |
| 13:02:11 | <anarcat> | but it was worth a ping |
| 13:54:49 | | Iki quits [Ping timeout: 258 seconds] |
| 14:22:12 | <@JAA> | arkiver: 'No.' :-/ |
| 14:31:28 | | hexa- quits [Quit: WeeChat 3.1] |
| 14:32:30 | | hexa- (hexa-) joins |
| 15:40:14 | | Arcorann_ quits [Ping timeout: 258 seconds] |
| 15:51:36 | | dm4v quits [Read error: Connection reset by peer] |
| 16:30:43 | | hexa- quits [Quit: WeeChat 3.1] |
| 16:30:55 | | hexa- (hexa-) joins |
| 16:31:28 | | hexa- quits [Quit: WeeChat 3.1] |
| 16:31:45 | | hexa- (hexa-) joins |
| 17:16:08 | | ave quits [Read error: Connection reset by peer] |
| 17:16:08 | | lun4 quits [Read error: Connection reset by peer] |
| 17:17:26 | | ave (ave) joins |
| 17:17:31 | | lun4 (lun4) joins |
| 17:32:55 | | Ryz quits [Remote host closed the connection] |
| 17:33:28 | | Ryz (Ryz) joins |
| 17:54:27 | | spirit joins |
| 17:55:53 | | DogsRNice (Webuser299) joins |
| 18:21:14 | | Daloader joins |
| 18:35:42 | | HP_Archivist (HP_Archivist) joins |
| 18:37:22 | | spirit quits [Client Quit] |
| 18:55:40 | <driib> | Hi, are there plans to archive newsru? See https://www.newsru.com/russia/31may2021/newsrucomoutoforder2.html and https://en.wikipedia.org/wiki/NEWSru |
| 18:58:09 | <@JAA> | Now there are. :-) |
| 18:58:11 | <@JAA> | Thanks! |
| 18:59:57 | <driib> | Thank you! |
| 19:00:20 | <@arkiver> | JAA: yeah pretty clear answer this time |
| 19:00:32 | <@JAA> | driib: Do you know whether https://www.inopressa.ru/ and http://www.meddaily.ru/ are also affected? |
| 19:00:53 | <@arkiver> | nice site |
| 19:01:02 | <@JAA> | arkiver: Better than silence I suppose. |
| 19:01:41 | <driib> | Sorry, not familiar with the two websites in question. |
| 19:02:43 | | noteness quits [Remote host closed the connection] |
| 19:03:30 | | noteness (noteness) joins |
| 19:03:43 | <@JAA> | Better safe than sorry, I guess. They're also operated by NEWSru. |
| 19:05:28 | <@JAA> | + http://superstyle.ru/ https://www.newsru.co.il/ |
| 19:08:51 | <@JAA> | http://www.newsru.ua/ already disappeared some years ago it seems. Redirecting to https://ua-news.in.ua/ these days, no idea if related. |
| 19:13:09 | <@JAA> | driib: Actually, the main site has already been running through ArchiveBot since this morning. :-) Started the other ones now. |
| 19:22:31 | | LeGoupil quits [Client Quit] |
| 19:58:41 | | MaxG quits [Remote host closed the connection] |
| 20:44:37 | | Iki joins |
| 20:47:07 | | Daloader quits [Ping timeout: 250 seconds] |
| 21:08:36 | <@JAA> | OpenGrey archival is in progress now. |
| 21:09:30 | <@arkiver> | JAA: any estimates? |
| 21:13:58 | | spirit joins |
| 21:21:33 | | HP_Archivist quits [Read error: Connection reset by peer] |
| 21:21:58 | | HP_Archivist (HP_Archivist) joins |
| 21:24:01 | <@JAA> | arkiver: Just over 1 million entries, but for almost all of them, it's only metadata, so it's tiny. |
| 21:25:25 | <@JAA> | There is already a dump that I mirrored, but it's missing entries as well as attachments (on the very few entries that have them). |
| 21:43:14 | <Frogging101> | arkiver: does archive.org accept WARCs for wayback ingestion via archiveteam people? |
| 21:43:17 | <Frogging101> | if this grab works I might have some |
| 21:43:25 | | Daloader joins |
| 21:45:28 | <tech234a> | ark iver: when you get a chance could you take a look at the last few days of messages in #down-the-tube? |
| 22:04:42 | | Daloader quits [Ping timeout: 250 seconds] |
| 22:13:02 | <@JAA> | So travis-ci.org is shutting down 'by end of May 2021'. They changed the notice on the website on 20 or 21 May... Cool. |
| 22:13:14 | <@JAA> | They've already deleted a lot of the old build logs, apparently. |
| 22:13:22 | | Wayward quits [Ping timeout: 250 seconds] |
| 22:23:13 | | sec^nd quits [Remote host closed the connection] |
| 22:23:52 | | pcr quits [Quit: Gateway shutdown] |
| 22:23:52 | | genofire quits [Quit: Gateway shutdown] |
| 22:24:44 | | sec^nd (second) joins |
| 22:31:03 | | HP_Archivist quits [Read error: Connection reset by peer] |
| 22:31:29 | | HP_Archivist (HP_Archivist) joins |
| 22:31:45 | | benjins quits [Remote host closed the connection] |
| 22:33:58 | | benjins joins |
| 22:34:15 | | benjins quits [Remote host closed the connection] |
| 22:35:48 | | benjins joins |
| 22:36:01 | | djsrv (djsrv) joins |
| 22:36:45 | | benjins quits [Remote host closed the connection] |
| 22:38:13 | | benjins joins |
| 22:42:59 | | HP_Archivist quits [Remote host closed the connection] |
| 22:43:07 | | HP_Archivist (HP_Archivist) joins |
| 22:46:02 | | mgrandi quits [Write error: Connection reset by peer] |
| 22:47:36 | | godane1 joins |
| 22:50:12 | | godane quits [Ping timeout: 250 seconds] |
| 22:51:56 | | mgrandi (mgrandi) joins |
| 23:12:49 | <@arkiver> | Frogging101: please archive through archivebot |
| 23:13:11 | <@arkiver> | tech234a: yes |
| 23:18:33 | <Frogging101> | arkiver: i'm using custom scripts but I can send a URL list to archivebot once I have them all, would mean hitting the site twice though |
| 23:18:50 | <thuban> | arkiver: i believe Frogging101 is archiving misterpoll.com, which is age-gated (via a POST form, not a GET link). |
| 23:19:00 | <Frogging101> | oh yeah and there's an age gate |
| 23:19:14 | <thuban> | afaik ab still can't accept cookies (https://github.com/ArchiveTeam/ArchiveBot/issues/416). |
| 23:19:16 | <Frogging101> | stored in the server-side session |
| 23:19:40 | <Frogging101> | thuban: it's also session specific, there's no single "age-gate passed" cookie |
| 23:19:50 | <thuban> | mhm |
| 23:19:51 | <Frogging101> | not sure when sessions expire |
| 23:20:59 | | LordThanatos quits [Quit: WeeChat 2.3] |
| 23:22:04 | | webdownload joins |
| 23:22:21 | <thuban> | i asked about this the other day and didn't get a response, but is there anyone whitelisted for the wbm who can get stuff like this via grab-site (with wpull-args=--load-cookies=), at least until/unless it gets implemented in archivebot? |
| 23:22:50 | <webdownload> | What is the best way to go about archiving YouTube comments? |
| 23:31:52 | <thuban> | hm: it occurs to me that the other site i was asking wrt does set its cookie in a GET. does archivebot store session cookies, such that hitting the start page first would be sufficient to get the rest working? |
| 23:51:33 | <Frogging101> | I'll just continue my grab and if I get it all without getting kicked off I'll upload it to IA |
| 23:52:01 | <Frogging101> | even if it's not waybackable it's probably still useful to someone |
| 23:53:08 | <@JAA> | POST stuff can't work in the WBM anyway. |
| 23:53:52 | <Frogging101> | POST is only for the the age gate, after you get your session token you can GET all the objects |
| 23:55:24 | <Frogging101> | until the server deletes your session, but I have no indication of how long that takes. More than 3 hours at least, since that's the longest I've gone with one token |
| 23:57:16 | <thuban> | it's four hours |
| 23:59:02 | <Frogging101> | Yeah, that seems to be the client-side TTL. |