00:30:36 | | etnguyen03 quits [Client Quit] |
01:15:54 | | etnguyen03 (etnguyen03) joins |
01:16:19 | | devsnek quits [] |
01:28:53 | <h2ibot> | OrIdow6 edited Tracker (+85, /* Rate limit, backfeed, queues, etc. */): https://wiki.archiveteam.org/?diff=54029&oldid=53762 |
01:33:05 | <@JAA> | OrIdow6: That depends (strongly) on the project. |
01:43:10 | | Jake quits [Quit: Ping timeout (120 seconds)] |
01:43:25 | | Jake (Jake) joins |
02:15:06 | | chains joins |
02:20:16 | | Radzig quits [Remote host closed the connection] |
02:26:56 | | Radzig joins |
02:30:04 | <eggdrop> | [remind] OrIdow6: dw |
02:35:03 | <h2ibot> | TheTechRobo edited Tracker (+78, More detailed (and quite possibly wrong)…): https://wiki.archiveteam.org/?diff=54030&oldid=54029 |
02:35:05 | <TheTechRobo> | JAA: ^ is that more accurate? |
02:36:36 | <@JAA> | TheTechRobo: Yeah, that's accurate. |
02:41:55 | <TheTechRobo> | JAA: Does offloading also apply to done, or is that separate? |
02:42:35 | <@JAA> | It does. |
02:43:32 | <TheTechRobo> | Hm, then I have a question. Why do all the queues have accurate counts on the tracker, but on e.g. telegram, done is 0 while done_counter is the correct value? |
02:46:28 | <@JAA> | I forgot how that works exactly, but we discard done items there because it'd be far too large. Same on #//. done_counter keeps track of just the number of completed items. |
02:49:03 | <TheTechRobo> | Ah, so on some projects, they aren't kept at all. Makes sense |
02:49:18 | <TheTechRobo> | I guess they don't really need to be, since the pipeline stores the item name in the WARC. |
02:49:41 | <@JAA> | Yes, and we do keep a record of them somewhere else if we really need to reconstruct it. |
02:55:06 | <h2ibot> | TheTechRobo edited Tracker (+511, Add offloading information about claims and done): https://wiki.archiveteam.org/?diff=54031&oldid=54030 |
03:28:17 | <@OrIdow6> | Thank you TheTechRobo, I intended that initial edit purely as something that would have gone into my personal notes but which was of public benefit, but it accidentallly ended up as "the best way to get an answer to a question on the Internet is to give the wrong answer" |
03:30:13 | <eggdrop> | [remind] OrIdow6: dw2 |
03:30:31 | <@OrIdow6> | !remindme 5h dw2 |
03:30:31 | <eggdrop> | [remind] ok, i'll remind you at 2024-12-15T08:30:31Z |
03:51:58 | | pabs quits [Quit: Don't rest until all the world is paved in moss and greenery.] |
03:59:52 | | pabs (pabs) joins |
04:10:40 | | etnguyen03 quits [Client Quit] |
04:16:43 | | etnguyen03 (etnguyen03) joins |
04:39:40 | | etnguyen03 quits [Remote host closed the connection] |
05:01:10 | | sec^nd quits [Ping timeout: 276 seconds] |
05:04:31 | | sec^nd (second) joins |
05:13:44 | | Webuser111291 joins |
05:14:58 | | second (second) joins |
05:15:16 | | Webuser111291 quits [Client Quit] |
05:17:25 | | sec^nd quits [Ping timeout: 276 seconds] |
05:17:26 | | second is now known as sec^nd |
05:20:45 | | Webuser341703 joins |
05:21:59 | | Webuser341703 quits [Client Quit] |
06:08:28 | | second (second) joins |
06:12:01 | | sec^nd quits [Ping timeout: 276 seconds] |
06:12:02 | | second is now known as sec^nd |
06:14:32 | | lennier2 quits [Ping timeout: 260 seconds] |
06:14:43 | <h2ibot> | Tech234a edited Twitch.tv (+270, /* Broadcast retention changes */ Add current…): https://wiki.archiveteam.org/?diff=54032&oldid=53983 |
06:14:50 | | lennier2 joins |
06:16:44 | <h2ibot> | Tech234a edited Twitch.tv (+20, /* Broadcast retention changes */ Fix old…): https://wiki.archiveteam.org/?diff=54033&oldid=54032 |
06:34:38 | | Webuser680517 joins |
06:34:55 | | Webuser680517 quits [Client Quit] |
06:46:39 | | chains quits [Read error: Connection reset by peer] |
06:46:57 | | chains joins |
06:47:55 | | Webuser268013 joins |
06:48:04 | | Webuser268013 quits [Client Quit] |
06:59:57 | | chains_ joins |
07:04:07 | | chains quits [Ping timeout: 260 seconds] |
07:09:22 | | Unholy23619246453771312 quits [Ping timeout: 260 seconds] |
07:29:12 | | chains_ quits [Ping timeout: 260 seconds] |
07:41:06 | | VannevarK joins |
07:44:16 | <pabs> | do we have anything that supports archiving TLS 1.0? |
07:44:55 | <pabs> | I mean sites that use that |
07:48:01 | <pabs> | ArchiveBot just gets operation not permitted AFAICT |
07:49:08 | <pabs> | SPN gives "Save Page Now could not capture this URL because it was unreachable." |
07:49:26 | <pabs> | example URL: https://ssl.linuxtag.org/survey/ |
07:54:06 | <VannevarK> | Hello, I had a couple questions about the GameFront archive, https://wiki.archiveteam.org/index.php/GameFront |
07:54:06 | <VannevarK> | I'm trying to track down some old game mods (this is Halo 2 for Xbox), I have about 150 filefront links from a forum mirror, but I'm not sure if this data (which is all from ~2004-2008) would have still been on GameFront at the time it was scraped. (also, none of it seems to be on moddb.com, possibly because they removed content for modded |
07:54:06 | <VannevarK> | consoles?) Another thing I'm trying to figure out, is how to convert old Filefront URLs to GameFront URLs, if that's even possible to do. The URLs I have are in the format of "http://files.filefront.com/Purgatoryrar/;4121654;;/fileinfo.html". I'm still digging into this, but any help would be appreciated. (approaches that require programming |
07:54:06 | <VannevarK> | ok) |
07:55:44 | <VannevarK> | (i suppose i could walk the CDX files and look for the target filenames if thats reliable) |
08:05:47 | | lennier2 quits [Ping timeout: 265 seconds] |
08:06:57 | | lennier2_ joins |
08:30:31 | <eggdrop> | [remind] OrIdow6: dw2 |
09:15:50 | | Ketchup901 (Ketchup901) joins |
09:42:44 | | MrMcNuggets (MrMcNuggets) joins |
10:00:08 | | ducky quits [Ping timeout: 260 seconds] |
10:00:20 | <h2ibot> | PaulWise edited Mailing Lists (+301, more stuff): https://wiki.archiveteam.org/?diff=54034&oldid=54004 |
10:07:30 | | ducky (ducky) joins |
10:13:34 | | khaoohs quits [Read error: Connection reset by peer] |
10:13:51 | | khaoohs joins |
10:14:17 | | khaoohs quits [Read error: Connection reset by peer] |
10:14:49 | | khaoohs joins |
10:24:04 | <szczot3k> | h2ibot doesn't take User: namespace into account? |
10:28:14 | <szczot3k> | https://wiki.archiveteam.org/index.php/Template:IRC-Hackint - this template may now be deleted, as it's unused, and shouldn't be used. |
10:29:08 | <szczot3k> | https://wiki.archiveteam.org/index.php?title=Template:Y - this template probably could be deleted, as it's unused, and looks like a test template |
10:29:40 | <szczot3k> | https://wiki.archiveteam.org/index.php?title=Template:Pink&redirect=no - ...pink? |
10:34:13 | <szczot3k> | https://wiki.archiveteam.org/index.php/Template:W2 - this actually seems used _somewhere_, but I can't find it anywhere in the wiki export. |
10:35:08 | <szczot3k> | https://wiki.archiveteam.org/index.php/Special:WhatLinksHere/Template:W2 |
10:35:09 | <szczot3k> | Huh |
10:36:00 | <szczot3k> | JAA, I see your revisions on W2. Do you think I should move it to [[Template:Wikipedia]]? |
10:39:16 | <szczot3k> | Same with W2+ |
10:40:09 | <szczot3k> | Oh, this change broke some pages to be sure. |
10:43:27 | <h2ibot> | Szczot3k edited Instagram (+34, Change W2/W2+ templates to Wikipedia. Still…): https://wiki.archiveteam.org/?diff=54037&oldid=53357 |
10:45:27 | <h2ibot> | Szczot3k edited MediaWiki talk:Spam-blacklist (+7, W2 cleanup): https://wiki.archiveteam.org/?diff=54038&oldid=38454 |
10:47:11 | <@OrIdow6> | VannevarK: Yeah that's the best way |
10:47:21 | <@OrIdow6> | You can usually generate all the CDX URLs using the "internetarchive" CLI tool and some sed/grep/whatever |
10:47:51 | <@OrIdow6> | ia search "collection:archiveteam_egloos" | jq -r .identifier | awk '{printf("https://archive.org/download/%s/%s.cdx.gz\n", $1, $1);}' | wget --input-file - --waitretry=30 |
10:50:18 | <szczot3k> | W2 cleanup here isn't possible with the current template, as it doesn't take second argument. https://wiki.archiveteam.org/index.php/Talk:Main_Page. Also I don't think that the Wikipedia template is actually a good thing in the current form. I think it should substitute {{Wikipedia:WP:IP|guest users}} to [[wikipedia:{{1}}|{{2}}]] |
10:50:18 | <szczot3k> | <sup>Wikipedia</sup> |
10:50:38 | <szczot3k> | because currently it breaks the flow of the text |
10:53:11 | | BennyOtt quits [Quit: ZNC 1.9.1 - https://znc.in] |
10:54:22 | | BennyOtt (BennyOtt) joins |
11:00:04 | <szczot3k> | Proposed changes, that won't break the flow of the text. https://wiki.archiveteam.org/index.php/User:Szczot3k/Sandbox/Wikipedia - Template at https://wiki.archiveteam.org/index.php/User:Szczot3k/Sandbox/Template:Wikipedia |
11:00:56 | <szczot3k> | If there will be consensus to change the template, I'll work on the template's page so it looks better |
11:21:00 | | Webuser299363 joins |
11:21:48 | | Webuser299363 quits [Client Quit] |
11:29:24 | <@OrIdow6> | !remindme 12h dw2 |
11:29:24 | <eggdrop> | [remind] ok, i'll remind you at 2024-12-15T23:29:24Z |
11:32:12 | <szczot3k> | !remindme 8h wiki template |
11:32:13 | <eggdrop> | [remind] ok, i'll remind you at 2024-12-15T19:32:12Z |
12:00:07 | | Bleo182600722719623 quits [Quit: The Lounge - https://thelounge.chat] |
12:02:48 | | Bleo182600722719623 joins |
12:05:53 | | Webuser533934 joins |
12:06:07 | | Webuser533934 quits [Client Quit] |
12:10:22 | | katocala quits [Ping timeout: 260 seconds] |
12:10:34 | | katocala joins |
12:23:47 | | katocala quits [Ping timeout: 260 seconds] |
12:24:35 | | katocala joins |
12:26:13 | | qwertyasdfuiopghjkl2 (qwertyasdfuiopghjkl2) joins |
12:35:10 | <VannevarK> | OrIdow6 Thanks, I was just looking for a good way to parse the CDX files |
12:39:25 | | useretail joins |
12:43:04 | | SkilledAlpaca418962 quits [Quit: SkilledAlpaca418962] |
12:45:46 | | SkilledAlpaca418962 joins |
13:19:46 | <szczot3k> | Do we still have some EFNet channels? |
13:23:20 | <k> | no |
15:47:27 | | nulldata8 (nulldata) joins |
15:49:47 | | nulldata quits [Ping timeout: 265 seconds] |
15:49:47 | | nulldata8 is now known as nulldata |
16:01:55 | | loug8318142 quits [Quit: The Lounge - https://thelounge.chat] |
16:02:13 | | loug8318142 joins |
16:18:05 | | DogsRNice joins |
16:31:11 | | pedantic-darwin quits [Quit: The Lounge - https://thelounge.chat] |
16:31:24 | | pedantic-darwin joins |
16:39:52 | | Commander001 quits [Ping timeout: 260 seconds] |
16:40:05 | | Commander001 joins |
16:40:39 | | PredatorIWD2 quits [Read error: Connection reset by peer] |
16:43:01 | | PredatorIWD2 joins |
16:44:57 | | Commander001 quits [Read error: Connection reset by peer] |
16:45:08 | | Commander001 joins |
16:48:23 | | etnguyen03 (etnguyen03) joins |
17:32:40 | | etnguyen03 quits [Client Quit] |
17:39:55 | | DopefishJustin joins |
17:39:55 | | DopefishJustin is now authenticated as DopefishJustin |
17:50:11 | | etnguyen03 (etnguyen03) joins |
17:57:28 | | Webuser021770 joins |
17:59:25 | | Webuser021770 quits [Client Quit] |
18:10:31 | | Notrealname1234 (Notrealname1234) joins |
18:11:13 | | Notrealname1234 quits [Client Quit] |
18:34:47 | | Commander001 quits [Ping timeout: 260 seconds] |
18:35:24 | | Commander001 joins |
19:19:50 | | etnguyen03 quits [Client Quit] |
19:31:49 | | etnguyen03 (etnguyen03) joins |
19:32:12 | <eggdrop> | [remind] szczot3k: wiki template |
19:33:20 | <szczot3k> | !remindme 22h wikitemplate |
19:33:21 | <eggdrop> | [remind] ok, i'll remind you at 2024-12-16T17:33:20Z |
19:34:26 | <szczot3k> | Open to feedback |
20:03:46 | | Ryz quits [Excess Flood] |
20:04:38 | | Ryz (Ryz) joins |
20:10:47 | | second (second) joins |
20:13:04 | <h2ibot> | Usernam edited List of websites excluded from the Wayback Machine (+35, …): https://wiki.archiveteam.org/?diff=54045&oldid=54025 |
20:14:25 | | sec^nd quits [Ping timeout: 276 seconds] |
20:14:26 | | second is now known as sec^nd |
20:25:07 | <h2ibot> | Himond000 edited Twitch.tv (-128, /* twitch-logger-docker */ fix): https://wiki.archiveteam.org/?diff=54046&oldid=54033 |
20:33:08 | <h2ibot> | Himond000 edited Deathwatch (+223, /* 2025 */ add kidona.rakuten.co.jp): https://wiki.archiveteam.org/?diff=54047&oldid=54022 |
20:43:10 | <h2ibot> | Himond000 edited Deathwatch (+232, /* 2025 */ add LINE Doctor): https://wiki.archiveteam.org/?diff=54048&oldid=54047 |
21:00:13 | <h2ibot> | JAABot edited List of websites excluded from the Wayback Machine (+0): https://wiki.archiveteam.org/?diff=54049&oldid=54045 |
21:01:35 | <szczot3k> | JAA ^huh? |
21:02:24 | <szczot3k> | https://wiki.archiveteam.org/?diff=54049&oldid=54045 seems like a buggy behaviour, it switched the place, AND it incremented the number of things |
21:04:03 | <TheTechRobo> | It incremented the number of things because Usernam added one, and it moved the URL so the list is alphabetical. |
21:04:25 | <TheTechRobo> | see Usernam's edit: https://wiki.archiveteam.org/?diff=54045&oldid=54025 |
21:04:33 | <szczot3k> | Yeah, when you put it like that it makes sense |
21:04:36 | <szczot3k> | Brain farted |
21:09:14 | <h2ibot> | Himond000 edited Deathwatch (+273, /* 2025 */ add rakuten ip-phone-smart): https://wiki.archiveteam.org/?diff=54050&oldid=54048 |
21:21:33 | | chains_ joins |
21:21:33 | | chains_ quits [Remote host closed the connection] |
21:29:28 | | BlueMaxima joins |
21:50:18 | | MrMcNuggets quits [Quit: WeeChat 4.3.2] |
21:59:14 | <myself> | has this been grabbed? https://forums.othernet.is/t/shutting-down-forum/7980 |
22:05:30 | <k> | myself: does not look like it |
22:06:54 | <k> | myself: i started a job for it in archivebot, thanks for letting us know |
22:07:28 | <myself> | thank you |
22:09:17 | <k> | :) <3 |
22:09:26 | | etnguyen03 quits [Client Quit] |
22:09:41 | <k> | !clockscan 1mo |
22:09:42 | <eggdrop> | [clockscan] parsed "1 months 22:09:42Z" as 1736978982 → 2025-01-15T22:09:42Z |
22:09:53 | <k> | !remindme 1mo grab https://forums.othernet.is/t/shutting-down-forum/7980 again? cc myself |
22:09:55 | <eggdrop> | [remind] ok, i'll remind you at 2025-01-15T22:09:53Z |
22:33:48 | <@JAA> | pabs: Hmm, I thought we had enabled down to TLS 1.0 on AB. (SSLv3 would require a custom OpenSSL build.) |
22:39:56 | <@JAA> | szczot3k: I'm unsure about the space before the <sup>. Personally, I'd put no space at all there, but if others disagree, I think it should at least be a non-breaking space so the <sup>[Wikipedia]</sup> doesn't end up on a new line. |
22:40:11 | <@JAA> | Also, ATrescue's a blast from the past. lol |
22:42:18 | | etnguyen03 (etnguyen03) joins |
23:18:55 | <@OrIdow6> | szczot3k: I am in favor of destroying the wiki |
23:23:23 | <pabs> | JAA: the recent linuxtag.org jobs got "Operation not permitted" on TLS 1.0 subdomains like ssl.linuxtag.org |
23:26:54 | <k> | OrIdow6: wat |
23:28:52 | <@OrIdow6> | k: Way too much "somone decided this in 10 minutes in 2012 and now we're stuck with it" on the wiki |
23:29:25 | <eggdrop> | [remind] OrIdow6: dw2 |
23:31:55 | <@OrIdow6> | eggdrop: Waiting to pester me until you see I'm online huh |
23:32:06 | <@OrIdow6> | !remindme 4h dw2 |
23:32:06 | <eggdrop> | [remind] ok, i'll remind you at 2024-12-16T03:32:06Z |
23:36:41 | | DogsRNice quits [Ping timeout: 265 seconds] |
23:43:02 | <@JAA> | pabs: Thanks, I'll investigate later. In a brief test, the OpenSSL config we use should be enough. |