00:00:12 | | Webuser413474 joins |
00:00:25 | | Webuser413474 quits [Client Quit] |
00:01:23 | <anarcat> | szczot3k: right, down like that |
00:13:33 | <runxiyu> | Alright, anyways |
00:13:41 | <runxiyu> | I'm considering running a warrior on native AMD64 |
00:13:42 | <runxiyu> | er |
00:13:45 | <runxiyu> | Native aarch64* |
00:13:46 | <runxiyu> | lol |
00:13:55 | <runxiyu> | I won't connect to AT servers or now |
00:13:58 | <runxiyu> | for now* |
00:14:29 | <runxiyu> | > No major censorship. If you believe your country implements major censorship, do not run a warrior. |
00:14:33 | <runxiyu> | Oh, I can't do it anyway |
00:16:40 | <runxiyu> | Anyways, modern ARM64 is also little endian. I don't think endianness is the issue, so maybe that should be checked |
00:16:48 | | simon816 quits [Quit: ZNC 1.9.1 - https://znc.in] |
00:17:19 | <katia> | runxiyu, there's other problems with data integrity in other architectures |
00:17:23 | <katia> | afaik. |
00:17:54 | <runxiyu> | Sounds like time to update the wiki and not just say endianness |
00:17:59 | <katia> | afaik, it has not been tested that it makes correct WARCs |
00:18:09 | <katia> | i think you can sign up for an account. |
00:18:48 | <runxiyu> | https://github.com/ArchiveTeam/warrior-code2 hasn't been touched in 6 years |
00:19:56 | <@JAA> | runxiyu: It is on the wiki: https://wiki.archiveteam.org/index.php/ArchiveTeam_Warrior#Can_I_run_the_Warrior_on_ARM_or_some_other_unusual_architecture? |
00:20:27 | <runxiyu> | JAA: Yes, and I'm saying that that section is strange, because it blames endianness which is the same |
00:20:45 | <katia> | questionable practices in the Wget archive-creating components? well i never |
00:20:47 | <@JAA> | Well, 'among other things'. |
00:20:50 | | simon816 (simon816) joins |
00:21:05 | <@JAA> | Endianness is an obvious potential issue. There might be others we haven't identified yet. |
00:21:28 | <runxiyu> | I can accept that there are other issues. I cannot imagine how endianness would be an issue on ARM64 |
00:21:32 | | Commander001 quits [Ping timeout: 260 seconds] |
00:22:14 | <@JAA> | It wouldn't if the code were solid. It wasn't even doing error checks on file operations until quite recently. |
00:24:54 | | etnguyen03 quits [Client Quit] |
00:26:12 | | loug8318142 quits [Quit: The Lounge - https://thelounge.chat] |
00:26:57 | <@JAA> | So without thorough verification (which is in itself hard enough to do), no non-x86. |
00:27:28 | <@OrIdow6> | ... I may have asked this before but what would verification entail? Might update that FAQ with that info |
00:29:14 | <@JAA> | There's no established procedure. Retrieving various test payloads and checking that the WARC contents are correct, basically. And ideally, the test cases should cover the entire warc.c code. So including dedupe, writing/loading CDX, etc. |
00:30:25 | <nicolas17> | and the hashing bug escaped testing :( |
00:30:36 | <@JAA> | The 'checking that the WARC contents are correct' bit is the hard part. There's no good tooling for this. It's part of why I started writing pywarc from scratch based on the specification. |
00:31:03 | <@OrIdow6> | nicolas17: Yeah that's been in the back of my mind the last few weeks |
00:33:00 | <@OrIdow6> | I hesitate to say it but we may benefit from "modern" code review things - forcing someone else to approve it, even such thigns as AI review, etc |
00:33:19 | <nicolas17> | let's rewrite it in rust /s |
00:33:19 | <@OrIdow6> | But that would've just stopped it (maybe!) from being introduced... not problems added in 2011 |
00:33:48 | <nicolas17> | welp I have 600MB of HTML from rrpicturearchives already |
00:34:04 | <@JAA> | Yeah, I have been reviewing a bunch of arkiver's changes to wget-at. These are tech debt from the old days. |
00:34:46 | <runxiyu> | does AT use wget or wget2? |
00:35:01 | <@JAA> | Neither |
00:35:09 | <nicolas17> | https://github.com/ArchiveTeam/wget-lua |
00:35:15 | <@JAA> | A fork of a fork of wget |
00:35:18 | <k> | wget3 |
00:35:53 | <@JAA> | Although the first fork was really written pretty much for AT, so I guess just one level of forking, despite the technical situation of the repos. |
00:36:10 | <@JAA> | wget2 doesn't have any WARC support. |
00:37:08 | <@OrIdow6> | <nicolas17> let's rewrite it in rust /s |
00:37:46 | <@OrIdow6> | You add the /s but I don't think this would be too bad an idea |
00:38:01 | <nicolas17> | I assume there's a reason we started from wget and not from scratch |
00:38:03 | <@OrIdow6> | Main difficulty is that it'd basically be impossible to add into vanilla wget |
00:42:05 | <nicolas17> | (also I'm not volunteering, I have less than 5 days of Rust experience) |
00:43:08 | <@OrIdow6> | nicolas17: Maybe other know more about this but from what I understand, AT started with vanilla wget producing files that went into tars and the like |
00:43:36 | <@OrIdow6> | So it was just that people were already familiar with the system |
00:43:52 | <@JAA> | That sounds about right, although it was long before my time here. |
00:44:10 | <@JAA> | The WARC implementation in wget is by AT people, IIRC. |
00:44:15 | <@OrIdow6> | Yep https://web.archive.org/web/20150403095211/https://twitter.com/textfiles/status/232550071235907584 |
00:44:17 | <eggdrop> | nitter: https://nitter.poast.org/textfiles/status/232550071235907584 |
00:44:23 | <@OrIdow6> | By https://github.com/gvtulder whoever that is |
00:52:33 | | etnguyen03 (etnguyen03) joins |
01:14:35 | | Island joins |
01:16:22 | | Sidpatchy quits [Ping timeout: 260 seconds] |
01:19:52 | <h2ibot> | PaulWise edited Mailing Lists (+183, mention extra archiving requirements): https://wiki.archiveteam.org/?diff=54055&oldid=54034 |
01:19:53 | <h2ibot> | PaulWise edited Mailing Lists (+0, typo): https://wiki.archiveteam.org/?diff=54056&oldid=54055 |
01:21:53 | <h2ibot> | PaulWise edited Usenet (+48, add links to other usenet archives): https://wiki.archiveteam.org/?diff=54057&oldid=49394 |
01:21:54 | <h2ibot> | PaulWise edited Usenet (-3, fix formatting): https://wiki.archiveteam.org/?diff=54058&oldid=54057 |
01:59:34 | | etnguyen03 quits [Client Quit] |
02:03:48 | | BornOn420 quits [Remote host closed the connection] |
02:04:19 | | BornOn420 (BornOn420) joins |
02:06:58 | <nicolas17> | datechnoman: another fun quirk of rrpicturearchives is that it runs on a Windows server, so filenames are case-insensitive |
02:07:26 | <nicolas17> | http://rrpicturearchives.net/aRcHiVeThUmBs.aspx?id=185053 |
02:07:44 | <nicolas17> | Windows Server-- |
02:07:45 | <eggdrop> | [karma] 'Windows Server' now has -1 karma! |
02:12:37 | <datechnoman> | Offt thats a fun one |
02:14:21 | <nicolas17> | but WBM is also case insensitive so uh >.> |
02:21:15 | | etnguyen03 (etnguyen03) joins |
02:26:57 | | lennier2 quits [Ping timeout: 260 seconds] |
02:29:21 | | lennier2_ joins |
02:51:07 | <h2ibot> | JustAnotherArchivist edited List of websites excluded from the Wayback Machine (+61): https://wiki.archiveteam.org/?diff=54059&oldid=54049 |
03:00:08 | <h2ibot> | JAABot edited List of websites excluded from the Wayback Machine (+0): https://wiki.archiveteam.org/?diff=54060&oldid=54059 |
03:24:33 | | etnguyen03 quits [Client Quit] |
03:33:16 | | etnguyen03 (etnguyen03) joins |
03:38:18 | | etnguyen03 quits [Remote host closed the connection] |
03:44:06 | <@arkiver> | to be clear |
03:44:10 | <@arkiver> | we're not getting rid of the wiki |
03:44:25 | <@arkiver> | the wiki is here to stay |
03:49:08 | <nicolas17> | got all rrpa album IDs, now retrying those that failed (99.9% are actually missing, but there's a few I missed in my first pass for whatever reason, maybe got a network timeout, or in some cases they were "restored" in the last few hours) |
03:49:54 | <@arkiver> | nicolas17: what do you think is the best way to archive them? |
03:50:22 | <nicolas17> | arkiver: the fact that I got this far without getting IP-banned suggests archivebot is feasible |
03:50:30 | <nicolas17> | they used to limit requests per day per IP |
03:50:36 | <@arkiver> | that sounds good yeah |
03:51:30 | <nicolas17> | well, the fact that there's 5 million pictures may suggest otherwise ^^ |
03:55:43 | <@JAA> | I wonder if they could in theory restore some things from the AB job from last year. I'm sure it'd be painful though. |
03:55:48 | <nicolas17> | http://rrpicturearchives.net/archiveList.aspx this is an ASP.NET __VIEWSTATE monstrosity |
03:56:11 | <nicolas17> | JAA: I saw a few albums added in the last few hours like this http://rrpicturearchives.net/archivethumbs.aspx?id=151998 |
03:56:20 | <nicolas17> | "Restored Album" |
03:56:37 | <nicolas17> | wait that had images |
03:56:50 | <@JAA> | I was about to say, looks pretty empty, yeah. |
03:57:15 | <nicolas17> | http://rrpicturearchives.net/archivethumbs.aspx?id=151013 |
03:57:38 | <@JAA> | Mhm |
03:58:06 | <nicolas17> | so I think they're messing around with things right now, manually restoring albums without their metadata or something |
03:59:18 | <@JAA> | Right |
04:09:52 | | Commander001 joins |
04:16:44 | <nicolas17> | http://rrpicturearchives.net/archivethumbs.aspx?id=103576 |
04:16:46 | <nicolas17> | >300 pages |
04:17:01 | <nicolas17> | different people use albums in different ways... |
04:18:16 | | DogsRNice quits [Read error: Connection reset by peer] |
04:18:56 | <nicolas17> | http://rrpicturearchives.net/archivethumbs.aspx?id=147502 biggest |
04:22:10 | <Flashfire42> | Well German government did a vote of no confidence and krygystan has got rid of its prime minister |
04:23:25 | <nicolas17> | Flashfire42: and south korea impeached their president |
04:23:40 | <nicolas17> | and I think to myself, what a wonderful world |
04:23:41 | <@JAA> | The one in Germany is as expected. |
04:24:35 | <Flashfire42> | well off to go find more stuff to archive except archivebot is close to full and youtube isnt running and telegram is full so who wants a drink |
05:00:22 | <@OrIdow6> | arkiver: I was joking with "destroying the wiki" |
05:01:04 | | BlueMaxima quits [Read error: Connection reset by peer] |
05:44:47 | | eroc19906 quits [Quit: The Lounge - https://thelounge.chat] |
05:45:11 | | eroc1990 (eroc1990) joins |
05:48:47 | | chains quits [Ping timeout: 260 seconds] |
06:12:07 | | Snivy quits [Ping timeout: 260 seconds] |
06:13:17 | | G4te_Keep3r3492415 quits [Ping timeout: 260 seconds] |
06:29:21 | <that_lurker> | lfgss shutting down mentioned in HN by the person running it https://news.ycombinator.com/item?id=42433044 |
06:29:41 | <that_lurker> | AB jobs is going smoothly, but its slow. |
06:34:48 | | Snivy (Snivy) joins |
06:48:01 | | Wohlstand quits [Quit: Wohlstand] |
06:48:26 | | Wohlstand (Wohlstand) joins |
06:50:31 | | lennier2 joins |
06:53:32 | | lennier2_ quits [Ping timeout: 260 seconds] |
07:00:11 | <BornOn420> | Flashfire42 I just woke up so I'm skipping on the drinks. In for a coffee? |
07:00:41 | <Flashfire42> | you can pay for my maccas and my bail for when I get arrested for going through these terrorism sites |
07:06:21 | | Snivy quits [Ping timeout: 252 seconds] |
07:09:17 | | Unholy23619246453771312 quits [Ping timeout: 260 seconds] |
07:09:32 | | Snivy (Snivy) joins |
07:45:33 | | PredatorIWD2 quits [Read error: Connection reset by peer] |
07:46:22 | | Wohlstand quits [Remote host closed the connection] |
07:46:36 | | Wohlstand (Wohlstand) joins |
07:48:17 | | Wohlstand quits [Client Quit] |
07:54:12 | | SF quits [Ping timeout: 260 seconds] |
08:12:24 | | SF joins |
08:21:00 | | PredatorIWD2 joins |
08:33:52 | | Snivy quits [Ping timeout: 260 seconds] |
08:44:50 | | k quits [Read error: Connection reset by peer] |
08:51:52 | | Island quits [Read error: Connection reset by peer] |
09:09:27 | | Commander001 quits [Read error: Connection reset by peer] |
09:10:17 | | Commander001 joins |
09:10:44 | | Wohlstand (Wohlstand) joins |
09:13:53 | | parfait quits [Quit: Leaving] |
09:23:57 | | Radzig2 joins |
09:26:57 | | Radzig quits [Ping timeout: 260 seconds] |
09:26:57 | | Radzig2 is now known as Radzig |
09:27:30 | | Snivy (Snivy) joins |
10:05:55 | | loug8318142 joins |
10:34:20 | | pabs quits [Read error: Connection reset by peer] |
10:39:19 | | pabs (pabs) joins |
10:53:22 | | k (katia) joins |
11:58:44 | | Wohlstand quits [Client Quit] |
12:00:07 | | Bleo182600722719623 quits [Quit: The Lounge - https://thelounge.chat] |
12:02:50 | | Bleo182600722719623 joins |
12:05:37 | | rohvani quits [Ping timeout: 260 seconds] |
12:41:36 | | VerifiedJ9 quits [Quit: The Lounge - https://thelounge.chat] |
12:42:12 | | VerifiedJ9 (VerifiedJ) joins |
12:44:47 | | SkilledAlpaca418962 joins |
13:06:05 | | Commander001 quits [Remote host closed the connection] |
14:01:42 | | SF quits [Ping timeout: 260 seconds] |
14:14:53 | | SF joins |
14:20:58 | <h2ibot> | Bzc6p uploaded File:Hudomains logo.png (Logo of official .hu registry, 2024-12-17.): https://wiki.archiveteam.org/?title=File%3AHudomains%20logo.png |
14:36:42 | | SF quits [Ping timeout: 260 seconds] |
14:49:20 | | SF joins |
14:53:27 | | ymgve_ joins |
14:57:07 | | ymgve quits [Ping timeout: 260 seconds] |
15:05:04 | <h2ibot> | Bzc6p created .hu domains seed (+3349, Created page with "{{Infobox project | title =…): https://wiki.archiveteam.org/?title=.hu%20domains%20seed |
15:06:05 | <h2ibot> | Bzc6p edited Template:Hungarian websites (+28, + [[.hu domains seed]]): https://wiki.archiveteam.org/?diff=54063&oldid=53635 |
15:08:24 | | Ketchup901 quits [Remote host closed the connection] |
15:08:36 | | Ketchup901 (Ketchup901) joins |
15:42:11 | <h2ibot> | Bzc6p edited .hu domains seed (+781, /* Progress and statistics */ Improve table): https://wiki.archiveteam.org/?diff=54064&oldid=54062 |
15:43:15 | | MrMcNuggets (MrMcNuggets) joins |
16:23:18 | <h2ibot> | Bzc6p edited .hu domains seed (+0, /* Progress and statistics */ Actually, the…): https://wiki.archiveteam.org/?diff=54065&oldid=54064 |
16:27:18 | <h2ibot> | Bzc6p edited .hu domains seed (+267, /* Methodology */ using a different API to…): https://wiki.archiveteam.org/?diff=54066&oldid=54065 |
16:35:20 | <h2ibot> | Bzc6p edited .hu domains seed (+17, /* Progress and statistics */ realize that data…): https://wiki.archiveteam.org/?diff=54067&oldid=54066 |
17:05:29 | | i_have_n0_idea9 quits [Quit: The Lounge - https://thelounge.chat] |
17:05:55 | | i_have_n0_idea9 (i_have_n0_idea) joins |
17:11:55 | | Island joins |
17:16:51 | | abirkill- (abirkill) joins |
17:18:52 | | abirkill quits [Ping timeout: 260 seconds] |
17:18:52 | | abirkill- is now known as abirkill |
17:22:57 | | BPCZ quits [Ping timeout: 260 seconds] |
18:03:47 | | balrog quits [Ping timeout: 260 seconds] |
18:08:15 | | balrog (balrog) joins |
18:27:37 | <h2ibot> | Cooljeanius edited Twitch.tv (+8, /* Broadcast retention changes */ use URL template): https://wiki.archiveteam.org/?diff=54068&oldid=54046 |
19:08:18 | | MrMcNuggets quits [Quit: WeeChat 4.3.2] |
19:47:36 | | chains joins |
19:52:43 | | ducky quits [Ping timeout: 260 seconds] |
19:53:26 | | ducky (ducky) joins |
20:23:57 | <h2ibot> | Himond000 edited Deathwatch (+203, /* 2025 */ add fanblogs.jp): https://wiki.archiveteam.org/?diff=54069&oldid=54054 |
20:41:18 | | yasomi quits [Quit: ZNC 1.9.1 - https://znc.in] |
20:46:05 | | yasomi (yasomi) joins |
20:49:49 | | qinplus_mobile joins |
20:56:53 | | ducky quits [Ping timeout: 260 seconds] |
20:59:25 | | ducky (ducky) joins |
21:00:13 | <nulldata> | https://www.nbcboston.com/news/local/an-everett-newspaper-is-shutting-down-after-the-mayor-sued-for-defamation/3579003/ |
21:00:30 | <nulldata> | Unfortunately it looks like the site has already been taken down. |
21:00:35 | <nulldata> | https://everettleader.com/ |
21:07:46 | | JaffaCakes118_2 quits [Remote host closed the connection] |
21:09:49 | | JaffaCakes118_2 (JaffaCakes118) joins |
21:09:54 | <pokechu22> | Looks like it wasn't in wikidata either, so no archiveteam_urls coverage: https://web.archive.org/web/20240000000000*/https://everettleader.com |
21:10:06 | | JaffaCakes118_2 quits [Read error: Connection reset by peer] |
21:13:06 | <pokechu22> | https://www.wikidata.org/wiki/Q131462554 (not that that will do much good now) |
21:13:09 | | pixel leaves [Error from remote client] |
21:54:10 | | yasomi quits [Remote host closed the connection] |
22:03:50 | | yasomi (yasomi) joins |
22:12:54 | | etnguyen03 (etnguyen03) joins |
22:13:19 | | BlueMaxima joins |
22:14:49 | | riteo quits [Remote host closed the connection] |
22:46:42 | | pokechu22 quits [Ping timeout: 260 seconds] |
22:47:25 | | DogsRNice joins |
22:48:57 | | pokechu22 (pokechu22) joins |
22:59:31 | | qinplus_mobile quits [Client Quit] |
23:05:21 | <Ryz> | So, regarding trying to be open and accessible; I noticed that potentially, one of the reasons that people who haven't been in IRC for too long is because they're accessing it on a mobile device, which is different than just being on a personal computer like a desktop or laptop |
23:05:51 | | kokos- quits [Quit: ZNC - https://znc.in] |
23:05:51 | | katia_ quits [Quit: ZNC - https://znc.in] |
23:06:18 | <Ryz> | Reason for saying this is I don't really know if the connection breaks if the tab goes inactive after X amount of time, since I personally never accessed IRC through a mobile device like a smartphone or tablet, or at least that's how I remember |
23:08:11 | | kokos- joins |
23:08:31 | <Ryz> | I'm not sure if it is covered on the wiki yet regarding access to IRC channels, but I feel that in general, if wanting input and answers, especially those from the outside that never been here, it would be strongly suggested to stay in IRC |
23:08:55 | <Ryz> | There have been many countless instances where the person just leaves |
23:09:52 | <Ryz> | Before an answer arrives to them accordingly oo; |
23:15:23 | <steering> | Ryz: plenty of people just leave even on a pc :P |
23:16:29 | <steering> | while its possible that browsers might kill the tab, most people probably just get tired of waiting and close the tab... or back in the day, they'd get tired of waiting and close whatever IRC client they downloaded and used for probably the first time ever :P |
23:16:46 | <Ryz> | I feel it's moreso on mobile, because of the tabs having to be deloaded |
23:17:53 | <Ryz> | Like, basically not their own volition to leave like that |
23:18:07 | | PredatorIWD2 quits [Read error: Connection reset by peer] |
23:21:27 | | etnguyen03 quits [Client Quit] |
23:22:35 | | etnguyen03 (etnguyen03) joins |
23:23:23 | | PredatorIWD2 joins |
23:24:39 | | katia_ (katia) joins |
23:41:27 | <nightpool> | Yeah, most people do not have access to a bouncer or persistent connection. They're using mobile, or a laptop. This is a persistent issue with accessibility on IRC, it really only works for people who only desktops and also never turn them off. |
23:41:33 | <nightpool> | Even desktop chrome unloads tabs these days if they're inactive |
23:41:59 | <nightpool> | I use hackint's matrix bridge for this reason, we might recommend on the wiki that people try that |
23:54:17 | | BornOn420 quits [Ping timeout: 276 seconds] |
23:55:05 | | BornOn420 (BornOn420) joins |