00:00:12Webuser413474 joins
00:00:25Webuser413474 quits [Client Quit]
00:01:23<anarcat>szczot3k: right, down like that
00:13:33<runxiyu>Alright, anyways
00:13:41<runxiyu>I'm considering running a warrior on native AMD64
00:13:42<runxiyu>er
00:13:45<runxiyu>Native aarch64*
00:13:46<runxiyu>lol
00:13:55<runxiyu>I won't connect to AT servers or now
00:13:58<runxiyu>for now*
00:14:29<runxiyu>> No major censorship. If you believe your country implements major censorship, do not run a warrior.
00:14:33<runxiyu>Oh, I can't do it anyway
00:16:40<runxiyu>Anyways, modern ARM64 is also little endian. I don't think endianness is the issue, so maybe that should be checked
00:16:48simon816 quits [Quit: ZNC 1.9.1 - https://znc.in]
00:17:19<katia>runxiyu, there's other problems with data integrity in other architectures
00:17:23<katia>afaik.
00:17:54<runxiyu>Sounds like time to update the wiki and not just say endianness
00:17:59<katia>afaik, it has not been tested that it makes correct WARCs
00:18:09<katia>i think you can sign up for an account.
00:18:48<runxiyu>https://github.com/ArchiveTeam/warrior-code2 hasn't been touched in 6 years
00:19:56<@JAA>runxiyu: It is on the wiki: https://wiki.archiveteam.org/index.php/ArchiveTeam_Warrior#Can_I_run_the_Warrior_on_ARM_or_some_other_unusual_architecture?
00:20:27<runxiyu>JAA: Yes, and I'm saying that that section is strange, because it blames endianness which is the same
00:20:45<katia>questionable practices in the Wget archive-creating components? well i never
00:20:47<@JAA>Well, 'among other things'.
00:20:50simon816 (simon816) joins
00:21:05<@JAA>Endianness is an obvious potential issue. There might be others we haven't identified yet.
00:21:28<runxiyu>I can accept that there are other issues. I cannot imagine how endianness would be an issue on ARM64
00:21:32Commander001 quits [Ping timeout: 260 seconds]
00:22:14<@JAA>It wouldn't if the code were solid. It wasn't even doing error checks on file operations until quite recently.
00:24:54etnguyen03 quits [Client Quit]
00:26:12loug8318142 quits [Quit: The Lounge - https://thelounge.chat]
00:26:57<@JAA>So without thorough verification (which is in itself hard enough to do), no non-x86.
00:27:28<@OrIdow6>... I may have asked this before but what would verification entail? Might update that FAQ with that info
00:29:14<@JAA>There's no established procedure. Retrieving various test payloads and checking that the WARC contents are correct, basically. And ideally, the test cases should cover the entire warc.c code. So including dedupe, writing/loading CDX, etc.
00:30:25<nicolas17>and the hashing bug escaped testing :(
00:30:36<@JAA>The 'checking that the WARC contents are correct' bit is the hard part. There's no good tooling for this. It's part of why I started writing pywarc from scratch based on the specification.
00:31:03<@OrIdow6>nicolas17: Yeah that's been in the back of my mind the last few weeks
00:33:00<@OrIdow6>I hesitate to say it but we may benefit from "modern" code review things - forcing someone else to approve it, even such thigns as AI review, etc
00:33:19<nicolas17>let's rewrite it in rust /s
00:33:19<@OrIdow6>But that would've just stopped it (maybe!) from being introduced... not problems added in 2011
00:33:48<nicolas17>welp I have 600MB of HTML from rrpicturearchives already
00:34:04<@JAA>Yeah, I have been reviewing a bunch of arkiver's changes to wget-at. These are tech debt from the old days.
00:34:46<runxiyu>does AT use wget or wget2?
00:35:01<@JAA>Neither
00:35:09<nicolas17>https://github.com/ArchiveTeam/wget-lua
00:35:15<@JAA>A fork of a fork of wget
00:35:18<k>wget3
00:35:53<@JAA>Although the first fork was really written pretty much for AT, so I guess just one level of forking, despite the technical situation of the repos.
00:36:10<@JAA>wget2 doesn't have any WARC support.
00:37:08<@OrIdow6><nicolas17> let's rewrite it in rust /s
00:37:46<@OrIdow6>You add the /s but I don't think this would be too bad an idea
00:38:01<nicolas17>I assume there's a reason we started from wget and not from scratch
00:38:03<@OrIdow6>Main difficulty is that it'd basically be impossible to add into vanilla wget
00:42:05<nicolas17>(also I'm not volunteering, I have less than 5 days of Rust experience)
00:43:08<@OrIdow6>nicolas17: Maybe other know more about this but from what I understand, AT started with vanilla wget producing files that went into tars and the like
00:43:36<@OrIdow6>So it was just that people were already familiar with the system
00:43:52<@JAA>That sounds about right, although it was long before my time here.
00:44:10<@JAA>The WARC implementation in wget is by AT people, IIRC.
00:44:15<@OrIdow6>Yep https://web.archive.org/web/20150403095211/https://twitter.com/textfiles/status/232550071235907584
00:44:17<eggdrop>nitter: https://nitter.poast.org/textfiles/status/232550071235907584
00:44:23<@OrIdow6>By https://github.com/gvtulder whoever that is
00:52:33etnguyen03 (etnguyen03) joins
01:14:35Island joins
01:16:22Sidpatchy quits [Ping timeout: 260 seconds]
01:19:52<h2ibot>PaulWise edited Mailing Lists (+183, mention extra archiving requirements): https://wiki.archiveteam.org/?diff=54055&oldid=54034
01:19:53<h2ibot>PaulWise edited Mailing Lists (+0, typo): https://wiki.archiveteam.org/?diff=54056&oldid=54055
01:21:53<h2ibot>PaulWise edited Usenet (+48, add links to other usenet archives): https://wiki.archiveteam.org/?diff=54057&oldid=49394
01:21:54<h2ibot>PaulWise edited Usenet (-3, fix formatting): https://wiki.archiveteam.org/?diff=54058&oldid=54057
01:59:34etnguyen03 quits [Client Quit]
02:03:48BornOn420 quits [Remote host closed the connection]
02:04:19BornOn420 (BornOn420) joins
02:06:58<nicolas17>datechnoman: another fun quirk of rrpicturearchives is that it runs on a Windows server, so filenames are case-insensitive
02:07:26<nicolas17>http://rrpicturearchives.net/aRcHiVeThUmBs.aspx?id=185053
02:07:44<nicolas17>Windows Server--
02:07:45<eggdrop>[karma] 'Windows Server' now has -1 karma!
02:12:37<datechnoman>Offt thats a fun one
02:14:21<nicolas17>but WBM is also case insensitive so uh >.>
02:21:15etnguyen03 (etnguyen03) joins
02:26:57lennier2 quits [Ping timeout: 260 seconds]
02:29:21lennier2_ joins
02:51:07<h2ibot>JustAnotherArchivist edited List of websites excluded from the Wayback Machine (+61): https://wiki.archiveteam.org/?diff=54059&oldid=54049
03:00:08<h2ibot>JAABot edited List of websites excluded from the Wayback Machine (+0): https://wiki.archiveteam.org/?diff=54060&oldid=54059
03:24:33etnguyen03 quits [Client Quit]
03:33:16etnguyen03 (etnguyen03) joins
03:38:18etnguyen03 quits [Remote host closed the connection]
03:44:06<@arkiver>to be clear
03:44:10<@arkiver>we're not getting rid of the wiki
03:44:25<@arkiver>the wiki is here to stay
03:49:08<nicolas17>got all rrpa album IDs, now retrying those that failed (99.9% are actually missing, but there's a few I missed in my first pass for whatever reason, maybe got a network timeout, or in some cases they were "restored" in the last few hours)
03:49:54<@arkiver>nicolas17: what do you think is the best way to archive them?
03:50:22<nicolas17>arkiver: the fact that I got this far without getting IP-banned suggests archivebot is feasible
03:50:30<nicolas17>they used to limit requests per day per IP
03:50:36<@arkiver>that sounds good yeah
03:51:30<nicolas17>well, the fact that there's 5 million pictures may suggest otherwise ^^
03:55:43<@JAA>I wonder if they could in theory restore some things from the AB job from last year. I'm sure it'd be painful though.
03:55:48<nicolas17>http://rrpicturearchives.net/archiveList.aspx this is an ASP.NET __VIEWSTATE monstrosity
03:56:11<nicolas17>JAA: I saw a few albums added in the last few hours like this http://rrpicturearchives.net/archivethumbs.aspx?id=151998
03:56:20<nicolas17>"Restored Album"
03:56:37<nicolas17>wait that had images
03:56:50<@JAA>I was about to say, looks pretty empty, yeah.
03:57:15<nicolas17>http://rrpicturearchives.net/archivethumbs.aspx?id=151013
03:57:38<@JAA>Mhm
03:58:06<nicolas17>so I think they're messing around with things right now, manually restoring albums without their metadata or something
03:59:18<@JAA>Right
04:09:52Commander001 joins
04:16:44<nicolas17>http://rrpicturearchives.net/archivethumbs.aspx?id=103576
04:16:46<nicolas17>>300 pages
04:17:01<nicolas17>different people use albums in different ways...
04:18:16DogsRNice quits [Read error: Connection reset by peer]
04:18:56<nicolas17>http://rrpicturearchives.net/archivethumbs.aspx?id=147502 biggest
04:22:10<Flashfire42>Well German government did a vote of no confidence and krygystan has got rid of its prime minister
04:23:25<nicolas17>Flashfire42: and south korea impeached their president
04:23:40<nicolas17>and I think to myself, what a wonderful world
04:23:41<@JAA>The one in Germany is as expected.
04:24:35<Flashfire42>well off to go find more stuff to archive except archivebot is close to full and youtube isnt running and telegram is full so who wants a drink
05:00:22<@OrIdow6>arkiver: I was joking with "destroying the wiki"
05:01:04BlueMaxima quits [Read error: Connection reset by peer]
05:44:47eroc19906 quits [Quit: The Lounge - https://thelounge.chat]
05:45:11eroc1990 (eroc1990) joins
05:48:47chains quits [Ping timeout: 260 seconds]
06:12:07Snivy quits [Ping timeout: 260 seconds]
06:13:17G4te_Keep3r3492415 quits [Ping timeout: 260 seconds]
06:29:21<that_lurker>lfgss shutting down mentioned in HN by the person running it https://news.ycombinator.com/item?id=42433044
06:29:41<that_lurker>AB jobs is going smoothly, but its slow.
06:34:48Snivy (Snivy) joins
06:48:01Wohlstand quits [Quit: Wohlstand]
06:48:26Wohlstand (Wohlstand) joins
06:50:31lennier2 joins
06:53:32lennier2_ quits [Ping timeout: 260 seconds]
07:00:11<BornOn420>Flashfire42 I just woke up so I'm skipping on the drinks. In for a coffee?
07:00:41<Flashfire42>you can pay for my maccas and my bail for when I get arrested for going through these terrorism sites
07:06:21Snivy quits [Ping timeout: 252 seconds]
07:09:17Unholy23619246453771312 quits [Ping timeout: 260 seconds]
07:09:32Snivy (Snivy) joins
07:45:33PredatorIWD2 quits [Read error: Connection reset by peer]
07:46:22Wohlstand quits [Remote host closed the connection]
07:46:36Wohlstand (Wohlstand) joins
07:48:17Wohlstand quits [Client Quit]
07:54:12SF quits [Ping timeout: 260 seconds]
08:12:24SF joins
08:21:00PredatorIWD2 joins
08:33:52Snivy quits [Ping timeout: 260 seconds]
08:44:50k quits [Read error: Connection reset by peer]
08:51:52Island quits [Read error: Connection reset by peer]
09:09:27Commander001 quits [Read error: Connection reset by peer]
09:10:17Commander001 joins
09:10:44Wohlstand (Wohlstand) joins
09:13:53parfait quits [Quit: Leaving]
09:23:57Radzig2 joins
09:26:57Radzig quits [Ping timeout: 260 seconds]
09:26:57Radzig2 is now known as Radzig
09:27:30Snivy (Snivy) joins
10:05:55loug8318142 joins
10:34:20pabs quits [Read error: Connection reset by peer]
10:39:19pabs (pabs) joins
10:53:22k (katia) joins
11:58:44Wohlstand quits [Client Quit]
12:00:07Bleo182600722719623 quits [Quit: The Lounge - https://thelounge.chat]
12:02:50Bleo182600722719623 joins
12:05:37rohvani quits [Ping timeout: 260 seconds]
12:41:36VerifiedJ9 quits [Quit: The Lounge - https://thelounge.chat]
12:42:12VerifiedJ9 (VerifiedJ) joins
12:44:47SkilledAlpaca418962 joins
13:06:05Commander001 quits [Remote host closed the connection]
14:01:42SF quits [Ping timeout: 260 seconds]
14:14:53SF joins
14:20:58<h2ibot>Bzc6p uploaded File:Hudomains logo.png (Logo of official .hu registry, 2024-12-17.): https://wiki.archiveteam.org/?title=File%3AHudomains%20logo.png
14:36:42SF quits [Ping timeout: 260 seconds]
14:49:20SF joins
14:53:27ymgve_ joins
14:57:07ymgve quits [Ping timeout: 260 seconds]
15:05:04<h2ibot>Bzc6p created .hu domains seed (+3349, Created page with "{{Infobox project | title =…): https://wiki.archiveteam.org/?title=.hu%20domains%20seed
15:06:05<h2ibot>Bzc6p edited Template:Hungarian websites (+28, + [[.hu domains seed]]): https://wiki.archiveteam.org/?diff=54063&oldid=53635
15:08:24Ketchup901 quits [Remote host closed the connection]
15:08:36Ketchup901 (Ketchup901) joins
15:42:11<h2ibot>Bzc6p edited .hu domains seed (+781, /* Progress and statistics */ Improve table): https://wiki.archiveteam.org/?diff=54064&oldid=54062
15:43:15MrMcNuggets (MrMcNuggets) joins
16:23:18<h2ibot>Bzc6p edited .hu domains seed (+0, /* Progress and statistics */ Actually, the…): https://wiki.archiveteam.org/?diff=54065&oldid=54064
16:27:18<h2ibot>Bzc6p edited .hu domains seed (+267, /* Methodology */ using a different API to…): https://wiki.archiveteam.org/?diff=54066&oldid=54065
16:35:20<h2ibot>Bzc6p edited .hu domains seed (+17, /* Progress and statistics */ realize that data…): https://wiki.archiveteam.org/?diff=54067&oldid=54066
17:05:29i_have_n0_idea9 quits [Quit: The Lounge - https://thelounge.chat]
17:05:55i_have_n0_idea9 (i_have_n0_idea) joins
17:11:55Island joins
17:16:51abirkill- (abirkill) joins
17:18:52abirkill quits [Ping timeout: 260 seconds]
17:18:52abirkill- is now known as abirkill
17:22:57BPCZ quits [Ping timeout: 260 seconds]
18:03:47balrog quits [Ping timeout: 260 seconds]
18:08:15balrog (balrog) joins
18:27:37<h2ibot>Cooljeanius edited Twitch.tv (+8, /* Broadcast retention changes */ use URL template): https://wiki.archiveteam.org/?diff=54068&oldid=54046
19:08:18MrMcNuggets quits [Quit: WeeChat 4.3.2]
19:47:36chains joins
19:52:43ducky quits [Ping timeout: 260 seconds]
19:53:26ducky (ducky) joins
20:23:57<h2ibot>Himond000 edited Deathwatch (+203, /* 2025 */ add fanblogs.jp): https://wiki.archiveteam.org/?diff=54069&oldid=54054
20:41:18yasomi quits [Quit: ZNC 1.9.1 - https://znc.in]
20:46:05yasomi (yasomi) joins
20:49:49qinplus_mobile joins
20:56:53ducky quits [Ping timeout: 260 seconds]
20:59:25ducky (ducky) joins
21:00:13<nulldata>https://www.nbcboston.com/news/local/an-everett-newspaper-is-shutting-down-after-the-mayor-sued-for-defamation/3579003/
21:00:30<nulldata>Unfortunately it looks like the site has already been taken down.
21:00:35<nulldata>https://everettleader.com/
21:07:46JaffaCakes118_2 quits [Remote host closed the connection]
21:09:49JaffaCakes118_2 (JaffaCakes118) joins
21:09:54<pokechu22>Looks like it wasn't in wikidata either, so no archiveteam_urls coverage: https://web.archive.org/web/20240000000000*/https://everettleader.com
21:10:06JaffaCakes118_2 quits [Read error: Connection reset by peer]
21:13:06<pokechu22>https://www.wikidata.org/wiki/Q131462554 (not that that will do much good now)
21:13:09pixel leaves [Error from remote client]
21:54:10yasomi quits [Remote host closed the connection]
22:03:50yasomi (yasomi) joins
22:12:54etnguyen03 (etnguyen03) joins
22:13:19BlueMaxima joins
22:14:49riteo quits [Remote host closed the connection]
22:46:42pokechu22 quits [Ping timeout: 260 seconds]
22:47:25DogsRNice joins
22:48:57pokechu22 (pokechu22) joins
22:59:31qinplus_mobile quits [Client Quit]
23:05:21<Ryz>So, regarding trying to be open and accessible; I noticed that potentially, one of the reasons that people who haven't been in IRC for too long is because they're accessing it on a mobile device, which is different than just being on a personal computer like a desktop or laptop
23:05:51kokos- quits [Quit: ZNC - https://znc.in]
23:05:51katia_ quits [Quit: ZNC - https://znc.in]
23:06:18<Ryz>Reason for saying this is I don't really know if the connection breaks if the tab goes inactive after X amount of time, since I personally never accessed IRC through a mobile device like a smartphone or tablet, or at least that's how I remember
23:08:11kokos- joins
23:08:31<Ryz>I'm not sure if it is covered on the wiki yet regarding access to IRC channels, but I feel that in general, if wanting input and answers, especially those from the outside that never been here, it would be strongly suggested to stay in IRC
23:08:55<Ryz>There have been many countless instances where the person just leaves
23:09:52<Ryz>Before an answer arrives to them accordingly oo;
23:15:23<steering>Ryz: plenty of people just leave even on a pc :P
23:16:29<steering>while its possible that browsers might kill the tab, most people probably just get tired of waiting and close the tab... or back in the day, they'd get tired of waiting and close whatever IRC client they downloaded and used for probably the first time ever :P
23:16:46<Ryz>I feel it's moreso on mobile, because of the tabs having to be deloaded
23:17:53<Ryz>Like, basically not their own volition to leave like that
23:18:07PredatorIWD2 quits [Read error: Connection reset by peer]
23:21:27etnguyen03 quits [Client Quit]
23:22:35etnguyen03 (etnguyen03) joins
23:23:23PredatorIWD2 joins
23:24:39katia_ (katia) joins
23:41:27<nightpool>Yeah, most people do not have access to a bouncer or persistent connection. They're using mobile, or a laptop. This is a persistent issue with accessibility on IRC, it really only works for people who only desktops and also never turn them off.
23:41:33<nightpool>Even desktop chrome unloads tabs these days if they're inactive
23:41:59<nightpool>I use hackint's matrix bridge for this reason, we might recommend on the wiki that people try that
23:54:17BornOn420 quits [Ping timeout: 276 seconds]
23:55:05BornOn420 (BornOn420) joins