00:04:36fuzzy80211 quits [Read error: Connection reset by peer]
00:04:42fuzzy8021 (fuzzy80211) joins
00:08:26nepeat quits [Quit: ZNC - https://znc.in]
00:10:10AlsoHP_Archivist joins
00:10:11nepeat (nepeat) joins
00:13:57HP_Archivist quits [Ping timeout: 255 seconds]
00:21:39HP_Archivist (HP_Archivist) joins
00:23:02ave quits [Client Quit]
00:23:52ave (ave) joins
00:24:14nepeat quits [Client Quit]
00:25:12AlsoHP_Archivist quits [Ping timeout: 255 seconds]
00:25:58nepeat (nepeat) joins
00:35:19HP_Archivist quits [Ping timeout: 255 seconds]
00:38:52HP_Archivist (HP_Archivist) joins
00:39:52<h2ibot>TheTechRobo edited Post News (+146): https://wiki.archiveteam.org/?diff=53484&oldid=52312
00:54:32etnguyen03 (etnguyen03) joins
01:33:32DogsRNice_ joins
01:37:25DogsRNice__ quits [Ping timeout: 255 seconds]
01:37:52DogsRNice_ quits [Ping timeout: 255 seconds]
01:50:54DogsRNice joins
01:52:37HP_Archivist quits [Client Quit]
02:01:28Ruthalas598 quits [Quit: END OF LINE]
02:17:06Ruthalas598 (Ruthalas) joins
02:37:28etnguyen03 quits [Remote host closed the connection]
03:03:01<@hook54321>ah, if it was a private beta then that's fair
03:32:10Guest54 quits [Client Quit]
03:49:07grid joins
04:01:50DogsRNice quits [Read error: Connection reset by peer]
04:33:55qwertyasdfuiopghjkl quits [Client Quit]
04:55:14qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
05:11:50<that_lurker>Entire Independent Board of Directors of 23andMe Resigns https://investors.23andme.com/news-releases/news-release-details/independent-directors-23andme-resign-board/ https://news.ycombinator.com/item?id=41573034
05:38:01charlotte_ joins
05:39:04StarletCharlotte quits [Ping timeout: 260 seconds]
05:41:54BlueMaxima quits [Read error: Connection reset by peer]
05:44:19qwertyasdfuiopghjkl quits [Ping timeout: 260 seconds]
05:51:56charlotte__ joins
05:54:49charlotte_ quits [Ping timeout: 255 seconds]
06:14:45cow_2001 quits [Quit: ✡]
06:16:06cow_2001 joins
06:18:59charlotte_ joins
06:18:59grid quits [Client Quit]
06:21:49charlotte__ quits [Ping timeout: 255 seconds]
06:42:08<pabs>Barto JAA - re mastodon JS, a wrapper around zygolophodon can do small parts of it (but not a whole site I think) https://github.com/jwilk/zygolophodon https://paste.debian.net/hidden/622ccf51/
06:43:18<@JAA>pabs: Embeds are beginning to require JS as well.
06:43:37<pabs>hmm, got an example?
06:43:43<@JAA>mastodon.social
06:43:50<@JAA>It's this PR, I think: https://github.com/mastodon/mastodon/pull/31766
06:43:57<pabs>this works without JS https://mozilla.social/@mozilla/113153943609185249/embed
06:44:13<@JAA>Yes, they're probably not running the bleeding edge.
06:45:06<@JAA>The PR was only merged 6 days ago and isn't in a release yet. But mastodon.social runs it already, it seems.
06:45:45<@JAA>It looks like there'll be a new release soon, and then it'll spread to most instances quickly.
06:46:29<pabs>crap
06:47:17<pabs>hmm, zygolophodon does still work with mastodon.social. maybe I can modify it to output API URLs instead
06:48:54<@JAA>Rewriting the URLs should be trivial.
06:56:13<pabs>aha, it has --debug-http already
06:57:42<pabs>does 2 requests for individual posts: /api/v1/statuses/113082066860765988 /api/v1/statuses/113082066860765988/context
06:59:14<pabs>and 3 for users: /api/v1/accounts/lookup?acct=mozilla /api/v1/accounts/110306602663312748/statuses?pinned=true /api/v1/accounts/110306602663312748/statuses?exclude_replies=true&limit=40
06:59:23<pabs>(plus pagination I guess)
07:05:49Unholy236192464537713 (Unholy2361) joins
07:11:06Unholy2361924645377135 (Unholy2361) joins
07:14:44Unholy236192464537713 quits [Ping timeout: 260 seconds]
07:14:44Unholy2361924645377135 is now known as Unholy236192464537713
07:40:45qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
08:08:17loug83 joins
08:31:52Doran quits [Ping timeout: 255 seconds]
08:32:02<monoxane>I wonder if it would be worth writing a thing that scrapes the raw ap apis instead of trying to go through the js ui
08:32:26<monoxane>it would violate the "preserve original content" rules though
08:42:48<magmaus3>monoxane: one potential problem is that some instances require authorized fetches, which would require the scraper to have an instance. (btw, that also means that it would be possible to prevent scraping, which is both a good and a bad thing)
08:56:58lennier2 quits [Read error: Connection reset by peer]
08:57:14lennier2 joins
09:15:47Island quits [Read error: Connection reset by peer]
09:34:22awauwa joins
11:00:02Bleo18260072271962 quits [Quit: The Lounge - https://thelounge.chat]
11:01:34Bleo18260072271962 joins
11:05:31Doran (Doranwen) joins
11:55:25SkilledAlpaca4 quits [Quit: SkilledAlpaca4]
11:57:24SkilledAlpaca4 joins
12:09:16BornOn420 quits [Quit: Textual IRC Client: www.textualapp.com]
12:09:51f_ quits [Remote host closed the connection]
12:09:55f_ (funderscore) joins
12:10:39f_ quits [Remote host closed the connection]
12:10:43f_ (funderscore) joins
12:11:33f_ quits [Remote host closed the connection]
12:12:37f_ (funderscore) joins
12:19:32BornOn420 (BornOn420) joins
12:46:37f_ quits [Remote host closed the connection]
12:46:40f_ (funderscore) joins
12:49:59sec^nd quits [Ping timeout: 260 seconds]
12:50:38sec^nd (second) joins
13:29:01Guest54 joins
13:58:34benjins3 quits [Ping timeout: 255 seconds]
14:03:24f_ quits [Remote host closed the connection]
14:03:27f_ (funderscore) joins
14:04:06Xanthon joins
14:05:43f_ quits [Remote host closed the connection]
14:05:45f_ (funderscore) joins
14:39:00HiccupJul (HiccupJul) joins
14:42:44<HiccupJul>Would it be okay to make a "List of websites not captured correctly by the Wayback Machine" page on the wiki, like the exclusions page? Don't have many examples right now, but there are a few. Although I guess it can be somewhat worked around by using archivebot.
14:43:49<HiccupJul>The one I was thinking of was this: https://www.electricsheep.co.jp/blog.php?id=431
14:43:54f_ quits [Remote host closed the connection]
14:43:56f_ (funderscore) joins
14:43:58<@arkiver>HiccupJul: that sounds nice, i'm guessing final call would be with JAA ^
14:44:18<HiccupJul>https://wiki.archiveteam.org/index.php/How_to_use_our_wiki this says to be bold but yeah I was wondering about his opinion
14:44:54<HiccupJul>webpage is the blog of the Gimmick! (famous NES game) developer, has behind the scenes info and such. blog pages only load if you navigate to the main page first.
14:45:54<HiccupJul>doesn't work through Save Page Now at the very least
14:47:15<HiccupJul>i'm asking on #archivebot if someone can try it through archivebot
14:48:06<h2ibot>MihaiArchive1 edited WikiTeam (+3, /* Wiki dumps */): https://wiki.archiveteam.org/?diff=53486&oldid=53483
14:48:07<h2ibot>MihaiArchive1 edited Wikimedia Commons (+57): https://wiki.archiveteam.org/?diff=53487&oldid=49964
14:49:06<h2ibot>Awauwa edited Deathwatch (+198, added mozilla.social): https://wiki.archiveteam.org/?diff=53488&oldid=53463
14:49:17loug83 quits [Read error: Connection reset by peer]
14:49:43loug83 joins
15:01:54HiccupJul_ joins
15:04:43HiccupJul quits [Ping timeout: 255 seconds]
15:14:37<@JAA>HiccupJul_: How would you define 'correctly'?
15:15:15<HiccupJul_>good question
15:15:36<HiccupJul_>but ones that don't have any of the content, like in this case, should probably be recorded
15:16:02<@JAA>The content not being displayed doesn't necessarily mean it wasn't captured though.
15:17:10<@JAA>I know there are sites that can be captured, all the relevant data is captured, but then something breaks on playback. If you know the API URL, you can still get the content back.
15:18:46<@JAA>The SPN 'just' does a MITM proxy to capture the network traffic. The WBM dynamically rewrites things, which sometimes breaks due to how the target site's JS is written.
15:19:16<HiccupJul_>huh
15:19:27Xanthon leaves
15:19:41<HiccupJul_>how can i check that for myself?
15:19:55<@JAA>There's no generic way. It depends on the individual site.
15:20:22<@JAA>You might be able to see something in the SPN output when using the submission form (rather than /save/URL).
15:21:04<@JAA>I see that https://www.electricsheep.co.jp/blog.php?id=431 returns a message about requiring cookies, so that's different, I guess.
15:21:19BornOn420 quits [Remote host closed the connection]
15:21:46<HiccupJul_>yeah i think its a server-side thing
15:22:03<HiccupJul_>ah i thought you meant the wayback machine api
15:22:13<@JAA>POST requests frequently break, but the failure mode varies. For example, it might only generate one capture per hour, and the playback then doesn't load the correct data.
15:22:32<@JAA>Ah, sorry, no, I mean the target site's.
15:23:30<HiccupJul_>yeah looking in chrome devtools network log, loading the page in incognito, i don't see the page content
15:23:39<HiccupJul_>so i think it is a server-side check of some kind
15:24:21<@JAA>Yeah
15:24:36<HiccupJul_>maybe the wiki page should just list things like that which save page now can't handle, e.g. navigating to home page first. bit of an obscure requirement though
15:27:10<@JAA>Yeah, I feel like there are too many different failure modes here to document them in a sensible manner. Maybe a list of those failure modes could be useful though.
15:32:11<@JAA>And then we can add a couple examples to each failure mode.
15:36:20f_ quits [Remote host closed the connection]
15:36:23f_ (funderscore) joins
15:37:32f_ quits [Remote host closed the connection]
15:37:34f_ (funderscore) joins
15:39:14f_ quits [Remote host closed the connection]
15:39:16f_ (funderscore) joins
15:41:06<HiccupJul_>side question: is there a way to view the metadata of IA items (like https://archive.org/metadata/whatever) after the item is taken down?
15:42:53loug831 joins
15:43:25loug83 quits [Ping timeout: 255 seconds]
15:47:28loug831 quits [Ping timeout: 255 seconds]
15:49:40<@arkiver>HiccupJul_: no
15:52:33<HiccupJul_>ah, didn't think so. do you know if there's any third party backup of that metadata being made?
16:04:39loug83 joins
16:06:09pabs quits [Ping timeout: 260 seconds]
16:09:04<@arkiver>i dont think so
16:14:59pabs (pabs) joins
16:16:51M--mlv|m quits [*.net *.split]
16:16:51Valkum|m quits [*.net *.split]
16:16:51rain|m quits [*.net *.split]
16:16:51hillow596|m quits [*.net *.split]
16:16:51username675f|m quits [*.net *.split]
16:16:51AntoninDelFabbro|m quits [*.net *.split]
16:16:51kaz__|m quits [*.net *.split]
16:16:51noxious quits [*.net *.split]
16:16:51Passiing|m quits [*.net *.split]
16:16:51Cronfox|m quits [*.net *.split]
16:16:51ram|m quits [*.net *.split]
16:16:51Peetz0r|m quits [*.net *.split]
16:16:51that_lurker|m quits [*.net *.split]
16:16:51nano412510 quits [*.net *.split]
16:16:51Misty|m quits [*.net *.split]
16:16:51yetanotherarchiver|m quits [*.net *.split]
16:16:51l0rd_enki|m quits [*.net *.split]
16:16:51will|m quits [*.net *.split]
16:16:51jevinskie quits [*.net *.split]
16:16:51vics quits [*.net *.split]
16:16:51Fijxu|m quits [*.net *.split]
16:16:51v1cs quits [*.net *.split]
16:16:51supermariofan67|m quits [*.net *.split]
16:16:51trumad|m quits [*.net *.split]
16:16:51gwetchen|m quits [*.net *.split]
16:16:51NickS|m quits [*.net *.split]
16:16:51haha-whered-it-go|m quits [*.net *.split]
16:16:51joepie91|m quits [*.net *.split]
16:16:51EvanBoehs|m quits [*.net *.split]
16:16:51Adamvoltagex|m quits [*.net *.split]
16:16:51nosamu|m quits [*.net *.split]
16:16:51superusercode quits [*.net *.split]
16:16:51Cydog|m quits [*.net *.split]
16:16:51GRBaset quits [*.net *.split]
16:16:51lasdkfj|m quits [*.net *.split]
16:16:51GhostIsBeHere|m quits [*.net *.split]
16:16:51bogsen quits [*.net *.split]
16:16:51CrispyAlice2 quits [*.net *.split]
16:16:51pannekoek11|m quits [*.net *.split]
16:16:51jwoglom|m quits [*.net *.split]
16:16:51akaibu|m quits [*.net *.split]
16:16:51madpro|m quits [*.net *.split]
16:16:51Froxcey|m quits [*.net *.split]
16:16:51Ruk8 quits [*.net *.split]
16:16:51vexr quits [*.net *.split]
16:16:51Video quits [*.net *.split]
16:16:51mikolaj|m quits [*.net *.split]
16:16:51phaeton quits [*.net *.split]
16:16:51Nulo|m quits [*.net *.split]
16:16:51yzqzss quits [*.net *.split]
16:16:51qyxojzh|m quits [*.net *.split]
16:16:51s-crypt|m|m quits [*.net *.split]
16:16:51hlgs|m quits [*.net *.split]
16:16:51thermospheric quits [*.net *.split]
16:16:51cmostracker|m quits [*.net *.split]
16:16:51noobirc|m quits [*.net *.split]
16:16:51coro quits [*.net *.split]
16:16:51manu|m quits [*.net *.split]
16:16:51t3chler|m quits [*.net *.split]
16:16:51masterx244|m quits [*.net *.split]
16:16:51Hans5958 quits [*.net *.split]
16:16:52iCesenberk|m quits [*.net *.split]
16:16:52octylFractal|m quits [*.net *.split]
16:16:52wrangle|m quits [*.net *.split]
16:16:52tech234a|m quits [*.net *.split]
16:16:52andrewvieyra|m quits [*.net *.split]
16:16:52Roki_100|m quits [*.net *.split]
16:16:52finalti|m quits [*.net *.split]
16:16:52moe-a-m|m quits [*.net *.split]
16:16:52Thibaultmol quits [*.net *.split]
16:16:52schwarzkatz|m quits [*.net *.split]
16:16:52jackt1365|m quits [*.net *.split]
16:16:52saouroun|m quits [*.net *.split]
16:16:52nstrom|m quits [*.net *.split]
16:16:52Exorcism|m quits [*.net *.split]
16:16:52nyuuzyou quits [*.net *.split]
16:16:52mpeter|m quits [*.net *.split]
16:16:52MaxG quits [*.net *.split]
16:16:52Minkafighter|m quits [*.net *.split]
16:16:52Tom|m1 quits [*.net *.split]
16:16:52alexshpilkin quits [*.net *.split]
16:16:52Fletcher quits [*.net *.split]
16:16:52ragu|m quits [*.net *.split]
16:16:52tomodachi94 quits [*.net *.split]
16:16:52MinePlayersPEMyNey|m quits [*.net *.split]
16:16:52flashfire42|m quits [*.net *.split]
16:16:52rewby|m quits [*.net *.split]
16:16:52theblazehen|m quits [*.net *.split]
16:16:52x9fff00 quits [*.net *.split]
16:16:52xxia|m quits [*.net *.split]
16:16:52audrooku|m quits [*.net *.split]
16:16:52Vokun quits [*.net *.split]
16:16:52mind_combatant quits [*.net *.split]
16:16:52DigitalDragon quits [*.net *.split]
16:16:52@Sanqui|m quits [*.net *.split]
16:16:52britmob|m quits [*.net *.split]
16:16:52igneousx quits [*.net *.split]
16:16:52Ajay quits [*.net *.split]
16:18:46rewby|m joins
16:23:09BornOn420 (BornOn420) joins
16:26:29Sanqui|m (Sanqui) joins
16:26:29@ChanServ sets mode: +o Sanqui|m
16:26:31joepie91|m joins
16:26:32pannekoek11|m joins
16:26:33Ruk8 (Ruk8) joins
16:26:33tech234a|m joins
16:26:33MinePlayersPEMyNey|m joins
16:26:33xxia|m joins
16:26:33tomodachi94 (tomodachi94) joins
16:26:33thermospheric (Thermospheric) joins
16:26:33hlgs|m joins
16:26:33mpeter|m joins
16:26:33andrewvieyra|m joins
16:26:33theblazehen|m joins
16:26:33nstrom|m joins
16:26:33saouroun|m joins
16:26:33mind_combatant joins
16:26:34haha-whered-it-go|m joins
16:26:34jackt1365|m joins
16:26:34t3chler|m joins
16:26:34schwarzkatz|m joins
16:26:34x9fff00 (x9fff00) joins
16:26:34madpro|m joins
16:26:34DigitalDragon (DigitalDragon) joins
16:26:34ragu|m joins
16:26:34britmob|m joins
16:26:34audrooku|m joins
16:26:34NickS|m joins
16:26:34Ajay joins
16:26:34igneousx (igneousx) joins
16:26:34akaibu|m joins
16:26:34masterx244|m joins
16:26:34Froxcey|m joins
16:26:34GRBaset (GRBaset) joins
16:26:34Minkafighter|m joins
16:26:34lasdkfj|m joins
16:26:34finalti|m joins
16:26:34Thibaultmol joins
16:26:34manu|m joins
16:26:34Fletcher (Fletcher) joins
16:26:34yzqzss (yzqzss) joins
16:26:35CrispyAlice2 (CrispyAlice2) joins
16:26:35Roki_100|m joins
16:26:35jevinskie (jevinskie) joins
16:26:35gwetchen|m joins
16:26:35wrangle|m joins
16:26:35will|m joins
16:26:35Hans5958 (Hans5958) joins
16:26:35moe-a-m|m joins
16:26:35vexr joins
16:26:35superusercode (superusercode) joins
16:26:35MaxG joins
16:26:35jwoglom|m joins
16:26:35Exorcism|m joins
16:26:35Cydog|m joins
16:26:35trumad|m joins
16:26:35nosamu|m joins
16:26:35noobirc|m joins
16:26:35cmostracker|m joins
16:26:35Tom|m1 joins
16:26:35Misty|m joins
16:26:35Vokun (Vokun) joins
16:26:35phaeton (phaeton) joins
16:26:35Video joins
16:26:35alexshpilkin joins
16:26:35flashfire42|m (flashfire42) joins
16:26:35EvanBoehs|m joins
16:26:35octylFractal|m joins
16:26:35coro joins
16:26:36qyxojzh|m joins
16:26:36Nulo|m joins
16:26:36s-crypt|m|m joins
16:26:36iCesenberk|m joins
16:26:36that_lurker|m joins
16:26:36supermariofan67|m joins
16:26:36yetanotherarchiver|m joins
16:26:36vics joins
16:26:36bogsen (bogsen) joins
16:26:36v1cs joins
16:26:37Fijxu|m joins
16:26:37Adamvoltagex|m joins
16:26:37GhostIsBeHere|m joins
16:26:37mikolaj|m joins
16:26:37l0rd_enki|m joins
16:26:37nyuuzyou joins
16:28:07Peetz0r|m joins
16:28:08nano412510 (nano412510) joins
16:28:08Passiing|m joins
16:28:08ram|m joins
16:28:08kaz__|m joins
16:28:09Cronfox|m joins
16:28:09noxious joins
16:28:09username675f|m joins
16:28:09AntoninDelFabbro|m joins
16:28:09rain|m joins
16:28:09hillow596|m joins
16:28:09Valkum|m joins
16:28:24Dango360_ quits [Remote host closed the connection]
16:28:46Dango360_ (Dango360) joins
16:34:25Dango360 (Dango360) joins
16:37:25Dango360_ quits [Ping timeout: 255 seconds]
16:41:25HiccupJul__ joins
16:44:04HiccupJul_ quits [Ping timeout: 260 seconds]
16:48:18BornOn420 quits [Remote host closed the connection]
16:48:48BornOn420 (BornOn420) joins
16:52:15HiccupJul__ quits [Client Quit]
16:55:18<nulldata>monoxane - Grabbing the API results via AB wouldn't violate a "preserve original content" rule. It's not ideal and wouldn't be easy to browse, but it's not making up or modifying content
17:02:51MrMcNuggets (MrMcNuggets) joins
17:20:40benjins3 joins
17:21:21benjins3 quits [Remote host closed the connection]
17:21:39benjins3 joins
17:28:55lflare quits [Quit: Bye]
17:32:50IDK (IDK) joins
17:37:39lflare (lflare) joins
17:40:11M--mlv|m joins
17:57:16<@JAA>(Faking HTML pages using the API data would however be bad.)
18:08:44MrMcNuggets quits [Client Quit]
18:10:32kokos- quits [Remote host closed the connection]
18:10:32katia_ quits [Remote host closed the connection]
18:11:10thuban quits [Quit: brb, fixing my shit]
18:27:56Guest54 quits [Client Quit]
18:33:36Guest54 joins
18:34:51sec^nd quits [Ping timeout: 260 seconds]
18:37:37sec^nd (second) joins
19:07:38charlotte_ is now known as StarletCharlotte
19:09:08sec^nd quits [Remote host closed the connection]
19:09:32Island joins
19:09:45sec^nd (second) joins
19:14:00BearFortress quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]
19:20:46awauwa quits [Quit: Client closed]
19:23:14kokos- joins
19:29:28katia_ (katia) joins
19:39:21<magmaus3>JAA: im assuming that adding additional js to make the contents readable would still violate the rule, right?
19:40:46<@JAA>Naturally
19:40:52<@JAA>Any modification at all does.
19:41:17<@JAA>However, you could have an external page that fetches the API response from the WBM and renders it however you like.
19:41:42kokos- quits [Ping timeout: 255 seconds]
19:41:47<steering>^ and then capture that in the WBM! :P
19:42:09katia_ quits [Ping timeout: 255 seconds]
19:45:51<@JAA>Why yes, I've done that before (due to CORS). :-D
19:46:23<@JAA>Well, not quite that, but same principle: https://web.archive.org/web/20211001003631id_/https://ia801403.us.archive.org/33/items/picosong.com_finder/index.html
20:21:04kokos- joins
20:23:08katia_ (katia) joins
20:26:10thuban (thuban) joins
20:40:54Unholy236192464537713 quits [Ping timeout: 260 seconds]
20:42:20etnguyen03 (etnguyen03) joins
21:03:04katocala quits [Ping timeout: 260 seconds]
21:03:55katocala joins
21:09:15loug83 quits [Client Quit]
21:29:55katocala quits [Ping timeout: 255 seconds]
21:30:08katocala joins
22:31:50<TheTechRobo>magmaus3: Depends on whether it's in the WARC or not. If you're modifying the WARC record, not allowed. But the Wayback Machine adding special code to fix the page would be fine.
22:32:25<@JAA>Right, yes, but we have no influence over that.
22:36:46<magmaus3>TheTechRobo: good to know :3
22:37:01etnguyen03 quits [Client Quit]
22:44:56beastbg8 (beastbg8) joins
22:45:03Dango360_ (Dango360) joins
22:47:16BlueMaxima joins
22:48:27Dango360 quits [Ping timeout: 255 seconds]
23:03:53Bleo18260072271962 quits [Quit: The Lounge - https://thelounge.chat]