| 00:05:48 | | qwertyasdfuiopghjkl quits [Remote host closed the connection] |
| 00:19:00 | | BlueMaxima_ joins |
| 00:22:20 | | BlueMaxima quits [Ping timeout: 265 seconds] |
| 00:22:39 | | HackMii quits [Ping timeout: 252 seconds] |
| 00:24:02 | | HackMii (hacktheplanet) joins |
| 00:27:50 | | qwertyasdfuiopghjkl joins |
| 00:50:58 | <tech234a> | https://bugs.python.org/ is being migrated to GitHub issues |
| 00:51:15 | <tech234a> | site is currently read-only |
| 00:51:20 | | jacobk quits [Ping timeout: 265 seconds] |
| 00:51:30 | <TheTechRobo> | tech234a: I think that's already in deathwatch? |
| 00:51:40 | <TheTechRobo> | yeh |
| 00:51:42 | <TheTechRobo> | *yeah |
| 00:52:12 | <tech234a> | Yeah it is, I hadn't remembered seeing it in IRC |
| 00:52:15 | <tech234a> | Thanks |
| 00:57:25 | | nepeat quits [Client Quit] |
| 01:03:01 | | dm4v_ joins |
| 01:03:33 | | dm4v quits [Ping timeout: 265 seconds] |
| 01:03:33 | | dm4v_ is now known as dm4v |
| 01:03:34 | | dm4v is now authenticated as dm4v |
| 01:03:34 | | dm4v quits [Changing host] |
| 01:03:34 | | dm4v (dm4v) joins |
| 01:18:37 | | Arcorann (Arcorann) joins |
| 01:21:38 | | nepeat (nepeat) joins |
| 01:52:13 | | pabs quits [Remote host closed the connection] |
| 01:53:34 | | pabs (pabs) joins |
| 02:40:40 | | xkey quits [Quit: WeeChat 3.3] |
| 02:40:59 | | xkey (xkey) joins |
| 02:58:56 | | Doranwen quits [Ping timeout: 265 seconds] |
| 02:59:13 | | Doranwen (Doranwen) joins |
| 03:01:47 | | JackThompson joins |
| 03:14:49 | | HackMii quits [Remote host closed the connection] |
| 03:16:02 | | HackMii (hacktheplanet) joins |
| 03:17:26 | | HP_Archivist quits [Ping timeout: 265 seconds] |
| 03:20:15 | | jacobk joins |
| 04:35:44 | | tbc1887 (tbc1887) joins |
| 04:42:35 | | tbc1887 quits [Client Quit] |
| 05:37:07 | | tzt quits [Ping timeout: 265 seconds] |
| 05:43:39 | | tzt (tzt) joins |
| 06:08:38 | | Ruthalas quits [Client Quit] |
| 06:17:06 | | tzt quits [Ping timeout: 265 seconds] |
| 06:20:08 | | jacobk quits [Ping timeout: 265 seconds] |
| 06:26:15 | | qwertyasdfuiopghjkl quits [Client Quit] |
| 06:42:47 | | Ruthalas (Ruthalas) joins |
| 06:48:50 | <Ryz> | So a bit weird, because of YouTube's take on TikTok with their 'Shorts' feature or function; previous videos that would be considered a short before the feature was applied are retroactively applied; one example is this YouTube channel: https://www.youtube.com/channel/UCLA1RbPxnAdpfSkBGesLcaA/videos |
| 08:46:46 | | jtagcat6 quits [Client Quit] |
| 08:47:07 | | jtagcat6 (jtagcat) joins |
| 08:48:02 | | JackThompson quits [Ping timeout: 265 seconds] |
| 08:49:22 | | Deviro joins |
| 08:50:15 | <Deviro> | is this place active? i could use some help with archiving news posted on steam |
| 09:06:52 | <spirit> | https://wiki.debian.org/GettingHelpOnIrc |
| 09:13:46 | <Deviro> | i run a small archiving project on a game called PR BF2, im collecting all the info i can on it and its subsequent spiritual succesors. ive been archiving the news from steam but i think they changed something recently and archive.org doesnt seem to show the text and images on a post. it just shows the steam logo buffering and probably tring to |
| 09:13:46 | <Deviro> | fetch content or something: https://web.archive.org/web/20220409084424/https://store.steampowered.com/news/app/736590/view/3135073898851129243 ; this wasnt a problem before, i just wanted to ask if you guys know if anything changed within steam and if there's a workaraound |
| 09:35:02 | <spirit> | ha, no way! can you help ME out with this by any chance? https://www.realitymod.com/forum/showthread.php?t=151811 |
| 09:35:19 | <spirit> | for the web archive, i hav eno clue but surely someone else might be able to help |
| 09:53:38 | | wyatt8740 quits [Ping timeout: 265 seconds] |
| 09:53:53 | | wyatt8740 joins |
| 10:02:34 | <Deviro> | that's awesome, sadly i dont have old pr installers, except i managed to snag the first few releases a few months ago, but im sure you have those already |
| 10:03:17 | | cronfox (Cronfox) joins |
| 10:05:09 | <Deviro> | im more into the timeline aspect of it, im sure you've seen this already, but in case you haven't: https://www.realitymod.com/forum/showthread.php?t=148376 |
| 10:15:38 | <cronfox> | nickserv |
| 10:22:04 | | Meem quits [Remote host closed the connection] |
| 10:38:32 | <spirit> | cheers! nah, i am not interested in the mod/game itself tbh, just a file hoarder ;) |
| 10:42:35 | | Iki1 quits [Ping timeout: 265 seconds] |
| 11:53:15 | | syntaxx quits [Client Quit] |
| 11:53:56 | | syntaxx (syntaxx) joins |
| 11:55:50 | | cronfox quits [Client Quit] |
| 11:55:56 | | cronfox (Cronfox) joins |
| 12:23:03 | <Deviro> | ive asked around for help with your problem, here's another question - does your fileplanet archive contain the forums as well? ive been looking for something from 2004 as well :P |
| 12:27:07 | | syntaxx2 (syntaxx) joins |
| 12:27:07 | | syntaxx quits [Read error: Connection reset by peer] |
| 12:27:08 | | syntaxx2 is now known as syntaxx |
| 12:40:33 | <spirit> | not THAT archive, but you might like https://gamespy-archives.quaddicted.com/ |
| 12:41:16 | <spirit> | https://gamespy-archives.quaddicted.com/111archives/ specifically |
| 12:41:48 | <spirit> | that page is off and on as i forget to enable the server sometimes |
| 12:41:52 | <spirit> | best grab all you want |
| 12:41:58 | | Megame (Megame) joins |
| 12:47:17 | | syntaxx quits [Ping timeout: 265 seconds] |
| 12:52:14 | <Deviro> | i looked at it while ago but it only contains posts from 2007 and further i think, or i might not know how to search things, it's always a mess |
| 13:02:02 | | syntaxx (syntaxx) joins |
| 13:09:37 | <spirit> | :D |
| 13:11:37 | <thuban> | Deviro: i haven't reverse-engineered this in detail, but it looks like steam relies on javascript in a way that doesn't play nice with the wayback machine. the wbm tries to rewrite urls in order to use its own versions of page resources, but depending on how xhr urls are constructed, this can go wrong |
| 13:11:45 | <thuban> | (if the wbm can't 'see' a url in the code, it can't rewrite it, and then when the javascript runs, it'll request the original web resource instead of the wbm version and probably get blocked by the content security policy) |
| 13:11:59 | <thuban> | the good news is that the data is actually all there! |
| 13:12:45 | <thuban> | the js isn't used to _fetch_ the contents of the news posts, just to render stuff that's already there |
| 13:13:29 | | BlueMaxima_ quits [Read error: Connection reset by peer] |
| 13:14:48 | <thuban> | if you view source (including in the wbm version), it's all in the "data-patnereventstore" attribute of the "application_config" div; looks like bbcode within json within html-entity encoding |
| 13:15:57 | <thuban> | *partnereventstore |
| 13:16:23 | <spirit> | those archives are a bit finicky, yeah. best just extract either the 7z or a warc and grep through for the text you are looking for |
| 13:16:31 | <spirit> | there is one html file per thread page iirc |
| 13:19:44 | <Deviro> | that makes a lot of sense thuban, im not programmer savvy, thanks for the help, im sure this will be resolved at some point, but i hate not being able to archive things properly, thanks for the help, the data being there at least is a good thing |
| 13:37:25 | | syntaxx quits [Ping timeout: 265 seconds] |
| 13:46:09 | | syntaxx (syntaxx) joins |
| 14:02:41 | | Arcorann quits [Ping timeout: 265 seconds] |
| 14:06:08 | | spirit quits [Quit: Leaving] |
| 14:09:00 | | HP_Archivist (HP_Archivist) joins |
| 14:41:42 | | jacobk joins |
| 14:56:07 | | tzt (tzt) joins |
| 16:09:06 | | marked2 joins |
| 16:21:04 | | JackThompson joins |
| 16:44:21 | <h2ibot> | TheTechRobo edited Telegram (+0, todo => inprogress): https://wiki.archiveteam.org/?diff=48461&oldid=48165 |
| 16:50:46 | | JackThompson quits [Client Quit] |
| 17:11:35 | | JackThompson joins |
| 17:15:53 | | Megame quits [Client Quit] |
| 17:42:24 | | Hackerpcs quits [Quit: Hackerpcs] |
| 17:43:44 | | jcotton42 joins |
| 17:45:16 | <jcotton42> | Hi, I'm trying to look for a pre-rewrite version of fanfic from fanfiction.net. There are a couple archives on fanfiction.net, but all the ones I've been able to check so far are too new |
| 17:45:23 | | Hackerpcs (Hackerpcs) joins |
| 17:45:43 | <jcotton42> | The only one I haven't checked yet is the Archive Team fanfiction swipe, https://archive.org/details/archiveteam_fanfiction but I can't figure out how to find the story in there, or if it is even in there |
| 17:46:54 | <jcotton42> | I grabbed all the cdx.idx and megawarc.json files and tried grepping them for keywords (author's name, story ID) but couldn't find anything that looked right |
| 17:46:57 | <jcotton42> | The URL is https://www.fanfiction.net/s/3148571/1/Ultimate-Danny-Phantom btw |
| 17:47:15 | <jcotton42> | (I got a few matches on the author's name, Grumbles, but it seems to just be their user page) |
| 17:47:38 | <jcotton42> | So I joined here as I saw on the archive team wiki that this was the IRC channel for this project |
| 17:51:08 | <jcotton42> | dropping for a second to switch to my phone |
| 17:51:16 | | jcotton42 leaves |
| 17:51:44 | | jcotton42 joins |
| 17:52:34 | <@JAA> | jcotton42: https://web.archive.org/web/20120413073613/http://www.fanfiction.net/s/3148571/1/Ultimate_Danny_Phantom is the earliest WBM snpashot I can find. |
| 17:52:42 | <@JAA> | That's from the archiveteam_fanfiction data. |
| 17:54:44 | <jcotton42> | Huh weird I couldn't get way back itself to work when I tried earlier |
| 17:54:55 | <jcotton42> | Thanks, this looks like what I need |
| 17:55:30 | <@JAA> | Dashes vs underscores in the slug. FFN apparently changed the URL format at some point. |
| 17:56:46 | <jcotton42> | Ahhh |
| 17:56:58 | <jcotton42> | good to know |
| 18:03:59 | <jcotton42> | Thanks again |
| 18:16:24 | | jcotton42 leaves |
| 18:26:19 | <@rewby> | And that's one of the rare moments where we hear about somebody using an AT dataset./ |
| 18:27:33 | | HP_Archivist quits [Ping timeout: 265 seconds] |
| 18:27:40 | | Iki joins |
| 19:32:47 | | AlsoHP_Archivist joins |
| 19:47:47 | | AlsoHP_Archivist quits [Ping timeout: 265 seconds] |
| 20:03:44 | | Iki quits [Ping timeout: 265 seconds] |
| 20:09:50 | | Mateon1 quits [Remote host closed the connection] |
| 20:10:03 | | Mateon1 joins |
| 20:26:12 | | nyany quits [Quit: (516): and then you went into taco bell without pants...and surprisingly you weren't the only one there without pants] |
| 20:38:23 | | Megame (Megame) joins |
| 20:43:18 | | AlsoHP_Archivist joins |
| 20:48:52 | | nyany (nyany) joins |
| 21:08:14 | <Ryz> | Sadness, the 'Save to the Wayback Machine' extension is being discontinued (at least on Google Chrome) :c |
| 21:08:48 | <Ryz> | Apparently there's the official version, but skeptical if it's better than what I was accustomed to workflow wise~ |
| 21:18:18 | <Ryz> | Welp, this is seriously seriously got good for my archiving productivity; the official version doesn't give me the information on when it was last archived when clicking the extension button |
| 21:19:02 | <Ryz> | Instead I would have to click on the calendar button and it would have to open up a tab to show when was the last time it was archived :/ |
| 21:20:33 | <Ryz> | *seriously seriously not good |
| 21:26:21 | <Ryz> | I'm assuming there's something holding back from updating VerifiedJ? s: |
| 21:27:37 | <Ryz> | I don't archive it via that extension a lot time ago; but I extensively use that thing nearly every day for doing ignores for ArchiveBot jobs by checking the last time it was archived |
| 21:28:00 | <Ryz> | Which is really inconvenient when using the official version |
| 21:45:45 | | Deviro quits [Remote host closed the connection] |
| 21:46:09 | | Deviro joins |
| 21:46:24 | | Deviro quits [Remote host closed the connection] |
| 21:54:48 | | Iki joins |
| 21:56:59 | <Craigle> | Ryz: Are you talking about this one? https://github.com/internetarchive/wayback-machine-webextension |
| 21:57:28 | <Craigle> | Because it definitely shows when a page was last archived without doing more than clicking on the extension button |
| 22:00:52 | <Ryz> | You sure? I clicked on the official extension button and it only shows something like this: https://transfer.archivete.am/pQBWO/2022-04-09_14-59-55.png |
| 22:01:14 | <Ryz> | And not something like what the unofficial extension does: https://transfer.archivete.am/myM3O/2022-04-09_15-00-49.png |
| 22:04:48 | <Ryz> | Cragile ^ |
| 22:04:52 | <Ryz> | *Craigle ^ |
| 22:06:20 | <Craigle> | Yes, if you click the "Settings" icon and then check the box for "Wayback Machine Count" it will display it |
| 22:06:26 | <Craigle> | https://transfer.archivete.am/inline/DkVoR/ext.JPG |
| 22:08:21 | <Craigle> | I manually install it from source on github because the Chrome Store version is/was an older one. But the Chrome Store version likely has that feature as well |
| 22:09:31 | <Ryz> | I clicked the option, here's what I got: https://transfer.archivete.am/kik1o/2022-04-09_15-08-06.png |
| 22:10:05 | <Ryz> | It only says how many times that page was saved :C |
| 22:10:46 | <Craigle> | Is that the one from the Chrome Store, or did you load it from Github? |
| 22:11:01 | <Ryz> | I got it from the Google Chrome store~ |
| 22:12:59 | <Craigle> | I suspect it may be an older version then. The one I loaded from Github a month or so ago shows "Version - 3.0.0.17" |
| 22:13:21 | <Craigle> | I believe that was why I loaded it manually. I don't think the Chrome Store version was at 3.0 |
| 22:13:31 | <Ryz> | Whaaa? I just installed it from Google Chrome, which I was assuming it was the latest version ><; |
| 22:14:38 | | Megame quits [Client Quit] |
| 22:14:55 | <Craigle> | I shows version 3.0 in the Chrome Store. I wouldn't think the functionality would be that different |
| 22:16:23 | <Craigle> | I'm running Brave and not straight Chrome, but since it's the same engine, I wouldn't expect that to make a difference either |
| 22:17:24 | <marked2> | I just installed from the Chrome store. It did show last archive time at first. and now it's stopped. More often than not it doesn't show it so far. |
| 22:19:32 | <Craigle> | I wonder if it's just a little slow pulling the data |
| 22:20:36 | <Craigle> | I just loaded a couple of sites fresh and it shows the count and date for me. That's really strange that it's not consistent |
| 22:20:41 | <Ryz> | For the reference URL I use: https://www.youtube.com/user/TheRunawayGuys/videos |
| 22:21:50 | <Ryz> | Uuuuuuugh, I'm actually upset that this might run my workflow s: |
| 22:22:56 | <Craigle> | Comes right up for me: https://transfer.archivete.am/inline/cRDCJ/Capture.JPG |
| 22:23:58 | <marked2> | turning on that setting "Wayback Machine Count" makes it much more consistent |
| 22:24:23 | <Ryz> | That's the weird thing, I did turn on "Wayback Machine Count" oo; |
| 22:25:03 | <Craigle> | I only have that and the replace 404 pages settings turned on. No other ones. I am logged in if that makes a difference |
| 22:27:52 | <Ryz> | Not logged in at all for me |
| 22:29:06 | <marked2> | yeah that's interesting. If I'm logged in I get times. If I'm logged out, I only get counts, and it requires an extra click. |
| 22:29:30 | <VerifiedJ> | You need to be logged into the IA for the official extension to show last saved dates for some reason |
| 22:29:43 | <VerifiedJ> | Hi Ryz |
| 22:30:16 | <Ryz> | Nooooooo seriously? :C |
| 22:30:20 | <Ryz> | Heya VerifiedJ |
| 22:30:26 | <Craigle> | If I recall you also get rate limited pretty steeply if you're not logged in |
| 22:30:37 | <Craigle> | I think that was why I logged in in the first place |
| 22:31:40 | <Ryz> | Rate limited deeply is a big oofer, since I also constantly check for dates~ |
| 22:31:50 | | lennier1 quits [Client Quit] |
| 22:32:18 | | lennier1 (lennier1) joins |
| 22:46:11 | <Ryz> | So yeah, still pretty annoyed this happened; gonna be a real hell impact for my productivity... |
| 22:48:30 | <Ryz> | No way to fix this VerifiedJ? |
| 22:55:28 | <marked2> | I think I found the API call https://github.com/internetarchive/wayback-machine-webextension/blob/a4f96e4a47917d79afd32d49dc1871b848b236d6/webextension/scripts/utils.js#L341 |
| 23:00:06 | | BlueMaxima joins |
| 23:02:09 | <VerifiedJ> | Ryz, I have an idea. I'll email you about it tomorrow. its midnight here and i need sleep. |
| 23:02:19 | <Ryz> | Oh! |
| 23:02:26 | <Ryz> | Huh, VerifiedJ, |
| 23:02:39 | <Ryz> | Uhh, I haven't heard from you when I sent two messages like...uhh, long long ago ><; |
| 23:03:04 | <Ryz> | Also, could just DM/PM me via IRC~ |
| 23:15:00 | | E joins |
| 23:18:49 | <E> | If I want to run a Warrior instance (or similar) for a few hours a day on a scheduled basis, is there a recommended way to do it? Thank you! |
| 23:23:53 | <TheTechRobo> | E: Hm, you could set up a scheduled task (i.e. crontab on Unix, Task Scheduler/whatever on Windows, not sure on Mac) to start and stop the Docker container manually... (you can run the Warrior via Docker, as that's how the virtual machine works iirc) |
| 23:24:08 | <TheTechRobo> | Not an expert at thsi kind of stuff tho |
| 23:24:45 | <E> | Thank you! Yeah, I was thinking of doing something like that, just not familiar with Docker |
| 23:26:48 | | Arcorann (Arcorann) joins |
| 23:38:04 | <@JAA> | I think orchestration software like K8s has such things, but that's almost definitely overkill here. I'd also go with a small script with cron or systemd or whatever your system is already using. |
| 23:38:39 | | qwertyasdfuiopghjkl joins |
| 23:39:39 | | E quits [Remote host closed the connection] |
| 23:42:45 | | TheTechRobo leaves [Leaving] |
| 23:43:15 | | TheTechRobo joins |
| 23:43:15 | | TheTechRobo is now authenticated as TheTechRobo |
| 23:49:49 | | E joins |
| 23:53:39 | | E quits [Remote host closed the connection] |