00:02:54march_happy quits [Read error: Connection reset by peer]
00:03:10march_happy (march_happy) joins
00:11:43jassdf joins
00:12:06jassdf quits [Remote host closed the connection]
00:35:53bonga joins
00:37:32<bonga>Let's change the default warrior project
00:37:49<bonga>Maybe to Google drive or mediafire
00:38:06<@JAA>Both of those are dormant at the moment.
00:38:23<bonga>Or Google sites or gamejolt
00:38:35<@JAA>Google Sites is also dormant, and Game Jolt is done for the time being.
00:39:12<@JAA>And there is a reason why Reddit is the default project: it needs the workers.
00:44:11<h2ibot>Wickedplayer494 edited Archiveteam:IRC (-5, Switching -BS to be main channel, considering…): https://wiki.archiveteam.org/?diff=48162&oldid=46616
00:45:11<h2ibot>Wickedplayer494 edited Current Projects (-10, Remove "primarily" as the EFnet transition is…): https://wiki.archiveteam.org/?diff=48163&oldid=48161
01:00:02dm4v quits [Client Quit]
01:00:05chrismeller (chrismeller) joins
01:02:33dm4v joins
01:02:35dm4v quits [Changing host]
01:02:35dm4v (dm4v) joins
01:24:09HP_Archiv joins
01:24:52HP_Archiv quits [Read error: Connection reset by peer]
01:25:59HP_Archivist quits [Ping timeout: 265 seconds]
01:42:22<h2ibot>SuperAnime9001 edited Miraheze (+128, Note Miraheze's regular deletion of "inactive"…): https://wiki.archiveteam.org/?diff=48164&oldid=48123
01:43:22<h2ibot>TheTechRobo edited Telegram (+32, snscrape can archive telegram channels): https://wiki.archiveteam.org/?diff=48165&oldid=47507
01:43:33Ruthalas (Ruthalas) joins
01:49:21benjins quits [Ping timeout: 252 seconds]
01:49:34benjins joins
02:04:26dm4v quits [Ping timeout: 240 seconds]
02:04:34dm4v joins
02:04:36dm4v quits [Changing host]
02:04:36dm4v (dm4v) joins
02:38:51Adr joins
02:51:26datechnoman quits [Quit: The Lounge - https://thelounge.chat]
02:53:49datechnoman (datechnoman) joins
03:00:41bs2k (bs2k) joins
03:16:46lennier1 quits [Client Quit]
03:17:09Ruthalas quits [Ping timeout: 265 seconds]
03:20:32lennier1 (lennier1) joins
03:26:49superkuh quits [Ping timeout: 265 seconds]
03:40:46<lennier1>sheezy.art shutting down March 1: https://sheezy.art/news/updates/2022-01-03/sheezy-art-is-shutting-down
03:41:11katocala quits [Ping timeout: 252 seconds]
03:42:21katocala joins
03:43:30<@OrIdow6>Site will freeze on February 1, so we can target in between if it's big
03:46:45<h2ibot>Lennier1 edited Deathwatch (+184, Add sheezy.art): https://wiki.archiveteam.org/?diff=48166&oldid=48160
03:46:56Ruthalas (Ruthalas) joins
03:50:39bonga quits [Remote host closed the connection]
03:51:37bonga joins
03:57:00<monika>https://sheezy.art/about ironic
03:59:45<@OrIdow6>Browsing by new, size does not look awful, 10s per day recently at most
04:01:10<@OrIdow6>"Sooner or later, they all will be gone"
04:07:34<@OrIdow6>POST seems to be used in most/all places for pagination, though; playback won't be nice
04:14:54march_happy quits [Remote host closed the connection]
04:22:03march_happy (march_happy) joins
04:32:49<@OrIdow6>Interestingly enough, there's an SPN1 link in the footer of every page
04:33:07<@OrIdow6>"archive this page"
04:34:30<@OrIdow6>This seems like a kind of site where the management is likely to be either very open to help or openly opposed to saving it; that's a point in favor of the former
04:37:22<@OrIdow6>Makes me think this might be subject to a mass SPN attempt
04:38:03<@OrIdow6>lennier1 or anyone else: how much are you in contact with this site's users? Would you know if there was an attempt to mass SPN everything?
04:38:11<appledash>wtf is SPN?
04:38:36<@OrIdow6>Save Page Now
04:38:44<appledash>Ahhh
04:41:12<lennier1>Not much. Flashpoint mentioned it, but just in the context of a small number of Flash animations.
04:45:00Adr quits [Remote host closed the connection]
04:47:24qwertyasdfuiopghjkl joins
04:48:03march_happy quits [Remote host closed the connection]
04:48:24<@OrIdow6>Oh
04:57:21march_happy (march_happy) joins
05:02:00<Jake>I'd probably get someone to reach out to them? I imagine they'd be open to an archiving attempt by us?
05:02:36HP_Archivist (HP_Archivist) joins
05:29:44superkuh joins
05:31:17hackbug77 joins
05:31:37hackbug77 quits [Remote host closed the connection]
06:07:40LegitSi joins
06:28:23Lord_Nightmare quits [Read error: Connection reset by peer]
06:30:39Lord_Nightmare (Lord_Nightmare) joins
06:44:53Lord_Nightmare quits [Read error: Connection reset by peer]
06:46:56Lord_Nightmare (Lord_Nightmare) joins
06:57:58Lord_Nightmare quits [Read error: Connection reset by peer]
07:01:21Lord_Nightmare (Lord_Nightmare) joins
07:11:03benjins quits [Read error: Connection reset by peer]
07:26:19katocala quits [Ping timeout: 252 seconds]
07:31:49katocala joins
08:34:09<pabs>thuban: are those wget issues reported somewhere?
08:50:36<Jake>My understanding is that it's not 'issues' but rather a lack of testing and validation on ARM.
08:53:36Adr joins
08:54:11froschgrosch joins
08:58:05<pabs>hmm, wget seems to work fine on my local 32-bit ARM-based router, both in TurrisOS (based on OpenWRT) and Debian. or is the testing/validation needed in warrior?
08:58:53<froschgrosch>i imagine it is just less reliable on arm-based architectures. And sometimes there isn't enough time to capture the same thing twice, so it needs to preferrably work on the first pass
09:00:23<Jake>It's not specifically warrior, but our modified version of wget, and my understand is that we're just being cautious, trying to double check that the output is what is expected.
09:05:21<@OrIdow6>Yes
09:07:00<@OrIdow6>I wasn't part of it, but at one point several months ago there was a spate of problems that approached being integrity issues due to shoddy design/programming in with WGET
09:07:08<@OrIdow6>IIRC
09:17:00<@OrIdow6>Skimming my logs, some metadata was getting truncated, and it was discovered that wget wasn't checking for errors in a bunch of function calls etc
09:19:41<@OrIdow6>So at present our reason for having confidence in its output is less "it's robust" and more "we've been running it under these restricted conditions for 10 years, if there was a problem we would've seen it by now"
09:27:01froschgrosch quits [Read error: Connection reset by peer]
09:35:05katocala quits [Remote host closed the connection]
09:36:39katocala joins
09:50:46march_happy quits [Ping timeout: 240 seconds]
09:51:04march_happy (march_happy) joins
10:01:13ThreeHM quits [Ping timeout: 265 seconds]
10:02:52ThreeHM (ThreeHeadedMonkey) joins
10:14:31Adr quits [Remote host closed the connection]
10:31:11nico_32 quits [Ping timeout: 265 seconds]
10:33:06pabs quits [Ping timeout: 240 seconds]
10:36:59pabs (pabs) joins
10:38:38nico_32 (nico) joins
10:49:20benjins joins
10:49:24benjins quits [Remote host closed the connection]
10:51:41benjins joins
10:51:54benjins quits [Remote host closed the connection]
10:53:25benjins joins
11:17:16sonick quits [Client Quit]
12:09:45sonick (sonick) joins
12:28:06<sonick>March 31 at noon, two Japanese hosting services, Webcrow and Star Server Free, will be shutting down.
12:28:36<sonick>These services are run by the same company, and both services are available for Wordpress, Mediawiki, dokuwiki, and various other PHP software, so you may need to set up a proper ignore list for each website.
12:30:25<sonick>Some people are collecting the websites now, and according to their data there are about 8k subdomains.
12:32:13<sonick>Also, Webcrow and Star server free do not seem to have good server performance and cannot handle too many requests at the same time.
12:33:18<sonick>Is it possible to get 8k websites with AB?
12:33:40qwertyasdfuiopghjkl quits [Remote host closed the connection]
12:33:50<@rewby>Not with AB but depending on the situation we have other tools
12:36:50<sonick>If using a non-AB tool, can we handle encountering mediawiki, etc.?
12:37:23BlueMaxima quits [Read error: Connection reset by peer]
12:37:24<@rewby>Probably.
12:37:51<@rewby>I'm not an expert in them, but I know we often use qwarc or write warrior projects for things like this
12:38:38<sonick>I understand. Thank you.
12:46:04<@arkiver>sonick: do you have a list of sites?
12:46:09<@arkiver>and any more information on them?
12:48:10<sonick>https://geolog.mydns.jp/webcrow/all_202109101922.txt
12:48:10<sonick>https://geolog.mydns.jp/webcrow/all.txt
12:48:10<sonick>This is the list of URLs that have been collected so far by the collection project (https://geolog.mydns.jp/webcrow/).
12:49:59qwertyasdfuiopghjkl joins
12:50:32<sonick>This page lists the types of domains available for Star server free: https://www.webcrow.jp/service/original_domain.php
12:51:16<sonick>Webcrow is in the format of *.webcrow.jp.
12:52:19<LegitSi>If you don't mind me asking, where did you get that information? I can't find it anywhere.
12:52:47<LegitSi>Nevermind, I saw it from the link you just sent.
12:54:45<sonick>The WARC, which was fetched partly by grab-site in a previous experiment, is uploaded here: https://archive.org/details/webcrows_warc
12:55:28<LegitSi>Japanese to English translation is finicky, but from what I can tell, Star Server Free might not actually be shut down, and it seems that the data can still be accessed in some way until September 30th, 2022.
12:55:50<LegitSi>Please tell me if I'm wrong, for I have zero experience with Japanese to English translation.
12:58:44<sonick>That's probably wrong. According to the shutdown announcement, the web site will be shut down at noon on March 31, but only FTP connections and data transfer for one' s own web site will be possible until September 30.
13:00:09<LegitSi>Wouldn't it be still possible then to transfer the data somehow?
13:00:15<LegitSi>Somehow, some way?
13:00:27<LegitSi>Or rather, grab the data for archival?
13:01:15<@rewby>For archival we'd rather have it in the "original" format rather than some export
13:05:05<sonick>Users can make FTP connections to their own websites, but not to others. Also, dynamic things like php will not work. So, we need to complete the fetching by March 31 using HTTP(S) connection.
13:05:18<LegitSi>ah ok
13:05:44<pabs>was https://bugs.llvm.org/ archived? I note it is read-only since November 26, 2021
13:07:19Arcorann quits [Ping timeout: 252 seconds]
14:07:41froschgrosch joins
14:32:16bs2k quits [Client Quit]
14:56:14billysnemesis joins
14:58:39<billysnemesis>Hi! I have an interest in Windows CE stuff. The only website that has software for late-90's HPCs is HPCFactor. Problem: They've put all this SHAREWARE behind a paywall.
14:58:47froschgrosch2 joins
14:58:50<billysnemesis>and FREEWARE.
14:59:41pi joins
14:59:46<billysnemesis>HPCFactor is demanding 10 british pounds to access the only source for Windows CE freeware/shareware.
14:59:49pi
14:59:56pi
15:00:02pi leaves
15:00:05<billysnemesis>?
15:01:07<billysnemesis>Anyone want to archive HPCFactor's software and driver collection?
15:02:08<billysnemesis>https://www.hpcfactor.com/info/help/membership/why/
15:02:24<billysnemesis>"There is no conspiracy here" lmfao
15:04:03<thuban>billysnemesis: if you have access to the walled-off content, i recommend uploading anything whose license permits redistribution to archive.org. as a rule, archiveteam doesn't touch stuff behind login walls, much less paywalls.
15:05:05<billysnemesis>well it's all freeware and shareware.
15:05:14<billysnemesis>i don't got an account
15:05:16froschgrosch2 quits [Client Quit]
15:05:54<froschgrosch>ignore pi please, I'm trying to get weechat to run on it but haven't figure out things yet
15:06:29<thuban>billysnemesis: then i guess you'd better make one, or find someone who has
15:07:33froschgrosch quits [Client Quit]
15:08:17pi joins
15:08:24pi
15:08:34pi is now known as froschgrosch
15:09:35<billysnemesis>well shiz i tried going on twitter "galacticfurball"s discord but they demand i sign up on discord and give them my lifes story
15:10:57<billysnemesis>that person seems like they'd be a good lead
15:15:57<billysnemesis>but i'm not giving discord my personal info and going like "oh hai i'm he him i like shoving cassiopeias up my bum and mongolian edm rap nightcore" just for a chance to get software for my half-broken win ce handheld
15:16:49<billysnemesis>i don't shove things up my bum or listwn to edm rap nightcore or mongolian throat singing
15:17:20<billysnemesis>i don't judge.... except if it's edm
15:20:00<thuban>then i guess you'd better make an account, or find someone else who has
15:20:19<thuban>people here may be willing to help you automate downloads or uploads, once you have access, but we can't help you with strangers not letting you into their clubs
15:22:09<billysnemesis>alright
15:35:52billysnemesis quits [Remote host closed the connection]
15:37:45<thuban>it looks like the forum.chip.de archivebot job is unlikely to finish in time. JAA, have we considered this as a qwarc candidate?
15:40:26<thuban>threads are numbered sequentially and accessible by id alone (with a link rel="canonical" for the slug), pagination is pretty straightforward.
15:59:20<@JAA>pabs: Yes, I archived bugs.llvm.org through AB last month.
15:59:45<@JAA>thuban: Yes, I'm aware of that and will be running it with qwarc. Just haven't set it up yet.
16:00:41<@JAA>Ditto for the Brigitte forums since they may well purge account names etc. at some undefined time and it's really a bit too large for AB anyway.
16:01:13<thuban>oh, great!
16:01:29bonga quits [Ping timeout: 252 seconds]
16:01:46chrismeller quits [Ping timeout: 240 seconds]
16:01:51bonga joins
16:16:31VerifiedJ quits [Quit: The Lounge - https://thelounge.chat]
16:17:12VerifiedJ (VerifiedJ) joins
16:25:38HP_Archivist quits [Read error: Connection reset by peer]
16:30:47march_happy quits [Ping timeout: 265 seconds]
16:36:46HP_Archivist (HP_Archivist) joins
17:08:08Iki joins
17:46:06Stiletto quits [Ping timeout: 240 seconds]
17:48:08bonga quits [Ping timeout: 265 seconds]
17:52:24bonga joins
17:57:18bonga quits [Ping timeout: 265 seconds]
17:57:43bonga joins
18:14:59HackMii quits [Remote host closed the connection]
18:16:53HackMii (hacktheplanet) joins
18:48:55Mateon1 quits [Remote host closed the connection]
18:50:05Mateon1 joins
19:27:11eroc1990 quits [Quit: Ping timeout (120 seconds)]
19:28:22eroc1990 (eroc1990) joins
19:35:08Stiletto joins
19:49:49systwi quits [Read error: Connection reset by peer]
19:50:28systwi (systwi) joins
19:50:50ave quits [Quit: Ping timeout (120 seconds)]
19:51:11ave (ave) joins
19:56:39DogsRNice (Webuser299) joins
19:57:06DogsRNice quits [Remote host closed the connection]
19:57:30DogsRNice (Webuser299) joins
20:13:48spirit quits [Quit: Leaving]
21:00:58bonga quits [Ping timeout: 265 seconds]
21:01:20bonga joins
21:18:17Ruthalas quits [Client Quit]
21:18:36Ruthalas (Ruthalas) joins
21:38:06bonga quits [Ping timeout: 240 seconds]
21:40:11bonga joins
21:45:51qwertyasdfuiopghjkl quits [Remote host closed the connection]
21:47:56qwertyasdfuiopghjkl joins
21:51:06march_happy (march_happy) joins
22:04:34froschgrosch
22:22:39Ruthalas quits [Ping timeout: 265 seconds]
22:23:51Ruthalas (Ruthalas) joins
22:32:16Arcorann (Arcorann) joins
23:28:04BlueMaxima joins
23:33:13march_happy quits [Ping timeout: 252 seconds]
23:37:05wyatt8740 quits [Ping timeout: 265 seconds]
23:39:08lukash7 quits [Quit: Ping timeout (120 seconds)]
23:39:27lukash7 joins
23:56:41lennier1 quits [Ping timeout: 252 seconds]
23:57:45lennier1 (lennier1) joins