| 00:07:02 | | benjinsm joins |
| 00:10:24 | | benjins quits [Ping timeout: 265 seconds] |
| 00:11:29 | | benjinsm is now known as benjins |
| 00:11:30 | | benjins is now authenticated as benjins |
| 00:13:11 | | Megame quits [Ping timeout: 252 seconds] |
| 00:14:22 | | BigBrain quits [Remote host closed the connection] |
| 00:14:41 | | BigBrain (bigbrain) joins |
| 00:31:24 | | treora quits [Remote host closed the connection] |
| 00:31:25 | | treora joins |
| 00:47:11 | <pabs> | https://apnews.com/article/saudi-arabia-death-sentence-twitter-a2b5549806605d1d21f332ac4c36e43f |
| 00:47:24 | <pabs> | death sentence for twitter/youtube stuff |
| 00:52:30 | | Bleo quits [Quit: Ping timeout (120 seconds)] |
| 00:52:42 | | Bleo joins |
| 00:57:00 | <nicolas17> | youtube project is weird, my VPS downloaded 33GB and uploaded 6GB |
| 00:57:19 | <nicolas17> | is it downloading videos and throwing them away based on some criteria instead of uploading? |
| 00:58:35 | <audrooku|m> | I was under the impression it only scraped metadata, if itd grabbing the youtube webpage for example it can theow away all of the html and just upload a json zst |
| 00:58:50 | <audrooku|m> | It's* throw* |
| 00:59:04 | | systwi_ quits [Client Quit] |
| 00:59:05 | | nothere quits [Quit: Leaving] |
| 00:59:09 | <@JAA> | → #down-the-tube |
| 01:06:16 | <@JAA> | pabs: Hmm, can't find him, but it might all be in Arabic of course. |
| 01:07:12 | | gfhh1 joins |
| 01:09:17 | | gfhh quits [Ping timeout: 252 seconds] |
| 01:34:25 | | PredatorIWD__ joins |
| 01:34:53 | | gfhh joins |
| 01:35:10 | | gfhh1 quits [Read error: Connection reset by peer] |
| 01:35:10 | | DLoader_ joins |
| 01:35:10 | | DLoader_ quits [Excess Flood] |
| 01:35:10 | | DLoader_ joins |
| 01:35:26 | <nicolas17> | audrooku|m: it's certainly making 100-1000MB downloads |
| 01:35:51 | | sec^nd quits [Ping timeout: 245 seconds] |
| 01:36:20 | | sec^nd (second) joins |
| 01:36:47 | | DLoader quits [Ping timeout: 252 seconds] |
| 01:36:53 | | DLoader_ is now known as DLoader |
| 01:37:53 | | PredatorIWD_ quits [Ping timeout: 265 seconds] |
| 01:47:56 | | drunkmoon joins |
| 01:56:10 | | wizbot joins |
| 02:01:10 | | nothere joins |
| 02:05:58 | | sec^nd quits [Remote host closed the connection] |
| 02:06:30 | | sec^nd (second) joins |
| 02:15:38 | | cas joins |
| 02:28:09 | | cas quits [Ping timeout: 265 seconds] |
| 02:28:09 | | drunkmoon quits [Ping timeout: 265 seconds] |
| 02:30:05 | | dumbgoy_ quits [Ping timeout: 265 seconds] |
| 02:49:40 | | drunkmoon joins |
| 02:51:44 | <pabs> | https://www.gamingonlinux.com/2023/08/dev-of-shadow-tactics-desperados-iii-shadow-gambit-the-cursed-crew-shutting-down/ |
| 03:05:58 | | wyatt8750 quits [Remote host closed the connection] |
| 03:11:11 | | wyatt8740 joins |
| 03:12:01 | | wyatt8740 quits [Client Quit] |
| 03:12:14 | <fireonlive> | 05:29:51 PM -+rss:#hackernews- Keisan Casio is shutting down: https://keisan.casio.com/keisan/abolition.php https://news.ycombinator.com/item?id=37328669 |
| 03:13:01 | | wyatt8740 joins |
| 03:15:35 | <fireonlive> | https://news.ycombinator.com/item?id=37328669#37330950 |
| 03:15:45 | <fireonlive> | "Looks like the Japanese forum is going offline too: https://keisan.casio.jp/keisan/user_forum/" |
| 03:16:07 | | yts98 leaves |
| 03:16:15 | | yts98 joins |
| 03:30:30 | | Doranwen quits [Ping timeout: 265 seconds] |
| 03:33:27 | | DogsRNice quits [Read error: Connection reset by peer] |
| 03:36:08 | | icedice quits [Ping timeout: 252 seconds] |
| 03:44:09 | | decagon__ joins |
| 03:47:54 | | krvme quits [Ping timeout: 265 seconds] |
| 03:50:30 | | yts98 leaves |
| 03:50:36 | | yts98 joins |
| 03:54:51 | | Doranwen (Doranwen) joins |
| 03:54:57 | | yts98 leaves |
| 03:55:00 | | yts98 joins |
| 04:05:46 | | etnguyen03 (etnguyen03) joins |
| 04:10:10 | | cas joins |
| 04:17:33 | | dumbgoy_ joins |
| 04:39:32 | | cas quits [Remote host closed the connection] |
| 04:40:58 | | etnguyen03 quits [Client Quit] |
| 04:54:29 | <pokechu22> | Larsenv: looks like the forum in question is https://discussions.apple.com/browse - that's probably worth saving, but we might want to wait a few days for the Gabon stuff to settle down (and the other urgent stuff too). It doesn't sound like they'll be immediately deleting the forum on October 1, just stopping employee posting on it |
| 04:55:03 | <pokechu22> | I don't recognize the forum software in use so it might be a bit of a mess for archivebot |
| 04:55:55 | <nicolas17> | probably custom? |
| 04:58:47 | <Larsenv> | What is Gabon |
| 04:58:56 | <pokechu22> | Do we actually know how big it is? |
| 04:59:02 | <pokechu22> | https://en.wikipedia.org/wiki/2023_Gabonese_coup_d%27%C3%A9tat |
| 04:59:49 | | cas joins |
| 05:01:19 | <Larsenv> | It's probably 20 years of history! |
| 05:02:39 | <cas> | oh? what is? |
| 05:04:14 | <nicolas17> | https://discussions.apple.com/browse |
| 05:07:10 | <fireonlive> | damn it |
| 05:07:40 | <fireonlive> | but yeah, it's been around forever it feels |
| 05:08:15 | <fireonlive> | hmmmm |
| 05:08:18 | <fireonlive> | https://discussions.apple.com/browse?page=1&sortBy=latestActivityOldest |
| 05:08:21 | <fireonlive> | not encouraging... |
| 05:08:34 | <fireonlive> | latest activity: oldest is something from 5 months ago |
| 05:09:00 | <fireonlive> | did they swap software/turn on auto-prune at some point? |
| 05:10:04 | <pokechu22> | The only provided sitemap in https://discussions.apple.com/sitemap-index.xml is https://discussions.apple.com/sitemap-fca81a5d-1.xml for recent posts |
| 05:10:49 | <pokechu22> | and https://discussions.apple.com/browse?page=103&sortBy=dateCreatedNewest is the last page given that way... |
| 05:11:04 | <pokechu22> | same with https://discussions.apple.com/browse?page=103&sortBy=latestActivityNewest |
| 05:11:47 | | BlueMaxima quits [Read error: Connection reset by peer] |
| 05:11:54 | <fireonlive> | oh ok, there's this from https://discussions.apple.com/thread/8261972 2018 |
| 05:11:58 | <fireonlive> | so it exists somewhere |
| 05:12:27 | <fireonlive> | https://discussions.apple.com/thread/1000000 is from 2007 |
| 05:13:07 | <nicolas17> | if I sort by "recently created", it only lets me get to page 103 |
| 05:13:32 | <nicolas17> | which has posts from *one day ago* so the forum seems to get 2000 threads per day? |
| 05:13:33 | <cptcobalt> | I’m sad that the 10000 or whatever post is not from someone ID hunting |
| 05:14:56 | <nicolas17> | anyway if threads have sequential IDs, why bother with /browse |
| 05:15:06 | <fireonlive> | yeah seems to be |
| 05:15:07 | <fireonlive> | https://discussions.apple.com/thread/200002 |
| 05:15:12 | <fireonlive> | there's some threads that ask me for login |
| 05:15:15 | <fireonlive> | and some just 404 |
| 05:15:15 | <fireonlive> | but |
| 05:15:28 | <fireonlive> | randomly playing with numbers |
| 05:16:26 | <fireonlive> | so many unanswered questions :p |
| 05:16:29 | <pokechu22> | Looking at the recent ones they don't seem to be fully sequential, in fact https://discussions.apple.com/thread/255097976 is listed before https://discussions.apple.com/thread/255097973 - but there might be some kind of spam queue that needs manual approval to escape or something? |
| 05:16:53 | <fireonlive> | there's also user profiles, https://discussions.apple.com/profile/OmegaOSX/participation |
| 05:18:04 | <fireonlive> | also 200001's reply links have 'answerid' params.. 987106022 for the first reply |
| 05:18:28 | <fireonlive> | but can be ignored i think |
| 06:10:00 | | cas quits [Ping timeout: 265 seconds] |
| 06:11:33 | <h2ibot> | FireonLive edited Xuite (+4, not so "Xuite" news, Xuite is offline): https://wiki.archiveteam.org/?diff=50630&oldid=50629 |
| 06:12:33 | <h2ibot> | Yts98 edited Xuite (+77, Xuite goes offline): https://wiki.archiveteam.org/?diff=50631&oldid=50630 |
| 06:13:33 | <h2ibot> | FireonLive edited Current Projects (+30, move Xuite to done): https://wiki.archiveteam.org/?diff=50632&oldid=50617 |
| 06:14:56 | <fireonlive> | auto project is Xuite still, dunno if there's a better choice |
| 06:18:51 | <flashfire42> | are any other projects currently active and feeding items? telegram maybe? |
| 06:19:34 | <h2ibot> | FireonLive edited Current Projects (+18, linkify YouTube's 'selected videos'): https://wiki.archiveteam.org/?diff=50633&oldid=50632 |
| 06:20:13 | <imer> | telegram is empty again |
| 06:20:24 | <imer> | reddit once arkiver gets time to look at the image stuff I guess |
| 06:21:36 | <fireonlive> | imgur is going to have a temporary startup as well |
| 06:22:00 | <fireonlive> | but yeah reddit will need quite the backlog crunch |
| 06:22:03 | <flashfire42> | /tableflip I guess I will find something to feed into something then |
| 06:22:24 | <flashfire42> | do you want more youtube videos? Mediafire chews it up way too quickly and I have exhausted my telegram stuff |
| 06:23:34 | <fireonlive> | if they fit within scope i don't see the harm, but i assume we'll chew though them quite quickly |
| 06:25:57 | <nicolas17> | there's still some youtube reclaims |
| 06:26:23 | <nicolas17> | and I have a ton of free bandwidth left on my VPS so I'm getting my money's worth :P |
| 06:28:44 | | nicolas17 quits [Client Quit] |
| 06:29:16 | | yts98 leaves |
| 06:33:57 | <fireonlive> | :D |
| 06:42:01 | | yts98 joins |
| 06:54:22 | <flashfire42> | so youtube is the only one running? plus the dregs of gfycat? |
| 06:55:04 | <imer> | telegram is also running, but empty currently |
| 06:55:12 | <imer> | but thats about it yeah |
| 07:05:02 | | Unholy2361316618085 quits [Remote host closed the connection] |
| 07:06:28 | | cas joins |
| 07:06:36 | | Unholy2361316618085 (Unholy2361) joins |
| 07:24:26 | | dumbgoy_ quits [Ping timeout: 265 seconds] |
| 07:24:54 | | Arcorann (Arcorann) joins |
| 07:30:18 | | Unholy2361316618085 quits [Remote host closed the connection] |
| 07:30:40 | | Unholy2361316618085 (Unholy2361) joins |
| 07:38:56 | | cas quits [Ping timeout: 265 seconds] |
| 07:49:41 | | nulldata quits [Ping timeout: 252 seconds] |
| 07:52:43 | | nulldata (nulldata) joins |
| 08:00:56 | <h2ibot> | JAABot edited CurrentWarriorProject (-4): https://wiki.archiveteam.org/?diff=50634&oldid=50582 |
| 08:01:39 | | JohnnyJ joins |
| 08:18:50 | | kiskaLogBot quits [Ping timeout: 252 seconds] |
| 08:19:05 | | kiskaLogBot joins |
| 08:19:59 | | icedice (icedice) joins |
| 08:50:57 | | kiskaLogBot quits [Ping timeout: 265 seconds] |
| 09:10:09 | <flashfire42> | Is that rearchive everything command still active on telegram? |
| 09:10:33 | | kiskaLogBot joins |
| 09:18:19 | | Exorcism (exorcism) joins |
| 09:21:24 | | kiskaLogBot quits [Ping timeout: 265 seconds] |
| 09:22:03 | | kiskaLogBot joins |
| 09:23:03 | | cas joins |
| 09:26:29 | | kiskaLogBot quits [Ping timeout: 252 seconds] |
| 09:26:50 | | kiskaLogBot joins |
| 09:30:06 | | kiryu quits [Ping timeout: 265 seconds] |
| 09:31:06 | | kiryu joins |
| 09:36:52 | | kiryu quits [Ping timeout: 265 seconds] |
| 09:37:11 | | kiryu joins |
| 09:38:35 | | kiskaLogBot quits [Ping timeout: 252 seconds] |
| 09:38:54 | | kiskaLogBot joins |
| 09:44:07 | | kiryu quits [Ping timeout: 265 seconds] |
| 09:44:52 | | kiryu joins |
| 09:59:49 | | toss (toss) joins |
| 10:00:01 | | railen63 quits [Remote host closed the connection] |
| 10:00:18 | | railen63 joins |
| 10:06:47 | <cas> | http://littlewitchacademia.jp/tv1st/ Requesting to queue this site for archival, if possible. Not necessarily urgent, but bandai namco deleted their page pertaining the anime a while back (https://web.archive.org/web/20180119174059/https://en.bandainamcoent.eu/little-witch-academia/little-witch-academia) so I hope the site as-is can be archived to |
| 10:06:48 | <cas> | preserves its content. |
| 10:17:59 | | Carnildo_again joins |
| 10:17:59 | | Carnildo quits [Read error: Connection reset by peer] |
| 10:29:47 | | toss quits [Client Quit] |
| 10:49:02 | <flashfire42> | JAA do you have the way to turn off the rearchive everything thing on telegram? |
| 10:49:32 | <@JAA> | flashfire42: No, also wrong channel. |
| 10:49:59 | <flashfire42> | you and I both know things get buried in #telegrab I wanted to be sure it was seen |
| 10:50:22 | <@JAA> | I read my pings. :-) |
| 10:50:50 | <@JAA> | (And I expect others to do so, too, regardless of other noise.) |
| 10:53:10 | <@rewby> | nstrom|m: I don't see an announcement on dedipath's site about the shutdown |
| 10:57:58 | | railen63 quits [Remote host closed the connection] |
| 10:58:01 | | railen69 joins |
| 10:58:23 | <@JAA> | Apparently just an email, and people on LET are reporting sender verification failures: https://lowendtalk.com/discussion/188358/dedipath-closure-of-business |
| 10:58:36 | | Exorcism quits [Remote host closed the connection] |
| 11:00:18 | | Exorcism (exorcism) joins |
| 11:01:40 | <nstrom|m> | was just about to post that over here. yeah I got the email from them because I have a server as well, can verify the contents of that post |
| 11:11:44 | <flashfire42> | if we need more work for warriors poke me when I am not working or sleeping I will toss some more into whatever you want |
| 11:20:01 | | Exorcism quits [Ping timeout: 245 seconds] |
| 11:29:00 | | kiryu quits [Ping timeout: 265 seconds] |
| 11:29:16 | | kiryu joins |
| 11:31:50 | <Exorcism|TheLounge> | https://twitter.com/ActiveTK5929/status/1696840098862809583 |
| 11:31:50 | <eggdrop> | nitter: https://nitter.net/ActiveTK5929/status/1696840098862809583 |
| 11:34:34 | <bladem> | Dedipath confirmed via email to LES moderator that they are shutting down entirely today: https://lowendspirit.com/discussion/comment/148585/#Comment_148585 |
| 11:51:14 | | cas quits [Ping timeout: 265 seconds] |
| 11:57:19 | | Mateon1 quits [Read error: Connection reset by peer] |
| 12:00:12 | <@JAA> | VirMach is using DediPath for a lot of their locations. I wonder what other providers either have their stuff colo'd there or are just reselling. |
| 12:04:50 | <nstrom|m> | I *think* dedipath didn't own the colocation facilities, just had ASN (as35913) & hardware |
| 12:04:53 | <nstrom|m> | I could be wrong on that though |
| 12:06:15 | <nstrom|m> | I know there were definitely resellers of dedipath VPSes. .ethernetservers.com was reselling dedipath in NJ but recently switched providers there after a datacenter fire |
| 12:08:28 | <nstrom|m> | they had space in 10 datacenters and their ASN announces a pretty big chunk of addresses so I'm sure there will be lots of affected customers in any case |
| 12:09:02 | <nstrom|m> | (they = dedipath in above) |
| 12:10:55 | <@JAA> | nstrom|m: They mention colo stuff in the email though...? |
| 12:11:22 | <@JAA> | 'In regards to our colocation customers if you are in the following locations please send a ticket to ...' |
| 12:11:56 | <nstrom|m> | yeah I think the companies they tell colo customers to contact are the companies that actually own the datacenters |
| 12:12:10 | <nstrom|m> | so it's probably something like "move your stuff off of the dedipath racks onto some other rack in the same facility if you want to stay" |
| 12:12:31 | <nstrom|m> | if I had to guess |
| 12:12:36 | <nstrom|m> | I'm not a colo customer there |
| 12:13:29 | <@JAA> | Ah |
| 12:13:55 | <@rewby> | DC's usually only like to sell in units of 1 or more racks |
| 12:13:58 | <@rewby> | As in, entire racks |
| 12:14:12 | <@rewby> | So there's an ecosystem of companies that basically sublease racks |
| 12:14:24 | <@rewby> | They get a rack in a DC (or even an entire cage) |
| 12:14:31 | <@rewby> | And then rent out individual RU with power and network |
| 12:14:54 | <@rewby> | So if dedipath had colo customers, this is likely what they did |
| 12:15:11 | <@rewby> | The DC would just drop power and maybe one or two fiber feeds into the rack |
| 12:15:33 | <@rewby> | And then the company puts in a top of rack, a PDU and brings more ips |
| 12:27:45 | | Mateon1 joins |
| 12:32:16 | | Mateon1 quits [Remote host closed the connection] |
| 12:32:29 | | Mateon1 joins |
| 12:36:47 | | Mateon1 quits [Ping timeout: 252 seconds] |
| 12:44:56 | <@JAA> | Oh yeah, as someone pointed out, LET still has a DediPath ad at the bottom. Beautiful. |
| 12:48:13 | | etnguyen03 (etnguyen03) joins |
| 12:55:17 | | Mateon1 joins |
| 12:59:53 | | wickedplayer494 quits [Ping timeout: 252 seconds] |
| 13:03:10 | | Mateon1 quits [Read error: Connection reset by peer] |
| 13:07:01 | | Mateon1 joins |
| 13:14:11 | | AmAnd0A quits [Ping timeout: 252 seconds] |
| 13:14:26 | | AmAnd0A joins |
| 13:24:51 | | qwertyasdfuiopghjkl quits [Client Quit] |
| 13:41:19 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
| 13:46:05 | | bf_ quits [Ping timeout: 252 seconds] |
| 13:54:52 | | erkinalp joins |
| 13:57:14 | | bf_ joins |
| 14:14:23 | <immibis> | "Summer Sale - 20% off select dedicated servers from just $36/m + save 20% on all VPS and web hosting! Click Here to Save Now!" |
| 14:14:36 | <immibis> | save because they will never deliver any service or bill you? |
| 14:23:08 | <qyxojzh|m> | <immibis> ""Summer Sale - 20% off select..." <- Is that a scam email or a genuine sale? |
| 14:23:47 | | bf_ quits [Remote host closed the connection] |
| 14:24:07 | | bf_ joins |
| 14:25:22 | <immibis> | that's the banner at the top of dedipath.com |
| 14:25:26 | | bf_ quits [Remote host closed the connection] |
| 14:25:33 | | cas joins |
| 14:25:36 | <immibis> | which is apparently going out of business today |
| 14:25:44 | <immibis> | according to the discussion above |
| 14:27:48 | | bf_ joins |
| 14:42:20 | | Arcorann quits [Ping timeout: 265 seconds] |
| 14:48:14 | | gfhh quits [Ping timeout: 252 seconds] |
| 15:04:07 | <@arkiver> | 44 TB of items with WARCs in the archiveteam collections have mediatype 'data' - this will make them unavailable in the Wayback Machine |
| 15:04:13 | <@arkiver> | i'm moving these to mediatype=web |
| 15:04:46 | <@arkiver> | after this i'll do a check to see if all WARCs have actually been derived - i already see quite some that have not been derived |
| 15:22:27 | | etnguyen03 quits [Ping timeout: 265 seconds] |
| 15:25:27 | | etnguyen03 (etnguyen03) joins |
| 15:43:56 | <@arkiver> | now rederiving some 113 TB of items with WARCs that may not be correctly indexed yet |
| 15:50:12 | <@arkiver> | (that is 30k items) |
| 15:50:37 | <@JAA> | Yay, more work for IA's overloaded systems. :-) |
| 15:51:49 | <TheTechRobo> | https://lounge.thetechrobo.ca/uploads/1900690ac0ae6650/image.png |
| 15:51:55 | | dumbgoy_ joins |
| 15:51:57 | <@arkiver> | or... more records in the Wayback Machine without having to upload new data :P |
| 15:52:06 | <@arkiver> | more completeness yay |
| 15:52:30 | <@JAA> | But great to fix those. I hope they weren't intentionally marked with a different mediatype. Probably not many of those in the AT collection though. |
| 15:53:11 | <@arkiver> | most of these were 2014 or 2015 items from 'before AB' |
| 15:53:16 | <@arkiver> | usually uploaded by a single user |
| 15:53:35 | <@JAA> | Ah, the dark ages. |
| 15:53:49 | <@arkiver> | from https://archive.org/details/archiveteam_earlywarcs |
| 15:53:52 | <fireonlive> | 😰 |
| 15:54:02 | <@arkiver> | they were initially uploaded as mediatype=data |
| 15:54:26 | <@arkiver> | (there were also some cases in which other items seemed to have been accidentally uploaded as mediatype=data) |
| 15:54:38 | <@arkiver> | but those are indeed pretty old, ~9 years or so |
| 15:55:40 | <@arkiver> | at this point in time, WARCs that should not go into the Wayback Machine are in https://archive.org/details/warczone |
| 15:58:05 | <@arkiver> | meanwhile we also still have https://archive.org/details/archiveteam-mobileme-hero , which contains 282 TB of WARCs... inside tar files :/ so also not in the Wayback Machine |
| 15:58:30 | <@arkiver> | basically that entire project is invisible to most users |
| 15:58:52 | <fireonlive> | hey my warcs are in the zone t_t |
| 15:59:04 | <fireonlive> | and i thought i was special! :P |
| 15:59:21 | <fireonlive> | but hmm interesting |
| 15:59:56 | <fireonlive> | “This massive collection represents one of the largest projects Archive Team may ever do: Over 272 terabytes…” |
| 15:59:58 | <fireonlive> | :) |
| 16:00:11 | <TheTechRobo> | lmao |
| 16:00:30 | <TheTechRobo> | urls project: 5.03PiB |
| 16:02:15 | <@JAA> | I've uploaded WARCs in TARs before, precisely so they don't accidentally get derived and put in the WBM. |
| 16:03:41 | <@arkiver> | but i think we want the mobileme collection to be indexed? |
| 16:04:35 | <@JAA> | Assuming there were no auth shenanigans involved, probably. |
| 16:06:20 | | etnguyen03 quits [Ping timeout: 252 seconds] |
| 16:13:18 | <@HCross> | oh arkiver I see you found the rest of XS4ALL :P |
| 16:20:29 | <@HCross> | is there a way to unpack those TAR files inside the IA, or do we need something to pull them down and reprocess them |
| 16:26:15 | | cas quits [Ping timeout: 265 seconds] |
| 16:30:14 | <@arkiver> | HCross: might have to pull them down :/ |
| 16:30:47 | <@arkiver> | there's more than just WARCs. it might be best to pull them down, just pull out the WARCs and put those WARCs back in together with tars containing leftovers |
| 16:41:47 | <fireonlive> | xs4all now that’s a name i haven’t heard in ages |
| 16:45:19 | <@rewby> | Me and HCross made an effort to archive their ISP hosting |
| 16:45:28 | <@rewby> | Was a bit of a pain |
| 16:45:31 | <@rewby> | And completeness is ??? |
| 16:45:34 | <@rewby> | But it's something |
| 16:48:40 | <@arkiver> | it's a very good "something" :) |
| 16:48:45 | <@rewby> | ? |
| 16:48:52 | <@rewby> | It was me having a go at heritrix |
| 16:49:06 | <@arkiver> | how was the experience? |
| 16:49:11 | <@rewby> | Good and bad? |
| 16:49:22 | <@rewby> | It needed some code tweaks to do what I wanted it to do |
| 16:49:33 | <@rewby> | And I ended up doing some cursed sharding |
| 16:49:39 | <@rewby> | To make it go faster |
| 16:49:57 | <@rewby> | Took a while to get my head around |
| 16:50:02 | <@rewby> | So really not any worse or better than most tools |
| 16:50:21 | <@rewby> | I ended up modifying some of the code around frontier management |
| 16:50:54 | <@rewby> | Because I needed it to do some cursed things to deal with xs4all |
| 16:52:06 | <@rewby> | One thing that was a pain was that at some point xs4all converted from xs4all.net/~user (or something similar) to user.xs4all.net (or something) (I forget the exact subdomains involved, but they went from ~user to subdomain) |
| 16:52:10 | <@rewby> | And while the redirect existed |
| 16:52:19 | <@rewby> | I had to hack a few checks out of the frontier code |
| 16:52:34 | <@rewby> | Because I didn't want to grab many outlinks |
| 16:52:55 | <@rewby> | But I needed it to go through to the ~user links on a different subdomain |
| 16:52:59 | <@rewby> | And follow the redirects |
| 16:54:26 | <@rewby> | It ended up working reasonably well |
| 16:54:37 | <@rewby> | Also iframes did some cursed stuff I dont' remember |
| 16:54:50 | <@JAA> | Are those patches/hacks available somewhere? |
| 16:54:59 | <@rewby> | No, I don't even know if I still have them |
| 16:55:04 | <@JAA> | Oof |
| 16:55:10 | <@rewby> | I mean, it's not that hard |
| 16:55:14 | <@rewby> | The source code is on github |
| 16:55:20 | <@rewby> | Find the frontier code and just fuck around and find out |
| 16:55:39 | <@rewby> | I will point out I never touched the actual retrieval and warc writing |
| 16:55:51 | <@rewby> | I just fucked around with link discovery and frontier |
| 16:56:08 | <@JAA> | Yeah, I just like this ideal world where the code for the archival is also freely available for anyone to figure out why the data is the way it is. |
| 16:56:17 | <@rewby> | Yeah I get that |
| 16:56:31 | <@rewby> | But genuinely, it was a copy paste hack job in a bunch of places |
| 16:56:34 | <fireonlive> | ahh :) |
| 16:57:16 | <@rewby> | also, I can't actually share the seed url list |
| 16:57:32 | <fireonlive> | ah yes a .nl isp :3. probably saw it on irc way back when |
| 17:06:37 | | DogsRNice joins |
| 17:10:28 | | etnguyen03 (etnguyen03) joins |
| 17:57:44 | | wizbot quits [Read error: Connection reset by peer] |
| 17:57:46 | | wizbot_ joins |
| 18:08:56 | | Carnildo_again is now known as Carnildo |
| 18:17:14 | | iCaotix quits [Ping timeout: 252 seconds] |
| 18:23:33 | | iCaotix joins |
| 18:38:39 | | emberquill08 quits [Quit: The Lounge - https://thelounge.chat] |
| 18:39:23 | | emberquill08 (emberquill) joins |
| 18:40:31 | | PredatorIWD__ quits [Read error: Connection reset by peer] |
| 18:47:08 | | AmAnd0A quits [Read error: Connection reset by peer] |
| 18:47:50 | | AmAnd0A joins |
| 19:05:31 | | qwertyasdfuiopghjkl quits [Remote host closed the connection] |
| 19:07:50 | | etnguyen03 quits [Ping timeout: 252 seconds] |
| 19:12:06 | | sec^nd quits [Ping timeout: 245 seconds] |
| 19:14:14 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
| 19:24:02 | | Darken (Darken) joins |
| 19:31:25 | | Darken quits [Remote host closed the connection] |
| 19:53:10 | | nicolas17 joins |
| 20:00:16 | <h2ibot> | 0KepOnline edited VHS on YouTube (+253, Added Ukrainian VHS on YT): https://wiki.archiveteam.org/?diff=50635&oldid=37007 |
| 20:00:17 | <h2ibot> | 0KepOnline edited Local TV News (+2215, /* Europe */ Ukraine): https://wiki.archiveteam.org/?diff=50636&oldid=48641 |
| 20:02:21 | <h2ibot> | Rexma edited List of websites excluded from the Wayback Machine/Partial exclusions/Twitter accounts (+38, add account): https://wiki.archiveteam.org/?diff=50637&oldid=49984 |
| 20:02:22 | <h2ibot> | Rob Kam edited WikiTeam (+142, The MediaWiki comparison of wiki farms is more…): https://wiki.archiveteam.org/?diff=50638&oldid=50537 |
| 20:04:14 | <fireonlive> | https://x.com/toadsanime/status/1697286315094511759?s=12 |
| 20:04:15 | <eggdrop> | nitter: https://nitter.net/toadsanime/status/1697286315094511759 |
| 20:05:20 | <fireonlive> | "Volition staff on Twitter are reporting that parent company Embracer has just closed the 30-year-old studio behind Saint's Row and Red Faction, with mass layoffs #VolitionJobs" |
| 20:08:15 | | systwi_ (systwi) joins |
| 20:09:26 | | systwi quits [Ping timeout: 252 seconds] |
| 20:15:13 | | systwi_ is now known as systwi |
| 20:17:08 | | tertu quits [Ping timeout: 252 seconds] |
| 20:17:17 | | erkinalp quits [Ping timeout: 265 seconds] |
| 20:19:28 | | tertu (tertu) joins |
| 20:36:37 | | nicolas17 quits [Ping timeout: 265 seconds] |
| 20:40:56 | | nicolas17 joins |
| 20:41:03 | <nicolas17> | ran telegram at high concurrency... pop went the modem |
| 20:52:22 | <@arkiver> | :P |
| 20:52:40 | <@arkiver> | well at time you're getting the maximum out of your modem :) |
| 20:57:38 | | etnguyen03 (etnguyen03) joins |
| 21:03:41 | | iCaotix quits [Ping timeout: 265 seconds] |
| 21:08:38 | | iCaotix joins |
| 21:11:33 | <nulldata> | fireonlive - that's so fucking sad |
| 21:14:04 | <fireonlive> | yeah :( |
| 21:14:16 | <nulldata> | Even sadder that the Saints Row 2 PC patch IdolNinja was so passionate about and organized before he died of cancer very likely won't see the light of day either |
| 21:16:49 | <fireonlive> | goddamn |
| 21:18:19 | | DogsRNice_ joins |
| 21:21:05 | | iCaotix quits [Ping timeout: 265 seconds] |
| 21:22:32 | | DogsRNice quits [Ping timeout: 265 seconds] |
| 21:24:24 | | iCaotix joins |
| 21:51:13 | | katocala quits [Remote host closed the connection] |
| 22:20:15 | | katocala joins |
| 22:20:31 | | katocala is now authenticated as katocala |
| 22:22:46 | <h2ibot> | Vokunal edited ZOWA (+52, added source): https://wiki.archiveteam.org/?diff=50639&oldid=50612 |
| 22:39:36 | | wickedplayer494 joins |
| 22:39:46 | | wickedplayer494 is now authenticated as wickedplayer494 |
| 23:42:40 | <nulldata> | It's almost as if one big company spending billions in debt buying up all the other publishers and developers isn't such a great thing for the stability of everyone involved 🤔 |
| 23:43:23 | <flashfire42|m> | Just gave telegram a bit more work. Will queue more soon |
| 23:48:30 | | nicolas17 quits [Ping timeout: 265 seconds] |
| 23:53:03 | | nicolas17 joins |