00:24:33<protondonor>wondering if I should set up a tracker for a website near and dear to my heart even though it doesn't have any signs of going down soon
00:24:49<TheTechRobo>what site?
00:24:52<protondonor>conworkshop.com
00:24:56<protondonor>needs a login to view anything
00:25:33<TheTechRobo>Login-walls are beyond Archive Team's scope, IIRC, so you'd need to do archival yourself
00:25:51<TheTechRobo>(or with others, but it wouldn't be an official project afaik)
00:26:06<protondonor>oh interesting, so there's no way to give a tracker login access?
00:27:30<TheTechRobo>With archiveteam it's _possible_ but not normally done
00:27:57<TheTechRobo>Things like accounts getting banned, and the Wayback Machine doesn't really ingest logged-in sites
00:28:24<protondonor>I'm part of the mod team so I could definitely talk to the rest of the mods about an alternate way of archiving/preserving the website
00:28:30<@JAA>We've even done it before in special cases (SPUF comes to mind), but generally, we only archive things that are publicly accessible.
00:28:59<protondonor>it's just gone down unexpectedly before leaving a lot of users in the lurch
00:29:06<TheTechRobo>SPUF?
00:29:52<@JAA>Steam Powered User Forums
00:29:58<TheTechRobo>Ah
00:30:05<TheTechRobo>It wasn't ingested into the WBM, was it?
00:30:11<TheTechRobo>I've never seen auth'ed stuff in there
00:30:18<@JAA>it was.
00:30:18C4K3 joins
00:30:23<TheTechRobo>Huh
00:30:28<TheTechRobo>TIL
00:30:46<@JAA>The forums were taken down before the shutdown date, but we were given an account to archive the rest of it. We still only archived things that were publicly accessible prior to the shutdown though.
00:31:29<protondonor>hmm. so would it be possible to give the WBM an account to archive the site or is that only for really special cases?
00:32:25<protondonor>oh wait there is actually stuff you could view without a login
00:33:36Arcorann quits [Ping timeout: 265 seconds]
00:35:49Arcorann (Arcorann) joins
00:36:59<@JAA>TheTechRobo: Random logged-in example (see top right of the page): https://web.archive.org/web/20170601231440/http://forums.steampowered.com/forums/showthread.php?mode=hybrid&t=2058997
00:40:44<protondonor>I guess I could back up all the things you can view without a login through WBM
00:40:47<protondonor>if it doesn't have them already
00:41:31<protondonor>yeah it's got em. even saved the fonts which is nice
00:42:24<protondonor>unrelated question, anyone got advice on archiving a wiki with 100k+ images? the job keeps stopping in the middle due to network flakes and doesn't resume well
00:44:49Jonboy345 joins
01:00:47chrismeller (chrismeller) joins
01:01:00chrismeller quits [Remote host closed the connection]
01:01:23chrismeller (chrismeller) joins
01:01:33jacobk quits [Ping timeout: 252 seconds]
01:03:46sonick quits [Client Quit]
01:07:31lennier1 quits [Client Quit]
01:07:56protondonor quits [Ping timeout: 244 seconds]
01:07:59lennier1 (lennier1) joins
01:12:20Jonboy345 quits [Read error: Connection reset by peer]
01:33:49bonga quits [Ping timeout: 252 seconds]
01:35:50sallysue quits [Ping timeout: 244 seconds]
01:40:02Iki1 joins
01:43:43Iki quits [Ping timeout: 252 seconds]
02:00:30pabs quits [Quit: Don't rest until all the world is paved in moss and greenery.]
02:06:43pabs (pabs) joins
02:08:18jacobk joins
02:09:18eroc1990 quits [Ping timeout: 265 seconds]
02:25:01eroc1990 (eroc1990) joins
02:32:36protondonor joins
03:19:06lennier1 quits [Ping timeout: 240 seconds]
03:20:02BurritoSOFT (BurritoSOFT) joins
03:20:37lennier2 joins
03:24:20lennier2 is now known as lennier1
03:33:26ThreeHM quits [Ping timeout: 240 seconds]
03:33:49ThreeHM (ThreeHeadedMonkey) joins
03:34:20sallysue joins
03:36:57march_happy quits [Remote host closed the connection]
03:39:13jacobk quits [Ping timeout: 252 seconds]
03:43:26march_happy (march_happy) joins
03:48:06protondonor quits [Ping timeout: 244 seconds]
03:50:06ThreeHM quits [Ping timeout: 240 seconds]
03:50:28ThreeHM (ThreeHeadedMonkey) joins
04:20:01<@OrIdow6>I thought I said it, but I never did: it looked like Duolingo forums's size was on the order of magnitude of the highest ID, but I was unable to tell exactly how large it was, since response code weirdness obscured whether the whole space was used
04:20:15<@OrIdow6>Used up to that point
04:21:48<@JAA>Is the 'response code weirdness' the same thing I described ~12 hours ago? HTTP 500 on comments, HTTP 200 on discussions.
04:23:28<@JAA>As far as I can tell, pretty much the entire range is in use, but since there are more comments than discussions, most IDs don't actually work.
04:25:17protondonor joins
04:28:30<@OrIdow6>From my terminal logs there were 404s, 500s, and 200s, what the 404s mean I don't remember
04:29:08<@OrIdow6>THe weirdness was that either 404s or 500s sometimes (seemingly randomly) came in as 200, and the body was a JSON object explaining the error
04:30:35<@OrIdow6>Or maybe it was just the straight error and not a JSON object
04:31:43<@OrIdow6>I only got a few examples (it looks like) for false 200s above the range (normally 404s), but my histograms of response codes for lower IDs suggested it was happening elsewhere
04:35:23<@OrIdow6>The moral of this story is that whatever scans these IDs and makes warcs should check the body
04:40:07<@JAA>I don't suppose you have any examples of the actual responses in those edge cases?
04:40:43jacobk joins
04:41:14<@OrIdow6>As far as I can tell, no
04:43:09<@OrIdow6>Trying to run something to get an example npow
04:47:39<@OrIdow6>https://forum-api.duolingo.com/comments/61157435?_=1643330843259 - 500 - body "500 Internal Server Error" - now gives a normal 404, came in a spate with several others
04:48:04<@OrIdow6>Not status code mismatch but something
04:49:32<@JAA>Mhm, although that's outside the range.
04:49:47<@JAA>IDs go to almost 56 million so far.
04:51:15<@JAA>On another note, does anyone know where the 'sentence discussions' are? Also just normal discussions in the forums or is that something separate?
05:00:52<protondonor>I think those may be on individual sentences
05:01:10<protondonor>I haven't used duolingo in a while but I recall there being a discussion on each example/test sentence
05:05:58<@OrIdow6>I know, I'm checking outside of the range to get what should be 404s
05:06:07sallysue quits [Ping timeout: 244 seconds]
05:06:18<@JAA>Ah, right.
05:12:04<@JAA>protondonor: Yeah, you're right, there's a 'discuss' button on every question/response. And yeah, it simply leads to a normal-looking forum discussion.
05:13:35<@JAA>Well, except that comments are overlayed with a 'Create a profile to join in on discussions!' prompt, but the content is there and it works exactly the same way on the backend as far as I can see.
05:25:09idkwhatiwadfklj joins
05:26:39idkwhatiwadfklj quits [Remote host closed the connection]
05:42:04DogsRNice quits [Read error: Connection reset by peer]
06:33:27BlueMaxima quits [Read error: Connection reset by peer]
08:51:37chrismeller quits [Read error: Connection reset by peer]
08:54:50chrismeller (chrismeller) joins
09:15:27march_happy quits [Ping timeout: 252 seconds]
09:15:56march_happy (march_happy) joins
11:29:18rellem (chrismeller) joins
11:32:57chrismeller quits [Ping timeout: 252 seconds]
11:36:01chris__ (chrismeller) joins
11:40:07rellem quits [Ping timeout: 265 seconds]
11:59:17chris__ quits [Client Quit]
11:59:34chrismeller (chrismeller) joins
12:05:44thuban quits [Ping timeout: 265 seconds]
12:09:01thuban joins
12:35:05sonick (sonick) joins
12:39:04c41e3a (c41e3a) joins
12:41:50c41e3a_ (c41e3a) joins
12:43:26c41e3a quits [Ping timeout: 240 seconds]
13:10:03fds joins
13:16:29AnotherIki joins
13:19:41Iki1 quits [Ping timeout: 265 seconds]
13:26:56Arcorann quits [Ping timeout: 265 seconds]
13:27:16c41e3a_ quits [Client Quit]
13:50:55qwertyasdfuiopghjkl joins
14:11:08Jonboy345 joins
14:53:36fds quits [Remote host closed the connection]
14:55:48Jonboy345 quits [Read error: Connection reset by peer]
14:58:38Jonboy345 joins
15:13:41Iki1 joins
15:15:16c41e3a (c41e3a) joins
15:16:15AnotherIki quits [Ping timeout: 252 seconds]
15:36:02c41e3a quits [Client Quit]
16:02:34chrismeller quits [Ping timeout: 265 seconds]
16:03:12Jonboy345 quits [Client Quit]
16:30:00Hackerpcs quits [Quit: Hackerpcs]
16:32:17Hackerpcs (Hackerpcs) joins
16:43:06march_happy quits [Ping timeout: 240 seconds]
16:55:46wyatt8740 quits [Ping timeout: 240 seconds]
16:56:47wyatt8740 joins
17:11:06thuban quits [Ping timeout: 240 seconds]
17:17:36thuban joins
17:23:51thuban quits [Ping timeout: 252 seconds]
17:25:23lennier1 quits [Client Quit]
17:26:33Jonboy345 joins
17:27:41lennier1 (lennier1) joins
17:28:18thuban joins
17:46:22HP_Archivist (HP_Archivist) joins
17:55:57Jonboy345 quits [Read error: Connection reset by peer]
18:05:29chrismeller (chrismeller) joins
18:05:42chrismeller quits [Remote host closed the connection]
18:06:49chrismeller (chrismeller) joins
18:07:12chrismeller quits [Remote host closed the connection]
18:08:19chrismeller (chrismeller) joins
18:08:42chrismeller quits [Remote host closed the connection]
18:09:49chrismeller (chrismeller) joins
18:10:12chrismeller quits [Remote host closed the connection]
18:10:35chrismeller (chrismeller) joins
18:55:26Jonboy345 joins
19:04:26chrismeller quits [Ping timeout: 240 seconds]
19:05:19Jonboy3451 joins
19:07:26Jonboy345 quits [Ping timeout: 240 seconds]
19:10:55VerifiedJ quits [Quit: The Lounge - https://thelounge.chat]
19:11:41VerifiedJ (VerifiedJ) joins
19:21:51Jonboy3451 quits [Client Quit]
19:40:54<IDK>is there a place where I can collabaratly archive discord messages
19:41:02<IDK>like a discord database
19:48:09<AK>#discard, not sure what really happened in there though
19:50:54<IDK>AK: I dont think the project active tho
19:51:10<@JAA>That's the channel for anything Discord-related, anyway.
19:56:35dffghfdghfg joins
19:56:49DogsRNice (Webuser299) joins
19:56:52dffghfdghfg quits [Remote host closed the connection]
20:12:27viridian (c41e3a) joins
20:27:55jacobk quits [Ping timeout: 265 seconds]
20:33:39viridian quits [Changing host]
20:33:39viridian (c41e3a) joins
20:36:17viridian quits [Changing host]
20:36:17viridian (c41e3a) joins
20:36:29viridian quits [Changing host]
20:36:29viridian (c41e3a) joins
20:36:36viridian quits [Changing host]
20:36:36viridian (c41e3a) joins
20:46:54jacobk joins
20:53:16viridian quits [Changing host]
20:53:16viridian (viridian) joins
21:09:39viridian quits [Client Quit]
21:18:59randomuser62342 joins
21:43:07tymscar joins
21:43:40<tymscar>hello. Just found out about this. Apparently there is a huge backup of xfire data
21:43:57<tymscar>I was an avid user of the service when I was a kid, so it would mean the world to me to get back some of my videos
21:44:35<tymscar>I found the internet archive but is there any better way to actually look throuhg it without downloading terrabytes of videos to see which one is mine?
21:44:44<tymscar>I did find my username in the list of usernames
21:46:49BlueMaxima joins
21:50:06<@OrIdow6>tymscar: Looking at the wiki, your name in that list doesn't mean your user page was saved, that was for internal use
21:50:12<@OrIdow6>Try https://web.archive.org/web/20140910184849/http://social.xfire.com/users/{your username} ?
21:50:13march_happy (march_happy) joins
21:50:30<tymscar>oh, ill check that
21:50:55<tymscar>sadly theres nothing there
21:51:44<tymscar>`This page is not available on the web
21:52:33<@OrIdow6>Wiki claims there was "no time" to get the user profiles, so scanning might be your only option
21:52:51<tymscar>how would I do that?
21:53:37<@OrIdow6>Look through the warcs, possibly parse out the username listing on videos and screenshots (or just use the whole page), match against your username
21:54:32<tymscar>but for me to look through them, do I need to download them
21:54:41<@OrIdow6>Yes
21:55:39<tymscar>ok, thank you. sadly, im not sure if thats even possible. theyre 25gig files, and even with my 1gig internet, it downloads less than 10kbps
21:56:47<@rewby>Was there anything in the URL that identified the username?
21:57:06<@OrIdow6>rewby: From what I've seen they're like http://crash.xfire.com/video/52aa7a/
21:57:15<@rewby>Ah, yeah that sucks
21:57:34<tymscar>If they wouldve announced it 1 month before I wouldve saved
21:57:45<tymscar>but I found out the site is gone 2 hours after it closed
21:57:51<tymscar>and I had thousands of videos
21:58:00<@rewby>If you know the urls, you can get them easily... Maybe bookmarks from somewhere?
21:58:36<tymscar>well I stopped using xfire and moved to skype in 2009
21:58:45<tymscar>theres like no chance of having any bookmarks ;(
22:00:04<@rewby>Hm
22:00:19<@rewby>Maybe the wayback machine's search engine can pull something up?
22:01:27<tymscar>I tried my username, but the archize ar gzipped
22:01:31<tymscar>so sadly not
22:03:50<@rewby>That's the IA search
22:03:55<@rewby>But I think the wbm has its own search engine
22:04:00<@rewby>Which looks inside of the archives
22:04:11Stiletto quits [Remote host closed the connection]
22:05:08<tymscar>do you have a link?
22:05:44<tymscar>https://archive.org/web/
22:05:45<tymscar>this one?
22:06:22<tymscar>https://web.archive.org/web/*/http://classic.xfire.com/profile/timichiciy*
22:06:24<tymscar>sadly nothing
22:07:23Arcorann (Arcorann) joins
22:08:10superkuh quits [Remote host closed the connection]
22:09:01superkuh joins
22:09:35superkuh_ joins
22:11:57Stiletto joins
22:13:31superkuh quits [Ping timeout: 252 seconds]
22:15:33tymscar quits [Remote host closed the connection]
22:16:37randomuser62342 quits [Remote host closed the connection]
22:29:30DogsRNice_ (Webuser299) joins
22:29:39jacobk quits [Ping timeout: 252 seconds]
22:31:26DogsRNice quits [Ping timeout: 240 seconds]
22:31:49Iki1 quits [Read error: Connection reset by peer]
22:39:23dvd (dvd) joins
23:07:03HP_Archivist quits [Ping timeout: 252 seconds]
23:21:57Iki joins
23:46:11jacobk joins
23:56:43daxxy quits [Ping timeout: 265 seconds]