00:00:03dm4v quits [Client Quit]
00:01:35dm4v joins
00:01:37dm4v quits [Changing host]
00:01:37dm4v (dm4v) joins
00:33:16Wingy quits [Remote host closed the connection]
00:34:07Wingy (Wingy) joins
01:02:53dm4v quits [Client Quit]
01:03:25dm4v joins
01:03:27dm4v quits [Changing host]
01:03:27dm4v (dm4v) joins
01:43:06fuzzy8021 (fuzzy8021) joins
02:08:44<h2ibot>JustAnotherArchivist edited Political parties/Switzerland (+301, /* Freiheitliche Bewegung Schweiz FBS */ Add FBS): https://wiki.archiveteam.org/?diff=47308&oldid=47197
02:15:48wyatt8750 joins
02:16:46<h2ibot>JustAnotherArchivist edited Elections (+124, Add 2021-11-28 Swiss votes): https://wiki.archiveteam.org/?diff=47309&oldid=47170
02:18:23wyatt8740 quits [Ping timeout: 258 seconds]
02:25:41Wingy quits [Remote host closed the connection]
02:26:31Wingy (Wingy) joins
02:35:13lennier1 quits [Quit: Going offline, see ya! (www.adiirc.com)]
02:35:43lennier1 (lennier1) joins
02:48:52<h2ibot>JustAnotherArchivist edited Coronavirus (+84): https://wiki.archiveteam.org/?diff=47310&oldid=47207
02:48:53<h2ibot>JustAnotherArchivist created Elections/2021 November Swiss votes (+560, Initial skeleton page): https://wiki.archiveteam.org/?title=Elections/2021%20November%20Swiss%20votes
03:02:35qw3rty__ joins
03:06:18qw3rty_ quits [Ping timeout: 258 seconds]
03:10:33<pabs>http://libregraphicsmag.com/ ended in 2016 https://libregraphicsmag.com/2016/03/an-announcement/index.html
03:10:49<pabs>probably worth archiving if there hasn't been an archive run since then
03:13:00<pabs>the source code for each edition is on gitlab.com, but the blog and PDFs and wiki would be good to archive too
03:17:01qwertyasdfuiopghjkl joins
03:17:49<@JAA>Looks like the active WordPress server is already down and they converted it to a static page on Dec 2019. But yeah.
03:18:02<@JAA>in*
03:19:02<@JAA>Threw it into AB.
03:22:44<@JAA>An update on DemoDrop: the downloads are such a mess that it's not really worth the effort. I've been downloading the streams, but I'm currently not on track to grab everything in time because their server is either a potato or overloaded by others also trying to download everything. The response time of the playaback API is several seconds currently...
03:24:23<@JAA>I have an idea though that I'll try tomorrow.
03:25:53Wingy quits [Remote host closed the connection]
03:26:44Wingy (Wingy) joins
03:27:58tzt quits [Ping timeout: 252 seconds]
03:29:07tzt (tzt) joins
03:45:34tzt quits [Ping timeout: 252 seconds]
03:46:55TheTechRobo quits [Ping timeout: 265 seconds]
03:48:36TheTechRobo (TheTechRobo) joins
03:56:35tzt (tzt) joins
04:15:41TheTechRobo quits [Ping timeout: 258 seconds]
04:16:09TheTechRobo (TheTechRobo) joins
04:37:00qwertyasdfuiopghjkl6 joins
04:38:12qwertyasdfuiopghjkl quits [Ping timeout: 244 seconds]
04:40:13tzt quits [Ping timeout: 258 seconds]
05:01:45qwertyasdfuiopghjkl6 is now known as qwertyasdfuiopghjkl
05:12:34mutantmonkey quits [Remote host closed the connection]
05:12:34sec^nd quits [Remote host closed the connection]
05:13:24sec^nd (second) joins
05:13:53mutantmonkey (mutantmonkey) joins
05:22:22tzt (tzt) joins
05:39:56BlueMaxima quits [Client Quit]
08:42:06HackMii quits [Remote host closed the connection]
08:43:45HackMii (hacktheplanet) joins
09:12:09HackMii quits [Remote host closed the connection]
09:12:30HackMii (hacktheplanet) joins
09:14:56wizards joins
09:17:28HackMii quits [Remote host closed the connection]
09:17:43HackMii (hacktheplanet) joins
09:17:45wizards_ quits [Ping timeout: 258 seconds]
09:28:52<systwi>OrIdow6: I ran the `curl' command it gave me, and I appended "-o- > output.html". The result was a 2-byte file, simply "{}" and nothing more.
09:29:48wizards_ joins
09:32:02wizards__ joins
09:32:42wizards quits [Ping timeout: 258 seconds]
09:35:22wizards_ quits [Ping timeout: 252 seconds]
10:13:52kiskaLogBot quits [Ping timeout: 252 seconds]
10:21:26HackMii quits [Remote host closed the connection]
10:22:22HackMii (hacktheplanet) joins
10:26:08HackMii quits [Remote host closed the connection]
10:26:41HackMii (hacktheplanet) joins
10:28:54wizards joins
10:29:48kiskaLogBot joins
10:31:28wizards__ quits [Ping timeout: 265 seconds]
10:34:22kiskaLogBot quits [Ping timeout: 265 seconds]
10:34:32kiskaLogBot joins
10:51:47IDK (IDK) joins
10:54:24<IDK>Roblox is down
10:54:43<IDK>https://usercontent.irccloud-cdn.com/file/RlRMU9Vg/IMG_0458.jpeg
10:55:02<IDK>Looks like a serious bug or r u dead attack
11:12:11<Sanqui>"completely wreck a website and bring it down" nice flowery language, not much in terms of explanation lol
11:17:32<russss>to use the technical term, "rekd"
11:17:40qwertyasdfuiopghjkl quits [Remote host closed the connection]
11:21:28<IDK>The word DDoS just explained it
11:21:39<IDK>I love watching roblox going down
11:21:49<IDK>Especially those big ones
11:23:11<IDK>https://usercontent.irccloud-cdn.com/file/9slD0Yg4/image.png
11:23:46<IDK>a server error is the end of roblox
11:36:23<@OrIdow6>systwi: Sounds like an empty JSON response
11:36:49<@OrIdow6>Maybe -v on curl could help you? Response codes as well as verify headers etc are being passed correctly
11:37:08<@OrIdow6>If it genuinely fails on CURL they could be looking at TLS or something like that
11:39:30<@OrIdow6>Sanqui: Yeah, doesn't exactly make me the most confident in it, haha
11:39:31<h2ibot>Sanqui created Webzone.ee (+449, Created page with "{{Infobox project | title =…): https://wiki.archiveteam.org/?title=Webzone.ee
11:39:53<@OrIdow6>Has a bit of a "they used coding and algorithms so the drones don't crash into each other" vibe to me
11:40:06<Sanqui>nice timing OrIdow6
11:40:10<Sanqui>two pings one second apart :P
11:40:37<@OrIdow6>I'm psychick
11:41:31<Sanqui>my search engine script is evolving, I can now loop over search engines, search terms, and domains of interest, then merge it all and derive parent urls
11:43:24<@OrIdow6>Do you have problems with the Google robot detection? I hear of that a lot
11:43:31<Sanqui>I'm scraping Bing and Yahoo at the moment
11:43:35<@OrIdow6>Oh
11:43:51<Sanqui>probably gonna try google in a bit too
11:44:06<Sanqui>I'm using this one https://github.com/tasos-py/Search-Engines-Scraper/
11:44:16<Sanqui>which works surprisingly well, for being an obscure project
11:52:27<@OrIdow6>Guess you've gotten lucky
11:53:45<Sanqui>I'm at 2403 webzone.ee URLs :D
12:06:36<h2ibot>Sanqui edited Webzone.ee (+53): https://wiki.archiveteam.org/?diff=47313&oldid=47312
12:11:24katocala quits [Remote host closed the connection]
12:44:57katocala joins
12:47:52qwertyasdfuiopghjkl joins
12:49:05Wingy quits [Remote host closed the connection]
12:50:07Wingy (Wingy) joins
12:58:00katocala quits [Remote host closed the connection]
13:08:22katocala joins
13:17:35qwertyasdfuiopghjkl quits [Client Quit]
13:17:58qwertyasdfuiopghjkl joins
13:30:16HP_Archivist (HP_Archivist) joins
13:58:06Jonboy3451 joins
13:59:30Arcorann quits [Ping timeout: 258 seconds]
13:59:34<IDK>https://www.washingtonpost.com/technology/2021/10/28/facebook-meta-name-change/
14:00:03<IDK>Facebook is changing its name to Meta???
14:01:42<h3ndr1k>Maybe the same as the Google / Alphabet split.
14:01:48Jonboy345 quits [Ping timeout: 258 seconds]
14:02:44<IDK>idk really
14:09:47Mateon2 joins
14:11:28Mateon1 quits [Ping timeout: 252 seconds]
14:11:28Mateon2 is now known as Mateon1
14:26:40Iki joins
14:56:45paul2520 (paul2520) joins
15:40:06Iki quits [Remote host closed the connection]
16:01:40AlsoHP_Archivist joins
16:04:05HP_Archivist quits [Ping timeout: 258 seconds]
16:13:29AlsoHP_Archivist quits [Client Quit]
16:13:47HP_Archivist (HP_Archivist) joins
16:19:25HP_Archivist quits [Ping timeout: 258 seconds]
17:05:03Wingy quits [Remote host closed the connection]
17:05:55Wingy (Wingy) joins
17:28:11<@JAA>Facebook, Inc. is changing its name. Facebook the website will remain Facebook, I bet.
17:28:17<@JAA>I.e. yes, it's like Google/Alphabet.
17:31:39<@JAA>systwi (Cc OrIdow6): IPv4 vs IPv6 is also one to look out for that I've been bitten by before. Add -4 or -6 to the curl command to force it to connect with the same protocol as the browser. You might want to use --resolve to force it to use the same IP as well. There are also plenty of ways how such repeated requests could get blocked, like constantly changing cookies/tokens. Then it becomes a real
17:31:45<@JAA>pain.
17:36:28HP_Archivist (HP_Archivist) joins
18:04:33Wingy quits [Remote host closed the connection]
18:05:24Wingy (Wingy) joins
18:07:11paul2520 quits [Remote host closed the connection]
18:20:03Wingy quits [Remote host closed the connection]
18:20:54Wingy (Wingy) joins
18:42:08Wingy quits [Client Quit]
18:44:04Wingy (Wingy) joins
18:46:56Wingy quits [Remote host closed the connection]
18:47:42Wingy (Wingy) joins
19:01:30Wingy quits [Remote host closed the connection]
19:12:07LeGoupil joins
19:18:53Wingy (Wingy) joins
19:37:28<TheTechRobo>JAA: What,s the difference between wget, wget-at, and wpull? Why wget-at instead of wpull?
19:37:58<TheTechRobo>Tell me if I'm asking too many questions, I'm just curious :-)
19:38:40<@JAA>TheTechRobo: wget-at = modified wget with Lua hooks, zstd compression, and other stuff for customising crawls. wpull = complete reimplementation of wget's behaviour in Python, also highly customisable.
19:39:26<@JAA>wpull's code is currently in a pretty sorry state. Doesn't even support Python 3.7, for example. There are countless bugs as well. I've been meaning to tackle that for some time, but other things keep coming up.
19:39:37<@JAA>Since Python 3.6 will reach EOL at the end of the year, it's getting more urgent though.
19:39:54Wingy quits [Ping timeout: 258 seconds]
19:40:29<@JAA>ArchiveBot uses wpull, and it works only because of how it's used. Straight wpull 2.x is virtually unusable.
19:40:49<@JAA>grab-site uses a fork of wpull, by the way, with various changes that aren't backward-compatible.
20:00:58Wingy (Wingy) joins
20:20:20Wingy quits [Read error: Connection reset by peer]
20:21:06Wingy (Wingy) joins
20:39:25TheTechRobo3641 joins
20:43:09TheTechRobo quits [Ping timeout: 258 seconds]
21:07:47Wingy quits [Remote host closed the connection]
21:08:32Wingy (Wingy) joins
21:30:32LeGoupil quits [Remote host closed the connection]
21:40:21driib71 (driib) joins
21:43:34driib7 quits [Ping timeout: 252 seconds]
21:43:34driib71 is now known as driib7
21:55:18<@JAA>So on DemoDrop, turns out that it's the actual stream server (actually a proxy to AWS S3 or similar) which is the limiting factor. No way around it unless the load (from other mass-downloaders?) reduces.
21:55:39TheTechRobo3641 is now known as TheTechRobo
21:56:34<@JAA>My current estimate is that I'd manage about 85% of the streams if the rate stays constant.
21:56:58<TheTechRobo>Any way us mere mortals can help? :-)
21:57:20<@JAA>Only the server operator can do anything to help this.
21:57:45<TheTechRobo>Well, if I can help out, ping me
22:20:04Wingy quits [Remote host closed the connection]
22:20:54Wingy (Wingy) joins
23:16:27Matthww quits [Quit: Ping timeout (120 seconds)]
23:16:43Matthww joins
23:21:03BlueMaxima joins
23:25:06Arcorann (Arcorann) joins
23:35:05Wingy quits [Remote host closed the connection]
23:35:53Wingy (Wingy) joins
23:49:13qwertyasdfuiopghjkl quits [Remote host closed the connection]