| 00:06:04 | | jacobk quits [Ping timeout: 240 seconds] |
| 00:06:24 | | qwertyasdfuiopghjkl quits [Ping timeout: 265 seconds] |
| 00:24:09 | <Ryz> | Want this stuff to be arrrcccccccchived, website's been around since 1996 damnit <#>; |
| 00:24:15 | <Ryz> | Old loooooooot |
| 00:29:05 | | Guac joins |
| 00:31:48 | | wyatt8750 quits [Ping timeout: 255 seconds] |
| 00:32:39 | | wyatt8740 joins |
| 00:41:28 | | fuzzy8021 quits [Killed (NickServ (GHOST command used by fuzzy802!~fuzzy8021@173.224.25.67))] |
| 00:41:34 | | fuzzy8021 (fuzzy8021) joins |
| 00:52:06 | <Guac> | Anyone here know someone who tries to keep archives of of prominent alt-right figures' online communications (including Youtube vids where applicable)? Trying to find a copy of a YouTube vid from 2017ish from an account that was suspended by 2019 |
| 01:06:22 | <joepie91|m> | usually searching for the raw youtube ID (without the rest of the URL) tends to turn up archives for me, or at least something like a title that I can use to find a copy elsewhere |
| 01:18:36 | | march_happy quits [Read error: Connection reset by peer] |
| 01:29:43 | | thetechrobo_ (TheTechRobo) joins |
| 01:53:38 | | jacobk joins |
| 01:59:03 | | march_happy (march_happy) joins |
| 02:04:03 | | le0n quits [Ping timeout: 255 seconds] |
| 02:40:04 | | k joins |
| 02:40:41 | | k quits [Remote host closed the connection] |
| 03:03:54 | | ThreeHM quits [Ping timeout: 255 seconds] |
| 03:04:14 | | ThreeHM (ThreeHeadedMonkey) joins |
| 03:18:03 | | michaelblob_ (michaelblob) joins |
| 03:21:40 | | michaelblob quits [Ping timeout: 265 seconds] |
| 04:02:54 | | Billy549 (Billy549) joins |
| 04:03:18 | | Stiletto quits [Ping timeout: 255 seconds] |
| 04:03:33 | <Billy549> | Good very early morning! Just wondering if any planned action is coming for GameFAQs? :) |
| 04:08:54 | <@OrIdow6> | I don't believe so Billy549 |
| 04:10:16 | <Billy549> | OrIdow6, I see - I know some people were trying to archive it directly through Web Archive, but seems GameFAQs already blocked the IPs |
| 04:11:01 | <@OrIdow6> | I don't think we've seen an indication of changes to the site yet |
| 04:11:51 | <Billy549> | That's a fair point, resources are spread too thin anyway I suppose - this just reminds me I need to set Warrior back up on my machine so gonna do that now. Thanks for the info at least :) |
| 04:15:18 | <@OrIdow6> | You're welcome |
| 04:16:05 | <@OrIdow6> | Please do tell us if the site (or any other sites) undergo substantial changes in the future etc. |
| 04:18:01 | <Billy549> | Of course :) |
| 04:19:35 | | Stiletto joins |
| 04:21:45 | | Billy549 quits [Client Quit] |
| 04:55:56 | <Guac> | Hm. Good bit of advice joepie91|m though it mostly turned up research papers that referenced the youtube link. It did also lead to a forum post that had some bits of description of part of the vid. |
| 05:21:04 | | mutantm0nkey quits [Remote host closed the connection] |
| 05:21:44 | | mutantm0nkey (mutantmonkey) joins |
| 05:27:27 | | mutantm0nkey quits [Ping timeout: 255 seconds] |
| 05:34:12 | | le0n (le0n) joins |
| 05:36:13 | | mutantm0nkey (mutantmonkey) joins |
| 06:25:04 | | qwertyasdfuiopghjkl joins |
| 07:05:21 | | pabs quits [Quit: Don't rest until all the world is paved in moss and greenery.] |
| 07:11:08 | | pabs (pabs) joins |
| 07:23:40 | | Guac quits [Remote host closed the connection] |
| 07:30:35 | | march_happy quits [Ping timeout: 265 seconds] |
| 07:31:20 | | march_happy (march_happy) joins |
| 07:48:00 | | march_happy quits [Read error: Connection reset by peer] |
| 07:48:42 | | march_happy (march_happy) joins |
| 07:57:44 | | BlueMaxima quits [Read error: Connection reset by peer] |
| 08:00:27 | | Arcorann quits [Ping timeout: 255 seconds] |
| 08:08:46 | | nikow3 joins |
| 08:09:42 | | nikow2 quits [Remote host closed the connection] |
| 08:12:26 | | Arcorann (Arcorann) joins |
| 08:19:24 | | march_happy quits [Ping timeout: 265 seconds] |
| 08:19:43 | | march_happy (march_happy) joins |
| 08:40:40 | | march_happy quits [Ping timeout: 265 seconds] |
| 08:41:13 | | march_happy (march_happy) joins |
| 08:46:48 | | march_happy quits [Ping timeout: 255 seconds] |
| 08:47:19 | | march_happy (march_happy) joins |
| 09:12:12 | | Mateon2 joins |
| 09:12:29 | | Gereon6200 (Gereon) joins |
| 09:12:36 | | Chris50104 (Chris5010) joins |
| 09:12:41 | | Jake1 (Jake) joins |
| 09:13:15 | | balrog quits [Quit: Bye] |
| 09:13:15 | | birdjj quits [Client Quit] |
| 09:13:15 | | Chris5010 quits [Client Quit] |
| 09:13:15 | | Gereon620 quits [Client Quit] |
| 09:13:15 | | Jake quits [Quit: Ping timeout (120 seconds)] |
| 09:13:15 | | Mateon1 quits [Remote host closed the connection] |
| 09:13:15 | | shoghicp quits [Excess Flood] |
| 09:13:15 | | Chris50104 is now known as Chris5010 |
| 09:13:15 | | Gereon6200 is now known as Gereon620 |
| 09:13:15 | | Mateon2 is now known as Mateon1 |
| 09:13:15 | | Jake1 is now known as Jake |
| 09:13:16 | | qwertyasdfuiopghjkl quits [Client Quit] |
| 09:13:20 | | birdjj joins |
| 09:13:27 | | Stiletto quits [Remote host closed the connection] |
| 09:13:31 | | shoghicp (shoghicp) joins |
| 09:14:03 | | balrog (balrog) joins |
| 09:17:56 | | Stiletto joins |
| 09:19:22 | | dm4v_ joins |
| 09:19:22 | | dm4v quits [Client Quit] |
| 09:19:36 | | dm4v_ is now known as dm4v |
| 09:25:11 | | Iki1 joins |
| 09:27:16 | | Iki quits [Ping timeout: 240 seconds] |
| 09:40:16 | | Jake quits [Client Quit] |
| 09:40:29 | | Jake (Jake) joins |
| 09:41:58 | | Jake quits [Client Quit] |
| 09:42:10 | | Jake (Jake) joins |
| 11:03:34 | | qwertyasdfuiopghjkl joins |
| 11:31:46 | | march_happy quits [Ping timeout: 265 seconds] |
| 11:32:14 | | march_happy (march_happy) joins |
| 12:26:24 | | march_happy quits [Ping timeout: 255 seconds] |
| 12:27:27 | | march_happy (march_happy) joins |
| 12:28:46 | | qw3rty_ joins |
| 12:31:42 | | qw3rty quits [Ping timeout: 265 seconds] |
| 13:02:42 | | Arcorann quits [Remote host closed the connection] |
| 13:33:16 | | jacobk quits [Ping timeout: 240 seconds] |
| 14:11:21 | | fenugrec joins |
| 14:15:38 | | shoghicp quits [Remote host closed the connection] |
| 14:15:55 | | shoghicp (shoghicp) joins |
| 14:17:06 | <thuban> | fenugrec: that wiki page is rather old (for one thing, we usually suggest https://github.com/ArchiveTeam/grab-site rather than wget these days). unfortunately, we don't currently have a good mechanism for bypassing that type of cloudflare protection--the best i can suggest is setting a browser user-agent and a generous timeout and hoping that it doesn't trigger. |
| 14:24:27 | <thuban> | we have definitely had Discussions⢠on the increasing prevalence of this and the practicality of various possible workarounds, but nothing concrete yet. may i ask what forum? |
| 14:27:18 | <thuban> | (the message about timestamping is harmless--'-m' is shorthand for several options, including timestamping, but it's the others we care(d) about) |
| 14:28:15 | <fenugrec> | thuban, thanks. Will try grabsite later today. I might try through a proxy too although IME that usually triggers clownflare even more. The forum is forum.tek.com |
| 14:30:35 | <fenugrec> | they already bodged previous migrations / updates , there are many broken links already, so I'm fairly certain they will not even try to migrate it again to a new platform |
| 14:37:22 | | march_happy quits [Ping timeout: 265 seconds] |
| 14:38:11 | | march_happy (march_happy) joins |
| 14:38:26 | <thuban> | shutdown date is "before the end of the year", for reference https://forum.tek.com/viewtopic.php?f=583&p=291164 |
| 14:38:28 | <thuban> | (Jake: want to try that go crawler of yours?) |
| 14:42:41 | <h2ibot> | Switchnode edited Deathwatch (+183, /* 2022 */ add tektronix forums): https://wiki.archiveteam.org/?diff=49051&oldid=49049 |
| 14:43:20 | <betamax> | Do we have a way to archive specific sub-forums of a vBulletin forum? |
| 14:43:51 | <betamax> | The "Vintage Radio Forums" have a section for members to sell or give away equipment to each other |
| 14:44:03 | <betamax> | but it seems it will now be subject to a 90-day deletion rule: https://www.vintage-radio.net/forum/showthread.php?t=194887 |
| 14:44:45 | <betamax> | Any way that the specific sub-forum in question ( https://www.vintage-radio.net/forum/forumdisplay.php?f=27 ) can be archived? |
| 14:45:13 | <betamax> | This is going to happen imminently, if it hasn't already happened. |
| 14:48:33 | <thuban> | betamax: i don't think there's a _good_ way, but i've done it on xenforo by (iirc) spidering just the thread list pages for that subforum, ignoring everything else, and then extracting the threads from the list of ignored urls and queueing each individually with no-parent |
| 14:54:00 | <thuban> | ah, yep: https://hackint.logs.kiska.pw/archiveteam-bs/20210430#c283892 |
| 14:54:53 | <thuban> | vbulletin has equivalent url structure |
| 15:00:09 | | ivan (ivan) joins |
| 15:00:41 | <ivan> | Guac should've been linked https://findyoutubevideo.thetechrobo.ca/ |
| 15:03:00 | | jacobk joins |
| 15:21:45 | <thuban> | hmmmm, no it doesn't actually. (sorry, the forum i checked that i thought was vbulletin is actually also xenforo now.) its threads are 'showthread.php?t=threadid&page=pageid', so --no-parent won't prevent wpull from recursing into other threads or indeed most of the forum ui (you can manage the latter with aggressive ignores but not the former). |
| 15:24:29 | <thuban> | write a qwarc script, i guess? (if you're not too concerned about page requisites) |
| 15:24:47 | <@JAA> | betamax: Two-step process: run a crawl with tight ignores that only permits forumdisplay.php with f=27. Then extract thread links from that and retrieve those, either by building a list with all pages (I think vB always links the last page, so you can generate the missing page URLs) or in theory by further ignores that only permit the relevant thread IDs. |
| 15:25:11 | <@JAA> | Also, looks like the deletion already happened, oldest thread I'm shown is from July. |
| 15:34:02 | | mutantm0nkey quits [Remote host closed the connection] |
| 15:35:45 | | mutantm0nkey (mutantmonkey) joins |
| 15:40:19 | <thuban> | yeah, that's probably a better idea actually. still some scripting involved, but probably less, plus you get page requisites |
| 15:40:24 | <thuban> | (to be clear, when i said you 'can't' manage thread selection with ignores, i meant _a priori_) |
| 15:40:37 | <thuban> | i would just generate the missing pages, since that way you can use --1 and not have to worry about ignoring every other damn thing |
| 16:12:24 | | @OrIdow6 quits [Quit: Quitting.] |
| 16:12:45 | | OrIdow6 (OrIdow6) joins |
| 16:12:45 | | @ChanServ sets mode: +o OrIdow6 |
| 16:12:46 | | CraftByte quits [Quit: Ping timeout (120 seconds)] |
| 16:13:01 | | CraftByte (DragonSec|CraftByte) joins |
| 16:14:52 | | jacobk quits [Ping timeout: 240 seconds] |
| 16:16:48 | | march_happy quits [Ping timeout: 255 seconds] |
| 16:17:41 | | march_happy (march_happy) joins |
| 16:19:49 | | jacobk joins |
| 16:21:25 | <@JAA> | Yep, agreed. The scripting can be a relatively simple grep + awk or similar. In case there's Transfer-Encoding, `warc-tiny dump-responses` (from my little-things repo) handles that. |
| 16:22:30 | <thuban> | https://transfer.archivete.am/sEcYz/generate_vbulletin_pageurls.sh |
| 16:23:39 | | ivan leaves |
| 16:25:06 | <thuban> | (using gs-dump-urls, for convenience) |
| 16:25:39 | <@JAA> | Oh right, the URLs are in the DB anyway, yeah, that's even easier. :-) |
| 16:26:15 | | Stiletto quits [Ping timeout: 255 seconds] |
| 17:15:25 | | dm4v quits [Ping timeout: 265 seconds] |
| 17:27:27 | | tech_exorcist (tech_exorcist) joins |
| 17:28:54 | | dm4v joins |
| 17:49:30 | | sec^nd quits [Ping timeout: 255 seconds] |
| 17:49:42 | | mutantm0nkey quits [Remote host closed the connection] |
| 17:50:14 | | sec^nd (second) joins |
| 17:50:45 | | mutantm0nkey (mutantmonkey) joins |
| 17:56:15 | | jacobk quits [Ping timeout: 255 seconds] |
| 17:58:27 | | jacobk joins |
| 18:19:39 | | jacobk quits [Ping timeout: 255 seconds] |
| 18:41:44 | | michaelblob_ quits [Read error: Connection reset by peer] |
| 18:43:39 | | michaelblob (michaelblob) joins |
| 18:54:39 | | tech_exorcist_ (tech_exorcist) joins |
| 18:55:05 | | tech_exorcist quits [Remote host closed the connection] |
| 19:08:28 | | Gaelan quits [Ping timeout: 240 seconds] |
| 19:20:21 | <tzt> | https://www.theverge.com/2022/10/4/23387510/facebook-meta-bulletin-newsletter-substack-shutdown |
| 19:20:58 | <tzt> | bulletin.com is shutting down 'early 2023' |
| 19:23:58 | | Gaelan (Gaelan) joins |
| 19:38:51 | | sec^nd quits [Ping timeout: 255 seconds] |
| 19:41:26 | | sec^nd (second) joins |
| 19:45:20 | | jacobk joins |
| 19:50:04 | | jacobk quits [Ping timeout: 240 seconds] |
| 19:50:33 | | jacobk joins |
| 19:51:23 | | tech_exorcist_ quits [Remote host closed the connection] |
| 20:19:05 | | qwertyasdfuiopghjkl quits [Ping timeout: 265 seconds] |
| 20:25:39 | | geezabiscuit quits [Ping timeout: 255 seconds] |
| 20:39:41 | | geezabiscuit (geezabiscuit) joins |
| 21:16:03 | | sec^nd quits [Ping timeout: 255 seconds] |
| 21:20:00 | | sec^nd (second) joins |
| 21:35:10 | | BPCZ quits [Remote host closed the connection] |
| 21:36:06 | | BPCZ (BPCZ) joins |
| 21:43:04 | | sec^nd quits [Remote host closed the connection] |
| 21:44:56 | <@arkiver> | tzt: please add it to the deathwatch page! |
| 21:44:59 | | sec^nd (second) joins |
| 21:44:59 | <@arkiver> | or someone else ^\ |
| 21:45:03 | <@arkiver> | or someone else ^ |
| 21:46:59 | <@JAA> | Oh wait, '*by* early 2023', not 'in'. So we should move quickly on that one. |
| 21:47:13 | <h2ibot> | JustAnotherArchivist edited Deathwatch (+172, /* 2023 */ Add Bulletin): https://wiki.archiveteam.org/?diff=49052&oldid=49051 |
| 21:48:13 | <h2ibot> | JustAnotherArchivist edited Deathwatch (+3, /* Pining for the Fjords (Dying) */ Moveā¦): https://wiki.archiveteam.org/?diff=49053&oldid=49052 |
| 21:49:13 | <h2ibot> | IDKhowToEdit edited Twitter (-72, Removes ChromeBot): https://wiki.archiveteam.org/?diff=49054&oldid=48258 |
| 21:49:14 | <h2ibot> | Usernam edited List of websites excluded from the Wayback Machine/Partial exclusions (+39): https://wiki.archiveteam.org/?diff=49055&oldid=49016 |
| 22:03:47 | <Jake> | thuban: yes. Will give it a shot tonight. |
| 22:18:36 | | thetechrobo_ is now known as TheTechRobo |
| 22:56:17 | | Ono joins |
| 22:56:43 | <Ono> | Does this work |
| 22:56:48 | | Arcorann (Arcorann) joins |
| 22:56:56 | | Ono quits [Remote host closed the connection] |
| 22:57:08 | | Ono joins |
| 22:57:17 | | Ono quits [Remote host closed the connection] |
| 22:57:34 | <@JAA> | oh no |