00:06:28<gamer191-1|m>If you take out "&asset_types=videos", then I assume it will also include photos, documents and campaign materials?
00:06:39<pokechu22>Yes
00:07:32<pokechu22>also looks like https://www.farmtransparency.org/gallery?asset_types=videos&p=2 works for pagination compared to https://www.farmtransparency.org/gallery?asset_types=videos and doesn't require that header, so I can just throw those in and discover things that way
00:17:28CuppyMan quits [Quit: Connection closed for inactivity]
00:34:16useretail quits [Quit: Leaving]
00:47:46<pokechu22>JAA: would qwarc be usable to grab things like https://www.farmtransparency.org/scripts/asset-display?p=3&asset_types=videos and https://www.farmtransparency.org/scripts/asset-popup?id=skfjn8ldvu28i3z8wrin&type=photos with X-Requested-With: XMLHttpRequest while *not* extracting embedded images/outlinks? (Those URLs seem necessary to navigate the site with javascript but have
00:47:48<pokechu22>alternative forms that archivebot can process)
00:51:43Guest58 quits [Quit: My Mac has gone to sleep. ZZZzzz…]
00:53:00Guest58 joins
00:57:12<@JAA>pokechu22: Sure. qwarc does no extraction whatsoever on its own.
01:08:30ATinySpaceMarine joins
01:10:44ATinySpaceMarine quits [Read error: Connection reset by peer]
01:10:54ATinySpaceMarine joins
01:15:15<pokechu22>JAA: https://transfer.archivete.am/T3y6Q/www.farmtransparency.org_scripts_asset-display_urls.txt
01:15:16<eggdrop>inline (for browser viewing): https://transfer.archivete.am/inline/T3y6Q/www.farmtransparency.org_scripts_asset-display_urls.txt
01:16:07<pokechu22>will prepare a list of asset-popup URLs after the job finishes
01:18:54<pabs>magmaus3: can you put the kitsunes.club/etc shutdowns into Deathwatch?
01:20:18<cruller>For now, I extracted links from https://www.farmtransparency.org/gallery? and https://www.farmtransparency.org/scripts/asset-display?p={2..294}& : https://transfer.archivete.am/11qwn6/Farm_Transparency_Repository_urls.txt
01:20:18<eggdrop>inline (for browser viewing): https://transfer.archivete.am/inline/11qwn6/Farm_Transparency_Repository_urls.txt
01:21:11<pokechu22>I started a job using https://transfer.archivete.am/4N1aQ/www.farmtransparency.org_seed_urls.txt which should have archivebot extract those itself (and I also included some basic faceting in that)
01:21:13<eggdrop>inline (for browser viewing): https://transfer.archivete.am/inline/4N1aQ/www.farmtransparency.org_seed_urls.txt
01:21:41Island joins
01:25:12ATinySpaceMarine quits [Client Quit]
01:25:23<pokechu22>the videos part is slightly annoying - it would require reading the page HTML to determine the resolution (e.g. <a href="javascript:void(0);" class="download" data-size="source_2704">Source (2.44gb)</a> on https://www.farmtransparency.org/videos?id=p7tyen34at but <a href="javascript:void(0);" class="download" data-size="source_1280">Source (8.71mb)</a> on
01:25:25<pokechu22>https://www.farmtransparency.org/videos?id=l5b22836ku). The POST to https://www.farmtransparency.org/videos?id=p7tyen34at&action=download then redirects to the video download (which I'm not sure about the rate-limiting of).
01:25:34ATinySpaceMarine joins
01:28:17<pokechu22>I *think* that if we generate WARCs of the POST to the download at source resolution, that would replay properly on web.archive.org (assuming it ignores POST data, and just grabs whatever POST happened at that URL). Though the redirect aspect would also be inconvenient.
01:29:25<@JAA>I believe that's what the WBM does for POST, yeah.
01:29:42<pokechu22>It seems like the video URLs are valid for 24 hours, and I don't think it's tied to a specific IP
01:30:00<pokechu22>so the POST part could happen on one machine and then archivebot could handle downloading
01:31:52andrew (andrew) joins
01:32:36<@JAA>Ugh, it opens a new tab, I hate that.
01:33:31<pokechu22>Yeah - you can get devtools open by using work offline (see 23:35) but it's still awkward
01:34:14<@JAA>You can also interrupt the page loading and then trigger it again via the dev tools.
01:34:19<@JAA>But yeah
01:35:38<pokechu22>the relevant headers seem to be the cookie 2620368ghwahw90w and the csrf_token post data, both of which can come from any page load (including the main page) and don't change, or at least last for a while
01:37:16<pokechu22>curl --verbose 'https://www.farmtransparency.org/videos?id=p7tyen34at&action=download' -H 'Cookie: 2620368ghwahw90w=XXXX' --data-raw 'size=source_2704&source=full&csrf_token=YYYY' and then the location header
01:37:36<@JAA>Yeah
01:38:51<@JAA>And yeah, looks like 24 hours validity, and they do work from another IP.
01:39:20<pokechu22>JAA: if I got a list of sizes with videos, is there a format for that would be easy to use for you to POST the redirects?
01:41:09BennyOtt quits [Ping timeout: 260 seconds]
01:42:34<@JAA>A list of 'videoid size' would work. Or I could extract the size value from the video pages if it's easy to tell which one we want (always source_*?).
01:43:22<@JAA>asset-display is done.
01:44:12<pokechu22>oh, I was thinking I needed to wait for archivebot to finish to make a list for asset-popup but I can just use cruller's list
01:44:37<pokechu22>I'm thinking always source_ but one video had 720p be bigger than source_
01:44:46<@JAA>I guess I can extract it from the asset-display WARC, too?
01:44:57<pokechu22>Oh, yeah
01:45:06<@JAA>Yeah, I'm wondering whether that is a reencode with different settings.
01:46:56<@JAA>I get 1258 video IDs from the asset-display WARC, so I guess that's correct.
01:47:50<pokechu22>ffprobe says the source video is "h264 (High) (avc1 / 0x31637661), yuv420p, 1280x720 [SAR 1:1 DAR 16:9], 1825 kb/s, 25 fps, 25 tbr, 12800 tbn, 50 tbc (default)" while the 720p one is "h264 (High) (avc1 / 0x31637661), yuv420p(tv, bt709), 1280x720, 2187 kb/s, 25 fps, 25 tbr, 25 tbn, 50 tbc (default)", which is a higher bitrate but different in other ways I don't understand
01:50:18<gamer191-1|m>So I guess either the site is wrong about which one is the source, or the 720p one has been upscaled
01:50:21<gamer191-1|m>Which video is it btw?
01:51:53<@JAA>It's not upscaled; they're both 720p.
01:52:18<pokechu22>https://www.farmtransparency.org/videos?id=l5b22836ku
01:52:37<@JAA>But reencoded with better settings perhaps, which means it'll encode the noise and artefacts from the 'source'.
01:53:06<@JAA>The source file has the encoder settings metadata stripped, sadly.
01:54:57<pokechu22>It's also possible that the original news broadcast spent longer encoding it better while vimeo's reencode was done faster and thus larger for the same quality... I think?
01:57:53<@JAA>Yeah, that's also plausible.
01:59:45<pokechu22>JAA: https://transfer.archivete.am/OFjmP/www.farmtransparency.org_scripts_asset-popup_urls.txt.zst
02:00:56<pokechu22>I don't think you can extract from the asset-display WARC since that's just pagination and doesn't include URLs, but you should be able to extract from the asset-popup warc? (unless you're just using asset-display to list it and then downloading separately, instead of using the WARC itself to determine the data-size values)
02:03:17<@JAA>I'm getting the video IDs from asset-display, and then I'll fetch the video page to extract the right size parameter for each video.
02:03:37<pokechu22>Alright
02:07:05etnguyen03 (etnguyen03) joins
02:08:56nine quits [Quit: See ya!]
02:09:08nine joins
02:09:09nine quits [Changing host]
02:09:09nine (nine) joins
02:13:12adryd01 (adryd) joins
02:26:26rohvani joins
02:30:12<@JAA>Hmm, something isn't right.
02:30:49<@JAA>I'm sending the right things, I think, but I get a redirect back to the video page.
02:31:42<@JAA>Oh, I guess it would help to use POST rather than GET.
02:33:02ericgallager quits [Quit: This computer has gone to sleep]
02:34:10ericgallager joins
02:38:15<@JAA>asset-popup is done.
02:46:22<@JAA>Looks like the download endpoint is very rate-limited.
02:46:55Guest58 quits [Client Quit]
02:48:42<@JAA>After 10 successful ones, it started redirecting to the video page instead.
02:49:37<pokechu22>Hmm, maybe the csrf key expired?
02:49:50<pokechu22>or are you still getting the same one?
02:50:16<gamer191-1|m>“I’m sending the right things…" out if interest, how did you fix it? I was having the same issue earlier, using Firefox’s “copy as curl” function
02:50:21<@JAA>Still the same, and the cookie hasn't changed either.
02:50:42<@JAA>But I extract the token from every video page anyway, so if it did change, I'd pick it up.
02:50:55<@JAA>And the cookie handling happens automatically by aiohttp.
02:51:23<@JAA>gamer191-1|m: 02:31:42 <@JAA> Oh, I guess it would help to use POST rather than GET.
02:52:19<gamer191-1|m>Can you download through your browser or same issue? Perhaps you’d need to use incognito to get a new token, but that’s obviously too cumbersome to do every 10 videos unless it can be automated
02:55:59<@JAA>Clearing the cookie jar doesn't seem to help.
02:56:31<@JAA>gamer191-1|m: I'm running this on a server and can't easily check with a browser through there.
02:57:25<gamer191-1|m>@JAA: “Please mirror our stuff, but also, we'll make it as hard as we can.”
02:57:25<gamer191-1|m>Reminds me of Kick.com advising users who wish to download their data to use yt-dlp, but then enabling Cloudflare TLS fingerprinting🤣
02:58:07etnguyen03 quits [Client Quit]
02:58:27<gamer191-1|m>“I'm running this on a server and can't easily check with a browser through there.” Can you try switching between IPv4 and IPv6 to see if it’s IP-based?
03:00:28etnguyen03 (etnguyen03) joins
03:00:40<@JAA>I did get another batch of 10 when I mentioned the cookie jar, so at least it expires somewhat quickly.
03:02:23<@JAA>IPv4/v6 selection isn't currently implemented in qwarc, and I'm not sure aiohttp even exposes it.
03:03:24<@JAA>Ah, it's in the TCPConnector, yeah, that'd require some patching.
03:04:36<@JAA>FWIW, both those runs that got download URLs were over IPv6.
03:17:12etnguyen03 quits [Remote host closed the connection]
03:23:31<@JAA>Those were less than 10 minutes apart, but 30 minutes later now I'm still blocked. :-|
03:47:43DogsRNice quits [Read error: Connection reset by peer]
04:00:08Guest58 joins
04:00:53Radzig quits [Quit: ZNC 1.10.1 - https://znc.in]
04:01:53Radzig joins
04:17:58clubsecret joins
04:23:23clubsecret quits [Client Quit]
04:29:02Shjosan quits [Quit: Am sleepy (-, – )…zzzZZZ]
04:29:33Shjosan (Shjosan) joins
04:35:37Guest58 quits [Client Quit]
04:41:37<fuzzy80211>need ips JAA?
04:54:55LddPotato quits [Read error: Connection reset by peer]
04:55:36LddPotato (LddPotato) joins
05:10:37fuzzy80211 quits [Read error: Connection reset by peer]
05:11:22fuzzy80211 (fuzzy80211) joins
05:20:18Guest58 joins
05:27:31BennyOtt (BennyOtt) joins
06:15:34Guest58 quits [Client Quit]
07:07:31HP_Archivist (HP_Archivist) joins
07:38:58ericgallager quits [Quit: This computer has gone to sleep]
07:40:49Guest58 joins
07:43:15IDK (IDK) joins
07:45:35abirkill (abirkill) joins
07:55:47<@JAA>fuzzy80211: It seems that I can maybe get around it with IPv6, but I'll get back to you if that doesn't work out.
08:08:02qw3rty_ joins
08:10:14Doranwen quits [Ping timeout: 260 seconds]
08:11:59qw3rty quits [Ping timeout: 260 seconds]
08:16:55qw3rty__ joins
08:20:14qw3rty_ quits [Ping timeout: 240 seconds]
08:26:04Wake6 joins
08:30:04Wake quits [Ping timeout: 260 seconds]
08:30:04Wake6 is now known as Wake
08:41:14ducky quits [Remote host closed the connection]
08:43:44ducky (ducky) joins
09:18:57qw3rty_ joins
09:22:14qw3rty__ quits [Ping timeout: 240 seconds]
09:35:18yasomimi (yasomi) joins
09:35:34yasomi quits [Ping timeout: 240 seconds]
09:36:06yasomimi is now known as yasomi
09:43:29qw3rty__ joins
09:47:04qw3rty_ quits [Ping timeout: 260 seconds]
09:50:20<h2ibot>Manu edited Discourse/archived (+83, Queued forums.suse.com): https://wiki.archiveteam.org/?diff=56907&oldid=56881
09:55:21<h2ibot>Manu edited Discourse/archived (+95, Queued forums.developer.nvidia.com): https://wiki.archiveteam.org/?diff=56908&oldid=56907
09:59:25MrMcNuggets (MrMcNuggets) joins
10:00:55Doranwen (Doranwen) joins
10:02:14midou quits [Ping timeout: 240 seconds]
10:02:20IDK quits [Client Quit]
10:02:36LunarianBunny1147 (LunarianBunny1147) joins
10:10:14Doranwen quits [Ping timeout: 240 seconds]
10:10:17Doran (Doranwen) joins
10:11:00BornOn420_ (BornOn420) joins
10:12:03midou joins
10:15:02Doranwen (Doranwen) joins
10:15:39Doran quits [Ping timeout: 260 seconds]
10:17:57SootBector (SootBector) joins
10:46:54monoxane quits [Ping timeout: 240 seconds]
10:47:17SootBector quits [Remote host closed the connection]
10:47:17BornOn420_ quits [Remote host closed the connection]
10:47:52BornOn420_ (BornOn420) joins
10:48:24SootBector (SootBector) joins
11:00:06Bleo182600722719623455222 quits [Quit: The Lounge - https://thelounge.chat]
11:00:34midou quits [Ping timeout: 240 seconds]
11:02:50Bleo182600722719623455222 joins
11:03:54ducky quits [Ping timeout: 240 seconds]
11:09:27nfriedly quits [Remote host closed the connection]
11:11:11BornOn420 quits [Quit: Textual IRC Client: www.textualapp.com]
11:20:28monoxane (monoxane) joins
11:21:25ATinySpaceMarine quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]
11:21:39ducky (ducky) joins
11:28:54ducky quits [Ping timeout: 240 seconds]
11:46:17ducky (ducky) joins
11:49:34monoxane quits [Ping timeout: 260 seconds]
11:50:54ducky quits [Ping timeout: 240 seconds]
12:02:15monoxane (monoxane) joins
12:09:41Dada joins
12:22:28ericgallager joins
12:23:59Snivy quits [Ping timeout: 260 seconds]
12:29:31ducky (ducky) joins
12:30:59ducky quits [Remote host closed the connection]
12:31:23ducky (ducky) joins
12:36:14ducky quits [Ping timeout: 260 seconds]
12:42:39<cruller>Japanese Q&A site "peing.net" will be terminated at 2025/8/29 12:00 (JST). Announcement: https://peing.net/ja/landing_pages/28
12:48:13<cruller>ugh, they disallow ia_archiver https://peing.net/robots.txt
12:50:31<justauser|m>I think when the website closes down, its robots.txt disappears and materials become visible in Wayback.
12:51:17fea joins
12:56:46IDK (IDK) joins
12:59:20fea quits [Client Quit]
13:03:28midou joins
13:07:34midou quits [Ping timeout: 240 seconds]
13:15:08<cruller>In any case, archiving is important. I'll do some more research on the service.
13:24:19midou joins
13:28:09kansei quits [Ping timeout: 260 seconds]
13:29:32kansei (kansei) joins
13:41:52adryd01 quits [Quit: Ping timeout (120 seconds)]
13:42:11adryd01 (adryd) joins
13:42:13TheTechRobo quits [Quit: Ping timeout (120 seconds)]
13:42:13Pedrosso quits [Quit: Ping timeout (120 seconds)]
13:42:14ScenarioPlanet quits [Quit: Ping timeout (120 seconds)]
13:42:32Pedrosso joins
13:42:37ScenarioPlanet (ScenarioPlanet) joins
13:43:30TheTechRobo (TheTechRobo) joins
13:43:54tek_dmn quits [Ping timeout: 240 seconds]
13:43:55NatTheCat3 (NatTheCat) joins
13:44:14NatTheCat quits [Ping timeout: 240 seconds]
13:44:14NatTheCat3 is now known as NatTheCat
13:45:11tek_dmn (tek_dmn) joins
14:06:18Dada quits [Remote host closed the connection]
14:08:48Dada joins
14:14:59<nstrom|m>I can't seem to post in #archivebot but fwiw looks like job 9dhhxf8jfmlq6vn10eo0ljph is pulling some files from open s3 bucket https://4dn-open-data-public.s3.amazonaws.com/ , might be worth running that separately if the job doesn't get everything
14:15:15<nstrom|m>cause I hear you guys like open buckets /shrug
14:20:19<anonymoususer852>"I can't seem to post in #archivebot ..." Correct me if I am wrong, but it is possible to talk in #archivebot, just not run commands.
14:21:01<nstrom|m>yeah I know I should be able to but element / matrix seemed to be having issues
14:21:19<nstrom|m>just seemed to be in that channel though
14:21:25<anonymoususer852>Ah I see.
14:22:38<anonymoususer852>Yeah you are in that channel, I checked before posting my reply. I use a regular IRC client, connected directly to hackint network, and am able to post there.
14:31:55<nstrom|m>probably just matrix being matrix
14:35:08<h2ibot>Anonymoususer852 edited Talk:Glitch (+1674, Updated discussion to where the project stands…): https://wiki.archiveteam.org/?diff=56909&oldid=56837
14:41:04jinn6 quits [Ping timeout: 260 seconds]
14:41:17jinn6 joins
14:43:34notSokar joins
14:45:44Sokar quits [Ping timeout: 260 seconds]
14:47:05ummmSokar joins
14:49:14notSokar quits [Ping timeout: 260 seconds]
14:51:50<pabs>IIRC #archivebot gets too many messages, matrix can't keep up
15:06:23<@JAA>cruller: Please add it to Deathwatch!
15:21:18tertu2 (tertu) joins
15:23:54tertu quits [Ping timeout: 240 seconds]
15:24:11ATinySpaceMarine joins
15:25:41ATinySpaceMarine quits [Client Quit]
15:26:12ATinySpaceMarine joins
15:27:24aninternettroll quits [Remote host closed the connection]
15:29:54aninternettroll (aninternettroll) joins
15:34:15pseudorizer quits [Quit: ZNC 1.10.1 - https://znc.in]
15:36:17<h2ibot>Cruller edited Deathwatch (+200, Add Peing): https://wiki.archiveteam.org/?diff=56910&oldid=56886
15:36:30<@JAA>:-)
15:36:48pseudorizer (pseudorizer) joins
15:37:12<cruller>Fortunately, it seems that questions that have been answered are given a URL of https://peing.net/ja/qs/{sequential number} for sharing. (I checked this using my own account.)
15:38:34jinn6 quits [Ping timeout: 240 seconds]
15:40:11<cruller>The latest number is 1189399360.
15:42:09<cruller>Unanswered questions can also be viewed by typing in the URL directly, but are probably not linked from anywhere.
15:45:41<@JAA>1.2 billion questions? Seems unlikely... Or do they not start at 0?
15:47:19<h2ibot>Cruller edited List of websites excluded from the Wayback Machine (+21, Add Peing): https://wiki.archiveteam.org/?diff=56911&oldid=56884
15:50:28<cruller>JAA: I haven't looked into the details, but I saw https://peing.net/ja/qs/118
15:51:42<cruller>A sampling survey may be necessary...
15:59:45notSokar joins
16:01:34ummmSokar quits [Ping timeout: 240 seconds]
16:02:18Sokar joins
16:03:01<cruller>Oh, unanswered questions can also get sequential number.
16:04:29notSokar quits [Ping timeout: 260 seconds]
16:06:45jinn6 joins
16:15:23<cruller>Assuming that spammers are sending a large number of messages, 1.2 billion may not be surprising. On Peing, you can ask questions even if you don't have an account, and I've never heard of rate limits or CAPTCHAs.
16:18:21aninternettroll quits [Remote host closed the connection]
16:20:54aninternettroll (aninternettroll) joins
16:23:44anarcat quits [Ping timeout: 260 seconds]
16:30:53notSokar joins
16:32:54Sokar quits [Ping timeout: 240 seconds]
16:33:04qw3rty__ quits [Ping timeout: 260 seconds]
16:37:09notSokar quits [Ping timeout: 260 seconds]
16:38:59Sokar joins
16:44:13qw3rty joins
16:46:56qw3rty_ joins
16:48:49qw3rty quits [Ping timeout: 260 seconds]
16:52:40jinn6 quits [Client Quit]
16:52:55jinn6 joins
16:59:16anarcat (anarcat) joins
17:01:45<@JAA>I got the download URLs for 964 videos from farmtransparency.org; the other 294 failed, most seem to be YouTube videos but there were a few other failures, too.
17:01:52<@JAA>1.53 TiB or so for the ones I got.
17:11:53<@JAA>First ones are running in AB, will start more as slots become available.
17:14:33<h2ibot>HadeanEon edited Deaths in 2000 (+16686, BOT - Updating page: {{saved}} (115),…): https://wiki.archiveteam.org/?diff=56912&oldid=56494
17:14:34<h2ibot>HadeanEon edited Deaths in 2000/list (+1357, BOT - Updating list): https://wiki.archiveteam.org/?diff=56913&oldid=56495
17:17:37NeonGlitch (NeonGlitch) joins
17:18:09<NeonGlitch>Heyo! Is https://www.openculture.com/2025/08/2178-occult-books-now-digitized-put-online.html an ArchiveTeam crawl opportunity?
17:18:14notSokar joins
17:20:34<h2ibot>HadeanEon edited Deaths in 2001 (+16214, BOT - Updating page: {{saved}} (7),…): https://wiki.archiveteam.org/?diff=56914&oldid=56496
17:20:34Sokar quits [Ping timeout: 240 seconds]
17:20:35<h2ibot>HadeanEon edited Deaths in 2001/list (+1229, BOT - Updating list): https://wiki.archiveteam.org/?diff=56915&oldid=56497
17:25:09ummmSokar joins
17:27:54notSokar quits [Ping timeout: 260 seconds]
17:28:59ericgallager quits [Quit: This computer has gone to sleep]
17:29:35<h2ibot>HadeanEon edited Deaths in 2002 (+18403, BOT - Updating page: {{saved}} (3),…): https://wiki.archiveteam.org/?diff=56916&oldid=56498
17:29:36<h2ibot>HadeanEon edited Deaths in 2002/list (+1373, BOT - Updating list): https://wiki.archiveteam.org/?diff=56917&oldid=56499
17:35:36<h2ibot>HadeanEon edited Deaths in 2003 (-623, BOT - Updating page: {{saved}} (5),…): https://wiki.archiveteam.org/?diff=56918&oldid=56242
17:35:37<h2ibot>HadeanEon edited Deaths in 2003/list (-52, BOT - Updating list): https://wiki.archiveteam.org/?diff=56919&oldid=56243
17:43:37<h2ibot>HadeanEon edited Deaths in 2004 (+834, BOT - Updating page: {{saved}} (6),…): https://wiki.archiveteam.org/?diff=56920&oldid=55479
17:43:38<h2ibot>HadeanEon edited Deaths in 2004/list (+58, BOT - Updating list): https://wiki.archiveteam.org/?diff=56921&oldid=56500
17:46:28notSokar joins
17:47:14ericgallager joins
17:48:54ummmSokar quits [Ping timeout: 260 seconds]
17:55:39<h2ibot>HadeanEon edited Deaths in 2005/list (+17, BOT - Updating list): https://wiki.archiveteam.org/?diff=56922&oldid=56350
17:58:49NeonGlitch quits [Ping timeout: 260 seconds]
18:01:12NeonGlitch (NeonGlitch) joins
18:11:41<h2ibot>HadeanEon edited Deaths in 2007 (-1413, BOT - Updating page: {{saved}} (5),…): https://wiki.archiveteam.org/?diff=56923&oldid=56418
18:11:42<h2ibot>HadeanEon edited Deaths in 2007/list (-58, BOT - Updating list): https://wiki.archiveteam.org/?diff=56924&oldid=56419
18:15:25Sokar joins
18:17:34notSokar quits [Ping timeout: 240 seconds]
18:32:45<h2ibot>HadeanEon edited Deaths in 2009 (+274, BOT - Updating page: {{saved}} (9),…): https://wiki.archiveteam.org/?diff=56925&oldid=55650
18:32:46<h2ibot>HadeanEon edited Deaths in 2009/list (+21, BOT - Updating list): https://wiki.archiveteam.org/?diff=56926&oldid=55651
18:46:47<h2ibot>HadeanEon edited Deaths in 2010 (+473, BOT - Updating page: {{saved}} (204),…): https://wiki.archiveteam.org/?diff=56927&oldid=56351
18:46:48<h2ibot>HadeanEon edited Deaths in 2010/list (+38, BOT - Updating list): https://wiki.archiveteam.org/?diff=56928&oldid=56352
18:46:58notSokar joins
18:48:54Sokar quits [Ping timeout: 240 seconds]
18:52:34Sokar joins
18:53:39notSokar quits [Ping timeout: 260 seconds]
18:58:32<c3manu>NeonGlitch: it kinda reads like more books would be added in the future?
18:59:49<h2ibot>HadeanEon edited Deaths in 2011 (+55519, BOT - Updating page: {{saved}} (204),…): https://wiki.archiveteam.org/?diff=56929&oldid=56501
18:59:50<h2ibot>HadeanEon edited Deaths in 2011/list (+4496, BOT - Updating list): https://wiki.archiveteam.org/?diff=56930&oldid=56502
19:03:20<NeonGlitch>c3manu: Definitely a possibility.
19:08:39ericgallager quits [Client Quit]
19:11:18ericgallager joins
19:16:52<h2ibot>HadeanEon edited Deaths in 2012 (+50538, BOT - Updating page: {{saved}} (193),…): https://wiki.archiveteam.org/?diff=56931&oldid=56503
19:16:53<h2ibot>HadeanEon edited Deaths in 2012/list (+4064, BOT - Updating list): https://wiki.archiveteam.org/?diff=56932&oldid=56504
19:21:06<c3manu>NeonGlitch: "We are now pleased to report that the first 2,178 books from the Ritman project have come available in their online reading room."
19:21:31<c3manu>NeonGlitch: do you happen to know more about the project?
19:22:17<NeonGlitch>c3manu: I can find out more from folks who work on it. It's not at risk, so probably not ArchiveTeam worthy, although I thought there might be value in a shallow crawl to have a sitemap on hand for future archival needs.
19:22:49h2ibot quits [Remote host closed the connection]
19:23:33<c3manu>NeonGlitch: to me it definitely looks useful. but the page also seems somewhat JS-heavy, so it could not be that fun dealing with it ^^"
19:24:17<NeonGlitch>c3manu: That's fair, can punt on it for now. Sorry to have bothered!
19:24:37<c3manu>i'm asking because a) if there's a plan already for the next batch that is being worked on, i would wait with throwing it in, or b) if the books are available elsewhere in a more accessible form it might be preferable to fetch that and do the shallow thing you suggested
19:24:58<c3manu>NeonGlitch: no it's all good, thanks for telling us about it :)
19:25:14<c3manu>and anything i just said is just my personal opinion, too ;)
19:28:39@rewby quits [Ping timeout: 260 seconds]
19:32:45h2ibot (h2ibot) joins
19:32:46<c3manu>NeonGlitch: just from https://embassyofthefreemind.com/en/library/269-ritman-research-institute and the links in the article it doesn't sound like a short-term project. so waiting for "the right time" will probably have us waiting forever :)
19:46:02<h2ibot>HadeanEon edited Deaths in 2013 (+58012, BOT - Updating page: {{saved}} (211),…): https://wiki.archiveteam.org/?diff=56933&oldid=56505
19:46:03<h2ibot>HadeanEon edited Deaths in 2013/list (+4487, BOT - Updating list): https://wiki.archiveteam.org/?diff=56934&oldid=56506
19:58:14Wohlstand (Wohlstand) joins
20:03:04<h2ibot>HadeanEon edited Deaths in 2014 (+66380, BOT - Updating page: {{saved}} (15),…): https://wiki.archiveteam.org/?diff=56935&oldid=56507
20:03:05<h2ibot>HadeanEon edited Deaths in 2014/list (+4802, BOT - Updating list): https://wiki.archiveteam.org/?diff=56936&oldid=56508
20:32:17notSokar joins
20:33:54Sokar quits [Ping timeout: 240 seconds]
20:50:12<h2ibot>HadeanEon edited Deaths in 2016 (-1240, BOT - Updating page: {{saved}} (131),…): https://wiki.archiveteam.org/?diff=56937&oldid=56061
20:50:13<h2ibot>HadeanEon edited Deaths in 2016/list (-120, BOT - Updating list): https://wiki.archiveteam.org/?diff=56938&oldid=56250
20:59:25Jens quits []
21:00:11Jens (JensRex) joins
21:05:24MrMcNuggets quits [Quit: WeeChat 4.3.2]
21:08:33<@JAA>For the record, I'm first running all of those farmtransparency.org videos through AB and only then looking at the ones that failed.
21:12:14<pokechu22>oh, I should also note that I ignored the vimeo embeds on the AB job in case there's a smarter way of handling them later
21:13:01<pokechu22>I doubt we can do it in a way that would make videos play back on web.archive.org (but hopefully we've done the POSTs right so that the download buttons will work)
21:13:15dabs joins
21:14:38Wake1 joins
21:17:54Wake quits [Ping timeout: 240 seconds]
21:17:54Wake1 is now known as Wake
21:20:04aninternettroll quits [Ping timeout: 260 seconds]
21:22:10ummmSokar joins
21:23:54notSokar quits [Ping timeout: 240 seconds]
21:24:59rewby (rewby) joins
21:24:59@ChanServ sets mode: +o rewby
21:26:13Sokar joins
21:27:39ummmSokar quits [Ping timeout: 260 seconds]
21:29:25aninternettroll (aninternettroll) joins
21:33:20<h2ibot>HadeanEon edited Deaths in 2017 (+427, BOT - Updating page: {{saved}} (373),…): https://wiki.archiveteam.org/?diff=56939&oldid=56511
21:33:21<h2ibot>HadeanEon edited Deaths in 2017/list (+31, BOT - Updating list): https://wiki.archiveteam.org/?diff=56940&oldid=56512
21:39:42etnguyen03 (etnguyen03) joins
21:45:12ericgallager quits [Client Quit]
21:48:58Wohlstand quits [Client Quit]
21:52:54Wohlstand (Wohlstand) joins
21:56:58ericgallager joins
22:02:01<Barto>https://social.anoxinon.de/@Codeberg/115033790447125787 interesting...
22:12:27<h2ibot>HadeanEon edited Deaths in 2018 (+102317, BOT - Updating page: {{saved}} (456),…): https://wiki.archiveteam.org/?diff=56941&oldid=56513
22:12:28<h2ibot>HadeanEon edited Deaths in 2018/list (+8310, BOT - Updating list): https://wiki.archiveteam.org/?diff=56942&oldid=56514
22:17:05lennier2 joins
22:19:34lennier2_ quits [Ping timeout: 240 seconds]
22:22:25ericgallager quits [Client Quit]
22:30:36etnguyen03 quits [Client Quit]
22:37:11Dada quits [Remote host closed the connection]
22:39:27Wohlstand quits [Client Quit]
22:54:15ericgallager joins
22:59:29SootBector quits [Remote host closed the connection]
23:00:39SootBector (SootBector) joins
23:04:34<h2ibot>HadeanEon edited Deaths in 2019 (-315, BOT - Updating page: {{saved}} (488),…): https://wiki.archiveteam.org/?diff=56943&oldid=56253
23:04:35<h2ibot>HadeanEon edited Deaths in 2019/list (-53, BOT - Updating list): https://wiki.archiveteam.org/?diff=56944&oldid=56358
23:06:17dabs quits [Read error: Connection reset by peer]
23:30:19Wohlstand (Wohlstand) joins
23:40:54tzt quits [Ping timeout: 240 seconds]
23:42:55etnguyen03 (etnguyen03) joins
23:45:40<h2ibot>HadeanEon edited Deaths in 2020/list (-2, BOT - Updating list): https://wiki.archiveteam.org/?diff=56945&oldid=56516
23:47:48lennier2_ joins
23:48:10tzt (tzt) joins
23:50:13Wohlstand quits [Client Quit]
23:50:34lennier2 quits [Ping timeout: 260 seconds]
23:53:54tzt quits [Ping timeout: 240 seconds]
23:54:33etnguyen03 quits [Client Quit]
23:54:45tzt (tzt) joins