00:06:28 | <gamer191-1|m> | If you take out "&asset_types=videos", then I assume it will also include photos, documents and campaign materials? |
00:06:39 | <pokechu22> | Yes |
00:07:32 | <pokechu22> | also looks like https://www.farmtransparency.org/gallery?asset_types=videos&p=2 works for pagination compared to https://www.farmtransparency.org/gallery?asset_types=videos and doesn't require that header, so I can just throw those in and discover things that way |
00:17:28 | | CuppyMan quits [Quit: Connection closed for inactivity] |
00:34:16 | | useretail quits [Quit: Leaving] |
00:47:46 | <pokechu22> | JAA: would qwarc be usable to grab things like https://www.farmtransparency.org/scripts/asset-display?p=3&asset_types=videos and https://www.farmtransparency.org/scripts/asset-popup?id=skfjn8ldvu28i3z8wrin&type=photos with X-Requested-With: XMLHttpRequest while *not* extracting embedded images/outlinks? (Those URLs seem necessary to navigate the site with javascript but have |
00:47:48 | <pokechu22> | alternative forms that archivebot can process) |
00:51:43 | | Guest58 quits [Quit: My Mac has gone to sleep. ZZZzzz…] |
00:53:00 | | Guest58 joins |
00:57:12 | <@JAA> | pokechu22: Sure. qwarc does no extraction whatsoever on its own. |
01:08:30 | | ATinySpaceMarine joins |
01:10:44 | | ATinySpaceMarine quits [Read error: Connection reset by peer] |
01:10:54 | | ATinySpaceMarine joins |
01:15:15 | <pokechu22> | JAA: https://transfer.archivete.am/T3y6Q/www.farmtransparency.org_scripts_asset-display_urls.txt |
01:15:16 | <eggdrop> | inline (for browser viewing): https://transfer.archivete.am/inline/T3y6Q/www.farmtransparency.org_scripts_asset-display_urls.txt |
01:16:07 | <pokechu22> | will prepare a list of asset-popup URLs after the job finishes |
01:18:54 | <pabs> | magmaus3: can you put the kitsunes.club/etc shutdowns into Deathwatch? |
01:20:18 | <cruller> | For now, I extracted links from https://www.farmtransparency.org/gallery? and https://www.farmtransparency.org/scripts/asset-display?p={2..294}& : https://transfer.archivete.am/11qwn6/Farm_Transparency_Repository_urls.txt |
01:20:18 | <eggdrop> | inline (for browser viewing): https://transfer.archivete.am/inline/11qwn6/Farm_Transparency_Repository_urls.txt |
01:21:11 | <pokechu22> | I started a job using https://transfer.archivete.am/4N1aQ/www.farmtransparency.org_seed_urls.txt which should have archivebot extract those itself (and I also included some basic faceting in that) |
01:21:13 | <eggdrop> | inline (for browser viewing): https://transfer.archivete.am/inline/4N1aQ/www.farmtransparency.org_seed_urls.txt |
01:21:41 | | Island joins |
01:25:12 | | ATinySpaceMarine quits [Client Quit] |
01:25:23 | <pokechu22> | the videos part is slightly annoying - it would require reading the page HTML to determine the resolution (e.g. <a href="javascript:void(0);" class="download" data-size="source_2704">Source (2.44gb)</a> on https://www.farmtransparency.org/videos?id=p7tyen34at but <a href="javascript:void(0);" class="download" data-size="source_1280">Source (8.71mb)</a> on |
01:25:25 | <pokechu22> | https://www.farmtransparency.org/videos?id=l5b22836ku). The POST to https://www.farmtransparency.org/videos?id=p7tyen34at&action=download then redirects to the video download (which I'm not sure about the rate-limiting of). |
01:25:34 | | ATinySpaceMarine joins |
01:28:17 | <pokechu22> | I *think* that if we generate WARCs of the POST to the download at source resolution, that would replay properly on web.archive.org (assuming it ignores POST data, and just grabs whatever POST happened at that URL). Though the redirect aspect would also be inconvenient. |
01:29:25 | <@JAA> | I believe that's what the WBM does for POST, yeah. |
01:29:42 | <pokechu22> | It seems like the video URLs are valid for 24 hours, and I don't think it's tied to a specific IP |
01:30:00 | <pokechu22> | so the POST part could happen on one machine and then archivebot could handle downloading |
01:31:52 | | andrew (andrew) joins |
01:32:36 | <@JAA> | Ugh, it opens a new tab, I hate that. |
01:33:31 | <pokechu22> | Yeah - you can get devtools open by using work offline (see 23:35) but it's still awkward |
01:34:14 | <@JAA> | You can also interrupt the page loading and then trigger it again via the dev tools. |
01:34:19 | <@JAA> | But yeah |
01:35:38 | <pokechu22> | the relevant headers seem to be the cookie 2620368ghwahw90w and the csrf_token post data, both of which can come from any page load (including the main page) and don't change, or at least last for a while |
01:37:16 | <pokechu22> | curl --verbose 'https://www.farmtransparency.org/videos?id=p7tyen34at&action=download' -H 'Cookie: 2620368ghwahw90w=XXXX' --data-raw 'size=source_2704&source=full&csrf_token=YYYY' and then the location header |
01:37:36 | <@JAA> | Yeah |
01:38:51 | <@JAA> | And yeah, looks like 24 hours validity, and they do work from another IP. |
01:39:20 | <pokechu22> | JAA: if I got a list of sizes with videos, is there a format for that would be easy to use for you to POST the redirects? |
01:41:09 | | BennyOtt quits [Ping timeout: 260 seconds] |
01:42:34 | <@JAA> | A list of 'videoid size' would work. Or I could extract the size value from the video pages if it's easy to tell which one we want (always source_*?). |
01:43:22 | <@JAA> | asset-display is done. |
01:44:12 | <pokechu22> | oh, I was thinking I needed to wait for archivebot to finish to make a list for asset-popup but I can just use cruller's list |
01:44:37 | <pokechu22> | I'm thinking always source_ but one video had 720p be bigger than source_ |
01:44:46 | <@JAA> | I guess I can extract it from the asset-display WARC, too? |
01:44:57 | <pokechu22> | Oh, yeah |
01:45:06 | <@JAA> | Yeah, I'm wondering whether that is a reencode with different settings. |
01:46:56 | <@JAA> | I get 1258 video IDs from the asset-display WARC, so I guess that's correct. |
01:47:50 | <pokechu22> | ffprobe says the source video is "h264 (High) (avc1 / 0x31637661), yuv420p, 1280x720 [SAR 1:1 DAR 16:9], 1825 kb/s, 25 fps, 25 tbr, 12800 tbn, 50 tbc (default)" while the 720p one is "h264 (High) (avc1 / 0x31637661), yuv420p(tv, bt709), 1280x720, 2187 kb/s, 25 fps, 25 tbr, 25 tbn, 50 tbc (default)", which is a higher bitrate but different in other ways I don't understand |
01:50:18 | <gamer191-1|m> | So I guess either the site is wrong about which one is the source, or the 720p one has been upscaled |
01:50:21 | <gamer191-1|m> | Which video is it btw? |
01:51:53 | <@JAA> | It's not upscaled; they're both 720p. |
01:52:18 | <pokechu22> | https://www.farmtransparency.org/videos?id=l5b22836ku |
01:52:37 | <@JAA> | But reencoded with better settings perhaps, which means it'll encode the noise and artefacts from the 'source'. |
01:53:06 | <@JAA> | The source file has the encoder settings metadata stripped, sadly. |
01:54:57 | <pokechu22> | It's also possible that the original news broadcast spent longer encoding it better while vimeo's reencode was done faster and thus larger for the same quality... I think? |
01:57:53 | <@JAA> | Yeah, that's also plausible. |
01:59:45 | <pokechu22> | JAA: https://transfer.archivete.am/OFjmP/www.farmtransparency.org_scripts_asset-popup_urls.txt.zst |
02:00:56 | <pokechu22> | I don't think you can extract from the asset-display WARC since that's just pagination and doesn't include URLs, but you should be able to extract from the asset-popup warc? (unless you're just using asset-display to list it and then downloading separately, instead of using the WARC itself to determine the data-size values) |
02:03:17 | <@JAA> | I'm getting the video IDs from asset-display, and then I'll fetch the video page to extract the right size parameter for each video. |
02:03:37 | <pokechu22> | Alright |
02:07:05 | | etnguyen03 (etnguyen03) joins |
02:08:56 | | nine quits [Quit: See ya!] |
02:09:08 | | nine joins |
02:09:09 | | nine is now authenticated as nine |
02:09:09 | | nine quits [Changing host] |
02:09:09 | | nine (nine) joins |
02:13:12 | | adryd01 (adryd) joins |
02:26:26 | | rohvani joins |
02:30:12 | <@JAA> | Hmm, something isn't right. |
02:30:49 | <@JAA> | I'm sending the right things, I think, but I get a redirect back to the video page. |
02:31:42 | <@JAA> | Oh, I guess it would help to use POST rather than GET. |
02:33:02 | | ericgallager quits [Quit: This computer has gone to sleep] |
02:34:10 | | ericgallager joins |
02:38:15 | <@JAA> | asset-popup is done. |
02:46:22 | <@JAA> | Looks like the download endpoint is very rate-limited. |
02:46:55 | | Guest58 quits [Client Quit] |
02:48:42 | <@JAA> | After 10 successful ones, it started redirecting to the video page instead. |
02:49:37 | <pokechu22> | Hmm, maybe the csrf key expired? |
02:49:50 | <pokechu22> | or are you still getting the same one? |
02:50:16 | <gamer191-1|m> | “I’m sending the right things…" out if interest, how did you fix it? I was having the same issue earlier, using Firefox’s “copy as curl” function |
02:50:21 | <@JAA> | Still the same, and the cookie hasn't changed either. |
02:50:42 | <@JAA> | But I extract the token from every video page anyway, so if it did change, I'd pick it up. |
02:50:55 | <@JAA> | And the cookie handling happens automatically by aiohttp. |
02:51:23 | <@JAA> | gamer191-1|m: 02:31:42 <@JAA> Oh, I guess it would help to use POST rather than GET. |
02:52:19 | <gamer191-1|m> | Can you download through your browser or same issue? Perhaps you’d need to use incognito to get a new token, but that’s obviously too cumbersome to do every 10 videos unless it can be automated |
02:55:59 | <@JAA> | Clearing the cookie jar doesn't seem to help. |
02:56:31 | <@JAA> | gamer191-1|m: I'm running this on a server and can't easily check with a browser through there. |
02:57:25 | <gamer191-1|m> | @JAA: “Please mirror our stuff, but also, we'll make it as hard as we can.” |
02:57:25 | <gamer191-1|m> | Reminds me of Kick.com advising users who wish to download their data to use yt-dlp, but then enabling Cloudflare TLS fingerprinting🤣 |
02:58:07 | | etnguyen03 quits [Client Quit] |
02:58:27 | <gamer191-1|m> | “I'm running this on a server and can't easily check with a browser through there.” Can you try switching between IPv4 and IPv6 to see if it’s IP-based? |
03:00:28 | | etnguyen03 (etnguyen03) joins |
03:00:40 | <@JAA> | I did get another batch of 10 when I mentioned the cookie jar, so at least it expires somewhat quickly. |
03:02:23 | <@JAA> | IPv4/v6 selection isn't currently implemented in qwarc, and I'm not sure aiohttp even exposes it. |
03:03:24 | <@JAA> | Ah, it's in the TCPConnector, yeah, that'd require some patching. |
03:04:36 | <@JAA> | FWIW, both those runs that got download URLs were over IPv6. |
03:17:12 | | etnguyen03 quits [Remote host closed the connection] |
03:23:31 | <@JAA> | Those were less than 10 minutes apart, but 30 minutes later now I'm still blocked. :-| |
03:47:43 | | DogsRNice quits [Read error: Connection reset by peer] |
04:00:08 | | Guest58 joins |
04:00:53 | | Radzig quits [Quit: ZNC 1.10.1 - https://znc.in] |
04:01:53 | | Radzig joins |
04:17:58 | | clubsecret joins |
04:23:23 | | clubsecret quits [Client Quit] |
04:29:02 | | Shjosan quits [Quit: Am sleepy (-, – )…zzzZZZ] |
04:29:33 | | Shjosan (Shjosan) joins |
04:35:37 | | Guest58 quits [Client Quit] |
04:41:37 | <fuzzy80211> | need ips JAA? |
04:54:55 | | LddPotato quits [Read error: Connection reset by peer] |
04:55:36 | | LddPotato (LddPotato) joins |
05:10:37 | | fuzzy80211 quits [Read error: Connection reset by peer] |
05:11:22 | | fuzzy80211 (fuzzy80211) joins |
05:20:18 | | Guest58 joins |
05:27:31 | | BennyOtt (BennyOtt) joins |
06:15:34 | | Guest58 quits [Client Quit] |
07:07:31 | | HP_Archivist (HP_Archivist) joins |
07:38:58 | | ericgallager quits [Quit: This computer has gone to sleep] |
07:40:49 | | Guest58 joins |
07:43:15 | | IDK (IDK) joins |
07:45:35 | | abirkill (abirkill) joins |
07:55:47 | <@JAA> | fuzzy80211: It seems that I can maybe get around it with IPv6, but I'll get back to you if that doesn't work out. |
08:08:02 | | qw3rty_ joins |
08:10:14 | | Doranwen quits [Ping timeout: 260 seconds] |
08:11:59 | | qw3rty quits [Ping timeout: 260 seconds] |
08:16:55 | | qw3rty__ joins |
08:20:14 | | qw3rty_ quits [Ping timeout: 240 seconds] |
08:26:04 | | Wake6 joins |
08:30:04 | | Wake quits [Ping timeout: 260 seconds] |
08:30:04 | | Wake6 is now known as Wake |
08:41:14 | | ducky quits [Remote host closed the connection] |
08:43:44 | | ducky (ducky) joins |
09:18:57 | | qw3rty_ joins |
09:22:14 | | qw3rty__ quits [Ping timeout: 240 seconds] |
09:35:18 | | yasomimi (yasomi) joins |
09:35:34 | | yasomi quits [Ping timeout: 240 seconds] |
09:36:06 | | yasomimi is now known as yasomi |
09:43:29 | | qw3rty__ joins |
09:47:04 | | qw3rty_ quits [Ping timeout: 260 seconds] |
09:50:20 | <h2ibot> | Manu edited Discourse/archived (+83, Queued forums.suse.com): https://wiki.archiveteam.org/?diff=56907&oldid=56881 |
09:55:21 | <h2ibot> | Manu edited Discourse/archived (+95, Queued forums.developer.nvidia.com): https://wiki.archiveteam.org/?diff=56908&oldid=56907 |
09:59:25 | | MrMcNuggets (MrMcNuggets) joins |
10:00:55 | | Doranwen (Doranwen) joins |
10:02:14 | | midou quits [Ping timeout: 240 seconds] |
10:02:20 | | IDK quits [Client Quit] |
10:02:36 | | LunarianBunny1147 (LunarianBunny1147) joins |
10:10:14 | | Doranwen quits [Ping timeout: 240 seconds] |
10:10:17 | | Doran (Doranwen) joins |
10:11:00 | | BornOn420_ (BornOn420) joins |
10:12:03 | | midou joins |
10:15:02 | | Doranwen (Doranwen) joins |
10:15:39 | | Doran quits [Ping timeout: 260 seconds] |
10:17:57 | | SootBector (SootBector) joins |
10:46:54 | | monoxane quits [Ping timeout: 240 seconds] |
10:47:17 | | SootBector quits [Remote host closed the connection] |
10:47:17 | | BornOn420_ quits [Remote host closed the connection] |
10:47:52 | | BornOn420_ (BornOn420) joins |
10:48:24 | | SootBector (SootBector) joins |
11:00:06 | | Bleo182600722719623455222 quits [Quit: The Lounge - https://thelounge.chat] |
11:00:34 | | midou quits [Ping timeout: 240 seconds] |
11:02:50 | | Bleo182600722719623455222 joins |
11:03:54 | | ducky quits [Ping timeout: 240 seconds] |
11:09:27 | | nfriedly quits [Remote host closed the connection] |
11:11:11 | | BornOn420 quits [Quit: Textual IRC Client: www.textualapp.com] |
11:20:28 | | monoxane (monoxane) joins |
11:21:25 | | ATinySpaceMarine quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.] |
11:21:39 | | ducky (ducky) joins |
11:28:54 | | ducky quits [Ping timeout: 240 seconds] |
11:46:17 | | ducky (ducky) joins |
11:49:34 | | monoxane quits [Ping timeout: 260 seconds] |
11:50:54 | | ducky quits [Ping timeout: 240 seconds] |
12:02:15 | | monoxane (monoxane) joins |
12:09:41 | | Dada joins |
12:22:28 | | ericgallager joins |
12:23:59 | | Snivy quits [Ping timeout: 260 seconds] |
12:29:31 | | ducky (ducky) joins |
12:30:59 | | ducky quits [Remote host closed the connection] |
12:31:23 | | ducky (ducky) joins |
12:36:14 | | ducky quits [Ping timeout: 260 seconds] |
12:42:39 | <cruller> | Japanese Q&A site "peing.net" will be terminated at 2025/8/29 12:00 (JST). Announcement: https://peing.net/ja/landing_pages/28 |
12:48:13 | <cruller> | ugh, they disallow ia_archiver https://peing.net/robots.txt |
12:50:31 | <justauser|m> | I think when the website closes down, its robots.txt disappears and materials become visible in Wayback. |
12:51:17 | | fea joins |
12:56:46 | | IDK (IDK) joins |
12:59:20 | | fea quits [Client Quit] |
13:03:28 | | midou joins |
13:07:34 | | midou quits [Ping timeout: 240 seconds] |
13:15:08 | <cruller> | In any case, archiving is important. I'll do some more research on the service. |
13:24:19 | | midou joins |
13:28:09 | | kansei quits [Ping timeout: 260 seconds] |
13:29:32 | | kansei (kansei) joins |
13:41:52 | | adryd01 quits [Quit: Ping timeout (120 seconds)] |
13:42:11 | | adryd01 (adryd) joins |
13:42:13 | | TheTechRobo quits [Quit: Ping timeout (120 seconds)] |
13:42:13 | | Pedrosso quits [Quit: Ping timeout (120 seconds)] |
13:42:14 | | ScenarioPlanet quits [Quit: Ping timeout (120 seconds)] |
13:42:32 | | Pedrosso joins |
13:42:37 | | ScenarioPlanet (ScenarioPlanet) joins |
13:43:30 | | TheTechRobo (TheTechRobo) joins |
13:43:54 | | tek_dmn quits [Ping timeout: 240 seconds] |
13:43:55 | | NatTheCat3 (NatTheCat) joins |
13:44:14 | | NatTheCat quits [Ping timeout: 240 seconds] |
13:44:14 | | NatTheCat3 is now known as NatTheCat |
13:45:11 | | tek_dmn (tek_dmn) joins |
14:06:18 | | Dada quits [Remote host closed the connection] |
14:08:48 | | Dada joins |
14:14:59 | <nstrom|m> | I can't seem to post in #archivebot but fwiw looks like job 9dhhxf8jfmlq6vn10eo0ljph is pulling some files from open s3 bucket https://4dn-open-data-public.s3.amazonaws.com/ , might be worth running that separately if the job doesn't get everything |
14:15:15 | <nstrom|m> | cause I hear you guys like open buckets /shrug |
14:20:19 | <anonymoususer852> | "I can't seem to post in #archivebot ..." Correct me if I am wrong, but it is possible to talk in #archivebot, just not run commands. |
14:21:01 | <nstrom|m> | yeah I know I should be able to but element / matrix seemed to be having issues |
14:21:19 | <nstrom|m> | just seemed to be in that channel though |
14:21:25 | <anonymoususer852> | Ah I see. |
14:22:38 | <anonymoususer852> | Yeah you are in that channel, I checked before posting my reply. I use a regular IRC client, connected directly to hackint network, and am able to post there. |
14:31:55 | <nstrom|m> | probably just matrix being matrix |
14:35:08 | <h2ibot> | Anonymoususer852 edited Talk:Glitch (+1674, Updated discussion to where the project stands…): https://wiki.archiveteam.org/?diff=56909&oldid=56837 |
14:41:04 | | jinn6 quits [Ping timeout: 260 seconds] |
14:41:17 | | jinn6 joins |
14:43:34 | | notSokar joins |
14:45:44 | | Sokar quits [Ping timeout: 260 seconds] |
14:47:05 | | ummmSokar joins |
14:49:14 | | notSokar quits [Ping timeout: 260 seconds] |
14:51:50 | <pabs> | IIRC #archivebot gets too many messages, matrix can't keep up |
15:06:23 | <@JAA> | cruller: Please add it to Deathwatch! |
15:21:18 | | tertu2 (tertu) joins |
15:23:54 | | tertu quits [Ping timeout: 240 seconds] |
15:24:11 | | ATinySpaceMarine joins |
15:25:41 | | ATinySpaceMarine quits [Client Quit] |
15:26:12 | | ATinySpaceMarine joins |
15:27:24 | | aninternettroll quits [Remote host closed the connection] |
15:29:54 | | aninternettroll (aninternettroll) joins |
15:34:15 | | pseudorizer quits [Quit: ZNC 1.10.1 - https://znc.in] |
15:36:17 | <h2ibot> | Cruller edited Deathwatch (+200, Add Peing): https://wiki.archiveteam.org/?diff=56910&oldid=56886 |
15:36:30 | <@JAA> | :-) |
15:36:48 | | pseudorizer (pseudorizer) joins |
15:37:12 | <cruller> | Fortunately, it seems that questions that have been answered are given a URL of https://peing.net/ja/qs/{sequential number} for sharing. (I checked this using my own account.) |
15:38:34 | | jinn6 quits [Ping timeout: 240 seconds] |
15:40:11 | <cruller> | The latest number is 1189399360. |
15:42:09 | <cruller> | Unanswered questions can also be viewed by typing in the URL directly, but are probably not linked from anywhere. |
15:45:41 | <@JAA> | 1.2 billion questions? Seems unlikely... Or do they not start at 0? |
15:47:19 | <h2ibot> | Cruller edited List of websites excluded from the Wayback Machine (+21, Add Peing): https://wiki.archiveteam.org/?diff=56911&oldid=56884 |
15:50:28 | <cruller> | JAA: I haven't looked into the details, but I saw https://peing.net/ja/qs/118 |
15:51:42 | <cruller> | A sampling survey may be necessary... |
15:59:45 | | notSokar joins |
16:01:34 | | ummmSokar quits [Ping timeout: 240 seconds] |
16:02:18 | | Sokar joins |
16:03:01 | <cruller> | Oh, unanswered questions can also get sequential number. |
16:04:29 | | notSokar quits [Ping timeout: 260 seconds] |
16:06:45 | | jinn6 joins |
16:15:23 | <cruller> | Assuming that spammers are sending a large number of messages, 1.2 billion may not be surprising. On Peing, you can ask questions even if you don't have an account, and I've never heard of rate limits or CAPTCHAs. |
16:18:21 | | aninternettroll quits [Remote host closed the connection] |
16:20:54 | | aninternettroll (aninternettroll) joins |
16:23:44 | | anarcat quits [Ping timeout: 260 seconds] |
16:30:53 | | notSokar joins |
16:32:54 | | Sokar quits [Ping timeout: 240 seconds] |
16:33:04 | | qw3rty__ quits [Ping timeout: 260 seconds] |
16:37:09 | | notSokar quits [Ping timeout: 260 seconds] |
16:38:59 | | Sokar joins |
16:44:13 | | qw3rty joins |
16:46:56 | | qw3rty_ joins |
16:48:49 | | qw3rty quits [Ping timeout: 260 seconds] |
16:52:40 | | jinn6 quits [Client Quit] |
16:52:55 | | jinn6 joins |
16:59:16 | | anarcat (anarcat) joins |
17:01:45 | <@JAA> | I got the download URLs for 964 videos from farmtransparency.org; the other 294 failed, most seem to be YouTube videos but there were a few other failures, too. |
17:01:52 | <@JAA> | 1.53 TiB or so for the ones I got. |
17:11:53 | <@JAA> | First ones are running in AB, will start more as slots become available. |
17:14:33 | <h2ibot> | HadeanEon edited Deaths in 2000 (+16686, BOT - Updating page: {{saved}} (115),…): https://wiki.archiveteam.org/?diff=56912&oldid=56494 |
17:14:34 | <h2ibot> | HadeanEon edited Deaths in 2000/list (+1357, BOT - Updating list): https://wiki.archiveteam.org/?diff=56913&oldid=56495 |
17:17:37 | | NeonGlitch (NeonGlitch) joins |
17:18:09 | <NeonGlitch> | Heyo! Is https://www.openculture.com/2025/08/2178-occult-books-now-digitized-put-online.html an ArchiveTeam crawl opportunity? |
17:18:14 | | notSokar joins |
17:20:34 | <h2ibot> | HadeanEon edited Deaths in 2001 (+16214, BOT - Updating page: {{saved}} (7),…): https://wiki.archiveteam.org/?diff=56914&oldid=56496 |
17:20:34 | | Sokar quits [Ping timeout: 240 seconds] |
17:20:35 | <h2ibot> | HadeanEon edited Deaths in 2001/list (+1229, BOT - Updating list): https://wiki.archiveteam.org/?diff=56915&oldid=56497 |
17:25:09 | | ummmSokar joins |
17:27:54 | | notSokar quits [Ping timeout: 260 seconds] |
17:28:59 | | ericgallager quits [Quit: This computer has gone to sleep] |
17:29:35 | <h2ibot> | HadeanEon edited Deaths in 2002 (+18403, BOT - Updating page: {{saved}} (3),…): https://wiki.archiveteam.org/?diff=56916&oldid=56498 |
17:29:36 | <h2ibot> | HadeanEon edited Deaths in 2002/list (+1373, BOT - Updating list): https://wiki.archiveteam.org/?diff=56917&oldid=56499 |
17:35:36 | <h2ibot> | HadeanEon edited Deaths in 2003 (-623, BOT - Updating page: {{saved}} (5),…): https://wiki.archiveteam.org/?diff=56918&oldid=56242 |
17:35:37 | <h2ibot> | HadeanEon edited Deaths in 2003/list (-52, BOT - Updating list): https://wiki.archiveteam.org/?diff=56919&oldid=56243 |
17:43:37 | <h2ibot> | HadeanEon edited Deaths in 2004 (+834, BOT - Updating page: {{saved}} (6),…): https://wiki.archiveteam.org/?diff=56920&oldid=55479 |
17:43:38 | <h2ibot> | HadeanEon edited Deaths in 2004/list (+58, BOT - Updating list): https://wiki.archiveteam.org/?diff=56921&oldid=56500 |
17:46:28 | | notSokar joins |
17:47:14 | | ericgallager joins |
17:48:54 | | ummmSokar quits [Ping timeout: 260 seconds] |
17:55:39 | <h2ibot> | HadeanEon edited Deaths in 2005/list (+17, BOT - Updating list): https://wiki.archiveteam.org/?diff=56922&oldid=56350 |
17:58:49 | | NeonGlitch quits [Ping timeout: 260 seconds] |
18:01:12 | | NeonGlitch (NeonGlitch) joins |
18:11:41 | <h2ibot> | HadeanEon edited Deaths in 2007 (-1413, BOT - Updating page: {{saved}} (5),…): https://wiki.archiveteam.org/?diff=56923&oldid=56418 |
18:11:42 | <h2ibot> | HadeanEon edited Deaths in 2007/list (-58, BOT - Updating list): https://wiki.archiveteam.org/?diff=56924&oldid=56419 |
18:15:25 | | Sokar joins |
18:17:34 | | notSokar quits [Ping timeout: 240 seconds] |
18:32:45 | <h2ibot> | HadeanEon edited Deaths in 2009 (+274, BOT - Updating page: {{saved}} (9),…): https://wiki.archiveteam.org/?diff=56925&oldid=55650 |
18:32:46 | <h2ibot> | HadeanEon edited Deaths in 2009/list (+21, BOT - Updating list): https://wiki.archiveteam.org/?diff=56926&oldid=55651 |
18:46:47 | <h2ibot> | HadeanEon edited Deaths in 2010 (+473, BOT - Updating page: {{saved}} (204),…): https://wiki.archiveteam.org/?diff=56927&oldid=56351 |
18:46:48 | <h2ibot> | HadeanEon edited Deaths in 2010/list (+38, BOT - Updating list): https://wiki.archiveteam.org/?diff=56928&oldid=56352 |
18:46:58 | | notSokar joins |
18:48:54 | | Sokar quits [Ping timeout: 240 seconds] |
18:52:34 | | Sokar joins |
18:53:39 | | notSokar quits [Ping timeout: 260 seconds] |
18:58:32 | <c3manu> | NeonGlitch: it kinda reads like more books would be added in the future? |
18:59:49 | <h2ibot> | HadeanEon edited Deaths in 2011 (+55519, BOT - Updating page: {{saved}} (204),…): https://wiki.archiveteam.org/?diff=56929&oldid=56501 |
18:59:50 | <h2ibot> | HadeanEon edited Deaths in 2011/list (+4496, BOT - Updating list): https://wiki.archiveteam.org/?diff=56930&oldid=56502 |
19:03:20 | <NeonGlitch> | c3manu: Definitely a possibility. |
19:08:39 | | ericgallager quits [Client Quit] |
19:11:18 | | ericgallager joins |
19:16:52 | <h2ibot> | HadeanEon edited Deaths in 2012 (+50538, BOT - Updating page: {{saved}} (193),…): https://wiki.archiveteam.org/?diff=56931&oldid=56503 |
19:16:53 | <h2ibot> | HadeanEon edited Deaths in 2012/list (+4064, BOT - Updating list): https://wiki.archiveteam.org/?diff=56932&oldid=56504 |
19:21:06 | <c3manu> | NeonGlitch: "We are now pleased to report that the first 2,178 books from the Ritman project have come available in their online reading room." |
19:21:31 | <c3manu> | NeonGlitch: do you happen to know more about the project? |
19:22:17 | <NeonGlitch> | c3manu: I can find out more from folks who work on it. It's not at risk, so probably not ArchiveTeam worthy, although I thought there might be value in a shallow crawl to have a sitemap on hand for future archival needs. |
19:22:49 | | h2ibot quits [Remote host closed the connection] |
19:23:33 | <c3manu> | NeonGlitch: to me it definitely looks useful. but the page also seems somewhat JS-heavy, so it could not be that fun dealing with it ^^" |
19:24:17 | <NeonGlitch> | c3manu: That's fair, can punt on it for now. Sorry to have bothered! |
19:24:37 | <c3manu> | i'm asking because a) if there's a plan already for the next batch that is being worked on, i would wait with throwing it in, or b) if the books are available elsewhere in a more accessible form it might be preferable to fetch that and do the shallow thing you suggested |
19:24:58 | <c3manu> | NeonGlitch: no it's all good, thanks for telling us about it :) |
19:25:14 | <c3manu> | and anything i just said is just my personal opinion, too ;) |
19:28:39 | | @rewby quits [Ping timeout: 260 seconds] |
19:32:45 | | h2ibot (h2ibot) joins |
19:32:46 | <c3manu> | NeonGlitch: just from https://embassyofthefreemind.com/en/library/269-ritman-research-institute and the links in the article it doesn't sound like a short-term project. so waiting for "the right time" will probably have us waiting forever :) |
19:46:02 | <h2ibot> | HadeanEon edited Deaths in 2013 (+58012, BOT - Updating page: {{saved}} (211),…): https://wiki.archiveteam.org/?diff=56933&oldid=56505 |
19:46:03 | <h2ibot> | HadeanEon edited Deaths in 2013/list (+4487, BOT - Updating list): https://wiki.archiveteam.org/?diff=56934&oldid=56506 |
19:58:14 | | Wohlstand (Wohlstand) joins |
20:03:04 | <h2ibot> | HadeanEon edited Deaths in 2014 (+66380, BOT - Updating page: {{saved}} (15),…): https://wiki.archiveteam.org/?diff=56935&oldid=56507 |
20:03:05 | <h2ibot> | HadeanEon edited Deaths in 2014/list (+4802, BOT - Updating list): https://wiki.archiveteam.org/?diff=56936&oldid=56508 |
20:32:17 | | notSokar joins |
20:33:54 | | Sokar quits [Ping timeout: 240 seconds] |
20:50:12 | <h2ibot> | HadeanEon edited Deaths in 2016 (-1240, BOT - Updating page: {{saved}} (131),…): https://wiki.archiveteam.org/?diff=56937&oldid=56061 |
20:50:13 | <h2ibot> | HadeanEon edited Deaths in 2016/list (-120, BOT - Updating list): https://wiki.archiveteam.org/?diff=56938&oldid=56250 |
20:59:25 | | Jens quits [] |
21:00:11 | | Jens (JensRex) joins |
21:05:24 | | MrMcNuggets quits [Quit: WeeChat 4.3.2] |
21:08:33 | <@JAA> | For the record, I'm first running all of those farmtransparency.org videos through AB and only then looking at the ones that failed. |
21:12:14 | <pokechu22> | oh, I should also note that I ignored the vimeo embeds on the AB job in case there's a smarter way of handling them later |
21:13:01 | <pokechu22> | I doubt we can do it in a way that would make videos play back on web.archive.org (but hopefully we've done the POSTs right so that the download buttons will work) |
21:13:15 | | dabs joins |
21:14:38 | | Wake1 joins |
21:17:54 | | Wake quits [Ping timeout: 240 seconds] |
21:17:54 | | Wake1 is now known as Wake |
21:20:04 | | aninternettroll quits [Ping timeout: 260 seconds] |
21:22:10 | | ummmSokar joins |
21:23:54 | | notSokar quits [Ping timeout: 240 seconds] |
21:24:59 | | rewby (rewby) joins |
21:24:59 | | @ChanServ sets mode: +o rewby |
21:26:13 | | Sokar joins |
21:27:39 | | ummmSokar quits [Ping timeout: 260 seconds] |
21:29:25 | | aninternettroll (aninternettroll) joins |
21:33:20 | <h2ibot> | HadeanEon edited Deaths in 2017 (+427, BOT - Updating page: {{saved}} (373),…): https://wiki.archiveteam.org/?diff=56939&oldid=56511 |
21:33:21 | <h2ibot> | HadeanEon edited Deaths in 2017/list (+31, BOT - Updating list): https://wiki.archiveteam.org/?diff=56940&oldid=56512 |
21:39:42 | | etnguyen03 (etnguyen03) joins |
21:45:12 | | ericgallager quits [Client Quit] |
21:48:58 | | Wohlstand quits [Client Quit] |
21:52:54 | | Wohlstand (Wohlstand) joins |
21:56:58 | | ericgallager joins |
22:02:01 | <Barto> | https://social.anoxinon.de/@Codeberg/115033790447125787 interesting... |
22:12:27 | <h2ibot> | HadeanEon edited Deaths in 2018 (+102317, BOT - Updating page: {{saved}} (456),…): https://wiki.archiveteam.org/?diff=56941&oldid=56513 |
22:12:28 | <h2ibot> | HadeanEon edited Deaths in 2018/list (+8310, BOT - Updating list): https://wiki.archiveteam.org/?diff=56942&oldid=56514 |
22:17:05 | | lennier2 joins |
22:19:34 | | lennier2_ quits [Ping timeout: 240 seconds] |
22:22:25 | | ericgallager quits [Client Quit] |
22:30:36 | | etnguyen03 quits [Client Quit] |
22:37:11 | | Dada quits [Remote host closed the connection] |
22:39:27 | | Wohlstand quits [Client Quit] |
22:54:15 | | ericgallager joins |
22:59:29 | | SootBector quits [Remote host closed the connection] |
23:00:39 | | SootBector (SootBector) joins |
23:04:34 | <h2ibot> | HadeanEon edited Deaths in 2019 (-315, BOT - Updating page: {{saved}} (488),…): https://wiki.archiveteam.org/?diff=56943&oldid=56253 |
23:04:35 | <h2ibot> | HadeanEon edited Deaths in 2019/list (-53, BOT - Updating list): https://wiki.archiveteam.org/?diff=56944&oldid=56358 |
23:06:17 | | dabs quits [Read error: Connection reset by peer] |
23:30:19 | | Wohlstand (Wohlstand) joins |
23:40:54 | | tzt quits [Ping timeout: 240 seconds] |
23:42:55 | | etnguyen03 (etnguyen03) joins |
23:45:40 | <h2ibot> | HadeanEon edited Deaths in 2020/list (-2, BOT - Updating list): https://wiki.archiveteam.org/?diff=56945&oldid=56516 |
23:47:48 | | lennier2_ joins |
23:48:10 | | tzt (tzt) joins |
23:50:13 | | Wohlstand quits [Client Quit] |
23:50:34 | | lennier2 quits [Ping timeout: 260 seconds] |
23:53:54 | | tzt quits [Ping timeout: 240 seconds] |
23:54:33 | | etnguyen03 quits [Client Quit] |
23:54:45 | | tzt (tzt) joins |