00:03:36<h2ibot>HadeanEon edited Deaths in 2020 (+74096, BOT - Updating page: {{saved}} (47),…): https://wiki.archiveteam.org/?diff=55142&oldid=54754
00:03:37<h2ibot>HadeanEon edited Deaths in 2020/list (+5961, BOT - Updating list): https://wiki.archiveteam.org/?diff=55143&oldid=54843
00:05:35Wohlstand quits [Quit: Wohlstand]
00:09:29NeonGlitch (NeonGlitch) joins
00:29:41<h2ibot>HadeanEon edited Deaths in 2021 (+51965, BOT - Updating page: {{saved}} (50),…): https://wiki.archiveteam.org/?diff=55144&oldid=55034
00:29:42<h2ibot>HadeanEon edited Deaths in 2021/list (+4478, BOT - Updating list): https://wiki.archiveteam.org/?diff=55145&oldid=55035
00:31:36NeonGlitch quits [Client Quit]
00:35:20moth_ joins
00:46:18Wohlstand (Wohlstand) joins
00:56:12Wohlstand quits [Client Quit]
00:57:19gust quits [Quit: Leaving]
01:00:46<h2ibot>HadeanEon edited Deaths in 2022 (+14603, BOT - Updating page: {{saved}} (214),…): https://wiki.archiveteam.org/?diff=55146&oldid=54775
01:00:47<h2ibot>HadeanEon edited Deaths in 2022/list (+3132, BOT - Updating list): https://wiki.archiveteam.org/?diff=55147&oldid=55036
01:22:06riteo quits [Ping timeout: 260 seconds]
01:31:51<h2ibot>HadeanEon edited Deaths in 2023 (+3890, BOT - Updating page: {{saved}} (177),…): https://wiki.archiveteam.org/?diff=55148&oldid=55037
01:31:52<h2ibot>HadeanEon edited Deaths in 2023/list (+1909, BOT - Updating list): https://wiki.archiveteam.org/?diff=55149&oldid=55038
01:32:39riteo (riteo) joins
01:33:14IDK quits [Quit: Connection closed for inactivity]
01:54:43PredatorIWD258 joins
01:55:56PredatorIWD25 quits [Ping timeout: 260 seconds]
01:55:56PredatorIWD258 is now known as PredatorIWD25
01:56:17JDev joins
02:00:56<h2ibot>HadeanEon edited Deaths in 2024 (-4620, BOT - Updating page: {{saved}} (200),…): https://wiki.archiveteam.org/?diff=55150&oldid=55039
02:00:57<h2ibot>HadeanEon edited Deaths in 2024/list (+2151, BOT - Updating list): https://wiki.archiveteam.org/?diff=55151&oldid=55040
02:13:58<h2ibot>HadeanEon edited Deaths in 2025 (-1013, BOT - Updating page: {{saved}} (102),…): https://wiki.archiveteam.org/?diff=55152&oldid=55115
02:14:53dendory quits [Quit: The Lounge - https://thelounge.chat]
02:24:32hackbug quits [Remote host closed the connection]
02:24:49hackbug (hackbug) joins
02:27:46lunik1 quits [Quit: :x]
02:28:31lunik1 joins
02:42:04lunik1 quits [Client Quit]
02:42:34lunik1 joins
02:49:32moth_ quits [Ping timeout: 250 seconds]
02:49:38moth_ joins
02:52:08benjins3 quits [Ping timeout: 250 seconds]
03:09:59lennier2_ joins
03:12:56lennier2 quits [Ping timeout: 250 seconds]
03:47:06Webuser334279 joins
03:47:45<Webuser334279>would you like to append the support of fediverse?
03:56:42<nicolas17>read this first https://wiki.archiveteam.org/index.php/Fediverse
04:19:54BlueMaxima quits [Read error: Connection reset by peer]
04:33:12Webuser334279 quits [Client Quit]
04:36:43<@JAA>Arzon image archival is going well so far. There's 15 million of them, so it'll take a bit. ETA 35 hours, 500-600 GiB
05:01:26<h2ibot>JustAnotherArchivist edited アルゾン (-82): https://wiki.archiveteam.org/?diff=55153&oldid=54906
05:12:19lflare quits [Quit: Bye]
05:12:41lflare (lflare) joins
05:29:41chains joins
05:35:55<tzt>is there anyway to archive this site: https://ardata.cy.gov.tw? it is a database of political donations in taiwan, but it has captchas for everything except the internal API, it is at risk due to the government department running it having its budget cut https://www.taipeitimes.com/News/front/archives/2025/03/19/2003833668
05:43:18^ quits [Read error: Connection reset by peer]
05:45:10^ (^) joins
06:17:16moth_ quits [Ping timeout: 260 seconds]
06:22:31^ quits [Ping timeout: 260 seconds]
06:22:34^ (^) joins
06:29:40^ quits [Ping timeout: 250 seconds]
06:30:08^ (^) joins
06:31:15benjins3 joins
06:37:28^ quits [Ping timeout: 250 seconds]
06:37:44^ (^) joins
06:51:06^ quits [Ping timeout: 260 seconds]
07:05:11VoynichCR (VoynichCR) joins
07:10:10VoynichCR quits [Client Quit]
07:11:58^ (^) joins
07:16:14chains quits [Client Quit]
07:19:04^ quits [Ping timeout: 250 seconds]
07:19:36^ (^) joins
07:20:53<gareth48|m>JAA: how did the VKET image archive go? I think I saw you mention the imagespace above earlier
07:35:58^ quits [Ping timeout: 250 seconds]
07:36:42^ (^) joins
07:36:47egallager quits [Quit: This computer has gone to sleep]
07:44:11^ quits [Ping timeout: 260 seconds]
08:04:00ScenarioPlanet quits [Quit: Ping timeout (120 seconds)]
08:04:15TheTechRobo quits [Quit: Ping timeout (120 seconds)]
08:04:23abirkill quits [Quit: Let us prepare to grapple with the ineffable itself, and see if we may not eff it after all.]
08:04:52ScenarioPlanet (ScenarioPlanet) joins
08:04:54abirkill (abirkill) joins
08:04:54TheTechRobo (TheTechRobo) joins
08:05:17SpikedCola quits [Remote host closed the connection]
08:05:24Xe quits [Read error: Connection reset by peer]
08:05:29SpikedCola joins
08:05:55yasomi (yasomi) joins
08:06:51ScenarioPlanet quits [Client Quit]
08:06:57^ (^) joins
08:07:15ScenarioPlanet (ScenarioPlanet) joins
08:07:46egallager joins
08:08:36Pedrosso quits [Quit: Ping timeout (120 seconds)]
08:08:53Pedrosso joins
08:09:08ScenarioPlanet quits [Client Quit]
08:09:16tzt quits [Ping timeout: 260 seconds]
08:09:29ScenarioPlanet (ScenarioPlanet) joins
08:10:03tzt (tzt) joins
08:10:08yasomi quits [Read error: Connection reset by peer]
08:11:01chrismrtn quits [Ping timeout: 260 seconds]
08:11:24chrismrtn (chrismrtn) joins
08:14:31^ quits [Ping timeout: 260 seconds]
08:17:02Soulflare quits [Quit: http://drsclan.net]
08:18:37Soulflare joins
08:19:26yasomi (yasomi) joins
08:20:19^ (^) joins
08:23:20@arkiver quits [Remote host closed the connection]
08:23:45arkiver (arkiver) joins
08:23:45@ChanServ sets mode: +o arkiver
08:27:56^ quits [Ping timeout: 260 seconds]
08:41:15^ (^) joins
08:47:41@arkiver quits [Remote host closed the connection]
08:48:35arkiver (arkiver) joins
08:48:35@ChanServ sets mode: +o arkiver
09:16:25Webuser556988 joins
09:16:32Webuser556988 quits [Client Quit]
09:18:39grill (grill) joins
09:36:12<h2ibot>Exorcism edited Tubelious (+1): https://wiki.archiveteam.org/?diff=55154&oldid=51069
09:50:14<h2ibot>Exorcism edited WeVidi (+32): https://wiki.archiveteam.org/?diff=55155&oldid=50512
09:55:56JDev quits [Quit: Ooops, wrong browser tab.]
10:08:11charlotte_ joins
10:08:48<charlotte_>Would anyone here happen to know what this script does? https://gitea.arpa.li/JustAnotherArchivist/little-things/src/branch/master/s3-bucket-find-direct-url
10:09:23<charlotte_>Nothing I've tried to use it for seems to work and there are no examples provided.
10:09:47<c3manu>tzt: oof, that’s looking rough. all JS, and you seem to get nothing from *any* page without entering one
10:10:10<charlotte_>https://lfcdownload.leapfrog.com has https://lfcdownload.leapfrog.com.s3.amazonaws.com/ associated wih it but feeding it into the script doesn't return that.
10:11:43<charlotte_>https://lfecontent.leapfrog.com/ also returns nothing.
10:11:57<charlotte_>What DOES it work with? How does it work? What does it return?
10:12:40<c3manu>tzt: looks like individual downloads do work though. at least the PDF on https://ardata.cy.gov.tw/news/latestNews/125 did
10:15:22<c3manu>tzt: creating a list from the API should be possible
10:17:35<c3manu>tzt: but without being able to archive an index it’s kinda meh. somebody else has any ideas for https://ardata.cy.gov.tw?
10:17:58charlotte_ is now known as StarletCharlotte
10:26:15Snivy quits [Quit: The Lounge - https://thelounge.chat]
10:26:56grill quits [Ping timeout: 260 seconds]
10:27:01Snivy (Snivy) joins
10:28:42grill (grill) joins
10:41:00<c3manu>tzt: i started creating the lists, but for individuals it seems like i’d have 5600 pages to flip through individually :|
10:41:05<c3manu>eeh manually, i mean
10:53:41<c3manu>nvm, i confused the number of items with the number of pages. it’s just 12.
11:00:02Bleo18260072271962345 quits [Quit: The Lounge - https://thelounge.chat]
11:02:46Bleo18260072271962345 joins
11:04:24grill quits [Ping timeout: 250 seconds]
11:05:02<katato>c3manu: from what i understood, the captcha is completely local and likely embedded into one of js scripts, without javascript its likely nothing since it seems to be a single-page-app
11:06:12grill (grill) joins
11:06:34yasomi is now known as Xe
11:06:36<c3manu>yeah. new tab, start at main page with captcha prompt again
11:07:39<katato>give me a few moments
11:07:56<c3manu>without being able to grab the index, i think it would make sense to create a separate IA item as well. i might get back at you for help with the description (so people looking for it can actually find it)
11:15:21StarletCharlotte quits [Ping timeout: 260 seconds]
11:18:10Ketchup902 quits [Remote host closed the connection]
11:18:19Ketchup901 (Ketchup901) joins
11:22:32BornOn420 quits [Remote host closed the connection]
11:22:45sec^nd quits [Remote host closed the connection]
11:23:04BornOn420 (BornOn420) joins
11:23:06sec^nd (second) joins
11:24:31<h2ibot>Exorcism edited EraCast (+32): https://wiki.archiveteam.org/?diff=55156&oldid=50517
11:32:03<katato>c3manu: sorry for the wait, i messed around and came up with this one-liner that gets rid of captcha once put into console. luckily the framework they use is running in development mode, which enables us the internal "ng" tool
11:32:04<katato>ng.probe(document.querySelector('app-home')).componentInstance.state.acceptTos();ng.probe(document.querySelector('ngb-modal-window')).componentInstance.dismiss("bye")
11:32:44<katato>the first one disables the captcha guard, the second one disposes of the modal. the first one is important to make links active
11:34:00SkilledAlpaca418962 quits [Quit: SkilledAlpaca418962]
11:34:28SkilledAlpaca418962 joins
11:36:22<c3manu>katato: thanks! i don’t know how that’s helping with AB though
11:36:38egallager quits [Quit: This computer has gone to sleep]
11:39:53<katato>yea, its unfortunately not of much help since its something that has to be injected beforehand
11:49:30<c3manu>the last list is currently running. i found files for individuals, groups, groups yearly, elections, and the PDF attachments on the news pages
11:49:36<c3manu>is there anything that i missed?
11:49:53<c3manu>oh and the single PDF with the instructions
12:00:43StarletCharlotte joins
12:15:11StarletCharlotte quits [Remote host closed the connection]
12:15:24StarletCharlotte joins
12:21:29charlotte_ joins
12:24:11StarletCharlotte quits [Ping timeout: 260 seconds]
12:28:54charlotte_ quits [Ping timeout: 250 seconds]
12:30:56charlotte_ joins
12:40:24beardicus quits [Quit: bye]
12:41:38beardicus (beardicus) joins
12:44:37FiTheArchiver joins
12:53:56charlotte_ quits [Ping timeout: 260 seconds]
12:56:16lukash98 quits [Ping timeout: 260 seconds]
13:00:43charlotte_ joins
13:16:36T31M quits [Quit: ZNC - https://znc.in]
13:16:41grill quits [Ping timeout: 260 seconds]
13:16:55T31M joins
13:17:26charlotte_ quits [Ping timeout: 250 seconds]
13:18:19<@Fusl>!ao < https://fusl.phoenix.arpa.li/file/PW_vos-IxTW32LEb89dK0XKowxPynHl8/posts.cv_everyone_urls3_sorted_excl_urls2.txt --explain "https://posts.cv/ posts.cv links + outlinks since last crawl"
13:18:33<@Fusl>that is not archivebot
13:18:36<@Fusl>i was tricked
13:26:20charlotte_ joins
13:32:47scurvy_duck joins
13:47:01charlotte_ quits [Ping timeout: 260 seconds]
13:52:34th3z0l4_ joins
13:52:51th3z0l4 quits [Ping timeout: 260 seconds]
13:54:06charlotte_ joins
14:16:46charlotte_ quits [Ping timeout: 260 seconds]
14:19:50scurvy_duck quits [Ping timeout: 250 seconds]
14:45:47charlotte_ joins
14:46:34scurvy_duck joins
14:48:34gamer191 joins
14:48:51<gamer191>Why are some channels (including this one) publicly logged, but not bridged to Matrix?
14:49:41<katia>whether the hackint<>matrix bridge works is not up to archiveteam
14:49:44Riku_V quits [Ping timeout: 250 seconds]
14:49:55<katia>but only up to hackint staff
14:50:00Riku_V (riku) joins
14:50:12<katia>well, and matrix itself
14:50:31<katia>matrix tends to fall over often, so maybe it did just that?
14:51:12<gamer191>this channel isn't bridged at all as far as I can tell
14:52:13<gamer191>Like it's not included in the list of channels in Matrix's "explore public rooms" and searching "archiveteam" for it brings up no results
14:52:27lennier2 joins
14:52:31<gamer191>*searching "archiveteam-bs" brings up no results
14:52:53<gamer191>(Searching "archiveteam" does bring up lots of other channels, that was a typo)
14:53:42gamer191 quits [Client Quit]
14:53:54lennier2__ joins
14:55:16lennier2_ quits [Ping timeout: 260 seconds]
14:56:32gamer191 joins
14:56:34<gamer191>(irc keeps disconnecting for some reason, so I'll just monitor the logs)
14:56:38gamer191 quits [Client Quit]
14:56:40lennier2 quits [Ping timeout: 250 seconds]
14:57:01Riku_V quits [Ping timeout: 260 seconds]
14:57:33Riku_V (riku) joins
14:57:34<@imer>there does look to be matrix people in here, although not a clue how any of that works
15:00:41<TheTechRobo>there is an unofficial ArchiveTeam matrix group which has a bunch of the channels, I'm not sure how you join it
15:01:04<TheTechRobo>however all channels on hackint are bridged to Matrix unless the channel specifically opts out. See https://hackint.org/transport/matrix
15:01:23<nightpool>all hackint channels are accessible/bridged, you just have to join them directly #archiveteam-bs:hackint.org
15:01:27<TheTechRobo>so I guess join #archiveteam-bs:hackint.org ?
15:01:34<nightpool>yeah
15:01:39<TheTechRobo>🥷
15:01:50<nightpool>I am currently using matrix, fwiw
15:02:33grill (grill) joins
15:05:39hackbug quits [Remote host closed the connection]
15:08:41charlotte_ quits [Ping timeout: 260 seconds]
15:11:52hackbug (hackbug) joins
15:13:39hackbug quits [Remote host closed the connection]
15:21:54hackbug (hackbug) joins
15:22:19charlotte_ joins
15:25:59<Fijxu|m>there is a way to tell the warrior to save all the temporal data that is going to be uploaded to archive.org on RAM?
15:27:41<Fijxu|m>because I noticed that it uses the disk drive to save the files and then upload them to the archive, but I don't want to write the files to disk to prevent it from wearing out after processing a lot of data
15:29:23hackbug quits [Read error: Connection reset by peer]
15:29:24<glassy>i dont believe thats possible Fijxu|m each "job" the worker gets can equate to a lot of URLs which is packaged as a WARC. If it was to upload every single file individually (from RAM) this would slow things down further
15:32:09lennier2 joins
15:34:10hackbug (hackbug) joins
15:34:56lennier2__ quits [Ping timeout: 260 seconds]
15:38:17<TheTechRobo>Fijxu|m: Depends on how you're running the Warrior. If you're using docker/podman, you can do a bind-mount of /home/warrior/data/projects to a tmpfs (cf. https://github.com/ArchiveTeam/warrior-dockerfile/issues/83).
15:38:49hackbug quits [Read error: Connection reset by peer]
15:39:04<Fijxu|m>cool, I was thinking about something like that
15:44:51charlotte_ quits [Ping timeout: 260 seconds]
15:45:05hackbug (hackbug) joins
15:55:41egallager joins
16:08:31moth_ joins
16:08:32moth_ quits [Client Quit]
16:18:07Webuser101156 joins
16:19:17Webuser101156 quits [Client Quit]
16:21:10scurvy_duck quits [Ping timeout: 250 seconds]
16:23:49BennyOtt quits [Quit: ZNC 1.9.1 - https://znc.in]
16:24:27BennyOtt (BennyOtt) joins
16:30:00BennyOtt quits [Client Quit]
16:31:49BennyOtt (BennyOtt) joins
16:34:55scurvy_duck joins
16:40:16scurvy_duck quits [Ping timeout: 260 seconds]
16:49:26<h2ibot>Himond000 edited Deathwatch (+190, /* 2025 */ add fc2web.com): https://wiki.archiveteam.org/?diff=55157&oldid=55069
16:50:12gaz joins
16:51:45SootBector (SootBector) joins
17:10:22scurvy_duck joins
17:19:04Gadelhas5628737 joins
17:27:34<@JAA>charlotte_: s3-bucket-find-direct-url only works with open buckets that get served from a custom domain.
17:29:16<katia>(they since parted)
17:32:06<@JAA>(Oh well)
17:36:42<katia>)))))))))
17:38:36aninternettroll quits [Ping timeout: 260 seconds]
17:42:50aninternettroll (aninternettroll) joins
17:47:21aninternettroll quits [Ping timeout: 260 seconds]
17:53:36<h2ibot>Bzc6p edited Indafotó (+168, /* Progress */ last update before shutdown): https://wiki.archiveteam.org/?diff=55158&oldid=55055
17:53:37aninternettroll (aninternettroll) joins
17:58:26aninternettroll quits [Ping timeout: 260 seconds]
18:07:04aninternettroll (aninternettroll) joins
18:09:31Wohlstand (Wohlstand) joins
18:09:56myself quits [Read error: Connection reset by peer]
18:10:46myself (myself) joins
18:19:26scurvy_duck quits [Ping timeout: 260 seconds]
18:25:13<FiTheArchiver>idk where to bring attention to this, but disney still have all of their old websites up on disney.io and more specifically this subdomain http://go-60de6c82-be11-98e1-4d6c-c65a234eee95.disney.io/ i have a huge interest in disney channel especially the old websites so i thought id ask is there any possible way to archive this site somehow?
18:26:30<katia>this seems to just redirect
18:26:39<@imer>404 on .io for me
18:26:43<FiTheArchiver>no hold up ill show an example
18:26:49<FiTheArchiver>http://go-60de6c82-be11-98e1-4d6c-c65a234eee95.disney.io/disneyvideos/liveaction/hannahmontana/index.html
18:26:57<@imer>riight
18:27:03<FiTheArchiver>they still have this up, u can find a few by doing site:disney.io on google too
18:28:36<katia>if you can compile a list of such links i can add them to archivebot for you FiTheArchiver
18:29:06<FiTheArchiver>ok, will do!!
18:30:04<@imer>http://go-60de6c82-be11-98e1-4d6c-c65a234eee95.disney.io/disneyvideos/liveaction/ smells like S3/object storage, probably no way to get a listing though
18:30:48<FiTheArchiver>on wayback machine the link has like thousands of archived video files from 2023 so im not sure if someone already uncovered all of this
18:34:51<FiTheArchiver>i think there's so little of these links i could probably just put them through save page now myself i was just wondering if archivebot could run the whole site but since there's no specific listing and u have to dig yourself i doubt it
18:35:04<FiTheArchiver>thank u for the help though!!
18:38:06Ketchup901 quits [Remote host closed the connection]
18:38:23Ketchup901 (Ketchup901) joins
18:42:13VoynichCR (VoynichCR) joins
18:48:34VoynichCR quits [Client Quit]
18:58:31scurvy_duck joins
19:11:10Ketchup901 quits [Remote host closed the connection]
19:11:16Ketchup901 (Ketchup901) joins
19:19:33devkev quits [Quit: The Lounge - https://thelounge.chat]
19:21:36egallager quits [Quit: This computer has gone to sleep]
19:27:35devkev (devkev) joins
19:37:37Doranwen quits [Read error: Connection reset by peer]
19:38:15Doranwen (Doranwen) joins
19:43:18loug83181422 joins
19:48:09Ketchup901 quits [Remote host closed the connection]
19:48:15Ketchup901 (Ketchup901) joins
20:35:58loug83181422 quits [Client Quit]
20:44:43charlotte_ joins
20:51:06scurvy_duck quits [Ping timeout: 260 seconds]
20:59:15BlueMaxima joins
21:44:45BearFortress quits []
21:50:30charlotte_ quits [Ping timeout: 250 seconds]
21:53:48charlotte_ joins
22:08:43<@JAA>Arzon images are still on track, about halfway done now after 18 hours.
22:16:24<h2ibot>Pokechu22 edited FTP/List (+83, ftp://www.istorichka.ru/): https://wiki.archiveteam.org/?diff=55159&oldid=50861
22:18:40grill quits [Ping timeout: 250 seconds]
22:20:39grill (grill) joins
22:28:43BearFortress joins
23:05:28grill quits [Ping timeout: 250 seconds]
23:13:12BlueMaxima quits [Read error: Connection reset by peer]
23:20:38gaz quits [Quit: Ooops, wrong browser tab.]
23:49:01charlotte_ quits [Ping timeout: 260 seconds]
23:57:38Webuser432312 joins
23:58:52egallager joins
23:59:37Webuser432312 quits [Client Quit]