00:01:32APOLLO03 quits [Read error: Connection reset by peer]
00:03:15APOLLO03 joins
00:03:39Yakov8 is now known as Yakov
00:05:09<klea>2026-02-26 00:03:56 <nulldata> https://learn.redhat.com/ <- Shutting down March 31, 2026. Running in AB just gives HTTP code 202. https://learn.redhat.com/t5/Red-Hat-Learning-Community-News/Evolving-how-we-learn-together/ba-p/57899 (forwarded from #archiveteam)
00:05:33<nicolas17>fuck sake
00:06:04<nicolas17>pokechu22: au site again not wanting to load in my browser, what's the max page number?
00:06:09<nicolas17>I'm at 2300
00:06:24<pokechu22>https://www.classification.gov.au/classification-ratings/latest-classification-decisions?field_rating%5B1%5D=1&page=192046
00:06:38<pokechu22>so you're at 1%
00:06:47<billybobbyjoe>nicholas17: 3840935 total lmao
00:07:58<nicolas17>billybobbyjoe: no like, pages in the *list*
00:08:12<nicolas17>there's 20 movies/games/things listed in each page
00:08:40<billybobbyjoe>well yes; each movie/game/thing is also, itself, a page.
00:09:25<billybobbyjoe>so there are 3840935 title pages AND Current page190447
00:09:35<billybobbyjoe>pages for the list
00:12:28etnguyen03 quits [Client Quit]
00:13:21<billybobbyjoe>not to mention the numerous filters, each base list page can be filtered in 16 different combinations (base included)
00:14:10tekulvw (tekulvw) joins
00:15:03<billybobbyjoe>and i don't wanna show up here just to burden yall but unfortunately in my 10 minutes of searching i've found numerous other extremely important .gov.au domains almost entirely absent lol
00:16:04<nicolas17>I'm only collecting the list to get the URLs of all the titles
00:16:14<billybobbyjoe>right ok
00:16:36<nicolas17>to actually archive the list for WBM it has to be done differently
00:19:03tekulvw quits [Ping timeout: 272 seconds]
00:24:45<billybobbyjoe>legislation.gov.au, the literal national register of laws for the entire nation: 170795 pieces of legislation, only 1731 total captures
00:24:46<billybobbyjoe>data.gov.au, the national collated register of open data: 109,441 datasets, 6021 total captures
00:24:46<billybobbyjoe>aph.gov.au, anything parliament house related: 7749 total captures, 1.8 million individual Hansard pages alone (i'd guess hansard is only ~20 percent of that whole domain)
00:24:46<billybobbyjoe>i could go on really forever but yeah, full sweep of the entire gov.au domain would be ideal fr
00:24:46<billybobbyjoe>i say that as if i have any idea what the fuck i'm doing lmao
00:25:15<billybobbyjoe>if yall want help i will offer it but someone will need to explain this all to me because i have no idea what half of this means
00:25:31<billybobbyjoe>willing to learn but very confused haha
00:28:01<billybobbyjoe>willing to commit some time to scan all these subdomains for search records
00:28:38<billybobbyjoe>the ones i listed are the ones i use on a regular basis and know of the top of my head so who truly knows how wide reaching this is
00:31:02APOLLO03 quits [Client Quit]
00:31:43<pokechu22>If you can identify important sites, that would be helpful. I do see https://www.legislation.gov.au/gazettes/historic/2004 has a few captures and https://www.legislation.gov.au/files/gazettes/historic/2004/2004GN01.pdf has none
00:33:19APOLLO03 joins
00:33:52<pokechu22>https://www.legislation.gov.au/ doesn't seem to be akamai so I can probably run that one in archivebot directly - the site seems to be somewhat scripty but also has versions that work without javascript
00:35:00tekulvw (tekulvw) joins
00:40:05tekulvw quits [Ping timeout: 268 seconds]
00:46:02etnguyen03 (etnguyen03) joins
00:52:45tekulvw joins
00:57:21tekulvw quits [Ping timeout: 268 seconds]
00:57:27etnguyen03 quits [Client Quit]
00:59:37<klea>2026-02-26 00:58:03 <Slimm> https://www.theverge.com/news/884824/corsair-ending-drop-shopping-site drop.com will be ceasing operations next month and likely purged/shut down (forums, site, reviews, etc) (forwarded from #archiveteam)
01:02:45Arcorann__ quits [Ping timeout: 272 seconds]
01:12:44Express826 joins
01:13:19Express826 quits [Client Quit]
01:17:28APOLLO03 quits [Client Quit]
01:17:53APOLLO03 joins
01:20:12lennier2_ quits [Read error: Connection reset by peer]
01:20:28lennier2_ joins
01:20:55tekulvw (tekulvw) joins
01:29:21tekulvw quits [Ping timeout: 272 seconds]
01:30:33tekulvw (tekulvw) joins
01:38:52tekulvw quits [Remote host closed the connection]
01:39:09tekulvw (tekulvw) joins
01:54:05tekulvw quits [Ping timeout: 268 seconds]
02:14:29sec^nd quits [Remote host closed the connection]
02:14:52tekulvw (tekulvw) joins
02:14:59sec^nd (second) joins
02:21:05billybobbyjoe quits [Quit: Ooops, wrong browser tab.]
02:25:05tekulvw quits [Ping timeout: 272 seconds]
02:26:21pabs quits [Ping timeout: 272 seconds]
02:27:10sec^nd quits [Remote host closed the connection]
02:27:41sec^nd (second) joins
02:34:10tekulvw (tekulvw) joins
02:42:49tekulvw quits [Ping timeout: 272 seconds]
02:49:32tekulvw (tekulvw) joins
02:56:51etnguyen03 (etnguyen03) joins
02:58:50tekulvw quits [Ping timeout: 268 seconds]
03:09:05APOLLO03 quits [Client Quit]
03:09:28APOLLO03 joins
03:10:10etnguyen03 quits [Client Quit]
03:18:00etnguyen03 (etnguyen03) joins
03:21:52tekulvw (tekulvw) joins
03:25:20pabs (pabs) joins
03:26:35tekulvw quits [Ping timeout: 268 seconds]
03:31:57tekulvw joins
03:36:42<h2ibot>Pokechu22 edited ArchiveBot/Ignore (+49, /* Pinterest */ s.pinimg.com/webapp too): https://wiki.archiveteam.org/?diff=60556&oldid=60495
03:39:49tekulvw quits [Ping timeout: 272 seconds]
03:42:15PredatorIWD253 joins
03:44:53PredatorIWD25 quits [Ping timeout: 272 seconds]
03:44:53PredatorIWD253 is now known as PredatorIWD25
03:51:59tekulvw (tekulvw) joins
03:56:10etnguyen03 quits [Client Quit]
03:59:27tekulvw quits [Ping timeout: 272 seconds]
04:00:38legoktm quits [Quit: http://quassel-irc.org - Chat comfortably. Anywhere.]
04:00:41legoktm joins
04:01:28etnguyen03 (etnguyen03) joins
04:10:18etnguyen03 quits [Remote host closed the connection]
04:16:11APOLLO03 quits [Client Quit]
04:17:06APOLLO03 joins
04:19:02tekulvw (tekulvw) joins
04:26:41tekulvw quits [Ping timeout: 272 seconds]
04:30:22tekulvw (tekulvw) joins
04:33:42nine quits [Quit: See ya!]
04:33:55nine joins
04:33:56nine quits [Changing host]
04:33:56nine (nine) joins
04:36:05Arcorann__ (Arcorann) joins
04:37:17Island_ quits [Read error: Connection reset by peer]
04:37:30tekulvw quits [Ping timeout: 268 seconds]
05:04:38n9nes quits [Ping timeout: 268 seconds]
05:05:13n9nes joins
05:13:50APOLLO03 quits [Read error: Connection reset by peer]
05:14:42DogsRNice quits [Read error: Connection reset by peer]
05:14:57APOLLO03 joins
05:18:12<nicolas17>I misread the number and thought I was almost done with the classification ratings
05:18:16<nicolas17>I am in fact almost 10% done
05:30:43<Hans5958>Is #losttenure official? Why is it not announced on #archiveteam (or mentioned on the MOTD; it still refers to #archiveteam-bs)
05:31:16<nicolas17>what
05:31:37<nicolas17>we don't put every new project channel in the MOTD
05:32:45<Hans5958>I mentioned it since it's still say "We know about the Tenor API → #archiveteam-bs"
05:33:02<Hans5958>If it is official then it should point to #losttenure, no?
05:33:16SootBector quits [Remote host closed the connection]
05:33:26<Hans5958>(on #archiveteam)
05:34:33SootBector (SootBector) joins
05:35:55<nicolas17>oh
05:36:13<nicolas17>I missed that part x_x
05:40:19APOLLO03 quits [Client Quit]
05:41:23APOLLO03 joins
05:42:25eythian quits [Quit: http://quassel-irc.org - Chat comfortabel. Waar dan ook.]
05:43:59eythian joins
05:50:55pabs quits [Ping timeout: 272 seconds]
05:52:06pabs (pabs) joins
06:03:29APOLLO03 quits [Client Quit]
06:03:30LddPotato quits [Read error: Connection reset by peer]
06:04:24LddPotato (LddPotato) joins
06:04:29APOLLO03 joins
06:07:57<pabs>hmm, archive.today is returning "Server Error" for me on archival, anyone else?
06:10:19<pabs>nicolas17: NLA uses Brozzler btw, so they can get JSy things
06:11:29APOLLO03 quits [Client Quit]
06:12:54APOLLO03 joins
06:13:57<pokechu22>Same
06:14:26nexussfan quits [Quit: Konversation terminated!]
06:21:48LddPotato quits [Read error: Connection reset by peer]
06:23:08LddPotato (LddPotato) joins
06:30:52lennier2_ quits [Read error: Connection reset by peer]
06:31:08lennier2_ joins
06:35:06LddPotato quits [Read error: Connection reset by peer]
06:35:50LddPotato (LddPotato) joins
06:45:52tekulvw (tekulvw) joins
06:51:05tekulvw quits [Ping timeout: 272 seconds]
06:51:57LddPotato quits [Read error: Connection reset by peer]
06:52:35LddPotato (LddPotato) joins
07:00:12APOLLO03 quits [Client Quit]
07:00:31APOLLO03 joins
07:01:05tekulvw (tekulvw) joins
07:03:36LddPotato quits [Read error: Connection reset by peer]
07:04:03LddPotato (LddPotato) joins
07:06:07tekulvw quits [Ping timeout: 268 seconds]
07:11:59Webuser613121 joins
07:12:14Webuser613121 quits [Client Quit]
07:36:57tekulvw (tekulvw) joins
07:41:45tekulvw quits [Ping timeout: 272 seconds]
08:03:03lennier2 joins
08:03:17APOLLO03 quits [Ping timeout: 272 seconds]
08:03:39APOLLO03 joins
08:06:27lennier2_ quits [Ping timeout: 272 seconds]
08:06:56fangfufu_ joins
08:09:01fangfufu quits [Ping timeout: 268 seconds]
08:09:43<nicolas17>ok my curl requests are not working anymore
08:10:00GodzFire quits [Quit: Ooops, wrong browser tab.]
08:12:43<nicolas17>got it working again... will do only 2 concurrent instead of 3
08:23:25APOLLO03a joins
08:23:53APOLLO03 quits [Read error: Connection reset by peer]
08:34:24APOLLO03a quits [Read error: Connection reset by peer]
08:35:49APOLLO03 joins
08:43:28APOLLO03 quits [Client Quit]
08:43:54APOLLO03 joins
09:06:33<TheoH7>Just got round to uploading the archive I did of https://community.jisc.ac.uk to IA:
09:06:38<TheoH7>https://archive.org/details/community.jisc.ac.uk-2026-02-16-8e903462-00000.warc
09:09:47<TheoH7>This is the one were I'd successfully got links to https://community.ja.net to redirect to the same domain as that was just an alias for the same IP's.
09:11:00APOLLO03a joins
09:11:18APOLLO03 quits [Ping timeout: 268 seconds]
09:27:31ducky quits [Ping timeout: 272 seconds]
09:29:39ducky (ducky) joins
09:38:10tekulvw (tekulvw) joins
09:45:53tekulvw quits [Ping timeout: 272 seconds]
09:48:18APOLLO03a quits [Read error: Connection reset by peer]
09:52:27nulldata-alt1 quits [Quit: Ping timeout (120 seconds)]
09:52:46APOLLO03 joins
10:07:25Guest quits [Ping timeout: 272 seconds]
10:07:41Washuu joins
10:15:14arch quits [Remote host closed the connection]
10:15:46arch (arch) joins
10:18:01APOLLO03 quits [Client Quit]
10:18:50APOLLO03 joins
10:25:37Guest joins
10:30:51Guest quits [Ping timeout: 272 seconds]
10:33:59APOLLO03 quits [Client Quit]
10:35:28APOLLO03 joins
10:46:24Dada joins
10:48:45Guest joins
10:55:31Guest quits [Ping timeout: 268 seconds]
11:01:23Guest joins
11:01:28croissant_ joins
11:05:23croissant quits [Ping timeout: 268 seconds]
11:05:50tekulvw (tekulvw) joins
11:06:37Guest quits [Ping timeout: 268 seconds]
11:07:49APOLLO03 quits [Client Quit]
11:09:28APOLLO03 joins
11:10:45tekulvw quits [Ping timeout: 272 seconds]
11:11:53Guest joins
11:17:05Guest quits [Ping timeout: 272 seconds]
11:20:15APOLLO03 quits [Client Quit]
11:20:46APOLLO03 joins
11:31:29APOLLO03 quits [Client Quit]
11:33:45APOLLO03 joins
11:42:35APOLLO03 quits [Client Quit]
11:42:52APOLLO03 joins
12:00:01Bleo1826007227196234552220 quits [Quit: The Lounge - https://thelounge.chat]
12:02:43Bleo1826007227196234552220 joins
12:09:36tekulvw (tekulvw) joins
12:12:26khaoohs__ quits [Read error: Connection reset by peer]
12:13:05khaoohs__ joins
12:14:43tekulvw quits [Ping timeout: 272 seconds]
12:15:06atphoenix_ quits [Read error: Connection reset by peer]
12:15:41atphoenix_ (atphoenix) joins
12:16:26Snivy quits [Quit: Ping timeout (120 seconds)]
12:18:18Snivy (Snivy) joins
12:22:43<@Fusl>nulldata: re https://learn.redhat.com/ looks like it wants not only proper UA but also cookies that are generated on first visit
12:49:44Washuu quits [Client Quit]
13:12:44cyanbox quits [Read error: Connection reset by peer]
13:37:24xtheaurisx joins
13:37:36xtheaurisx quits [Client Quit]
13:40:51Arcorann__ quits [Ping timeout: 272 seconds]
13:49:06APOLLO03 quits [Client Quit]
13:50:11APOLLO03 joins
14:05:27APOLLO03 quits [Read error: Connection reset by peer]
14:07:53APOLLO03 joins
14:17:47midou quits [Ping timeout: 268 seconds]
14:22:41midou joins
14:38:20<@arkiver>on learn.redhat.com, should be good with AB *i think*
14:41:55<masterx244|m>sometimes a sacrificial first URL (with some meaningless extra parameters) to prime the cookies can be a trick there. had to do that at a crawl once, too where i had the first URL of the URLlist copied and a garbage parameter added to get a unparametered one with the right flags set, there was a ad interstitial on the first visit of that site that had to be skipped since it was useless for archival and putting that onto a different URL
14:41:55<masterx244|m>was the easiest method
14:45:23FiTheArchiver joins
14:52:17FiTheArchiver quits [Client Quit]
14:54:01simon816 quits [Quit: ZNC 1.10.1 - https://znc.in]
14:54:10^ quits [Ping timeout: 268 seconds]
14:54:14^ (^) joins
15:00:43simon816 (simon816) joins
15:02:11^ quits [Ping timeout: 268 seconds]
15:02:12midou quits [Ping timeout: 268 seconds]
15:02:17^ (^) joins
15:05:35catbottom quits [Quit: ZNC 1.9.1+deb2+b3 - https://znc.in]
15:06:51catbottom joins
15:10:11tekulvw (tekulvw) joins
15:15:13tekulvw quits [Ping timeout: 272 seconds]
15:28:59APOLLO03 quits [Client Quit]
15:30:13APOLLO03 joins
15:30:26<@arkiver>we have a little emergency project coming for https://numerabilis.u-paris.fr/medica/bibliotheque-numerique/
15:30:32<@arkiver>shutting down on the 28th
15:31:47Nekroschizofrenetyk joins
15:31:48<eggdrop>[tell] Nekroschizofrenetyk: [2026-02-21T12:13:57Z] <justauser> https://www.olawsky.de/schlesien/forum.html works for me. Want an AB run?
15:32:01<justauser>Actually, already started.
15:32:13<Nekroschizofrenetyk>Hi
15:32:40<Nekroschizofrenetyk>direct links to separate messages work for you? Like this: https://www.olawsky.de/forum/messages/10462.html
15:32:47<Nekroschizofrenetyk>Yeah, that would be great!
15:32:51<justauser>Should be all done already.
15:33:19<justauser>About 1G, list of URLs saved here: https://archive.org/download/archiveteam_archivebot_go_20260224194814_be59de7b/www.olawsky.de-inf-20260224-172857-bfa95-meta.warc.gz
15:34:16<Nekroschizofrenetyk>oh, yes
15:34:24<Nekroschizofrenetyk>great, fantastic!
15:34:32<Nekroschizofrenetyk>by the way
15:34:46<Nekroschizofrenetyk>are you still going to run the Archiwum Allegro project?
15:34:55<Nekroschizofrenetyk>because it might be pointless
15:35:18<justauser>I think it concluded with "potato".
15:35:20<Nekroschizofrenetyk>urls of archival auctions redirect to current ones or similar
15:38:43Nekroschizofrenetyk quits [Client Quit]
15:50:39lennier2_ joins
15:52:19APOLLO03 quits [Client Quit]
15:53:38APOLLO03 joins
15:53:51lennier2 quits [Ping timeout: 272 seconds]
16:04:15iPwnedYourIOTSmartdog quits [Quit: Ping timeout (120 seconds)]
16:04:29iPwnedYourIOTSmartdog joins
16:07:46VerifiedJ quits [Remote host closed the connection]
16:08:31VerifiedJ (VerifiedJ) joins
16:10:19ducky quits [Ping timeout: 272 seconds]
16:20:08Island joins
16:28:07ducky (ducky) joins
16:32:00tekulvw (tekulvw) joins
16:41:21tekulvw quits [Ping timeout: 272 seconds]
16:58:16APOLLO03 quits [Client Quit]
16:58:42APOLLO03 joins
17:03:33tekulvw (tekulvw) joins
17:08:36tekulvw quits [Ping timeout: 268 seconds]
17:15:50tekulvw (tekulvw) joins
17:20:37tekulvw quits [Ping timeout: 272 seconds]
17:21:13Webuser726658 joins
17:23:12lennier2_ quits [Read error: Connection reset by peer]
17:23:26lennier2_ joins
17:30:48APOLLO03 quits [Ping timeout: 268 seconds]
17:32:01APOLLO03 joins
17:44:39Wohlstand (Wohlstand) joins
17:47:57tekulvw (tekulvw) joins
17:52:55tekulvw quits [Ping timeout: 272 seconds]
17:54:30APOLLO03 quits [Client Quit]
17:55:07APOLLO03 joins
17:56:12rover joins
17:57:59roverinexile quits [Ping timeout: 272 seconds]
18:08:13APOLLO03 quits [Client Quit]
18:08:51APOLLO03 joins
18:08:56DogsRNice joins
18:25:22tekulvw (tekulvw) joins
18:32:28tekulvw quits [Ping timeout: 268 seconds]
18:34:09APOLLO03 quits [Client Quit]
18:34:58APOLLO03 joins
18:51:56tekulvw (tekulvw) joins
18:56:53tekulvw quits [Ping timeout: 272 seconds]
19:10:22APOLLO03a joins
19:10:42APOLLO03 quits [Ping timeout: 268 seconds]
19:15:15aninternettroll quits [Ping timeout: 272 seconds]
19:21:44tekulvw (tekulvw) joins
19:21:48APOLLO03a quits [Ping timeout: 268 seconds]
19:23:08APOLLO03 joins
19:24:40aninternettroll (aninternettroll) joins
19:26:39tekulvw quits [Ping timeout: 272 seconds]
19:26:58<@arkiver>imer: for whenever you are around, we have a (i think not huge) project coming up with close deadline for Medica - Bibliothèque Numérique. i made the tracker under "medica"
19:27:10<@arkiver>whenever possible could add a target under "medica" with
19:27:13<@arkiver>archiveteam_medica
19:27:16<@arkiver>medica_
19:27:31<@imer>iyep, ’ll set that up soon
19:27:32<@arkiver>Archive Team Medica Bibliothèque numérique:
19:27:36<@arkiver>thanks a lot!
19:27:41<@arkiver>i will be starting this after sleep
19:27:51<@arkiver>so not need to get it up immediately
19:27:53<@arkiver>good night :)
19:28:02<@arkiver>or day, or something :)
19:32:37DogsRNice_ joins
19:35:22DogsRNice quits [Ping timeout: 268 seconds]
19:36:01tekulvw (tekulvw) joins
19:36:57<nicolas17>oh no
19:37:11<nicolas17>the classification.gov.au website added more titles