00:01:05Island joins
00:12:50andrew quits [Quit: Ping timeout (120 seconds)]
00:13:11andrew (andrew) joins
00:20:07kitonthe2et joins
00:23:54fishingforsoup joins
00:25:20kitonthe2et quits [Ping timeout: 240 seconds]
00:39:49<pabs>qwertyasdfuiopghjkl arkiver - btw channel for that is #justsolve, JAA pinged the admin too
00:40:28<nicolas17>I agree with adding it to #nodeping, and that doesn't even need to wait for the site to be back
00:43:32parfait (kdqep) joins
00:48:58fishingforsoup quits [Client Quit]
00:49:11fishingforsoup joins
00:49:13<qwertyasdfuiopghjkl>nicolas17: I think I've seen notifications for it in #nodeping before, it's just not detecting the outage this time because it's still giving a 200 OK status code.
00:49:39<nicolas17>oh
00:49:41<nicolas17>oof
00:49:43<nicolas17>fucking PHP
00:50:35<nicolas17>how is "require(): Unable to allocate memory for pool" not a fatal error returning 500?
00:53:48Wohlstand (Wohlstand) joins
01:06:26<nicolas17>https://i.imgflip.com/888wmc.jpg
01:20:09<nicolas17>JAA: ^
01:27:08<fireonlive>thanks mediawiki for that 200 :p
01:28:26<nicolas17>good thing argenteam supports content-encoding because that was 3.14GiB of HTML
01:50:51<h2ibot>Nicolas17v2 created ARGENTeaM (+2540, Create page): https://wiki.archiveteam.org/?title=ARGENTeaM
01:50:52<h2ibot>JustAnotherArchivist changed the user rights of User:Nicolas17v2
01:51:19qwertyasdfuiopghjkl quits [Client Quit]
01:53:27Church quits [Ping timeout: 252 seconds]
01:53:27kpcyrd quits [Ping timeout: 252 seconds]
01:53:27fluke quits [Ping timeout: 252 seconds]
01:54:40kpcyrd (kpcyrd) joins
01:54:44fluke joins
01:54:51Church (Church) joins
01:54:56<Vokun>Could someone could put this in AB? https://everynoise.com The owner said he's not sure how much longer he's going to run the site
01:55:02<Vokun>My source https://www.reddit.com/r/Archiveteam/comments/18az1cu/everynoisecom_might_go_down_soon
02:06:00<pokechu22>Done but it looks like there was a job in 2022, will check what happened with that
02:10:33Earendil7 (Earendil7) joins
02:25:44<fireonlive>for project channels with h2ibot or similar (blogger/pastebin/youtube/telegram/etc), would it be a good idea to have ‘the bot’ in a separate side channel (per project) while discussion for the project continues in the main channel? especially for when the bot can get somewhat "noisy" for some with a bunch of additions and some discussion may be
02:25:45<fireonlive>missed cc JAA arkiver Vokun
02:26:56<thuban>i've contemplated that myself
02:26:57<project10>my +1 on that fwiw ... but I might just be 'holding it wrong'
02:26:57<fireonlive>e.g. something like #telegrab-queue or #telegrab-bot
02:26:58<h2ibot>JustAnotherArchivist edited ARGENTeaM (+7, Datetimeify): https://wiki.archiveteam.org/?diff=51256&oldid=51255
02:27:40<nicolas17>maybe the bot should reply in NOTICEs too?
02:28:27<fireonlive>perhaps… but would still clutter i suppose (and require more code changes than just moving a channel)
02:28:58<h2ibot>Nicolas17v2 created Argenteam (+23, Add redirect): https://wiki.archiveteam.org/?title=Argenteam
02:32:08<@JAA>Moving the channel is a one-line diff in a config file.
02:32:20<@JAA>NOTICE isn't supported at all by the code currently.
02:32:27<thuban>oof
02:32:48<thuban>i filter several of the bots
02:32:53<thuban>but (a) that doesn't work for everyone
02:32:59<thuban>and (b) the ones that have h2ibot talk to itself mostly have to be stateful, so either i let them potentially interfere with one another or i make a new bit for every bot and copy/paste all the triggers
02:33:24<thuban>will probably have to write a full-on script at some point
02:33:38<fireonlive>indeed re: filters not working for everyone (so notice may not make a difference there)
02:34:03<fireonlive>even just people popping in to be a bit buried
02:34:12<project10>thuban: is this for weechat?
02:34:20<thuban>project10: yeah
02:35:18<fireonlive>ever since thuban brought it up i try to prefix everything eggdrop says haha
02:35:49<@JAA>It'd be nice if we could associate responses directly with the bot command message, but that's not something hackint (or most clients) currently supports.
02:36:12<fireonlive>yeah… would be nice to leap forward ircv3 a bit
02:38:52Inti83 joins
02:42:27<thuban>(if we don't move channels, it might be nice to have the bot accept a 'cc' argument to repeat back along with the name of the submitted item, then cc the original submitter's nick when it submits to itself)
02:43:00mindstrut1 joins
02:43:07mindstrut quits [Read error: Connection reset by peer]
02:48:31qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
02:50:23<@JAA>That would be something for arkiver to implement. h2ibot just forwards messages, it doesn't do any message handling itself.
02:51:22<thuban>yes, that's what i meant by 'the bot'. sorry for unclearness
02:51:45<nicolas17>oh hey argenteam has an API...
02:54:27<fireonlive>we love to see it
02:58:16kitonthenet joins
03:02:20kitonthenet quits [Ping timeout: 240 seconds]
03:05:49<flashfire42>That question just comes from my pastebin stuff XD
03:07:20kiryu quits [Ping timeout: 240 seconds]
03:07:35kiryu joins
03:07:36kiryu quits [Changing host]
03:07:36kiryu (kiryu) joins
03:12:17Wohlstand quits [Client Quit]
03:14:52nic90 quits [Quit: The Lounge - https://thelounge.chat]
03:15:46nic90 (nic) joins
03:19:00Inti83 quits [Ping timeout: 265 seconds]
03:22:04kitonthenet joins
03:26:55kitonthenet quits [Ping timeout: 272 seconds]
03:44:39DogsRNice quits [Read error: Connection reset by peer]
03:50:54BlueMaxima quits [Read error: Connection reset by peer]
04:08:36aninternettroll quits [Read error: Connection reset by peer]
04:08:39aninternettroll (aninternettroll) joins
04:27:34kitonthe1et joins
04:31:50kitonthe1et quits [Ping timeout: 240 seconds]
04:33:23<h2ibot>Nicolas17v2 edited ARGENTeaM (+1071, Explain API and tvshows): https://wiki.archiveteam.org/?diff=51258&oldid=51256
04:39:06AlsoHP_Archivist quits [Client Quit]
04:39:27HP_Archivist (HP_Archivist) joins
04:54:24nicolas17 reads AB manual
04:54:41<nicolas17>so !ao gets *only* the given URL, it doesn't even recurse into images or stylesheets?
04:55:05<fireonlive>url + dependencies i believe
04:55:14<@JAA>URL + page requisites, yes
04:55:43<nicolas17>if I "!ao < somelist" and the list has multiple URLs with the same prerequisites, I would hope it gets those prerequisites only once
04:56:03<nicolas17>but independent "!ao someurl" would get those multiple times right?
04:56:08<@JAA>Correct
04:58:25<nicolas17>so I could make a huge list with URLs like https://argenteam.net/movie/148927 and then !ao< the list, it won't recurse into links but it will get images and stylesheets, and it will get them only once
04:58:54<nicolas17>what about redirects? if I add A and B, and A redirects to B, will it request B only once?
05:01:37<@JAA>No, redirects aren't deduped.
05:02:00<@JAA>Cf. https://github.com/ArchiveTeam/wpull/issues/431
05:08:19kitonthe1et joins
05:16:50kitonthe1et quits [Ping timeout: 240 seconds]
05:25:19<pokechu22>However, the no-parent rule can lead to some complications with that too: if the initial URL was https://example.com/dira/url and https://example.com/dira/url redirects to https://example.com/dirb/url then that redirect's target *won't* be saved. This applies to !ao https://example.com/dira/url or !a https://example.com/dira/url or an !ao < list or an !a < list (but if it's
05:25:22<pokechu22>discovered from https://example.com/ or a similar URL without a subdirectory, or if it's offsite, then the no-parent rule doesn't matter)
05:38:56<nicolas17>okay!
05:39:13<nicolas17>JAA: I think here's all webpages https://transfer.archivete.am/L9wLu/argenteam.net_webpages.txt.zst
05:40:28<nicolas17>/tv/$id URLs redirect to the first episode of the TV show, so that will be fetched twice, but seems acceptable
06:03:18Arcorann (Arcorann) joins
06:03:21HP_Archivist quits [Client Quit]
06:05:47<tech234a>One of Mozilla's (partially) public telemetry websites is being replaced with a private version on December 15 https://groups.google.com/a/mozilla.org/g/firefox-dev/c/kNuk69n7nhc
06:06:21<tech234a>The attached doc says the data will be available elsewhere but I can't seem to find it in the other location yet
06:08:09<nicolas17>ok time to pass out
06:08:10atphoenix quits [Remote host closed the connection]
06:08:47<h2ibot>Tech234a edited Deathwatch (+221, /* 2023 */ Add Mozilla telemetry): https://wiki.archiveteam.org/?diff=51259&oldid=51253
06:08:55atphoenix (atphoenix) joins
06:09:05<nicolas17>oh I forgot to add argenteam to deathwatch
06:11:13ScenarioPlanet quits [Client Quit]
06:12:20<@JAA>nicolas17: AB job for that is running now.
06:14:48<h2ibot>FireonLive edited Mailman2 (+1224, add asterisk (most/(all?) to be discontinued…): https://wiki.archiveteam.org/?diff=51260&oldid=51145
06:17:14pabs goes to AB the digium lists fireonlive
06:17:23<pabs>fireonlive: can you also add them to deathwatch?
06:17:35<fireonlive>ah sure :)
06:21:50<h2ibot>FireonLive edited Deathwatch (+400, add Sangoma/Digium/Asterisk): https://wiki.archiveteam.org/?diff=51261&oldid=51259
06:23:17<nicolas17>I was wondering why the queue kept *growing* on that AB job... prerequisites :P
06:25:30<@JAA>page requisites*
06:25:36<nicolas17>that
06:30:07<nicolas17>note that to discover these URLs I ran multiple threads as fast as they could go and reached 50 req/s
06:31:13kitonthe1et joins
06:31:22<nicolas17>50 req/s would probably be way too much here, just saying if you want to raise it up a notch... the server can cope :P
06:33:56<fireonlive>con 99999999
06:37:33kitonthe1et quits [Ping timeout: 272 seconds]
06:39:53datechnoman quits [Quit: The Lounge - https://thelounge.chat]
06:40:28datechnoman (datechnoman) joins
06:44:41Island quits [Read error: Connection reset by peer]
07:02:05<h2ibot>PaulWise edited Mailman2 (-1233, digium lists in progress): https://wiki.archiveteam.org/?diff=51262&oldid=51260
07:05:45<fireonlive>=] tks
07:10:20Barto quits [Ping timeout: 240 seconds]
07:10:43Barto (Barto) joins
07:16:40<@JAA>nicolas17: So the /tv/ID URLs redirect to /episode/something, which means the redirect isn't followed as pokechu22 explained above. But since the latter URL is also in the list, that should be fine.
07:25:02<@arkiver>JAA: did we make WARCs of uloz.to ?
07:25:35<@JAA>arkiver: It didn't sound like WARCs were being made since pro accounts were involved.
07:25:38<@JAA>Sanqui: ^
07:25:46<@arkiver>ah i see
07:25:51<@arkiver>where is the data now?
07:50:47<@JAA>Transition of power in Argentina is on Sunday, so we should probably start doing something with that.
07:51:06<@JAA>nicolas17: Any progress on setting something up for the geofenced sites?
08:10:37<@arkiver>JAA: i guess that "something" would be archiving various government sites in #archivebot ?
08:10:58<@arkiver>especially the departments he wants to cut out or greatly reduce funds for
08:11:39<@JAA>arkiver: Yeah, plus special handling for sites that are being annoying, like Buttflare or geofencing to Argentina.
08:13:39Dango360 quits [Read error: Connection reset by peer]
08:19:10Dango360 (Dango360) joins
08:34:56<@Sanqui>arkiver: we downloaded with pro accounts (otherwise the download speeds were abysmal and captchas were all over), using an off the shelf tool because there wasn't a time for a warc friendly solution. we currently have 40+ TB of data across multiple people and computers and are coordinating deduplication and concatenation using a few scripts, it's a work in progress. I'll try to get a wiki article up though
08:36:52<@arkiver>Sanqui: sounds pretty good, do you have examples of the data?
08:37:10<@arkiver>it sounds like you're still doing some cleanup, so examples might come later i guess
08:38:36<@Sanqui>arkiver: https://www.ejha.cz/ulozto/list.php
08:39:22<@Sanqui>spoiler: there's a lot of warez
08:40:02<@Sanqui>but also a lot of videos, music, video game saves and mods, other legal data.
08:40:38<@Sanqui>page's not loading for me now, can write again when it is
09:02:12Naruyoko quits [Remote host closed the connection]
09:02:27<@arkiver>Sanqui: hah, i see
09:02:31<@arkiver>but still good it was saved
09:02:35Naruyoko joins
09:02:42<@Sanqui>seems to be loading now
09:02:49<@arkiver>what were you further plans for this data?
09:03:09<@Sanqui>good question
09:03:13pokechu22 quits [Ping timeout: 272 seconds]
09:04:28<@Sanqui>we (czechoslovak game archive) are considering subtly hinting that we downloaded a crapload of data from ulož.to and that if somebody is looking for something specific for research purposes to contact us
09:05:03<@Sanqui>and of course, if it's suitable/desirable we would also put it on IA
09:05:27<@Sanqui>would have to determine the best process+formats for that
10:00:04Bleo1826 quits [Client Quit]
10:01:20Bleo1826 joins
10:07:11Naruyoko quits [Ping timeout: 272 seconds]
10:45:18Naruyoko joins
10:50:48Naruyoko quits [Read error: Connection reset by peer]
10:53:54Naruyoko joins
10:56:42Naruyoko quits [Remote host closed the connection]
10:57:03Naruyoko joins
10:59:51Gereon9 (Gereon) joins
11:02:17Gereon95 quits [Ping timeout: 272 seconds]
11:25:37gfhh1 joins
11:28:53gfhh quits [Ping timeout: 272 seconds]
11:51:38qwertyasdfuiopghjkl quits [Remote host closed the connection]
11:54:22qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
12:02:09icedice (icedice) joins
12:13:10gfhh2 joins
12:17:01gfhh1 quits [Ping timeout: 272 seconds]
12:18:50parfait quits [Ping timeout: 240 seconds]
12:24:22cas joins
12:25:43<cas>hello, Im trying to download this video, and I noticed that in "about this capture", it is claimed that this video is archived by archiveteam. However, the wayback machine claims that this video is not archived. I would like to ask if there's any way to fetch the full video for downloading, as im interested in its content.
12:25:53<cas>https://web.archive.org/web/20220103091055/https://www.youtube.com/watch?v=OQktVBtbygI
12:26:12<cas>forgot to post the video link
12:45:06bob joins
12:45:51bob quits [Remote host closed the connection]
12:55:50Arcorann quits [Ping timeout: 240 seconds]
12:56:56kitonthenet joins
12:57:28Inti83 joins
13:05:20kitonthenet quits [Ping timeout: 240 seconds]
13:08:26itachi1706 quits [Quit: Bye :P]
13:08:55itachi1706 (itachi1706) joins
13:09:04<TheTechRobo>cas: Looks like that video's watch page was saved, but not the video itself.
13:27:25<Inti83>Hi, re requests for EndOfTerm, thy are already annuoncing they will stop financing a set of human rights sites, ex clandestine concentration camps which are now cultural centers for memory and justice. The sites are https://www.exccdolimpo.org.ar and https://memoriaexatletico.blogspot.com. I added them and a couple more to the Argentine Wiki page
13:27:26<Inti83>as well as some comments
13:28:41<Inti83>The new president takes office on 10th december. They also announced to stop financing the ministry for gender and lgtb+ which has a lot of resources - specifically this editorial https://editorial.mingeneros.gob.ar/
13:37:29tertu2 quits [Quit: so long...]
13:43:01tertu (tertu) joins
13:45:02vitzli (vitzli) joins
13:54:31ScenarioPlanet (ScenarioPlanet) joins
13:55:11qwertyasdfuiopghjkl quits [Remote host closed the connection]
14:07:09Inti83 quits [Ping timeout: 265 seconds]
14:16:17kitonthe1et joins
14:21:50kitonthe1et quits [Ping timeout: 240 seconds]
14:37:50eroc1990 quits [Ping timeout: 240 seconds]
15:00:50BearFortress quits [Ping timeout: 240 seconds]
15:06:14pepito joins
15:06:47pepito quits [Remote host closed the connection]
15:25:33Inti83 joins
15:30:41VerifiedJ quits [Quit: The Lounge - https://thelounge.chat]
15:31:12VerifiedJ (VerifiedJ) joins
15:42:27BearFortress joins
16:09:54Island joins
16:13:48vitzli quits [Client Quit]
16:19:49cas quits [Remote host closed the connection]
16:32:40parfait (kdqep) joins
16:50:23DogsRNice joins
16:52:17Inti83 quits [Remote host closed the connection]
17:12:10monoxane quits [Quit: Ping timeout (120 seconds)]
17:22:58<nicolas17>oh no
17:23:20<nicolas17>https://argenteam.net/movie/148671/Pain.Hustlers.%282023%29 --> https://foro.argenteam.net/viewtopic.php?f=11&t=185066 --> https://argenteam.net/movie/148671
17:23:29<nicolas17>the official blog posts have links without the slug
17:24:04<nicolas17>er forum posts
17:24:07<nicolas17>I'm not awake yet
17:28:20<fireonlive>🐌
17:55:41kitonthenet joins
17:58:01monoxane (monoxane) joins
18:02:50kitonthenet quits [Ping timeout: 240 seconds]
18:18:05c3manu (c3manu) joins
18:20:50BearFortress quits [Read error: Connection reset by peer]
18:21:16BearFortress joins
18:22:50Dango360 quits [Ping timeout: 240 seconds]
18:23:05bleb quits [Ping timeout: 272 seconds]
18:26:09cm joins
18:27:54Dango360 (Dango360) joins
18:31:29kitonthe2et joins
18:40:49kitonthe2et quits [Ping timeout: 272 seconds]
18:47:16Hackerpcs quits [Quit: Hackerpcs]
18:49:48Hackerpcs (Hackerpcs) joins
18:52:51kitonthe2et joins
19:02:46mindstrut1 quits [Read error: Connection reset by peer]
19:03:02mindstrut1 joins
19:03:40lennier2_ joins
19:06:20lennier2 quits [Ping timeout: 240 seconds]
19:09:19kitonthe2et quits [Ping timeout: 272 seconds]
19:11:26pokechu22 (pokechu22) joins
19:18:17Craigle quits [Quit: The Lounge - https://thelounge.chat]
19:18:48Craigle (Craigle) joins
19:20:13kitonthenet joins
19:25:50kitonthenet quits [Ping timeout: 240 seconds]
19:43:32<@JAA>Sanqui: FYI, I just tried to archive the page with AB, and it went down again immediately. I think it's the people here using The ~~Lounge~~ Clownge with link prefetching enabled.
19:44:14<@Sanqui>*shrug* it's not mine anyway, but I suppose I should've PM'd it to arkiver instead :)
19:45:50<@JAA>It does seem to be hosted on a Z3 or something.
19:47:02<that_lurker>can you add it in a https://transfer.archivete.am/ txt file so that the url is not posted in the channel?
19:55:30atphoenix quits [Remote host closed the connection]
19:56:11atphoenix (atphoenix) joins
20:09:48pokechu22 quits [Client Quit]
20:16:30abirkill- (abirkill) joins
20:18:39BlueMaxima joins
20:18:59abirkill quits [Ping timeout: 272 seconds]
20:18:59abirkill- is now known as abirkill
20:19:35pokechu22 (pokechu22) joins
20:23:59<@JAA>I guess that would be a workaround, but ugh.
20:25:06sepro quits [Quit: Bye!]
20:26:31sepro (sepro) joins
20:26:36<TheTechRobo>I can disable link prefetching
20:27:03<nicolas17>every single user in the channel using thelounge would need to disable it
20:27:29<TheTechRobo>Right
20:27:50<TheTechRobo>I wonder if ircv3 could add link embeds so this wasn't necessary
20:28:02<@JAA>I'll just grab-site it.
20:33:24<fireonlive>tfw JAA disses your IRC client again
20:33:30<fireonlive>:'(
20:34:04<that_lurker>Well I tried making a feature request, but they did not get my point apparently https://github.com/thelounge/thelounge/issues/4805
20:43:50Wohlstand (Wohlstand) joins
20:48:04<Pedrosso>How's the argentina end-of-term project going?
20:51:12<that_lurker>At least its interesting to know that thelounge makes link preview requests as twitterbot :P
20:55:09<fireonlive>https://dl.fireon.live/irc/tltest
20:55:29<fireonlive>Mozilla/5.0 (compatible; The Lounge IRC Client; +https://github.com/thelounge/thelounge) facebookexternalhit/1.1 Twitterbot/1.0
20:56:55<@JAA>Pedrosso: It isn't yet, but I'll get started on it tonight.
20:57:08<Pedrosso>Awesome
20:58:00<that_lurker>I completly forgot how hard it is to argue the point of your feature requests in the land of github :P
20:58:29<fireonlive>(44 requests total to that url btw)
20:58:43<fireonlive>(39 from The Lounge)
20:58:52<that_lurker>we need 25 more request
20:58:53<that_lurker>s
20:58:53<nicolas17>now post it on mastodon and see how much worse it gets
20:59:08<fireonlive>that_lurker: indeed :3
22:07:22kitonthe1et joins
22:11:50kitonthe1et quits [Ping timeout: 240 seconds]
22:13:21kitonthenet joins
22:31:23Lord_Nightmare quits [Quit: ZNC - http://znc.in]
22:35:14Lord_Nightmare (Lord_Nightmare) joins
22:38:42<nicolas17>https://argenteam.net/api/v1/movie/596 aaaugh
22:55:58<nicolas17>https://argenteam.net/api/v1/movie/148898 many affected
22:56:40<fireonlive>oh no
22:59:07c3manu quits [Remote host closed the connection]
23:01:10<@JAA>Oof