00:02:25Naruyoko5 quits [Remote host closed the connection]
00:02:44Naruyoko5 joins
00:03:18Naruyoko5 quits [Client Quit]
00:05:56etnguyen03 quits [Client Quit]
00:06:24Island joins
00:06:44Naruyoko joins
00:09:11Doomaholic quits [Read error: Connection reset by peer]
00:09:56Doomaholic (Doomaholic) joins
00:11:58<pabs>JAA: affected domains/urls btw https://transfer.archivete.am/uiUDG/archive-linuxtag.org
00:11:58<eggdrop>inline (for browser viewing): https://transfer.archivete.am/inline/uiUDG/archive-linuxtag.org
00:28:49<h2ibot>PaulWise edited ArchiveBot/Monitoring (+0, typo): https://wiki.archiveteam.org/?diff=54051&oldid=54003
00:30:05<k>OrIdow6: what would you replace wiki with
00:31:05<@JAA>szczot3k: I just remembered that I saw another option for rendering Wikipedia links somewhere before: prefixing them with the pseudo-namespace 'wp', i.e. 'wp:Example'. I think that might've been on other Wikimedia sites.
00:33:02BlueMaxima quits [Read error: Connection reset by peer]
00:36:18loug8318142 quits [Quit: The Lounge - https://thelounge.chat]
00:40:13etnguyen03 (etnguyen03) joins
00:44:56<pabs>does AT have any site classifiers? ie visit URL, check what type of software the page uses (blogger, wordpress, mediawiki, mailman, cgit etc), find the "primary" URL of the instance, generate an appropriate AB/WikiBot/etc job command, stash it away.
00:46:42lennier2 joins
00:46:57<pabs>I wrote one for wikibot and was thinking about generalising it and integrating it into anything that already exists
00:46:58<pabs>https://transfer.archivete.am/mKDCg/check-wikis-emit-wikibot-cmds
00:46:59<eggdrop>inline (for browser viewing): https://transfer.archivete.am/inline/mKDCg/check-wikis-emit-wikibot-cmds
00:48:04Ketchup901 quits [Ping timeout: 276 seconds]
00:49:52lennier2_ quits [Ping timeout: 260 seconds]
00:51:00<@OrIdow6>k: The same wiki but with some of the garbage removed
00:51:11<@OrIdow6>I'm being hyperbolic with "destroy"
00:51:45<@JAA>pabs: #// has code to queue to queue to various other DPoS projects (Telegram, Pastebin, Imgur, etc.). That's the only thing I'm aware of.
00:51:52<@JAA>s/to queue //
00:54:08Ketchup901 (Ketchup901) joins
00:56:00<nicolas17>click https://wiki.archiveteam.org/index.php/Special:Random and think "is this accurate/updated?" :P
01:23:07superkuh quits [Ping timeout: 260 seconds]
02:03:15<nicolas17>W2 broke?
02:03:30<nicolas17>https://wiki.archiveteam.org/index.php/YouTube#Stories
02:03:42<nicolas17>> “Stories” was a feature of YouTube which works in a similar way to , and .
02:06:56<@JAA>Maybe we should go through everything ATrescue touched back then and make sure it's as intended.
02:10:46<nicolas17>oh good looks like youtube is one of the few pages using it
02:11:59<nicolas17>JAA: if I change this {{W2}} usage to be {{Wikipedia}} instead, it will look ugly
02:12:16<nicolas17>> works in a similar way to "Instagram Stories" at Wikipedia, "Facebook Stories" at Wikipedia and "Snapchat" at Wikipedia.
02:12:36<nicolas17>W2 was useful for that
02:15:00<nicolas17>ATrescue was right (?)
02:15:14<@JAA>nicolas17: That's why szczot3k suggested changing Template:Wikipedia.
02:15:28<@JAA>They were right that the existing template was odd in some circumstances.
02:15:38<@JAA>They tried to 'fix' it in the worst possible way.
02:16:39<nicolas17>well that paragraph of the youtube page is unreadable meanwhile
02:18:25sec^nd quits [Ping timeout: 276 seconds]
02:25:20sec^nd (second) joins
03:08:23lennier2 quits [Ping timeout: 265 seconds]
03:09:14lennier2 joins
03:09:26etnguyen03 quits [Client Quit]
03:16:04etnguyen03 (etnguyen03) joins
03:32:07<eggdrop>[remind] OrIdow6: dw2
03:32:20etnguyen03 quits [Client Quit]
03:37:52night quits [Ping timeout: 260 seconds]
03:38:45night joins
03:40:34etnguyen03 (etnguyen03) joins
03:51:37etnguyen03 quits [Remote host closed the connection]
04:12:14DogsRNice joins
04:53:18Webuser511904 joins
05:05:42<@OrIdow6>!remindme 1h dw2
05:05:42<eggdrop>[remind] ok, i'll remind you at 2024-12-16T06:05:42Z
05:05:48<@OrIdow6>This is how I operate the alarm on my phone to wake up BTW
05:06:26DogsRNice quits [Read error: Connection reset by peer]
05:22:46Webuser511904 quits [Client Quit]
05:50:18midou quits [Ping timeout: 265 seconds]
06:05:43<eggdrop>[remind] OrIdow6: dw2
06:31:06<@OrIdow6>Alright let's do this
06:31:52<@OrIdow6>2024-12-15: untergrund.net - almost all have been run in AB
06:32:16<@OrIdow6>You know what I'll just list the pending ones
06:36:46<@OrIdow6>vector.co.jp: run in AB I think? attn c3manu
06:39:03<@OrIdow6>2024-12-26: dokuso.co.jp: there was a job in AB for the main domain but, aside from that using JS heavily, I don't see anything having been done wiht subdomians
06:43:43<@OrIdow6>2024-12-31: techable.jp - running in AB and looks like it will finish soon
06:44:30<@OrIdow6>2024-12-31: tomlehrersongs.com: run well but just pointing out for those new here that this was added to DW about 4 years ago
06:45:03<@JAA>End of 2024 seemed so far away at the time.....
06:45:35<@OrIdow6>2024-12-31: Google Jamboard: Needs to be done by me, google stuff is a pain but it should be simple (if ugly) enough to filter all the other millions of types of documents out and just do those
06:45:45<@OrIdow6>WEll, doesn't need to be done by me, but I said I'd do it
06:46:23<@OrIdow6>JAA: Yeah it's really weird to see that finally resolve
06:46:38<@JAA>The last untergrund.net job finished by now, by the way.
06:46:39<@OrIdow6>It did! Like 5% of a lifetime that's been on there
06:46:48<@JAA>STOP!
06:46:48<@OrIdow6>Ah ok
06:47:00<@JAA>:-P
06:49:14<@OrIdow6>Hahaha
06:50:05<@JAA>Next you'll tell me that it was added closer to Obama's presidency than to today.
06:54:28<@OrIdow6>End of the year' (gives a Cloudflare error right now): SteamRep - main site run but that_lurker you mentioned in July "do[ing] a seeded !a with those targetting only the https://forums.steamrep.com/members/ link. Or an !ao" - is that needed?
06:56:03<@JAA>Main site is Buttflare'd.
06:56:08<that_lurker>yeah
06:57:32<that_lurker>forums was kinda the only thing you would get and if I remember correctly the members pages where on the main buttflare protected js hell site
07:00:10<@OrIdow6>'End of the year': AnnoZone: looks like the forums and filehost ran well, wiki was run in wikibot I think but I only see a single aborted job (and not for any subdomains) in AB? attn arkiver pabs
07:02:57<pokechu22>I'm not sure if annozone.de is complete or not - there were issues with session IDs on the forums and we ultimately ended up ignoring any URL with session IDs, which may have ignored valid ones (but keeping them wouldn't have been feasible)
07:05:51Unholy23619246453771312 (Unholy2361) joins
07:08:19<@JAA>AnnoZone isn't shutting down anymore; it was taken over by a new owner.
07:08:58<@JAA>Looks like the forums have already been migrated to a new system.
07:10:09<@JAA>Relevant announcement: https://www.annozone.de/forum/thread/17942-die-annozone-zieht-um-neuer-kapitaen-an-bord/
07:10:26<@OrIdow6>Thanks JAA, sounds good
07:10:43<@JAA>We should probably still run the wikis through AB though.
07:14:22<@OrIdow6>Yeah
07:14:46@OrIdow6 hopes this wall of text doesn't defeat its purpose by drowning out all the individual items
07:14:55<@OrIdow6>Sometime soon: Vice.com: AB jobs in February, when it was announced that they were going to stop posting to the site, but they've kept making posts since then, what's up with that?
07:17:04<@OrIdow6>Same thing with https://www.lifehacker.com.au/ , licensed from the same company but shutting down due to a different reason - looks kinda spammy though so maybe they've just stopped publishing *useful* stuff there
07:18:07<@JAA>Hmm, Wikipedia says that 'Vice has since partnered with other media companies, such as Savage Ventures, to distribute its content.'
07:18:26<@JAA>https://www.axios.com/2024/05/09/vice-media-relaunch-savage-ventures
07:27:25<@OrIdow6>Unknown: Genshin channel: seems they had some drama in January but are fine now?
07:27:38<@OrIdow6>Fine for now, that is
07:30:56<@OrIdow6>Several false alarms like this that I'm going to skip over
07:33:34<@OrIdow6>Unknown: booru.org: site and owner strike one as old and fragile but no reason to think it is at risk at the end of 2024 in particular
07:34:28<@OrIdow6>Unknown: https://todayonline.com/: site is still up, a few weeks's articles were posted between the AB job and the freeze if we want to bother with that?
07:34:30<@OrIdow6>And that's it.
07:36:30<@OrIdow6>... wait
07:36:53<@OrIdow6>Forgot about stuff shutting 0.01 seconds into 2025
07:38:55midou joins
07:40:00<@OrIdow6>2025-01-?: fx-rashinban: Doesn't look like we've done anything, site superficiallly looks like it will run in AB well
07:46:03<@OrIdow6>'early 2025': Foursquare City Guide: Doesn't look like we've done anything
07:48:28<@OrIdow6>2025-01-01: Unmanned Spaceflight: AB job in progress, do you expect this will finish in time thuban?
07:48:45<@OrIdow6>And THAT's it.
07:54:12<szczot3k>JAA: are you talking about interwiki? That's how it's done with Wikipedia:Example, can also create an interwiki for wp:Example.
07:54:32lukash98 quits [Ping timeout: 260 seconds]
07:54:40<szczot3k>Soo... do we have at least some consensus for the <sup> template? I can remove the space.
08:01:52<@JAA>szczot3k: Ah, right. And the Wikimedia projects all have interwiki links for WP, WCOM, WD, and whatnot.
08:02:24lukash98 joins
08:20:18i_have_n0_idea9 quits [Quit: The Lounge - https://thelounge.chat]
08:20:48i_have_n0_idea9 (i_have_n0_idea) joins
08:21:47<szczot3k>Huh, it should be on [[Special:Interwiki]]
08:21:53<szczot3k>I don't see one
08:23:36<@JAA>Looks like it's an extension that needs to be enabled.
08:26:37Wohlstand (Wohlstand) joins
08:37:40loug8318142 joins
08:39:49<szczot3k>The wiki also needs an update
08:40:24<szczot3k>JAA: https://wiki.archiveteam.org/index.php/Special:MostInterwikis although this Special exists
08:40:43<szczot3k>But yeah, there's no interwiki extension
08:40:45<szczot3k>Weird.
09:09:10<VannevarK>Hmm, wrt this FileFront/GameFront thing, I can't find any of the filenames in the URLS in the archives. I did find links for the download IDs though, however every one goes to a 404 page. So it's looking like this content from 04-08 has been purged, or at least the content that I'm looking for (some Halo 2 mods for original Xbox). Just in case
09:09:10<VannevarK>that information is of interest to anybody
09:21:17<szczot3k>https://wiki.archiveteam.org/index.php/User:Szczot3k/Sandbox/Wikipedia without the space. Will work on the documention of the template, so it doesn't look like total shit
09:21:46<szczot3k>!remindme 6h wikipedia template docs
09:21:46<eggdrop>[remind] ok, i'll remind you at 2024-12-16T15:21:46Z
09:21:48<szczot3k>But after work :)
09:23:30Webuser358633 joins
09:24:00Webuser358633 quits [Client Quit]
10:22:00<thuban>< pabs> does AT have any site classifiers? ie visit URL, check what type of software the page uses (blogger, wordpress, mediawiki, mailman, cgit etc), find the "primary" URL of the instance, generate an appropriate AB/WikiBot/etc job command, stash it away.
10:22:13<thuban>i have a blogger detector lying around (although it doesn't generate commands and archival method for blogger currently depends on whether there's a custom domain)
10:22:26<thuban><@OrIdow6> 2025-01-01: Unmanned Spaceflight: AB job in progress, do you expect this will finish in time thuban?
10:22:28<thuban>yes, i do
10:22:54<@OrIdow6>Ok, thanks
10:24:04<thuban>(there was also a prior job in october--although it's much smaller than the current one, plus the metadata seems to be missing, so idk whether it was aborted or much more aggressively filtered or what)
10:28:28<@OrIdow6>I wonder if anyone's ever analyzed DW and seen what the most common days, days of the week, etc are
10:28:38<@OrIdow6>I certainly have my guesses
10:48:22Wohlstand quits [Ping timeout: 260 seconds]
10:59:55Island quits [Read error: Connection reset by peer]
11:34:31colla joins
11:34:51<pabs>thuban: what did you use to write it? mine is Python, requests and BeautifulSoup
11:34:59<thuban>ditto
11:35:08<pabs>:)
11:36:55<thuban>https://transfer.archivete.am/inline/PUhGC/blogspot-checker.py
11:41:05<thuban>(because the html templating is fully general, it's impossible to be 100% specific _or_ sensitive with blogspot, but this worked fine in practice)
12:00:02Bleo182600722719623 quits [Quit: The Lounge - https://thelounge.chat]
12:02:42Bleo182600722719623 joins
12:09:12chains joins
12:20:07ymgve joins
12:23:27ymgve_ quits [Ping timeout: 260 seconds]
12:39:23SkilledAlpaca418962 quits [Quit: SkilledAlpaca418962]
12:49:42G4te_Keep3r3492415 quits [Ping timeout: 260 seconds]
12:49:42G4te_Keep3r3492415 joins
13:17:02lennier2_ joins
13:17:29lflare quits [Killed (palermo.hackint.org (Nickname regained by services))]
13:17:31lflare (lflare) joins
13:17:33lunik112 joins
13:17:37useretail_ joins
13:17:37lukash989 joins
13:17:39driib9 (driib) joins
13:17:41G4te_Keep3r34924156 joins
13:17:46G4te_Keep3r3492415 quits [*.net *.split]
13:17:46loug8318142 quits [*.net *.split]
13:17:46lukash98 quits [*.net *.split]
13:17:46midou quits [*.net *.split]
13:17:46night quits [*.net *.split]
13:17:46lennier2 quits [*.net *.split]
13:17:46Doomaholic quits [*.net *.split]
13:17:46DopefishJustin quits [*.net *.split]
13:17:46useretail quits [*.net *.split]
13:17:46Radzig quits [*.net *.split]
13:17:46adamus1red quits [*.net *.split]
13:17:46Kenshin quits [*.net *.split]
13:17:46Stagnant_ quits [*.net *.split]
13:17:46@imer quits [*.net *.split]
13:17:46@kaz quits [*.net *.split]
13:17:46bilboed0 quits [*.net *.split]
13:17:46igloo22225 quits [*.net *.split]
13:17:46Gadelhas562873 quits [*.net *.split]
13:17:46lunik11 quits [*.net *.split]
13:17:46ThetaDev quits [*.net *.split]
13:17:46tzt quits [*.net *.split]
13:17:46itachi1706 quits [*.net *.split]
13:17:46driib quits [*.net *.split]
13:17:46ell7 quits [*.net *.split]
13:17:46@Fusl quits [*.net *.split]
13:17:46lunik112 is now known as lunik11
13:17:46lukash989 is now known as lukash98
13:17:46driib9 is now known as driib
13:17:46G4te_Keep3r34924156 is now known as G4te_Keep3r3492415
13:18:11midou joins
13:18:21DopefishJustin joins
13:19:54Kenshin joins
13:19:55Gadelhas562873 joins
13:19:59ell7 (ell) joins
13:20:01imer (imer) joins
13:20:01@ChanServ sets mode: +o imer
13:20:03ThetaDev joins
13:20:25night joins
13:20:50itachi1706 (itachi1706) joins
13:21:07tzt (tzt) joins
13:21:21Doomaholic (Doomaholic) joins
13:21:28Stagnant_ (Stagnant) joins
13:21:56bilboed0 joins
13:22:08loug8318142 joins
13:22:14adamus1red (adamus1red) joins
13:30:42Radzig joins
14:49:36Fusl (Fusl) joins
14:49:36@ChanServ sets mode: +o Fusl
14:54:02igloo22225 (igloo22225) joins
14:54:07kaz (Kaz) joins
14:54:07@ChanServ sets mode: +o kaz
15:04:00cow_2001 quits [Quit: ✡]
15:21:46<eggdrop>[remind] szczot3k: wikipedia template docs
15:25:59Gadelhas5628731 joins
15:30:07Gadelhas562873 quits [Ping timeout: 260 seconds]
15:30:07Gadelhas5628731 is now known as Gadelhas562873
16:34:43JaffaCakes118 (JaffaCakes118) joins
16:36:41<JaffaCakes118>Hello could someone please archive https://autofather.com with archivebot please (not much coverage)
16:42:26Alex41313 joins
16:43:32<Alex41313>Hi! How do I get a new site added to the Deathwatch page?
16:43:32<Alex41313>This UK based cycling forum which is very popular and has decades worth of posts is planning on shutting down (will be deleted completely) on 16th March 2025 due to the new UK Online Safety Act legislation passing the day after.
16:43:32<Alex41313>https://www.lfgss.com/conversations/401475/
17:10:57Alex41313 quits [Client Quit]
17:33:22<eggdrop>[remind] szczot3k: wikitemplate
17:39:37<h2ibot>That lurker edited Deathwatch (+196, add lfgss…): https://wiki.archiveteam.org/?diff=54053&oldid=54050
17:40:38<h2ibot>That lurker edited Deathwatch (+1, fix typo...): https://wiki.archiveteam.org/?diff=54054&oldid=54053
17:54:36lukash986 joins
17:55:00lukash989 joins
17:55:40DogsRNice joins
17:55:57lukash98 quits [Ping timeout: 260 seconds]
17:55:57lukash989 is now known as lukash98
17:59:07lukash986 quits [Ping timeout: 252 seconds]
18:54:52lennier2 joins
18:55:27lennier2_ quits [Ping timeout: 260 seconds]
18:57:55cow_2001 joins
19:57:32Wohlstand (Wohlstand) joins
21:03:30<nicolas17>rrpa
21:03:41<nicolas17>"We are back online! Thank you for your patience as we had to recover for a major hardware failure. Good news is that we are running on newer and better servers which should be faster and more reliable. The bad news is that due to the nature of the hardware failure (physical hard drive crashes), we did suffer some data loss. We needed to use a specialized company to help recover some of the data, but even then some of it was corrupt due to the
21:03:43<nicolas17>hard drive crash."
21:04:06<nicolas17>no backups ._.
21:04:28<nicolas17>http://rrpicturearchives.net/
21:12:19<datechnoman>Sounds like a perfect job for AB
21:12:26<datechnoman>Before more data is lost :(
21:12:46<anarcat>surely people here heard that openstreetmap is down?
21:14:38<szczot3k>anarcat it claims it's R/O for maint?
21:14:54<@JAA>> The OpenStreetMap database is currently in read-only mode while essential database maintenance work is carried out.
21:17:22<nicolas17>network outage at primary server
21:18:25<nicolas17>they switched to secondary server, but its database is missing the last writes so it would be unsafe to make it writable
21:18:38<nicolas17>and they decided it wasn't worth it to try to replicate the last missing writes manually over 4G
21:19:19<szczot3k>Sounds like a sensible thing to do
21:19:57<szczot3k>So down as in, people maintaining it will have a bad day, but not down as in we lost all data
21:20:07<nicolas17>"We are hoping to have services fully restore on Wednesday 18 December 2024 based on our expectation of when our ISP will have restored services. We have chosen to wait for the ISP restore rather than activating our higher risk disaster recover plan."
21:21:19<szczot3k>So it's not yet time for #openstreetdown
21:22:15<nicolas17>datechnoman: last time I tried crawling rrpa, if you do more than N requests in a day, your IP is blocked for the rest of the day
21:22:46<nicolas17>iirc images are fine, webpages are the problem
21:27:00<datechnoman>nicolas17 - ahhh gotcha. So its a slow DPOS project if we want to grab it by the sounds
21:43:16BlueMaxima joins
21:53:39<nicolas17>datechnoman: also it seems non-existent albums redirect to an error page that returns 200 OK
21:53:45<nicolas17>http://rrpicturearchives.net/archivethumbs.aspx?id=151767
21:57:10<nicolas17>I'll wget albums until I get banned (:
21:58:15balrog_ is now known as balrog
22:00:54JaffaCakes118_2 (JaffaCakes118) joins
22:01:27ThetaDev quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]
22:01:35JaffaCakes118 quits [Remote host closed the connection]
22:01:53ThetaDev joins
22:02:08<datechnoman>Oh just delightful. Cant have things being too easy. Remindes me of telegram with their fake 200's
22:03:34<nicolas17>most images are in S3 so we can hit those without limit
22:20:29<nicolas17>http://rrpicturearchives.net/archivethumbs.aspx?id=146979 this might be related to the outage / corruption
22:34:12katocala quits [Ping timeout: 260 seconds]
22:34:24katocala joins
22:54:17katocala quits [Ping timeout: 252 seconds]
22:54:50katocala joins
23:11:53etnguyen03 (etnguyen03) joins
23:35:40Ketchup901 quits [Ping timeout: 276 seconds]
23:35:53Ketchup901 (Ketchup901) joins
23:42:53etnguyen03 quits [Client Quit]
23:50:29runxiyu (runxiyu) joins
23:58:41etnguyen03 (etnguyen03) joins