00:02:04 | <qwertyasdfuiopghjkl> | Ryz: some more htmlplanet.com subdomains (from a few other subdomain finder sites), deduped against the list t.hat_lurker sent: https://transfer.archivete.am/inline/LImB2/more-htmlplanet.com-subdomains.txt |
00:09:10 | | Guest77 quits [Client Quit] |
00:11:11 | | lennier2 quits [Read error: Connection reset by peer] |
00:11:30 | | lennier2 joins |
00:25:36 | <h2ibot> | BornOn420 edited URLTeam (+164, Added b2n.ir): https://wiki.archiveteam.org/?diff=52123&oldid=52104 |
00:25:37 | <h2ibot> | BornOn420 edited Telegram (+449, Added a paragraph on channel discovery based…): https://wiki.archiveteam.org/?diff=52124&oldid=51730 |
00:25:38 | <h2ibot> | BornOn420 edited ArchiveTeam Warrior (+1730, Added a paragraph on running Orbstack, a Docker…): https://wiki.archiveteam.org/?diff=52125&oldid=52044 |
00:25:39 | <h2ibot> | Boofdev edited Pomf.se/Clones (-79, updated yapc file stats): https://wiki.archiveteam.org/?diff=52126&oldid=51795 |
00:37:16 | | Mateon2 joins |
00:39:14 | | Mateon1 quits [Ping timeout: 265 seconds] |
00:39:14 | | Mateon2 is now known as Mateon1 |
00:44:41 | | Perk joins |
00:44:42 | | Perk1 joins |
00:44:51 | | Perk quits [Remote host closed the connection] |
00:44:51 | | Perk1 is now known as Perk |
00:51:14 | | Carnildo quits [Read error: Connection reset by peer] |
00:51:28 | | Carnildo joins |
00:51:56 | | Unholy236192464 (Unholy2361) joins |
00:54:13 | | Unholy23619246 quits [Ping timeout: 265 seconds] |
00:54:13 | | Unholy236192464 is now known as Unholy23619246 |
01:11:55 | | systwi_ quits [Quit: systwi_] |
01:11:55 | | nothere quits [Quit: Leaving] |
01:11:59 | | etnguyen03 (etnguyen03) joins |
01:16:35 | | useretail__ joins |
01:18:58 | | useretail_ quits [Ping timeout: 255 seconds] |
01:19:25 | | Barto quits [Ping timeout: 255 seconds] |
01:22:15 | | Barto (Barto) joins |
01:27:23 | | nothere joins |
01:35:52 | | BlueMaxima joins |
01:40:07 | | Carnildo quits [Ping timeout: 255 seconds] |
02:12:30 | <TheTechRobo> | !tell Guest77 https://replayweb.page is a convenient site for viewing WARCs. |
02:12:31 | <eggdrop> | [tell] ok, I'll tell Guest77 when they join next |
02:18:31 | | Island joins |
03:09:31 | | etnguyen03 quits [Client Quit] |
03:10:30 | | etnguyen03 (etnguyen03) joins |
03:11:58 | | Island quits [Ping timeout: 265 seconds] |
03:19:57 | <thuban> | !tell Guest77 if you want to extract files en masse rather than browse them, you can use `unar`: https://theunarchiver.com/command-line |
03:19:58 | <eggdrop> | [tell] ok, I'll tell Guest77 when they join next |
03:24:23 | | Notrealname1234 (Notrealname1234) joins |
03:26:02 | <thuban> | !tell Guest77 note that because warc records are not necessarily one-to-one with uris (there may be multiple requests for the same location at different times, with different data, etc), simply extracting the whole thing like it's a zip may be 'lossy'; i don't know specifically how unar handles this |
03:26:02 | <eggdrop> | [tell] ok, I'll tell Guest77 when they join next |
03:31:20 | | Notrealname1234 quits [Client Quit] |
03:33:12 | <Ryz> | Heya folks, thanks for doing searching for htmlplanet.com - I have another one being customer.netspace.net.au that I need subdomains being mined from; https://www.subdomain.center/ didn't fetch a lot, just I think 150 results :C |
03:33:32 | <pokechu22> | hmm, I feel like I might have done that in the past? |
03:35:06 | <pokechu22> | Yeah, I'm pretty sure that one's been done before: https://archive.fart.website/archivebot/viewer/?q=netspace.net.au - or at least partially done? flashfire42 did a lot of work with those |
03:36:23 | <pokechu22> | ... and I did an !a < list on https://transfer.archivete.am/G7mC1/netspace.net.au_subdomains_v2.txt at https://archive.fart.website/archivebot/viewer/job/202307102021426cske |
03:36:24 | <eggdrop> | inline (for browser viewing): https://transfer.archivete.am/inline/G7mC1/netspace.net.au_subdomains_v2.txt |
03:45:38 | | Shjosan quits [Quit: Am sleepy (-, – )…zzzZZZ] |
03:46:18 | | Shjosan (Shjosan) joins |
03:58:59 | | BlueMaxima quits [Read error: Connection reset by peer] |
04:10:51 | | nertzy_ quits [Remote host closed the connection] |
04:15:28 | | Carnildo joins |
04:27:05 | | Carnildo quits [Read error: Connection reset by peer] |
04:27:08 | | Carnildo joins |
04:38:32 | | Carnildo quits [Remote host closed the connection] |
04:38:34 | | Carnildo joins |
04:41:38 | | Carnildo quits [Read error: Connection reset by peer] |
04:41:49 | | Carnildo joins |
04:43:50 | | nickel joins |
04:44:52 | | nickel quits [Client Quit] |
04:50:29 | | etnguyen03 quits [Client Quit] |
04:51:36 | | etnguyen03 (etnguyen03) joins |
04:57:13 | <nicolas17> | does 7zip for windows support warc? |
04:59:37 | <pokechu22> | You can open .warc.gz to get .warc but it doesn't do anything with .warc |
05:00:49 | | etnguyen03 quits [Remote host closed the connection] |
05:03:30 | | Carnildo quits [Read error: Connection reset by peer] |
05:03:49 | | Carnildo joins |
05:08:21 | | Carnildo quits [Remote host closed the connection] |
05:08:28 | | Carnildo joins |
05:30:47 | | Carnildo quits [Read error: Connection reset by peer] |
05:30:52 | | Carnildo joins |
07:05:02 | | Unholy23619246 quits [Remote host closed the connection] |
07:06:10 | | Unholy236192464 (Unholy2361) joins |
08:01:10 | | Carnildo quits [Remote host closed the connection] |
08:01:24 | | Carnildo joins |
08:08:40 | | Carnildo quits [Remote host closed the connection] |
08:08:52 | | Carnildo joins |
08:20:11 | | Carnildo quits [Read error: Connection reset by peer] |
08:20:22 | | Carnildo joins |
08:55:37 | | Carnildo_again joins |
08:55:59 | | Carnildo quits [Read error: Connection reset by peer] |
09:00:02 | | Bleo18260072 quits [Client Quit] |
09:01:18 | | Bleo18260072 joins |
09:06:41 | | Arcorann_ joins |
09:10:34 | | Carnildo joins |
09:11:23 | | Carnildo_again quits [Read error: Connection reset by peer] |
09:23:30 | | pedantic-darwin6 joins |
09:23:30 | <@arkiver> | seems like a lot has happened around subtitle sites the last few years |
09:23:37 | | pedantic-darwin quits [Ping timeout: 255 seconds] |
09:23:37 | | pedantic-darwin6 is now known as pedantic-darwin |
09:23:51 | <@arkiver> | opensubtitles enshittified further |
09:23:54 | <@arkiver> | subscene.com shutting down |
09:24:21 | <@arkiver> | which are currently the major subtitle sites? subdl.com ? any other significant ones? |
09:28:09 | | lflare quits [Ping timeout: 272 seconds] |
09:28:47 | <@arkiver> | so subscene.com closing in only a few hours? |
09:30:24 | | nulldata quits [Quit: So long and thanks for all the fish!] |
09:31:04 | | nulldata (nulldata) joins |
09:31:15 | | driib quits [Quit: The Lounge - https://thelounge.chat] |
09:31:34 | | driib (driib) joins |
09:31:35 | <katia> | allegedly |
09:32:54 | | pedantic-darwin9 joins |
09:33:58 | | pedantic-darwin quits [Ping timeout: 255 seconds] |
09:33:58 | | pedantic-darwin9 is now known as pedantic-darwin |
09:38:37 | <@arkiver> | anyone has downloaded the subscene.com dump already and knows what the data looks like? |
09:41:18 | <@arkiver> | if someone has already downloaded it, can you please give me a listing of the files inside it? |
09:58:29 | | f_ (funderscore) joins |
10:00:55 | <c3manu> | arkiver: the subscene.com dump from then has been uploaded to subdl.com, but i don't remember the exact URL right now. it is linked in the reddit post from earlier |
10:02:16 | <katia> | https://subdl.org/subscene/ |
10:02:22 | <katia> | this? |
10:02:34 | <c3manu> | ah, subdl.org was it. yeah |
10:03:45 | <c3manu> | apart from that, i don't know how independent subdl.org is. when checking the download links on https://subdl.org/subtitles/2380/nobody-knows-2020/ for example, they point to URLs like https://subtitle.faylab.com/subdownload.php?dl=/subtitle/1046332-2156610.zip |
10:03:46 | | lflare (lflare) joins |
10:04:09 | <c3manu> | in fact, you can also use https://subtitle.faylab.com/ for browsing |
10:04:18 | <c3manu> | faylab seems to be some AI startup |
10:06:23 | <c3manu> | it would be possible the page disappears as well when faylab goes down eventually. but who knows. |
10:15:14 | | sec^nd quits [Remote host closed the connection] |
10:15:57 | | Perk quits [Read error: Connection reset by peer] |
10:15:58 | | sec^nd (second) joins |
10:16:38 | | solitonmedic joins |
10:22:35 | | sec^nd quits [Ping timeout: 250 seconds] |
10:26:55 | | benjinsm joins |
10:28:03 | | benjins2__ joins |
10:28:13 | | sec^nd (second) joins |
10:29:44 | | f_ quits [Client Quit] |
10:30:13 | | benjins2_ quits [Ping timeout: 255 seconds] |
10:30:13 | | benjinsmi quits [Ping timeout: 255 seconds] |
10:48:12 | | sd quits [Client Quit] |
11:03:58 | | tertu quits [Ping timeout: 255 seconds] |
11:04:01 | | tertu2 (tertu) joins |
11:08:07 | | Carnildo quits [Remote host closed the connection] |
11:08:22 | | Carnildo joins |
11:41:21 | <@arkiver> | c3manu: hah, nice little investigation, yeah! |
11:41:44 | | Perk joins |
11:42:53 | | Carnildo quits [Remote host closed the connection] |
11:42:56 | | Carnildo_again joins |
11:46:13 | <@arkiver> | rewby: i have absolutely not idea if i can get a project running in time, but i want to try to get a copy of subscene.com |
11:46:53 | <@arkiver> | so can we get a target already set up perhaps? |
11:47:20 | <@arkiver> | it would be |
11:47:26 | <@arkiver> | archiveteam_subscene_ |
11:47:27 | <@arkiver> | subscene_ |
11:47:31 | <@arkiver> | Archive Team Subscene: |
11:48:07 | <@arkiver> | let's hope the the site stays on for a tiny bit longer |
11:50:14 | <@arkiver> | if someone has a good name for a subscene channel, let's make one |
11:50:22 | <@arkiver> | else we will use #archiveteam-bs |
11:52:27 | <@arkiver> | JAA: just in case, can you already trigger drone on subscene-grab please? |
11:53:08 | <that_lurker> | #subpar maybe. |
11:53:49 | <that_lurker> | nevermind katia owns it already :P |
11:54:32 | <angenieux> | #submarined maybe |
11:55:59 | <that_lurker> | There is also #submissive :P |
11:56:16 | <that_lurker> | or a variant #submissing :D |
11:56:29 | <katia> | 🍆 |
11:56:37 | <katia> | i like #submissing |
11:57:40 | <@arkiver> | ohhh |
11:57:48 | <@arkiver> | #submissing sounds awesome |
11:58:03 | <kpcyrd> | #submissing is funny lol |
11:58:11 | <angenieux> | its good |
12:02:06 | <@arkiver> | anyone know exactly how many hours we have left? |
12:02:19 | <@arkiver> | hmm 24 hours already passed? |
12:02:26 | <@arkiver> | let's hope it stays online |
12:04:30 | | Carnildo_again quits [Read error: Connection reset by peer] |
12:04:32 | | Carnildo_again joins |
12:21:18 | | Carnildo_again quits [Remote host closed the connection] |
12:21:28 | | Carnildo_again joins |
12:26:21 | | Perk quits [Read error: Connection reset by peer] |
12:27:01 | | Perk joins |
12:27:02 | | Perk6 joins |
12:27:10 | | Perk6 quits [Remote host closed the connection] |
12:42:09 | <h2ibot> | Nulldata uploaded File:Subscene-logo.gif: https://wiki.archiveteam.org/?title=File%3ASubscene-logo.gif |
12:42:38 | <@arkiver> | thanks nulldata |
12:42:46 | <@arkiver> | nulldata: are you able to get a good quality version of their icon? |
12:44:09 | <h2ibot> | Nulldata uploaded File:Subscene-screenshot.png: https://wiki.archiveteam.org/?title=File%3ASubscene-screenshot.png |
12:44:39 | | ell quits [Client Quit] |
12:44:53 | | ell (ell) joins |
12:46:32 | | Perk quits [Read error: Connection reset by peer] |
12:50:52 | | Wohlstand (Wohlstand) joins |
12:59:34 | | etnguyen03 (etnguyen03) joins |
13:01:21 | | Perk joins |
13:02:12 | <h2ibot> | Yzqzss edited 半次元 (-3, All(?) Web APIs are down): https://wiki.archiveteam.org/?diff=52129&oldid=50231 |
13:03:24 | | lflare quits [Client Quit] |
13:06:13 | <h2ibot> | Nulldata created Subscene (+820, Created page): https://wiki.archiveteam.org/?title=Subscene |
13:08:37 | <nulldata> | arkiver - the only slightly better one I've found was from a Reddit post - however the text seems off. https://lounge.nulldata.foo/uploads/ad21d617075dd9b3/subscene-logo2.png |
13:11:34 | | lennier2_ joins |
13:11:53 | | Dango360_ (Dango360) joins |
13:11:54 | | parfait_ joins |
13:12:18 | | superkuh_ joins |
13:12:33 | | useretail_ joins |
13:12:34 | | Still_Carnildo joins |
13:12:38 | | Jake2 (Jake) joins |
13:12:40 | | lunik11 joins |
13:12:47 | | s-crypt8 (s-crypt) joins |
13:12:49 | | Perk1 joins |
13:12:53 | | Unholy2361924640 (Unholy2361) joins |
13:12:57 | | linuxgemini1 (linuxgemini) joins |
13:13:02 | | tertu (tertu) joins |
13:13:08 | | Ruthalas593 (Ruthalas) joins |
13:13:19 | | nulldata4 (nulldata) joins |
13:13:23 | | midou_ joins |
13:13:26 | | TastyWiener953 (TastyWiener95) joins |
13:13:31 | | Webuser536 quits [Client Quit] |
13:13:31 | | qwertyasdfuiopghjkl quits [Client Quit] |
13:13:31 | | midou quits [Read error: Connection reset by peer] |
13:13:31 | | Guest quits [Quit: Connection closed] |
13:13:31 | | nulldata quits [Read error: Connection reset by peer] |
13:13:31 | | Carnildo_again quits [Read error: Connection reset by peer] |
13:13:31 | | Perk quits [Read error: Connection reset by peer] |
13:13:31 | | Jake quits [Read error: Connection reset by peer] |
13:13:31 | | Mateon1 quits [Read error: Connection reset by peer] |
13:13:31 | | linuxgemini quits [Read error: Connection reset by peer] |
13:13:31 | | skyrocket quits [Read error: Connection reset by peer] |
13:13:31 | | TastyWiener95 quits [Read error: Connection reset by peer] |
13:13:31 | | Unholy236192464 quits [Read error: Connection reset by peer] |
13:13:31 | | linuxgemini1 is now known as linuxgemini |
13:13:31 | | Unholy2361924640 is now known as Unholy236192464 |
13:13:31 | | Jake2 is now known as Jake |
13:13:32 | | Perk1 is now known as Perk |
13:13:32 | | nulldata4 is now known as nulldata |
13:13:32 | | midou_ is now known as midou |
13:13:32 | | TastyWiener953 is now known as TastyWiener95 |
13:13:35 | | knecht40 joins |
13:13:37 | | skyrocket joins |
13:14:12 | | tertu2 quits [Ping timeout: 265 seconds] |
13:14:37 | <@arkiver> | thanks for checking nulldata |
13:14:52 | <@arkiver> | i'll have the project ready in a few minutes i think |
13:14:55 | <@arkiver> | but we have no target yet |
13:15:15 | <h2ibot> | Switchnode edited ArchiveTeam Warrior (-1209, /* Advanced usage (container only) */…): https://wiki.archiveteam.org/?diff=52131&oldid=52125 |
13:15:26 | | useretail__ quits [Ping timeout: 265 seconds] |
13:15:27 | | Ruthalas59 quits [Ping timeout: 265 seconds] |
13:15:27 | | gaz quits [Ping timeout: 265 seconds] |
13:15:27 | | Dango360 quits [Ping timeout: 265 seconds] |
13:15:27 | | bladem quits [Ping timeout: 265 seconds] |
13:15:27 | | Ruthalas593 is now known as Ruthalas59 |
13:15:39 | | parfait quits [Ping timeout: 265 seconds] |
13:16:38 | | bladem (bladem) joins |
13:17:39 | | Arcorann__ joins |
13:18:04 | | Arcorann_ quits [Ping timeout: 265 seconds] |
13:18:04 | | lennier2 quits [Ping timeout: 265 seconds] |
13:18:04 | | lunik1 quits [Ping timeout: 265 seconds] |
13:18:04 | | knecht4 quits [Ping timeout: 265 seconds] |
13:18:04 | | s-crypt quits [Ping timeout: 265 seconds] |
13:18:04 | | superkuh quits [Ping timeout: 265 seconds] |
13:18:04 | | lunik11 is now known as lunik1 |
13:18:04 | | knecht40 is now known as knecht4 |
13:18:05 | | s-crypt8 is now known as s-crypt |
13:25:13 | | tapos joins |
13:31:59 | <thuban> | yzqzss / yts98 / Misty|m: is there a summary of the results of stwp's banciyuan project (maybe on your wiki?) that the archiveteam wiki can link to? i understand you downloaded videos (and images?) that we did not |
13:40:23 | | vics|m joins |
13:44:58 | <thuban> | JAA: i don't like the current behavior of the datetime template. if a time is specified without a zone, it renders it as "UTC", but if a time is specified as utc, it has to be rendered as "UTC (UTC+0)"; i think that the former should either be left ambiguous or emit a warning (lest it introduce false precision) and the latter should not be redundant (so that zone can be made |
13:45:00 | <thuban> | explicit in the source without looking weird). |
13:46:21 | <thuban> | you introduced the current zone handling; would you object to my revising it along those lines? (assuming i don't break it lmao, mediawiki template syntax makes my eyeballs hurt) |
14:09:49 | | Arcorann__ quits [Ping timeout: 255 seconds] |
14:17:19 | | Carnildo joins |
14:17:19 | | Still_Carnildo quits [Read error: Connection reset by peer] |
14:26:49 | | lflare (lflare) joins |
14:39:35 | <h2ibot> | Nulldata edited Subscene (+54, DPoS started): https://wiki.archiveteam.org/?diff=52132&oldid=52130 |
14:47:23 | | ell quits [Client Quit] |
14:48:13 | | ell (ell) joins |
14:49:41 | | f_ (funderscore) joins |
14:52:08 | <thuban> | JAA: i have an edit mostly ready to go, but either it's very cleverly hidden in the documentation or base mediawiki templating can't actually require parameters or emit errors (?!) |
14:52:38 | <thuban> | there's https://www.mediawiki.org/wiki/Template:Error but idk how much trouble it would be to set up the lua module |
14:57:07 | <c3manu> | arkiver: also don't forget the forum.subscene.com. i thought it was running fine yesterday, apparently over night it had to be stalled and is still giving 403s right now |
14:59:21 | <nulldata> | c3manu - the forums use heavy Cloudflare unfortunately. |
15:00:39 | <h2ibot> | JAABot edited CurrentWarriorProject (+0): https://wiki.archiveteam.org/?diff=52133&oldid=52051 |
15:10:21 | <yzqzss> | <thuban> "yzqzss / yts98 / Misty: is there..." <- Most of the work on stwp's banciyuan archive project is done by Misty. |
15:10:22 | <yzqzss> | She's been busy with her thesis lately, and doesn't have any free time until May 25th :( |
15:10:22 | <yzqzss> | So, there is no project summary on our wiki at the moment, and we haven't done the final stage of metadata cleaning yet. |
15:13:16 | <thuban> | ok, just checking! ping me whenever, i'll update the at wiki |
15:13:56 | <thuban> | Misty|m: much success with your thesis! |
15:15:27 | <@arkiver> | good luck Misty|m :) |
15:17:07 | | csacscs joins |
15:17:15 | <yzqzss> | BTW, yts98 is not in STWP. |
15:17:15 | <yzqzss> | yts98 doesn't seem to have uploaded Niconico Game_Atsumaru's data to IA yet. i PM'd him on Feb 1st, no reply. He hasn't been on the thread for a long time. |
15:17:17 | | Carnildo quits [Read error: Connection reset by peer] |
15:17:22 | | Carnildo joins |
15:17:55 | | csacscs quits [Client Quit] |
15:20:15 | <yzqzss> | thread->channel |
15:23:57 | <thuban> | ah, my mistake; i had that impression from the #wuciyuan logs |
15:27:02 | <thuban> | i messaged him on 10 feb and also got no reply. i still have my portion of the game atsumaru data (~270G of warcs) but no idea where the rest is... |
15:28:49 | <yzqzss> | I have deleted them all... |
15:29:06 | <thuban> | :( |
15:41:27 | <@arkiver> | so lost data? |
15:41:57 | | f_ quits [Ping timeout: 250 seconds] |
15:42:12 | <thuban> | unless yts98 turns up again, looks like it |
15:42:44 | | f_ (funderscore) joins |
15:43:17 | <thuban> | i sniffed around a bit and couldn't find any trace after october of last year. hope he's ok |
15:45:45 | <thuban> | (might (still) be on qq? https://www.hw179.com/space-uid-37513.html) |
15:46:24 | <Terbium> | #submissing reminds me of the Titan sub incident lol |
15:46:53 | <katia> | such a good name |
15:46:58 | <katia> | that_lurker++ |
15:46:58 | <eggdrop> | [karma] 'that_lurker' now has 6 karma! |
15:47:27 | <@arkiver> | that_lurker++ |
15:47:27 | <eggdrop> | [karma] 'that_lurker' now has 7 karma! |
15:48:50 | <thuban> | ah, here's the pad we were using: https://pad.notkiska.pw/p/game-atsumaru |
15:50:16 | <yzqzss> | yts98 said there was 1.4TB warc in total... |
15:51:49 | <thuban> | that was total data, i believe, so more like half that in warc |
15:53:39 | | etnguyen03 quits [Client Quit] |
15:54:21 | | etnguyen03 (etnguyen03) joins |
15:56:21 | | ell quits [Client Quit] |
15:57:00 | | ell (ell) joins |
15:57:40 | <thuban> | threedeeitguy wasn't here long and disappeared last july, while matatabi doesn't appear in my logs at all... |
16:02:50 | <thuban> | arkiver: if i sent you my single giant zst, could you get it onto ia in the manner you suggested in https://hackint.logs.kiska.pw/archiveteam-bs/20230628#c355785 ? |
16:03:30 | <thuban> | sorry, i know you're busy, but i've had issues with batch uploads in the past |
16:04:07 | | etnguyen03 quits [Client Quit] |
16:04:11 | <yzqzss> | <thuban> "(might (still) be on qq? https:/..." <- This is probably not him, he is Taiwanese, IIRC. https://github.com/yth98 may be another account of his, but also no activity in 2024. |
16:05:07 | <@arkiver> | thuban: hmm sure |
16:05:25 | <@arkiver> | you have metadata etc. ready for each item that needs to be created? |
16:05:47 | | sec^nd quits [Ping timeout: 250 seconds] |
16:07:31 | <thuban> | yzqzss: ah, i wondered about that. you're probably right |
16:09:59 | | sec^nd (second) joins |
16:11:22 | | Wohlstand quits [Client Quit] |
16:16:25 | <thuban> | arkiver: no :/ all i have are game files, warcs, and a handful of json files used in the download process, which might or might not have anything useful |
16:16:36 | | Wohlstand (Wohlstand) joins |
16:17:10 | <@arkiver> | thuban: how big is the entire zst? |
16:17:57 | <thuban> | arkiver: 221G |
16:18:21 | <@arkiver> | we can also just upload the entire file to a single IA item for now |
16:19:28 | <@arkiver> | as a mediatype=data item |
16:22:06 | <thuban> | upload that large seems likely to be flaky, but i'll give it a go if you think that's best |
16:24:13 | | Wohlstand quits [Client Quit] |
16:25:32 | | systwi_ joins |
16:25:57 | <thuban> | that said it will probably have to wait a week or two. data is on a raid enclosure that's going bad which i'm currently backing up; that particular file is safe already but i want to get the rest done before i touch the array for anything else |
16:26:43 | <thuban> | !remindme 1week upload game atsumaru data |
16:26:44 | <eggdrop> | [remind] ok, i'll remind you at 2024-05-10T00:00:00Z |
16:27:53 | <@arkiver> | thuban: no problem at all! |
16:28:47 | <thuban> | thanks for helping <3 |
16:28:57 | <@arkiver> | of course! |
16:29:14 | <@arkiver> | most important is getting the dump safely uploaded, after that we can look into separate items |
16:32:43 | <@rewby> | arkiver: You have a target |
16:32:45 | <@rewby> | Good lucjk |
16:32:55 | <@arkiver> | rewby: wooh :) |
16:33:05 | <@arkiver> | i'll see about moving the other WARCs over later |
16:33:15 | <@arkiver> | thanks rewby |
16:34:00 | <@arkiver> | rewby: is it correct the imgur target is still enabled on subscene too? |
16:34:08 | <@arkiver> | oh just removed |
16:34:12 | <@arkiver> | nvm |
16:42:58 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
16:47:48 | <mgrandi> | https://www.bfbcpa.us/ this company was suspended from doing SEC work, don't know if they are going under as a result |
16:49:32 | <katia> | mgrandi, added to archivebot |
16:50:22 | | Carnildo quits [Read error: Connection reset by peer] |
16:50:28 | | Carnildo joins |
16:50:58 | <mgrandi> | Ty |
16:51:49 | | treora quits [Remote host closed the connection] |
16:51:50 | | treora joins |
16:52:05 | | treora quits [Remote host closed the connection] |
16:52:06 | | treora joins |
16:52:20 | | treora quits [Remote host closed the connection] |
16:52:21 | | treora joins |
16:52:36 | | treora quits [Remote host closed the connection] |
16:52:37 | | treora joins |
16:52:51 | | treora quits [Remote host closed the connection] |
16:52:53 | | treora joins |
16:53:07 | | treora quits [Remote host closed the connection] |
16:53:09 | | treora joins |
16:53:18 | <@JAA> | thuban: I agree with handling the 'UTC (UTC+0)' case, but I don't think we should allow ambiguous times like that. Also, some pages already depend on UTC being the default. Perhaps we should have a separate {{ambiguous datetime}} template. Indeed, you can't have fatal errors from templates as far as I know. That's why {{URL}} adds a category to pages with broken arguments, for example. |
16:53:23 | | treora quits [Remote host closed the connection] |
16:54:11 | | treora joins |
16:54:26 | | treora quits [Remote host closed the connection] |
16:54:27 | | treora joins |
16:54:41 | | treora quits [Remote host closed the connection] |
16:54:42 | | treora joins |
16:58:37 | <thuban> | JAA: i also would prefer not to allow ambiguous times. (i don't think a separate template is necessary.) i am willing to manually edit existing usages of the template (since it was """required""" before being made to default to utc, so any use of the default should be intentional). |
16:59:13 | <@arkiver> | so post.news |
16:59:20 | <thuban> | JAA: what about that lua module? |
16:59:46 | <@arkiver> | #past-news it is for post.news |
17:04:38 | | Notrealname1234 (Notrealname1234) joins |
17:04:53 | <@JAA> | thuban: What I'm trying to say I guess is that the default time zone for everything here should be UTC. We already do that in IRC logs etc. without explicitly mentioning UTC because this is an international environment and UTC is the only sensible time zone there. |
17:05:12 | <@JAA> | No idea re Lua. |
17:07:21 | <thuban> | JAA: i agree that it _should_ be the default. but sooner or later some jackass like me is going to be updating the wiki, and copy something from their local logs with a local timestamp, and not notice that they've failed to convert it without a big red error to warn them, and then it will be wrong forever. and we should have guardrails against that. |
17:07:55 | <@JAA> | Mhm |
17:08:35 | <Notrealname1234> | Wow, first "JAA" message without a dot? |
17:09:03 | <katia> | No. |
17:09:03 | <h2ibot> | Nulldata edited Post News (+13, Channel decided): https://wiki.archiveteam.org/?diff=52134&oldid=52070 |
17:09:23 | <Notrealname1234> | katia: Yes, also i was just joking lol |
17:11:17 | | Notrealname1234 quits [Client Quit] |
17:11:25 | <thuban> | (i did think of abusing the parser by, say, invoking #time with a malformed argument so that it prints "Invalid time". but i think that would be the Wrong Thing :P) |
17:11:30 | | Notrealname1234 (Notrealname1234) joins |
17:15:23 | | Island joins |
17:15:37 | | Notrealname1234 quits [Client Quit] |
17:23:08 | <thuban> | JAA: ah, i think it would require installing an extension, which iirc you don't have access for (who does?) https://www.mediawiki.org/wiki/Extension:Scribunto#Installation |
17:23:33 | <@JAA> | thuban: I do, but I still have no idea. :-) |
17:25:39 | <@JAA> | I'm not sure we have anything Lua-based on the wiki currently, in particular. |
17:25:59 | | Carnildo quits [Read error: Connection reset by peer] |
17:26:18 | | Carnildo joins |
17:26:34 | <thuban> | looks like not https://wiki.archiveteam.org/index.php/Special:Version |
17:27:20 | <@JAA> | Yeah, so getting that to work might be more pain than it's worth. |
17:30:06 | <@JAA> | #formatdate doesn't seem to produce any error when passing an invalid time. Fun. |
17:30:16 | <thuban> | the docs make it look pretty straightforward. but i have no idea what the at wiki is like behind the scenes, so idk, let me know |
17:30:33 | <@JAA> | I.e. we wouldn't catch that error anyway. |
17:31:22 | <@JAA> | I think we can go with the same approach as on {{URL}}, i.e. add a category to pages with incorrect usage so that they can be found and fixed. |
17:34:01 | <thuban> | hm, i guess if it's colored red that's good enough |
17:40:10 | <h2ibot> | Switchnode edited Template:Datetime (+717, require zones for times): https://wiki.archiveteam.org/?diff=52135&oldid=51013 |
17:40:25 | <thuban> | look ok? |
17:40:56 | <thuban> | other than the category being added, i thought that wasn't supposed to happen inside <includeonly>... |
17:42:21 | <thuban> | oh lol, it's because i gave negative examples. nvm |
17:44:57 | | Carnildo quits [Read error: Connection reset by peer] |
17:44:59 | | Carnildo joins |
17:46:55 | <@JAA> | Yeah, that looks fine apart from the category. |
17:47:23 | <@JAA> | Maybe we can just remove those negative examples. |
17:47:51 | <thuban> | meh, i think it's fine to leave it |
17:48:04 | <thuban> | will update usages now |
17:52:12 | <h2ibot> | Switchnode edited Deathwatch (+20, explicitly mark utc datetimes (due to template…): https://wiki.archiveteam.org/?diff=52136&oldid=52107 |
17:53:12 | <h2ibot> | Switchnode edited Elections/2019 Swiss federal election/Candidates/Luzern (+4, explicitly mark utc datetimes (due to template…): https://wiki.archiveteam.org/?diff=52137&oldid=51299 |
17:55:13 | <h2ibot> | Switchnode edited List of websites excluded from the Wayback Machine/Former exclusions (+4, explicitly mark utc datetimes (due to template…): https://wiki.archiveteam.org/?diff=52138&oldid=52038 |
17:56:13 | <h2ibot> | Switchnode edited MediaFire (+4, /* Projects */ explicitly mark utc datetimes…): https://wiki.archiveteam.org/?diff=52139&oldid=51305 |
17:57:14 | <h2ibot> | Switchnode edited Roblox (+5, /* Marketplace Comments removal (April 2024) */…): https://wiki.archiveteam.org/?diff=52140&oldid=52115 |
17:58:14 | <h2ibot> | Switchnode edited Sola.ai (+4, /* Shutdown notice */ explicitly mark utc…): https://wiki.archiveteam.org/?diff=52141&oldid=51317 |
18:08:49 | | Carnildo quits [Read error: Connection reset by peer] |
18:08:53 | | Carnildo joins |
18:10:16 | <h2ibot> | Switchnode edited Template:Datetime (+9, fix spacing): https://wiki.archiveteam.org/?diff=52142&oldid=52135 |
18:11:16 | <h2ibot> | Switchnode edited South Park Forums (+4, explicitly mark utc datetimes (due to template…): https://wiki.archiveteam.org/?diff=52143&oldid=51486 |
18:12:16 | <h2ibot> | Switchnode edited TJ (+4, explicitly mark utc datetimes (due to template…): https://wiki.archiveteam.org/?diff=52144&oldid=52055 |
18:13:16 | <h2ibot> | Switchnode edited We-TeVe (+4, explicitly mark utc datetimes (due to template…): https://wiki.archiveteam.org/?diff=52145&oldid=51016 |
18:19:18 | <h2ibot> | Switchnode edited YouTube (+40, explicitly mark utc datetimes (due to template…): https://wiki.archiveteam.org/?diff=52146&oldid=51945 |
18:27:21 | <h2ibot> | Switchnode edited YouTube (+10, /* Video counter */ missed one): https://wiki.archiveteam.org/?diff=52147&oldid=52146 |
18:31:16 | <thuban> | fwiw, i was able to trace everything back to a cited source or earlier revision that was explicitly utc, except for three additions by JAA who i assume just got it right |
18:35:32 | | Carnildo quits [Read error: Connection reset by peer] |
18:35:46 | | Carnildo joins |
18:36:23 | <h2ibot> | Switchnode edited 半次元 (+37, update status; datetimeify): https://wiki.archiveteam.org/?diff=52148&oldid=52129 |
18:44:39 | <yzqzss> | STWP is going to start a new archive project: the Discuz! archive. ("Disucz!" is the most popular BBS|forum software in China) |
18:45:56 | <yzqzss> | We think this project can work with AT/AB: the detailed metadata grabbed through site's API is uploaded to IA like wikiteam3/doku-dumper, and at the same time the discuz! archive tool outputs a urls.txt (each url is a thread), so ArchiveBot can do web archiving for them. |
18:46:10 | <yzqzss> | is this idea feasible? :) |
18:46:20 | | Unholy2361924645 (Unholy2361) joins |
18:46:48 | <yzqzss> | some "powered by Discuz!" examples: |
18:46:48 | <yzqzss> | https://www.cn-dos.net/forum/ |
18:46:48 | <yzqzss> | http://nesbbs.com/bbs/ |
18:46:48 | <yzqzss> | http://www.crystalradio.cn/ |
18:46:48 | <yzqzss> | https://discuz.dismall.com/ |
18:47:22 | <pokechu22> | Something like that probably would work, though if the goal is to just throw in a list of URLs without recursion I'm pretty sure there's also a distributed project for that? |
18:47:55 | | Unholy236192464 quits [Ping timeout: 255 seconds] |
18:47:55 | | Unholy2361924645 is now known as Unholy236192464 |
18:48:34 | <thuban> | pokechu22: #// would not be appropriate for this due to the ddos risk. |
18:50:04 | | treora quits [Read error: Connection reset by peer] |
18:50:05 | | treora joins |
18:52:17 | | etnguyen03 (etnguyen03) joins |
18:52:25 | <thuban> | i think archivebot would be, though! (just give the output files identifiable names) |
18:55:25 | <thuban> | arkiver: just so i don't forget, there's also https://archive.org/details/game_atsumaru_comments_scoreboard_api_20230628. i don't think there's much useful in it either, but |
18:57:30 | | Carnildo quits [Read error: Connection reset by peer] |
18:57:32 | | Carnildo joins |
19:08:51 | | Carnildo quits [Remote host closed the connection] |
19:08:53 | | Carnildo joins |
19:45:29 | | f_ quits [Ping timeout: 250 seconds] |
19:46:50 | | Notrealname1234 (Notrealname1234) joins |
19:48:36 | | Unholy236192464 quits [Ping timeout: 265 seconds] |
19:59:43 | <h2ibot> | Manu edited Mailman/2 (-7, /* eclipse.org lists done */): https://wiki.archiveteam.org/?diff=52149&oldid=52120 |
20:05:14 | <fireonlive> | c3manu++ |
20:05:14 | <eggdrop> | [karma] 'c3manu' now has 5 karma! |
20:06:23 | | Carnildo quits [Remote host closed the connection] |
20:06:41 | | Carnildo joins |
20:10:08 | <c3manu> | <3 |
20:11:50 | <h2ibot> | Manu edited Mailman/2 (+21, /* http://ips.gov.au/ done */): https://wiki.archiveteam.org/?diff=52150&oldid=52149 |
20:13:40 | | Notrealname1234 quits [Client Quit] |
20:14:21 | | Guest joins |
20:15:24 | <fireonlive> | <3 |
20:18:43 | | Carnildo quits [Read error: Connection reset by peer] |
20:18:53 | | Carnildo joins |
20:20:41 | | Notrealname1234 (Notrealname1234) joins |
20:20:52 | <h2ibot> | Manu edited Mailman/2 (+43, /* acl.bestbits.at lost */): https://wiki.archiveteam.org/?diff=52151&oldid=52150 |
20:21:13 | | Overlordz joins |
20:21:19 | | Carnildo quits [Remote host closed the connection] |
20:21:22 | | yano quits [Quit: WeeChat, the better IRC client, https://weechat.org/] |
20:21:28 | | Carnildo joins |
20:24:03 | | yano (yano) joins |
20:29:13 | | Notrealname1234 quits [Client Quit] |
20:53:14 | | Wohlstand (Wohlstand) joins |
20:57:59 | <h2ibot> | Manu edited Mailman/2 (+64, /* http://addictivecode.org/pipermail lost */): https://wiki.archiveteam.org/?diff=52152&oldid=52151 |
21:00:31 | | Carnildo quits [Read error: Connection reset by peer] |
21:00:44 | | Carnildo joins |
21:23:03 | <h2ibot> | Manu edited Mailman/2 (+28, /* http://agilees.net/mailman/listinfo "saved" */): https://wiki.archiveteam.org/?diff=52153&oldid=52152 |
21:32:11 | | wyatt8740 quits [Remote host closed the connection] |
21:35:20 | | wyatt8740 joins |
21:37:06 | <h2ibot> | Manu edited Mailman/2 (+69, /* https://alice.wu.ac.at/mailman/listinfo…): https://wiki.archiveteam.org/?diff=52154&oldid=52153 |
21:38:06 | <h2ibot> | Nulldata edited Subscene (+119, Has died. F.): https://wiki.archiveteam.org/?diff=52155&oldid=52132 |
21:41:06 | <h2ibot> | Nulldata edited Subscene (+14, Past tense.): https://wiki.archiveteam.org/?diff=52156&oldid=52155 |
21:42:06 | <h2ibot> | Nulldata edited Subscene (+1): https://wiki.archiveteam.org/?diff=52157&oldid=52156 |
21:42:34 | | Carnildo quits [Read error: Connection reset by peer] |
21:42:37 | | Carnildo joins |
21:44:07 | <h2ibot> | Manu edited Mailman/2 (+0, /* addendum */): https://wiki.archiveteam.org/?diff=52158&oldid=52154 |
21:47:07 | <h2ibot> | Manu edited Mailman/2 (+8, /* http://alioth-lists.debian.net/pipermail/…): https://wiki.archiveteam.org/?diff=52159&oldid=52158 |
21:47:34 | | sd (sd) joins |
21:47:42 | | Notrealname1234 (Notrealname1234) joins |
21:51:16 | | sd quits [Client Quit] |
21:52:27 | | sd (sd) joins |
21:57:09 | <h2ibot> | Manu edited Mailman/2 (+136, /* saved amsat.org lists */): https://wiki.archiveteam.org/?diff=52160&oldid=52159 |
21:59:47 | | Notrealname1234 quits [Client Quit] |
22:09:15 | | wyatt8740 quits [Ping timeout: 265 seconds] |
22:09:35 | | etnguyen03 quits [Client Quit] |
22:10:03 | | wyatt8740 joins |
22:10:16 | | etnguyen03 (etnguyen03) joins |
22:10:40 | | sd quits [Client Quit] |
22:10:50 | | Carnildo quits [Read error: Connection reset by peer] |
22:11:07 | | Carnildo joins |
22:11:12 | <h2ibot> | Manu edited Mailman/2 (+55, /* https://aquaticinfo.org/mailman/listinfo…): https://wiki.archiveteam.org/?diff=52161&oldid=52160 |
22:13:12 | <h2ibot> | Manu edited Mailman/2 (+43, /* http://arago-project.org/pipermail/ lost */): https://wiki.archiveteam.org/?diff=52162&oldid=52161 |
22:18:13 | <h2ibot> | Manu edited Mailman/2 (+57, /* https://lists.barton.de/ saved */): https://wiki.archiveteam.org/?diff=52163&oldid=52162 |
22:18:58 | | eightthree quits [Ping timeout: 255 seconds] |
22:19:13 | <h2ibot> | Manu edited Mailman/2 (-34, /* remove http://arthur.barton.de/pipermail…): https://wiki.archiveteam.org/?diff=52164&oldid=52163 |
22:20:03 | | etnguyen03 quits [Client Quit] |
22:21:00 | | eightthree joins |
22:22:57 | | Notrealname1234 (Notrealname1234) joins |
22:35:51 | | Carnildo quits [Remote host closed the connection] |
22:35:58 | | Carnildo joins |
22:45:40 | | wyatt8750 joins |
22:46:29 | | Carnildo quits [Read error: Connection reset by peer] |
22:46:43 | | Carnildo joins |
22:46:57 | | wyatt8740 quits [Ping timeout: 265 seconds] |
22:54:12 | | wyatt8750 quits [Ping timeout: 265 seconds] |
22:54:19 | | decky_e quits [Read error: Connection reset by peer] |
22:54:40 | | wyatt8740 joins |
22:54:43 | | decky_e joins |
23:00:21 | <h2ibot> | JAABot edited CurrentWarriorProject (+0): https://wiki.archiveteam.org/?diff=52165&oldid=52133 |
23:10:46 | | Carnildo quits [Remote host closed the connection] |
23:10:58 | | Carnildo joins |
23:15:56 | | etnguyen03 (etnguyen03) joins |
23:17:57 | | Notrealname1234 quits [Client Quit] |
23:23:25 | | Carnildo quits [Read error: Connection reset by peer] |
23:23:49 | | Carnildo joins |
23:26:06 | | wyatt8740 quits [Ping timeout: 265 seconds] |
23:28:46 | | wyatt8740 joins |
23:42:57 | | lunik1 quits [Client Quit] |
23:43:26 | | lunik11 joins |
23:45:08 | | ell quits [Client Quit] |
23:45:12 | | etnguyen03 quits [Client Quit] |
23:45:19 | | ell (ell) joins |
23:50:37 | | pedantic-darwin0 joins |
23:51:13 | | pedantic-darwin quits [Ping timeout: 255 seconds] |
23:51:13 | | pedantic-darwin0 is now known as pedantic-darwin |