00:01:40QuantumLTU quits [Client Quit]
00:02:27voskull joins
00:04:31Ivan226 quits [Ping timeout: 265 seconds]
00:14:46voskull quits [Remote host closed the connection]
00:15:43Ivan226 joins
00:26:28Arcorann (Arcorann) joins
00:45:55nicolas17 joins
01:04:59BlueMaxima quits [Read error: Connection reset by peer]
01:18:25andrew quits [Quit: ]
01:18:43andrew (andrew) joins
01:25:14wyatt8740 quits [Ping timeout: 265 seconds]
01:25:27wyatt8740 joins
01:27:59warriorprob joins
01:43:20<pabs>why isn't the job for the root of the domain listed on https://archive.fart.website/archivebot/viewer/domain/listor.tp-sv.se ?
01:43:52<pabs>the two subdir jobs are listed but not the main one
01:46:39<pokechu22>hash collision: https://archive.fart.website/archivebot/viewer/job/2dtad - that job was 2dtad3hqrs5vy2u11czl02uep, the other was something else that started with 2dtad, I guess
01:46:57<nicolas17>D:
01:50:25<pabs>aha. maybe it should accept the full job ID :)
01:54:31icedice quits [Client Quit]
02:00:02Ivan226 quits [Ping timeout: 265 seconds]
02:03:50Guest50 quits [Ping timeout: 252 seconds]
02:03:52<pokechu22>I imagine it works that way because it's probably parsing the filename (so both dieter-l-koch.de-shallow-20190606-171452-2dtad.json and listor.tp-sv.se-inf-20230509-004345-2dtad.json look the same to it). Oddly I don't think the job ID is in the JSON file - it's in the meta-warc, but that's annoying to find
02:04:07<pokechu22>er, not annoying to find, annoying to extract in bulk
02:04:53Guest50 joins
02:07:18Philipp_DE quits [Remote host closed the connection]
02:20:02vantec (vantec) joins
02:36:33Mateon2 joins
02:36:35<pabs>JAA: I think the opensource.com AB job is done with the site itself, still 70k links to go tho
02:38:17Mateon1 quits [Ping timeout: 252 seconds]
02:38:17Mateon2 is now known as Mateon1
02:43:21Ivan226 joins
03:30:32Ruthalas5 quits [Ping timeout: 252 seconds]
03:34:28Ruthalas5 (Ruthalas) joins
04:11:48birdjj quits [Read error: Connection reset by peer]
04:11:54birdjj joins
04:12:20Guest50 quits [Ping timeout: 252 seconds]
04:14:48Guest50 joins
04:18:45Ivan226 quits [Ping timeout: 265 seconds]
04:19:01hitgrr8 joins
04:25:32Guest50 quits [Ping timeout: 252 seconds]
04:34:18Guest50 joins
04:46:36BlueMaxima joins
05:43:21BigBrain quits [Ping timeout: 245 seconds]
05:44:32nicolas17 quits [Client Quit]
06:22:49BigBrain (bigbrain) joins
06:35:34datechnoman quits [Quit: The Lounge - https://thelounge.chat]
06:36:07datechnoman (datechnoman) joins
06:58:15dumbgoy__ quits [Ping timeout: 265 seconds]
07:00:13birdjj8 joins
07:00:15birdjj quits [Read error: Connection reset by peer]
07:00:15birdjj8 is now known as birdjj
07:04:11BlueMaxima quits [Read error: Connection reset by peer]
07:20:46ehmry quits [Client Quit]
07:53:12ehmry joins
08:16:18Island quits [Read error: Connection reset by peer]
08:16:26Ivan226 joins
08:21:34Philipp_DE joins
09:16:52icedice (icedice) joins
09:46:11NameUser77 quits [Remote host closed the connection]
10:21:15haltingstate quits [Ping timeout: 265 seconds]
10:56:36BearFortress quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]
11:09:01BearFortress joins
11:47:54Philipp_DE quits [Remote host closed the connection]
11:49:46HP_Archivist (HP_Archivist) joins
12:18:52Philipp_DE joins
13:12:49<h2ibot>Sanqui edited Deathwatch (+275, add The Silph Road): https://wiki.archiveteam.org/?diff=49779&oldid=49775
13:22:56icedice quits [Client Quit]
13:31:16Guest50 quits [Read error: Connection reset by peer]
13:37:01nostalgebraist joins
13:50:46Arcorann quits [Ping timeout: 252 seconds]
14:16:03icedice (icedice) joins
14:23:52Guest50 joins
14:34:20x-56k-modem quits [Quit: WeeChat 3.8]
14:43:11Guest50 quits [Ping timeout: 252 seconds]
14:48:57Philipp_DE quits [Remote host closed the connection]
14:56:56nighthnh099 joins
14:58:06<nighthnh099>I have a bunch of files mirrored from a couple of websites using wget, these pages either no longer exist or the website went down
14:58:32<nighthnh099>my stupid self forgot to archive them on wayback, I'm wondering if they could be merged to wayback somehow?
14:58:41<nighthnh099>they're not in warc so already they're missing some information
14:59:03<nighthnh099>they're also scattered across my pc so I won't be able to provide them immediately by the way
15:02:52x-56k-modem (x-56k-modem) joins
15:05:23<icedice>nighthnh099: It's not going to be accepted if it's not in warc
15:06:13<icedice>You could self-host the HTML files and archive that. Not ideal since nobody who comes from the original source would find it unless they specifically google for it and your site shows up
15:07:03<icedice>Neocities can host HTML files for free and is a genuinely good platform, like how the internet used to be, so maybe try putting it there?
15:07:37<nighthnh099>for some of the files, I already put them on archive.org anyway, so maybe I'll just do that for the rest
15:07:46<icedice>They've also been around for quite a while and the freemium model probably keeps it sustainable
15:08:11<icedice>https://neocities.org/
15:08:34<icedice>You could upload the HTML files to Internet Archive though
15:08:46<icedice>They'd be DDL only though, no Wayback Machine
15:17:10<h2ibot>Bzc6p edited News+C/hu (+19, Update): https://wiki.archiveteam.org/?diff=49781&oldid=49195
15:22:08Philipp_DE joins
15:24:56icedice quits [Client Quit]
15:36:36icedice (icedice) joins
15:54:20dumbgoy__ joins
16:00:56tzt quits [Ping timeout: 252 seconds]
16:02:27tzt (tzt) joins
16:04:56nighthnh099 quits [Remote host closed the connection]
16:11:24nicolas17 joins
16:40:33ZizzyDizzyMC quits [Remote host closed the connection]
16:50:39hader210 joins
16:51:01hader210 leaves
17:03:29jacksonchen666 (jacksonchen666) joins
17:15:11tech_exorcist (tech_exorcist) joins
17:27:43sonick quits [Client Quit]
17:43:26icedice quits [Client Quit]
17:57:53icedice (icedice) joins
18:31:29Minkafighter quits [Quit: The Lounge - https://thelounge.chat]
18:31:46Minkafighter joins
18:43:46myriad joins
18:45:09jacksonchen666 quits [Client Quit]
19:00:19myriad quits [Remote host closed the connection]
19:17:37myriad_ joins
19:20:32myriad_ quits [Remote host closed the connection]
19:23:13Megame (Megame) joins
19:33:03Craigle quits [Quit: The Lounge - https://thelounge.chat]
19:33:35Craigle (Craigle) joins
19:38:17myriad_ joins
19:48:41myriad_ quits [Ping timeout: 265 seconds]
19:56:47GNU_world joins
20:00:36Guest50 joins
20:02:10myriad_ joins
20:03:00myriad_ quits [Remote host closed the connection]
20:05:22myriad_ joins
20:07:39myriad_ quits [Remote host closed the connection]
20:08:55myriad_ joins
20:11:25myriad_ quits [Remote host closed the connection]
20:19:46<icedice>Is anyone here able to grab 429'd Imgur URLs from an ongoing archivation job and run them on an unblocked IP? JAA was supposed to do it, but I haven't seen him here today
20:20:49<@JAA>icedice: If by 'archivation job' you mean ArchiveBot, then no. And I'm here, just too many things to do at once, so I didn't get to you yet.
20:21:04<icedice>Ok
20:21:09<icedice>Ah, that makes sense
20:21:18<icedice>I was wondering why I didn't see you on D-Day
20:21:42<icedice>Take your time
20:28:02myriad_ joins
20:28:45nostalgebraist quits [Read error: Connection reset by peer]
20:30:28Guest50 quits [Client Quit]
20:32:02nostalgebraist joins
20:36:57Philipp_DE quits [Remote host closed the connection]
20:40:06nostalgebraist_ joins
20:41:48nostalgebraist quits [Ping timeout: 252 seconds]
20:42:49myriad_ quits [Ping timeout: 265 seconds]
20:43:11Shjosan quits [Quit: Am sleepy (-, – )…zzzZZZ]
20:44:02Shjosan (Shjosan) joins
20:44:15myriad_ joins
20:55:32myriad_ quits [Ping timeout: 252 seconds]
20:59:11hitgrr8 quits [Client Quit]
20:59:34Guest50 joins
21:07:23myriad_ joins
21:09:53myriad_ quits [Remote host closed the connection]
21:40:06tech_exorcist quits [Client Quit]
21:55:11Island joins
22:04:57Lord_Nightmare quits [Quit: ZNC - http://znc.in]
22:10:32Lord_Nightmare (Lord_Nightmare) joins
22:10:53WOANS joins
22:18:20WOANS quits [Remote host closed the connection]
22:31:14HP_Archivist quits [Client Quit]
22:33:16birdjj quits [Client Quit]
22:34:00birdjj joins
22:34:25atuser joins
22:39:49fredgido (fredgido) joins
22:54:43guest6767 joins
22:56:32<guest6767>i do have a sitemap for a isp, can i share it here? If not, where i can share it?
23:03:00guest6767 quits [Remote host closed the connection]
23:13:44Pannekoek joins
23:16:25nicolas17 quits [Client Quit]
23:37:41Pannekoek quits [Remote host closed the connection]
23:52:55random joins
23:58:14dumbgoy joins
23:59:48dumbgoy__ quits [Ping timeout: 252 seconds]