00:05:20etnguyen03 quits [Ping timeout: 252 seconds]
00:23:53qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
00:37:53ell quits [Client Quit]
00:54:32Wohlstand (Wohlstand) joins
01:29:51<pabs>fireonlive: add to Deathwatch?
01:30:18benjinsm joins
01:33:15benjins quits [Ping timeout: 265 seconds]
01:34:26Dango360 quits [Ping timeout: 252 seconds]
01:42:07bladem quits [Read error: Connection reset by peer]
01:49:50wickedplayer494 quits [Ping timeout: 252 seconds]
01:50:25wickedplayer494 joins
02:03:05etnguyen03 (etnguyen03) joins
02:31:50AnotherIki joins
02:35:36Iki1 quits [Ping timeout: 265 seconds]
02:35:37<h2ibot>PaulWise edited Site exploration (+104, add bing-scrape link (by JAA)): https://wiki.archiveteam.org/?diff=50868&oldid=50594
02:37:41ymgve quits [Ping timeout: 252 seconds]
03:08:13Rootliam leaves
03:13:14Wohlstand quits [Client Quit]
03:13:30Wohlstand (Wohlstand) joins
03:14:54Wohlstand quits [Client Quit]
03:15:10Wohlstand (Wohlstand) joins
03:55:50dumbgoy quits [Ping timeout: 265 seconds]
03:57:41DogsRNice quits [Read error: Connection reset by peer]
04:02:23etnguyen03 quits [Ping timeout: 252 seconds]
04:05:03etnguyen03 (etnguyen03) joins
04:15:53<h2ibot>Yts98 edited ZOWA (+50, Mark as in progress): https://wiki.archiveteam.org/?diff=50869&oldid=50844
04:36:56<h2ibot>Yts98 edited Current Projects (+0, move ZOWA to current): https://wiki.archiveteam.org/?diff=50870&oldid=50777
04:38:14etnguyen03 quits [Client Quit]
04:42:26ell (ell) joins
04:42:27Stiletto joins
05:16:02<h2ibot>PaulWise created ArchiveBot/Ignore/NonSequentialIntegers (+5986, add a way to ignore non-sequential integer…): https://wiki.archiveteam.org/?title=ArchiveBot/Ignore/NonSequentialIntegers
05:19:59Island quits [Read error: Connection reset by peer]
05:21:03<h2ibot>PaulWise edited MoinMoin (+157, link to the non-sequential integer ignores page): https://wiki.archiveteam.org/?diff=50872&oldid=50783
05:57:53project10 quits [Ping timeout: 252 seconds]
06:00:16project10 (project10) joins
06:09:55railen63 quits [Remote host closed the connection]
06:11:16project10 quits [Client Quit]
06:15:39project10 (project10) joins
06:17:14railen63 joins
06:24:50project10 quits [Ping timeout: 252 seconds]
06:26:39project10 (project10) joins
06:58:24jacobgkau quits [Remote host closed the connection]
07:00:08nfriedly quits [Remote host closed the connection]
07:03:25jacobgkau joins
07:06:45Unholy2361316618085159 (Unholy2361) joins
07:20:39jacobgkau quits [Client Quit]
07:25:07parfait (kdqep) joins
07:29:35efeafewa quits [Remote host closed the connection]
07:38:39BlueMaxima quits [Read error: Connection reset by peer]
07:40:35gfhh quits [Ping timeout: 265 seconds]
07:42:01Arcorann (Arcorann) joins
08:19:14ThreeHM quits [Ping timeout: 252 seconds]
08:19:49jacobgkau joins
08:22:59ymgve joins
08:32:30ThreeHM (ThreeHeadedMonkey) joins
09:03:03greg joins
09:16:07Naruyoko joins
09:18:13Naruyoko5 quits [Ping timeout: 265 seconds]
09:55:26jacobgkau quits [Ping timeout: 265 seconds]
10:00:01railen63 quits [Remote host closed the connection]
10:00:17railen63 joins
10:17:14nfriedly joins
12:24:22Matthww11 quits [Quit: Ping timeout (120 seconds)]
12:24:41Matthww11 joins
12:44:48kiryu quits [Remote host closed the connection]
12:45:58kiryu (kiryu) joins
12:46:32Chris5010 quits [Ping timeout: 252 seconds]
12:54:48Chris5010 (Chris5010) joins
12:54:56Chris5010 quits [Client Quit]
12:55:14Chris5010 (Chris5010) joins
13:17:00etnguyen03 (etnguyen03) joins
13:38:44Arcorann quits [Ping timeout: 265 seconds]
13:46:55Mateon2 joins
13:46:57magmaus3 quits [Ping timeout: 265 seconds]
13:47:35Mateon1 quits [Ping timeout: 252 seconds]
13:47:35Mateon2 is now known as Mateon1
13:58:37<h2ibot>JustAnotherArchivist edited Miraheze (+29, Datetimeify): https://wiki.archiveteam.org/?diff=50873&oldid=50859
14:01:11parfait_ joins
14:05:19parfait quits [Ping timeout: 265 seconds]
14:05:36ssss joins
14:06:22<ssss>How many warriors/containers should i run if i have a connection of 50mbits/10mbits?
14:07:30<imer>ssss: usually you're limited by the sites we're archiving, so it depends on that and what project you're running, some projects are more bandwidth hungry than others
14:08:56<qwertyasdfuiopghjkl>I disagree with the recent edits to https://wiki.archiveteam.org/index.php/Template:Rescued making it show only the earliest year of archival and hiding the years of later archival projects behind a "and more" link (that also just scrolls down to the list of categories, which seems a bit unintuitive). (see
14:08:56<qwertyasdfuiopghjkl>https://wiki.archiveteam.org/index.php/Memory_of_Mankind for an example) The date of the most recent archival would probably be more useful to show than the earliest one, and the infobox has enough room to show *all* the years, so collapsing the list isn't needed. (also, having a list of years without context is a bit ambiguous and this doesn't
14:08:57<qwertyasdfuiopghjkl>seem like something that really needs to be done in that/a template anyway?)
14:09:47<@JAA>No disagreement here. VoynichCR isn't currently here.
14:10:10<ssss>@imer i just go with "archiveteams choice". Is it in the magnitude of 1-5 oder 10-20 containers?
14:12:04<imer>ssss: if you're running each warrior with 6 concurrency (or whatever it's called in the ui), probably 1-2, might even have to go less than 6 for some projects (none I can think of at the moment, the upcoming orange one had pretty strict limits iirc)
14:12:48<ssss>nice to know, thanks
14:12:51<imer>just gut feeling of course, might have to tweak as things go :)
14:15:41<nstrom|m>generally the limit is not bandwidth but that the site being archived limits to a certain number of connections per IP address before blocking/throttling
14:16:17<nstrom|m>and that depends on the individual site/archiving project
14:31:43<h2ibot>VoynichCr edited Template:Rescued (-104, error): https://wiki.archiveteam.org/?diff=50874&oldid=50867
14:36:44<h2ibot>VoynichCr edited Memory of Mankind (+5, 2023): https://wiki.archiveteam.org/?diff=50875&oldid=50858
14:37:37VoynichCR (VoynichCR) joins
14:37:43<VoynichCR>hi
14:39:36<SketchCow>I see someone referred to me.
14:39:42<SketchCow>I .... am slow to respond here.
14:39:46<SketchCow>Just use jscott@archive.org
14:39:55<SketchCow>Or jesuschristmorearchiveteamcrap@textfiles.com
14:39:58<SketchCow>Both work equally.
14:42:04<Peroniko>Second one is much better sounding
14:46:09IDK (IDK) joins
14:53:10HP_Archivist quits [Ping timeout: 265 seconds]
14:59:27etnguyen03 quits [Ping timeout: 265 seconds]
15:05:21Island joins
15:05:59ssss quits [Remote host closed the connection]
15:21:00lukash9 quits [Quit: The Lounge - https://thelounge.chat]
15:34:17<fireonlive>🤨
15:40:35<anarcat>https://journalmetro.com/ is going bankrupt
15:40:50<anarcat>i'm going to try to salvage some of it
15:45:46<anarcat>oh dear
15:45:54<anarcat>https://journalmetro.com/sitemap.xml?yyyy=2015&mm=09&dd=26 looks pretty bad
15:48:45<@JAA>Did they wipe content, or are the old sitemaps just broken.
15:48:50<@JAA>s/\./?/
15:56:05<anarcat>i don't know
15:56:19<anarcat>that's partly why i was thinking of skipping those
15:56:46BearFortress quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]
15:57:04<@JAA>Right. Well, it went through them all by now, and it's good to have a record of this.
15:58:14<anarcat>yeah
15:58:51lukash9 joins
16:04:03<project10>hm, what is "Publi-sac"?
16:04:24<@JAA>VoynichCR:
16:04:24<@JAA>14:08:56 < qwertyasdfuiopghjkl> I disagree with the recent edits to https://wiki.archiveteam.org/index.php/Template:Rescued making it show only the earliest year of archival and hiding the years of later archival projects behind a "and more" link (that also just scrolls down to the list of categories, which seems a bit unintuitive). (see
16:04:29<@JAA>14:08:56 < qwertyasdfuiopghjkl> https://wiki.archiveteam.org/index.php/Memory_of_Mankind for an example) The date of the most recent archival would probably be more useful to show than the earliest one, and the infobox has enough room to show *all* the years, so collapsing the list isn't needed. (also, having a list of years without context is a bit ambiguous and this doesn't
16:04:35<@JAA>14:08:57 < qwertyasdfuiopghjkl> seem like something that really needs to be done in that/a template anyway?)
16:06:36<project10>oic, publisac = junk mail bomb
16:07:04<anarcat>yep
16:08:55<project10>"Je suis fier de notre longue histoire (+90 ans dans certains marchés)" -- damn, that's a shame
16:13:31<@JAA>So the Canucks forums... They have fairly tight rate limits and return AWS WAF captchas (HTTP 405) if you exceed them. There's no AAAA records on the forum.canucks.com. → canucks.ipsdns.com. → vancouver.nhl.invisionmanaged.net. CNAME chain, but it is in fact reachable over IPv6 since Invision's managed hosting supports IPv6.
16:18:49jacobgkau joins
16:26:54HP_Archivist (HP_Archivist) joins
16:40:36Dango360 (Dango360) joins
17:07:03gfhh joins
17:11:15magmaus3 (magmaus3) joins
17:15:41dumbgoy joins
17:23:18<@JAA>Looks like a lot of topics in those forums got wiped at some point and return a 'There are no posts to show' error now.
17:23:41<@JAA>See e.g. https://forum.canucks.com/forum/2-general-hockey-discussion/page/1018/
17:27:18shinji257 quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]
17:27:27shinji257 (shinji257) joins
17:28:00shinji257 quits [Remote host closed the connection]
17:30:22etnguyen03 (etnguyen03) joins
17:51:41Miki_57 joins
18:01:32nonplussed joins
18:02:46<nonplussed>anyone know what's going on with archive.today?
18:04:09<fireonlive>seems fine here; be sure you're not using 1.1.1.1 and family for your DNS resolver
18:05:27<nonplussed>i just use my ISP's default. would that also affect downforeveryoneorjustme.com? that site also says archive.today is down
18:05:47<@JAA>archive.{today,ph} seems fine.
18:07:55<nonplussed>okay and does this say it's up for you? https://downforeveryoneorjustme.com/archive.today
18:10:06<@JAA>No (and also that site is awful).
18:11:24<nonplussed>you have a better one? or one that says it's up?
18:13:08<@JAA>I tend to do my own checks, so no, don't know a better one.
18:13:30<nonplussed>what dns do you use, or recommend?
18:13:38<@JAA>I run my own recursive resolver.
18:14:07<@JAA>We use Quad9 for our projects. Specifically 9.9.9.10 and its other IPs.
18:14:48shinji257 (shinji257) joins
18:16:08<nonplussed>hmm, maybe i'll change to that then
18:16:57<nonplussed>seems odd to change though because of one nonworking site
18:25:20HP_Archivist quits [Ping timeout: 252 seconds]
18:35:19guest9234 joins
18:40:15guest9234 quits [Remote host closed the connection]
18:53:23nonplussed quits [Ping timeout: 265 seconds]
18:53:48nonplussed joins
19:11:13immibis quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]
19:15:19immibis joins
19:20:50nonplussed63 joins
19:21:30leo60228 quits [Quit: ZNC 1.8.2 - https://znc.in]
19:21:48leo60228 (leo60228) joins
19:24:19nonplussed quits [Ping timeout: 265 seconds]
19:34:01Lord_Nightmare quits [Quit: ZNC - http://znc.in]
19:37:42Lord_Nightmare (Lord_Nightmare) joins
19:47:17balrog quits [Quit: Bye]
19:47:29coolname5 joins
19:48:00magmaus3 quits [Ping timeout: 265 seconds]
19:48:28Webuser213 joins
19:48:38<@JAA>I'm running a qwarc retrieval of the Canucks forum topic pages. I randomly get banned every few minutes with no clear patterns, but it's working well otherwise and should finish in time.
19:50:24<@JAA>NB, I'm running this *slower* than AB, which isn't getting banned, so no idea...
19:51:24VoynichCR quits [Remote host closed the connection]
19:52:14Webuser213 leaves
19:55:44balrog (balrog) joins
20:20:10DogsRNice joins
20:23:46coolname5 quits [Read error: Connection reset by peer]
20:24:37coolname5 joins
20:27:45coolname5 quits [Remote host closed the connection]
20:27:57coolname5 joins
20:28:49coolname5 quits [Remote host closed the connection]
20:29:05etnguyen03 quits [Ping timeout: 252 seconds]
20:30:23coolname5 joins
20:33:47coolname5 quits [Remote host closed the connection]
20:34:48coolname5 joins
20:37:46coolname5 quits [Remote host closed the connection]
20:37:57coolname5 joins
20:41:47coolname5 quits [Remote host closed the connection]
20:41:59coolname5 joins
20:43:09coolname5 quits [Remote host closed the connection]
20:44:04coolname5 joins
20:48:09coolname5 quits [Remote host closed the connection]
20:48:17coolname5 joins
20:49:59aaa joins
20:51:39coolname5 quits [Remote host closed the connection]
20:52:54coolname5 joins
20:53:03aaa quits [Remote host closed the connection]
20:53:25aaa2 joins
20:53:35aaa2 leaves
20:56:33coolname5 quits [Remote host closed the connection]
20:56:41coolname5 joins
20:57:10aaa joins
20:59:48coolname5 quits [Remote host closed the connection]
20:59:58coolname5 joins
21:00:25ThetaDev quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]
21:00:34ThetaDev joins
21:01:25coolname5 quits [Remote host closed the connection]
21:01:35coolname5 joins
21:01:47aaa quits [Remote host closed the connection]
21:02:02bbb joins
21:02:22bbb quits [Remote host closed the connection]
21:02:29bb2 joins
21:07:35greg quits [Ping timeout: 252 seconds]
21:07:35coolname5 quits [Read error: Connection reset by peer]
21:07:44coolname5 joins
21:08:37bb2 leaves
21:08:48web joins
21:09:10web leaves
21:10:28coolname5 quits [Remote host closed the connection]
21:10:38coolname5 joins
21:12:16web joins
21:13:49coolname5 quits [Remote host closed the connection]
21:13:57coolname5 joins
21:14:28coolname5 quits [Read error: Connection reset by peer]
21:14:37coolname5 joins
21:14:54coolname5 quits [Read error: Connection reset by peer]
21:15:27greg joins
21:16:43coolname5 joins
21:17:11coolname5 quits [Read error: Connection reset by peer]
21:17:22coolname5 joins
21:17:34web leaves
21:20:19greg quits [Ping timeout: 265 seconds]
21:20:32coolname5 quits [Remote host closed the connection]
21:20:44coolname5 joins
21:20:57greg joins
21:23:26coolname5 quits [Remote host closed the connection]
21:23:37coolname5 joins
21:24:08aaa joins
21:26:28Peroniko quits [Client Quit]
21:28:07coolname5 quits [Remote host closed the connection]
21:28:19coolname5 joins
21:29:26coolname5 quits [Remote host closed the connection]
21:29:37coolname5 joins
21:29:49aaa leaves
21:29:58web joins
21:30:27web leaves
21:30:40web joins
21:30:50HP_Archivist (HP_Archivist) joins
21:31:26Peroniko joins
21:31:48coolname5 quits [Remote host closed the connection]
21:31:56coolname5 joins
21:32:10web quits [Remote host closed the connection]
21:32:15web2 joins
21:35:42coolname5 quits [Remote host closed the connection]
21:35:53coolname5 joins
21:35:55nonplussed63 is now known as nonplussed
21:37:03web2 leaves
21:37:18web joins
21:38:19<nonplussed>hey JAA, just curious, why did you say downforeveryoneorjustme.com is awful? have never had issues with it myself
21:40:00<fireonlive>personally, everything not directly 'can i access $site test' is all user-reported now which has a lot of false positives
21:40:07<fireonlive>e.g. can't reach facebook? comcast is down!
21:41:46<nonplussed>that site says it does it by "performing a server check from our servers"; i don't see anything about it being user-reported
21:42:40<fireonlive>https://downforeveryoneorjustme.com/crunchyroll for example: "A problem with Crunchyroll has been detected based on visitor reports"
21:42:50<fireonlive>but perhaps I was thinking about another site, as this UI looks a bit different
21:42:56coolname5 quits [Remote host closed the connection]
21:43:51coolname5 joins
21:45:32web quits [Remote host closed the connection]
21:45:45coolname5 quits [Remote host closed the connection]
21:47:32Barto quits [Read error: Connection reset by peer]
21:47:38Barto (Barto) joins
21:49:19<nonplussed>oh, weird, i guess it gives user reports when it's a major website. but for most websites i've checked, it's always done a live test
21:50:13etnguyen03 (etnguyen03) joins
22:06:19IRC2DC joins
22:12:02nonplussed quits [Ping timeout: 265 seconds]
22:14:16andrew quits [Quit: ]
22:16:53greg quits [Ping timeout: 252 seconds]
22:20:06magmaus3 (magmaus3) joins
22:20:53nonplussed joins
22:24:57magmaus3 quits [Client Quit]
22:31:21<@JAA>nonplussed: Apart from these user report inaccuracies, I see four different tracking/ads services' scripts, and the site uses JS for absolutely no reason. No thanks.
22:39:12<fireonlive>could be useful for us: https://twitter.com/iustinBB/status/1703785504670445780
22:39:12<eggdrop>nitter: https://nitter.net/iustinBB/status/1703785504670445780
22:39:23<fireonlive>also paypal has waaaaaaaaaaay too many domains. please stop.
22:39:59etnguyen03 quits [Ping timeout: 252 seconds]
22:40:28imer quits [Quit: Oh no]
22:41:07imer (imer) joins
22:43:01IRC2DC quits [Remote host closed the connection]
22:44:27IRC2DC joins
22:44:57IRC2DC quits [Remote host closed the connection]
22:45:04<@JAA>Neat
22:46:21<@JAA>Direct link for when Twitter dies: https://github.com/duckduckgo/tracker-radar 'Data set of top third party web domains with rich metadata about them'
22:46:28BlueMaxima joins
22:47:03<fireonlive>ah right.. RIP twitter :(
22:51:27IRC2DC joins
22:52:39<fireonlive>i wonder what 'DC' is in 'IRC2'
22:53:02magmaus37 (magmaus3) joins
22:55:10magmaus37 is now known as magmaus3
22:59:44magmaus3 quits [Client Quit]
23:00:05magmaus3 (magmaus3) joins
23:02:55andrew (andrew) joins
23:03:17efeafewa joins
23:05:56<nicolas17>https://twitter.com/mtruslowstorey/status/1703781076689121556
23:05:57<eggdrop>nitter: https://nitter.net/mtruslowstorey/status/1703781076689121556
23:08:06<fireonlive>microsoft access :(
23:09:29<@JAA>And they're throwing it into Google Data Studio, which will surely live on forever...
23:09:52<nicolas17>someone has to help her get an iso :/
23:10:08<@JAA>Yeah, that should be the first priority.
23:12:20<flashfire42>Its a pity there isnt a way to see how much space is available on targets or how many connections are being attempted at once to see how far down the queue I may be
23:12:37<@JAA>There is no queue.
23:12:40<fireonlive>i don't think the latter is possible
23:12:46etnguyen03 (etnguyen03) joins
23:13:56DogsRNice_ joins
23:14:37<fireonlive>it's a lovely thundering herd problem :3
23:14:48<fireonlive>or something.
23:16:48DogsRNice quits [Ping timeout: 265 seconds]
23:24:26xarph quits [Quit: ZNC 1.8.2+deb2build5 - https://znc.in]
23:24:36xarph joins
23:45:59etnguyen03 quits [Ping timeout: 252 seconds]
23:49:05Rootliam joins
23:52:12<Rootliam>I saw in the irc log jason scott said I should contact at him at "jscott⊙ao".... what exactly is ⊙ao
23:54:06<fireonlive>oh maybe your client messed it up?
23:54:37<fireonlive> jscott@archive.org
23:54:46<fireonlive>oh, maybe the logs site did
23:54:57<qwertyasdfuiopghjkl>https://hackint.logs.kiska.pw/archiveteam-bs/20230918#c381184 doesn't show the whole email address, try https://irclogs.archivete.am/archiveteam-bs/2023-09-18#l689f4802
23:55:23<fireonlive>the second email is....
23:55:26<fireonlive>not worth repeating
23:55:41nonplussed quits [Remote host closed the connection]
23:56:18<fireonlive>jason@textfiles.com exists on that domain, though.
23:57:07<fireonlive>ah yeah, or use qwertyasdfuiopghjkl's link