00:00:01 | <eggdrop> | [remind] thuban: run archivebot jobs for https://wiki.archiveteam.org/index.php/Political_parties/Georgia if they're not done yet |
00:00:42 | <pabs> | hmm, AB is rolling again?! |
00:04:31 | <thuban> | just a trickle, i understand |
00:05:47 | <thuban> | JAA: georgian elections are on the 26th; ok to start running relevant sites now or should i wait that "24-ish hours"? |
00:06:45 | <@JAA> | thuban: I think I'd wait. |
00:07:22 | <thuban> | ok! |
00:07:28 | <pabs> | ah, I read AB channel backlog, sorry for the noise |
00:07:30 | <@JAA> | Yes, AB is moving again, but most pipelines are still full or nearly full. Data is being moved away. |
00:08:11 | <@JAA> | Takes a bit to drain that backlog. |
00:08:20 | <@JAA> | Urgent things can be run though. |
00:08:22 | | seacow joins |
00:08:24 | <thuban> | are things starting to move on ia's end or did we just add an emergency target? |
00:08:26 | | etnguyen03 quits [Client Quit] |
00:08:51 | <@JAA> | The latter |
00:08:57 | <thuban> | ack |
00:09:01 | <thuban> | !remindme 24h run archivebot jobs for https://wiki.archiveteam.org/index.php/Political_parties/Georgia |
00:09:02 | <eggdrop> | [remind] ok, i'll remind you at 2024-10-24T00:09:01Z |
00:14:07 | | BlueMaxima quits [Read error: Connection reset by peer] |
00:33:35 | | bob joins |
00:34:46 | | bob quits [Client Quit] |
00:37:41 | | seacow quits [Client Quit] |
00:37:52 | | seacow joins |
00:51:06 | | etnguyen03 (etnguyen03) joins |
01:28:30 | | Hackerpcs quits [Quit: Hackerpcs] |
01:30:08 | | Hackerpcs (Hackerpcs) joins |
01:35:50 | | Hackerpcs quits [Ping timeout: 260 seconds] |
01:37:33 | | Hackerpcs (Hackerpcs) joins |
01:42:50 | | Hackerpcs quits [Ping timeout: 260 seconds] |
01:43:04 | | FartWithFury quits [Read error: Connection reset by peer] |
01:47:52 | | Hackerpcs (Hackerpcs) joins |
01:59:09 | | lennier2 joins |
02:02:05 | | lennier2_ quits [Ping timeout: 260 seconds] |
02:27:39 | | etnguyen03 quits [Client Quit] |
02:39:34 | | Island quits [Read error: Connection reset by peer] |
02:40:42 | | etnguyen03 (etnguyen03) joins |
02:42:32 | | Island joins |
02:50:29 | | etnguyen03 quits [Remote host closed the connection] |
03:37:49 | | wickedplayer494 quits [Read error: Connection reset by peer] |
03:39:13 | | wickedplayer494 joins |
03:39:27 | | wickedplayer494 is now authenticated as wickedplayer494 |
03:41:10 | | useretail quits [Quit: Leaving] |
03:56:46 | <pabs> | I think https://minecraft.wiki/ is using TLS fingerprinting, is anyone able to detect that? |
04:06:32 | <pokechu22> | If I use Firefox's "copy as curl" command and then run that it still works, so if they are doing TLS fingerprinting they don't exclude curl (compare that to some cloudflare sites where curl does fail). ... though even `curl https://minecraft.wiki/` works fine. And I think there are mods that directly interact with the wiki as well so it should be accessible from Java |
04:11:21 | <pabs> | thanks. huh, it isn't that |
04:15:17 | <pabs> | hmm, just UA, but with python requests the same UA fails |
04:23:00 | <@OrIdow6> | A straight get() with Requests works on my end |
04:23:04 | | DogsRNice joins |
04:23:50 | <@OrIdow6> | Could be a composite score |
04:24:06 | <steering> | or python/reqeusts versions |
04:24:08 | <steering> | but with speeling |
04:26:14 | <pabs> | hmm |
04:27:10 | <pabs> | huh, requests.get works, but with the browser UA it doesn't... |
04:51:00 | | useretail joins |
05:23:55 | | Sluggs quits [Ping timeout: 260 seconds] |
05:35:23 | | DogsRNice quits [Read error: Connection reset by peer] |
06:12:47 | | Snivy6 (Snivy) joins |
06:15:15 | | Snivy quits [Ping timeout: 260 seconds] |
06:15:15 | | Snivy6 is now known as Snivy |
06:31:33 | | sec^nd quits [Ping timeout: 240 seconds] |
06:38:01 | | sec^nd (second) joins |
06:41:25 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
06:55:48 | | pixel leaves [Error from remote client] |
06:56:48 | | loug831814 joins |
06:56:51 | | loug831814 quits [Remote host closed the connection] |
06:57:07 | | loug831814 joins |
07:03:07 | | JaffaCakes118_2 (JaffaCakes118) joins |
07:05:49 | | Unholy236192464537713 (Unholy2361) joins |
07:06:35 | | JaffaCakes118 quits [Ping timeout: 260 seconds] |
07:06:53 | <h2ibot> | Bzc6p edited TVN.hu (+577, three websites gone): https://wiki.archiveteam.org/?diff=53622&oldid=49355 |
07:20:59 | | @arkiver quits [Remote host closed the connection] |
07:21:24 | | arkiver (arkiver) joins |
07:21:24 | | @ChanServ sets mode: +o arkiver |
07:24:56 | <h2ibot> | Bzc6p created Template:Company/Central Médiacsoport (+752, Created page with "{| style="border: 1px solid…): https://wiki.archiveteam.org/?title=Template%3ACompany/Central%20M%C3%A9diacsoport |
07:24:57 | <h2ibot> | Bzc6p created Cafeblog.hu (+633, Created page with "{{Infobox project | title =…): https://wiki.archiveteam.org/?title=Cafeblog.hu |
07:34:58 | <h2ibot> | Bzc6p edited Atw.hu (+330, /* Site reconnaissance */ free hosting…): https://wiki.archiveteam.org/?diff=53625&oldid=45641 |
07:44:59 | <h2ibot> | Bzc6p edited Freeweb.hu (+463, /* Deletions */ update): https://wiki.archiveteam.org/?diff=53626&oldid=45632 |
08:06:13 | | @rewby quits [Ping timeout: 255 seconds] |
08:14:11 | | Sluggs joins |
08:18:05 | <h2ibot> | Bzc6p created Mindenkilapja.hu (+1644, Created page with "{{Infobox project | title =…): https://wiki.archiveteam.org/?title=Mindenkilapja.hu |
08:18:48 | | Radzig quits [Ping timeout: 258 seconds] |
08:30:07 | <h2ibot> | Bzc6p created SG Fórum (+547, Created page with "{{Infobox project | title =…): https://wiki.archiveteam.org/?title=SG%20F%C3%B3rum |
08:36:08 | <h2ibot> | Bzc6p edited Videok.hu (+64, Proclaimed dead.): https://wiki.archiveteam.org/?diff=53629&oldid=45763 |
08:41:56 | | vix5110_ joins |
08:47:09 | <h2ibot> | Bzc6p created Darkweb.hu (+562, Created page with "{{Infobox project | title =…): https://wiki.archiveteam.org/?title=Darkweb.hu |
08:47:12 | | Radzig joins |
08:57:11 | <h2ibot> | Bzc6p created Baratikor.com (+443, Created page with "{{Infobox project | title =…): https://wiki.archiveteam.org/?title=Baratikor.com |
09:08:13 | <h2ibot> | Bzc6p edited X3.hu (+32): https://wiki.archiveteam.org/?diff=53632&oldid=46204 |
09:08:14 | <h2ibot> | Bzc6p edited Tar.hu (+69): https://wiki.archiveteam.org/?diff=53633&oldid=45615 |
09:18:15 | <h2ibot> | Bzc6p created Napiszar.org (+407, Created page with "{{Infobox project | title =…): https://wiki.archiveteam.org/?title=Napiszar.org |
09:22:16 | <h2ibot> | Bzc6p edited Template:Hungarian websites (+63, update status of websites): https://wiki.archiveteam.org/?diff=53635&oldid=51935 |
09:32:14 | | bf_ joins |
09:40:54 | | rewby (rewby) joins |
09:40:54 | | @ChanServ sets mode: +o rewby |
09:41:45 | | katocala joins |
09:42:11 | | katocala is now authenticated as katocala |
10:16:21 | | Mist8kenGAS_ quits [Quit: Leaving] |
10:25:26 | | sralracer joins |
10:25:47 | | sralracer is now authenticated as sralracer |
10:37:25 | | Dango360_ (Dango360) joins |
10:40:40 | | _Dango360 quits [Ping timeout: 260 seconds] |
10:43:49 | | charlotte_ joins |
10:44:19 | | StarletCharlotte quits [Read error: Connection reset by peer] |
11:00:00 | | bf_ quits [Remote host closed the connection] |
11:00:02 | | Bleo18260072271962 quits [Quit: The Lounge - https://thelounge.chat] |
11:02:44 | | Bleo18260072271962 joins |
11:03:15 | | StarletCharlotte joins |
11:03:25 | | Unholy236192464537713 quits [Ping timeout: 260 seconds] |
11:04:39 | | miana quits [Quit: Connection closed for inactivity] |
11:05:45 | | charlotte_ quits [Ping timeout: 260 seconds] |
11:39:22 | | SkilledAlpaca418 quits [Remote host closed the connection] |
11:41:07 | | SkilledAlpaca418 joins |
12:31:39 | | vix5110_ quits [Client Quit] |
12:46:05 | | Stvkimension11 (Stvkimension11) joins |
12:48:11 | <hexa-> | yeah, it is not smart that way |
12:48:25 | <hexa-> | wrong room :D |
12:58:16 | | Stvkimension11 quits [Ping timeout: 255 seconds] |
13:30:02 | <h2ibot> | Bzc6p edited News+C/hu (-190, Cloudflare protection gone, we go on crunching): https://wiki.archiveteam.org/?diff=53639&oldid=53492 |
13:43:34 | | nstrom joins |
13:53:44 | | StarletCharlotte quits [Remote host closed the connection] |
14:00:48 | | JaffaCakes118_2 quits [Remote host closed the connection] |
14:01:03 | | JaffaCakes118_2 (JaffaCakes118) joins |
14:02:04 | | loug8318142 joins |
14:02:26 | | loug831814 quits [Read error: Connection reset by peer] |
14:02:26 | | loug8318142 is now known as loug831814 |
14:25:15 | | Commander001 quits [Ping timeout: 260 seconds] |
14:25:49 | | Commander001 joins |
14:31:05 | | midou quits [Ping timeout: 260 seconds] |
14:41:21 | | midou joins |
15:46:25 | | Commander001 quits [Read error: Connection reset by peer] |
15:46:37 | | Commander001 joins |
15:53:43 | | FartWithFury (FartWithFury) joins |
15:54:19 | | FartWithFury quits [Read error: Connection reset by peer] |
15:58:40 | | Juesto (Juest) joins |
16:01:51 | | Juest quits [Ping timeout: 258 seconds] |
16:01:52 | | Juesto is now known as Juest |
16:17:59 | | vix5110_ joins |
16:38:41 | | JaffaCakes118_2 quits [Remote host closed the connection] |
16:39:40 | | vix5110_ quits [Client Quit] |
16:43:16 | | Sluggs quits [Ping timeout: 258 seconds] |
16:43:17 | | JaffaCakes118 (JaffaCakes118) joins |
16:48:39 | | Sluggs joins |
17:04:05 | | Unholy236192464537713 (Unholy2361) joins |
17:21:10 | <angenieux> | What is the difference (in purpose?) between urlteam and urlteamwasright channel? |
17:22:49 | <imer> | angenieux: #urlteam is the urlteam project, #urlteamwasright is an upcoming separate project to archive goo.gl ones |
17:23:08 | <nicolas17> | is there room on telegram/urls targets again? I have stuff pending upload for 12 days |
17:23:44 | <imer> | #// has been slugging along, unsure about telegram |
17:24:05 | <nstrom> | urls has been working , albeit a bit choppily |
17:24:07 | <nstrom> | telegram still paused |
17:27:25 | <nicolas17> | I was trying to upload, not get new tasks |
17:27:31 | <nicolas17> | looks like telegram works too yay |
17:27:35 | <nicolas17> | I can finally get rid of these paused workers |
17:51:49 | <angenieux> | thanks imer, does that mean for the urlteamwasright project I need to run a different warrior container than the urlteam one? |
17:52:37 | <@JAA> | Yes, once the project is running, anyway. |
17:54:54 | <angenieux> | I see |
17:56:05 | <TheTechRobo> | Specifically, #urlteamwasright will be WARC, so it will work in the Wayback Machine, and it's a targetted crawl, not a brute-force. |
17:56:38 | <@JAA> | Well, still brute force, but yeah, not the long slog that is URLTeam. |
18:16:38 | | Commander001 quits [Remote host closed the connection] |
18:31:15 | | Commander001 joins |
18:40:10 | | Unholy236192464537713 quits [Ping timeout: 260 seconds] |
18:43:04 | | loug831814 quits [Quit: The Lounge - https://thelounge.chat] |
18:45:15 | | loug8318142 joins |
19:04:48 | | nstrom quits [Client Quit] |
19:07:15 | <nicolas17> | what's the default warrior project? |
19:07:50 | <@JAA> | https://wiki.archiveteam.org/index.php/CurrentWarriorProject |
19:23:06 | | loug8318142 quits [Read error: Connection reset by peer] |
19:23:07 | | loug8318142 joins |
19:30:13 | | hogchips quits [Ping timeout: 255 seconds] |
19:43:22 | | rappet quits [Quit: https://quassel-irc.org - Komfortabler Chat. Überall.] |
19:45:08 | | eightthree_ joins |
19:45:30 | | eightthree quits [Ping timeout: 260 seconds] |
19:46:27 | | rappet (rappet) joins |
19:50:04 | | eightthree_ is now known as eightthree |
20:06:02 | | ThreeHM quits [Ping timeout: 258 seconds] |
20:07:53 | | ThreeHM (ThreeHeadedMonkey) joins |
20:12:33 | | ThreeHM quits [Ping timeout: 258 seconds] |
20:14:22 | | ThreeHM (ThreeHeadedMonkey) joins |
20:27:21 | | knecht quits [Quit: knecht] |
20:34:15 | | knecht (knecht) joins |
20:38:38 | | ThreeHM quits [Ping timeout: 258 seconds] |
20:40:31 | | ThreeHM (ThreeHeadedMonkey) joins |
20:50:19 | | etnguyen03 (etnguyen03) joins |
20:51:44 | | BlueMaxima joins |
20:53:02 | | lennier2_ joins |
20:55:52 | | lennier2 quits [Ping timeout: 258 seconds] |
21:04:19 | | ThreeHM quits [Ping timeout: 258 seconds] |
21:12:01 | | knecht quits [Remote host closed the connection] |
21:12:10 | | knecht (knecht) joins |
21:39:56 | <TheTechRobo> | https://www.nytimes.com/2024/10/23/technology/characterai-lawsuit-teen-suicide.html , anything to archive? |
21:40:32 | <TheTechRobo> | really sad article :( |
21:41:27 | <TheTechRobo> | er, here is a non-login-walled, version: https://archive.is/ZX3dI |
22:03:58 | | loug8318142 quits [Client Quit] |
22:27:40 | | magmaus3 quits [Ping timeout: 260 seconds] |
22:33:11 | | magmaus3 (magmaus3) joins |
22:56:26 | | sralracer quits [Client Quit] |
23:37:59 | | etnguyen03 quits [Client Quit] |
23:58:31 | | etnguyen03 (etnguyen03) joins |