00:01:01 | <h2ibot> | JAABot edited CurrentWarriorProject (+5): https://wiki.archiveteam.org/?diff=50996&oldid=50995 |
00:05:10 | | Wohlstand quits [Client Quit] |
00:09:02 | | etnguyen03 quits [Ping timeout: 252 seconds] |
00:21:22 | | etnguyen03 (etnguyen03) joins |
00:37:31 | | flashfire42 quits [Client Quit] |
00:41:13 | | icedice quits [Client Quit] |
00:47:06 | | Ryz263 (Ryz) joins |
00:47:07 | | s-crypt2 (s-crypt) joins |
00:47:16 | | flashfire42 joins |
00:47:20 | | kiska (kiska) joins |
00:48:18 | | etnguyen03 quits [Ping timeout: 265 seconds] |
01:05:33 | | etnguyen03 (etnguyen03) joins |
01:18:46 | <fireonlive> | Rite Aid files for bankruptcy amid opioid-related lawsuits and falling sales: https://www.cbsnews.com/news/rite-aid-bankruptcy-opioids-lawsuits/ |
01:23:50 | | etnguyen03 quits [Ping timeout: 252 seconds] |
01:26:00 | | DLoader quits [Ping timeout: 265 seconds] |
01:26:32 | | Megame quits [Client Quit] |
01:31:00 | | DLoader joins |
01:34:26 | | etnguyen03 (etnguyen03) joins |
02:11:29 | <project10> | fireonlive: yep I posted that one last night and pabs did the AB 🪄 |
02:12:05 | | project10 misses fuckedcompany.com |
02:22:40 | <fireonlive> | ah :D |
03:07:01 | | lennier2 quits [Ping timeout: 265 seconds] |
03:07:33 | | lennier2 joins |
03:21:57 | | nic9 quits [Quit: The Lounge - https://thelounge.chat] |
03:22:45 | | nic9 (nic) joins |
04:10:01 | | sec^nd quits [Ping timeout: 245 seconds] |
04:12:29 | | sec^nd (second) joins |
04:14:53 | | dumbgoy quits [Ping timeout: 252 seconds] |
04:34:41 | | AntiLiberal quits [Ping timeout: 252 seconds] |
04:41:01 | <audrooku|m> | How should I submit ~250 blog post urls to be archived for the WBM? should I use a script to submit them to the spn2 api? or is there a way to queue them for archiveteam? |
04:43:34 | <pokechu22> | They can be run via archivebot - archivebot doesn't evaluate javascript but that should be fine for blog posts |
04:43:49 | <pokechu22> | if you upload a list of URLs to https://transfer.archivete.am/ I can run it |
04:43:58 | <audrooku|m> | will do in a few hours, thanks |
04:44:01 | <pokechu22> | (or if they're all on the same site, I can just do a recursive archivebot job over the whole site) |
04:56:56 | | etnguyen03 quits [Client Quit] |
05:00:49 | <pabs> | audrooku|m: re SPN2, I find the email endpoint is better than the web one for sending lots of links, since the web API has relatively low limits. |
05:01:06 | <pabs> | but archivebot indeed is the best option |
05:02:59 | <audrooku|m> | I'm not familiar with this email endpoint |
05:04:07 | <pabs> | mail HTML or plain text to savepagenow@archive.org and it goes through SPN2, you will get one or more mails in response, batches of 100 URLs IIRC |
05:04:25 | <audrooku|m> | interesting |
05:04:38 | <pabs> | and if some of the URLs tempfail you can copy those into a new mail |
05:05:13 | <pabs> | I'd only use SPN2 for things that really need JS or otherwise don't work in archivebot tho |
05:05:27 | <pabs> | linkedin for eg does not like AB |
05:05:38 | <pabs> | but sometimes works in SPN2 |
05:06:41 | <audrooku|m> | noted |
05:11:01 | | Island quits [Read error: Connection reset by peer] |
05:28:51 | <Ryz> | Oof, apparently Epic Games (former owner) laid off 50% of the staff of Bandcamp before being sold off: https://twitter.com/ethangach/status/1713970488257413600 |
05:28:52 | <eggdrop> | nitter: https://nitter.net/ethangach/status/1713970488257413600 |
05:30:11 | <audrooku|m> | 6mo severance tho |
05:39:46 | | DogsRNice quits [Read error: Connection reset by peer] |
06:38:23 | | datechnoman quits [Quit: The Lounge - https://thelounge.chat] |
06:38:58 | | datechnoman (datechnoman) joins |
06:54:24 | | Arcorann (Arcorann) joins |
06:59:59 | | treora quits [Ping timeout: 265 seconds] |
07:01:38 | | treora joins |
07:02:53 | | rawktucc quits [Ping timeout: 265 seconds] |
07:06:26 | | rktk (rktk) joins |
07:23:00 | <pabs> | https://www.lightbluetouchpaper.org/2023/10/16/hacktivism-in-ukraine-and-gaza/ |
07:23:14 | <pabs> | hmm, meant for -ot channel, oops |
07:25:28 | | xkey quits [Quit: WeeChat 3.8] |
07:33:01 | | xkey (xkey) joins |
07:33:05 | | xkey quits [Client Quit] |
07:33:44 | | xkey (xkey) joins |
07:38:06 | | BlueMaxima quits [Read error: Connection reset by peer] |
07:55:31 | <pabs> | https://blog.zarfhome.com/2023/10/microsoft-consumes-activision |
08:06:22 | | tttt quits [Remote host closed the connection] |
08:20:33 | | treora quits [Remote host closed the connection] |
08:20:44 | | treora joins |
08:20:50 | | xkey quits [Client Quit] |
08:21:51 | | xkey (xkey) joins |
08:21:55 | | sss joins |
08:56:09 | | neggles quits [Quit: bye friends - ZNC - https://znc.in] |
09:02:23 | | icedice (icedice) joins |
09:12:54 | | parfait quits [Ping timeout: 265 seconds] |
09:29:59 | <pabs> | https://www.nature.com/articles/d41586-023-03191-3 - Argentina to shut down their national science org |
09:31:11 | <pabs> | er, one candidate wants to |
10:18:49 | | sec^nd quits [Remote host closed the connection] |
10:19:11 | | sec^nd (second) joins |
10:37:29 | | yasom1 quits [Ping timeout: 265 seconds] |
10:39:08 | | yasomi (yasomi) joins |
10:41:19 | | neggles (neggles) joins |
11:24:10 | | bf_ joins |
11:41:59 | | VerifiedJ quits [Quit: The Lounge - https://thelounge.chat] |
11:42:27 | | VerifiedJ (VerifiedJ) joins |
11:53:39 | | BearFortress quits [Client Quit] |
12:19:00 | | shinji257 quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.] |
12:19:08 | | shinji257 (shinji257) joins |
12:19:17 | | BearFortress joins |
12:35:39 | | etnguyen03 (etnguyen03) joins |
12:37:50 | | Arcorann quits [Ping timeout: 265 seconds] |
12:49:08 | | etnguyen03 quits [Ping timeout: 252 seconds] |
13:21:56 | | eroc1990 quits [Quit: The Lounge - https://thelounge.chat] |
13:22:23 | | eroc1990 (eroc1990) joins |
13:41:38 | | treora quits [Remote host closed the connection] |
13:41:40 | | treora joins |
13:54:12 | | etnguyen03 (etnguyen03) joins |
15:09:31 | | Island joins |
15:23:44 | | DLoader_ joins |
15:26:02 | | DLoader quits [Ping timeout: 265 seconds] |
15:26:05 | | DLoader_ is now known as DLoader |
15:39:21 | | dumbgoy joins |
15:45:41 | | etnguyen03 quits [Ping timeout: 252 seconds] |
15:45:53 | | xkey quits [Client Quit] |
15:47:36 | | xkey (xkey) joins |
16:14:51 | | DogsRNice joins |
16:22:09 | | Megame (Megame) joins |
16:31:58 | <h2ibot> | Megame edited Deathwatch (+145, /* 2023 */ tensorboard.dev): https://wiki.archiveteam.org/?diff=50997&oldid=50991 |
16:31:59 | <h2ibot> | JustAnotherArchivist changed the user rights of User:Megame |
16:38:17 | <DogsRNice> | https://knockout.chat/thread/32805 |
16:38:24 | <DogsRNice> | a bunch of source engine map archives |
16:38:48 | <DogsRNice> | some of them are on arhive.org but one person said they had issues with uploading them |
16:48:02 | | LeGoupil joins |
16:51:22 | | etnguyen03 (etnguyen03) joins |
17:01:35 | | etnguyen03 quits [Ping timeout: 252 seconds] |
17:08:01 | | pabs quits [Ping timeout: 265 seconds] |
17:21:18 | | pabs (pabs) joins |
17:36:38 | | etnguyen03 (etnguyen03) joins |
17:40:38 | | wessel1512 quits [Ping timeout: 252 seconds] |
17:40:41 | | wessel1512 joins |
17:45:44 | <pokechu22> | DogsRNice: I've started an archivebot job on https://ar.mevl2.duckdns.org/ which I believe is the main archive (https://maps.mevl2.duckdns.org/ links to it) |
18:01:59 | | Larsenv quits [Quit: The Lounge - https://thelounge.chat] |
18:03:23 | | Larsenv (Larsenv) joins |
18:05:13 | | Larsenv quits [Client Quit] |
18:06:24 | | Larsenv (Larsenv) joins |
18:06:41 | | Larsenv quits [Client Quit] |
18:10:19 | | Larsenv (Larsenv) joins |
18:10:39 | <Barto> | yeah, bandcamp is still one of those golden place of the internet |
18:12:24 | <Barto> | question, i know that !ao < https://transfer.archivete.am/... exists, does !a < https://transfer.archivete.am/... exists too? What are the quirks? Could it be used to save all main domain + subdomains all at once? |
18:13:28 | <@JAA> | It exists, but you have to be very careful about what you throw into there because it can break recursion in entertaining ways. |
18:13:43 | <@JAA> | (That's also why it isn't documented.) |
18:14:11 | <@JAA> | It might recurse over things you don't want, or it might miss things you'd expect it to grab. |
18:14:45 | <@JAA> | That depends on the contents of the initial list as well as timing. |
18:14:49 | <pokechu22> | It exists, but each URL tracks what URL it came from, which means that the notion of something being on site or offsite gets messy (especially when sites link to other subdomains; if a page on a subdomain is found by a different domain first, it'll be treated as offsite and things from that page won't be recursed over). Things get worse when you do multiple URLs on the same |
18:14:52 | <pokechu22> | site as the no-parent rule means that example.com/a/ linking to example.com/b/subpage will entirely skip example.com/b/subpage, even if that same link is discovered via example.com/b/ later |
18:16:01 | | sss quits [Remote host closed the connection] |
18:16:02 | | kiryu_ joins |
18:16:24 | <@JAA> | Basically, the only *safe* way of using it is to have a list of URLs that are all on the same host and which are all identical up to the last slash. |
18:19:04 | | kiryu quits [Ping timeout: 265 seconds] |
18:25:11 | | etnguyen03 quits [Ping timeout: 252 seconds] |
18:30:08 | | lukash9 quits [Ping timeout: 252 seconds] |
18:34:02 | | lukash9 joins |
18:34:23 | | etnguyen03 (etnguyen03) joins |
18:51:18 | <Barto> | ok, as i understand my example is not the recommended way to do it. Indeed there's some weird behavior with it |
18:51:31 | <Barto> | thanks for the explanation |
18:55:10 | <pokechu22> | Yeah. We've still done it in the past for things where interlinking is unlikely to happen (mainly for ISP hosting with thousands of users) but it's not a good idea in most case |
18:56:18 | <Barto> | i was especially seeing it the way to "group" archiving jobs, so no way it's a good idea :D |
19:09:43 | | LeGoupil quits [Client Quit] |
19:20:47 | | systwi_ quits [Quit: systwi_] |
19:20:47 | | nothere quits [Quit: Leaving] |
19:36:40 | | Wohlstand (Wohlstand) joins |
19:53:11 | | nothere joins |
20:03:01 | | threedeeitguy39 quits [Quit: The Lounge - https://thelounge.chat] |
20:05:04 | | threedeeitguy39 (threedeeitguy) joins |
20:19:37 | | treora quits [Read error: Connection reset by peer] |
20:19:38 | | treora joins |
20:20:56 | <h2ibot> | Flashfire42 edited List of websites excluded from the Wayback Machine (+33): https://wiki.archiveteam.org/?diff=50998&oldid=50994 |
20:28:59 | | Megame quits [Client Quit] |
20:40:35 | | flashfire42 is now authenticated as flashfire42 |
20:53:42 | | sss joins |
21:01:05 | <h2ibot> | JAABot edited List of websites excluded from the Wayback Machine (+0): https://wiki.archiveteam.org/?diff=50999&oldid=50998 |
21:22:03 | | parfait (kdqep) joins |
21:38:47 | | BlueMaxima joins |
21:45:27 | | SF quits [Ping timeout: 265 seconds] |
21:55:12 | | jtagcat quits [Quit: Ping timeout (120 seconds)] |
21:55:33 | | jtagcat (jtagcat) joins |
21:58:03 | | SF joins |
22:01:58 | | abirkill- (abirkill) joins |
22:04:18 | | abirkill quits [Ping timeout: 265 seconds] |
22:04:18 | | abirkill- is now known as abirkill |
22:04:19 | | treora quits [Read error: Connection reset by peer] |
22:04:24 | | treora joins |
22:29:28 | | icedice2 (icedice) joins |
22:29:36 | | decky joins |
22:32:49 | | icedice quits [Ping timeout: 265 seconds] |
22:32:49 | | decky_e_ quits [Ping timeout: 265 seconds] |
22:43:41 | | etnguyen03 quits [Ping timeout: 252 seconds] |
22:58:00 | | etnguyen03 (etnguyen03) joins |
22:58:03 | | Island_ joins |
22:59:05 | | Island quits [Ping timeout: 252 seconds] |
23:05:12 | | yawkat quits [Ping timeout: 265 seconds] |
23:11:40 | | yawkat (yawkat) joins |
23:29:20 | | ats_ quits [Ping timeout: 252 seconds] |
23:34:30 | | ats (ats) joins |
23:40:26 | | icedice2 quits [Client Quit] |
23:40:29 | | etnguyen03 quits [Ping timeout: 265 seconds] |