00:29:16 | | Overlordz quits [Client Quit] |
00:33:50 | | jasons quits [Ping timeout: 240 seconds] |
01:10:00 | | bf_ joins |
01:21:57 | | MetaNova quits [Remote host closed the connection] |
01:23:42 | | MetaNova (MetaNova) joins |
01:34:06 | <nulldata> | Can someone please throw https://www.toysforbob.com/ into AB? Future looks bleak. https://www.sfchronicle.com/tech/article/tech-layoffs-microsoft-activision-blizzard-18651552.php |
01:37:34 | | jasons (jasons) joins |
01:38:52 | <pokechu22> | Done (also https://careers.toysforbob.com/) |
01:40:15 | <nulldata> | Thanks! |
01:42:03 | | eroc19904 quits [Client Quit] |
01:42:29 | | eroc1990 (eroc1990) joins |
02:01:28 | | hh joins |
02:03:00 | | hh quits [Remote host closed the connection] |
02:34:03 | | jasons quits [Ping timeout: 272 seconds] |
02:41:04 | | xarph joins |
03:20:53 | | BlueMaxima quits [Read error: Connection reset by peer] |
03:24:55 | | icedice quits [Client Quit] |
03:36:22 | <fireonlive> | -+rss- John Walker, founder of Autodesk, dead at 75: https://scanalyst.fourmilab.ch/t/john-walker-1949-2024/4305 https://news.ycombinator.com/item?id=39297185 |
03:36:47 | | jasons (jasons) joins |
03:36:54 | | bf_ quits [Remote host closed the connection] |
04:03:59 | | fishingforsoup__ joins |
04:07:50 | | fishingforsoup_ quits [Ping timeout: 240 seconds] |
04:36:17 | | jasons quits [Ping timeout: 272 seconds] |
04:43:59 | <pabs> | TIL SPN2 does result in a useful archive for Facebook pages :/ |
04:44:06 | <pabs> | does *not* |
04:45:12 | | Sophira quits [Quit: leaving] |
04:48:12 | | eroc1990 quits [Client Quit] |
04:48:57 | <pabs> | AB does seem to get some actual content (photos etc), and some of the captures do work, some don't |
04:52:07 | | line_ quits [Ping timeout: 272 seconds] |
04:53:55 | | eroc1990 (eroc1990) joins |
05:15:04 | <HP_Archivist> | Sometimes if they're public and you don't have to login, (public pages) you can grab the direct or hardlinks for photos and then into the WBM they go. |
05:15:18 | <HP_Archivist> | Problem is, nobody will ever find the mess of a URL that they often are |
05:15:36 | <nicolas17> | the links expire |
05:15:44 | <HP_Archivist> | Oh really? |
05:15:46 | <nicolas17> | which means they change |
05:15:53 | <HP_Archivist> | Ugh, I didn't know that |
05:16:29 | <HP_Archivist> | Oh well. At least it captures images in that moment |
05:16:38 | <nicolas17> | which means if last week you saved a Facebook image URL, and it was now deleted, searching it on WBM may not work because it was archived last month with a different auth thingy in the URL |
05:16:44 | <h2ibot> | Pokechu22 edited Jira (+392, https://jira.mariadb.org/; {{inprogress}}): https://wiki.archiveteam.org/?diff=51683&oldid=51681 |
05:17:13 | <HP_Archivist> | nicolas17: That was my point. Nobody will ever find the original URL |
05:18:10 | <@JAA> | Well, the non-expiring URLs still exist, they're just virtually impossible to find. |
05:18:28 | <HP_Archivist> | ^^ |
05:18:34 | <@JAA> | Oh, you're talking about just the images, no, right. |
05:18:46 | <@JAA> | I'm talking about the screen-sized post IDs. |
05:19:35 | <@JAA> | I've seen those break before, although the shorter IDs still exist but weren't linked anywhere. |
05:19:40 | <HP_Archivist> | Yeah, I meant the hard links to the pics. Not that window-in-window resize that FB does to load images (screen-sized post IDs, I guess) |
05:19:47 | <@JAA> | But this might be outdated again already since Facebook seems to change their shit every other day. |
05:20:32 | <@JAA> | 'screen-sized' as in 'the post ID in the URL is so long it would fill your whole screen', which they introduced sometime in the not-too-distant past. |
05:21:04 | <@JAA> | As opposed to the short-ish numeric IDs that have been used in the past. |
05:22:19 | <HP_Archivist> | Saved or not, I have a short, but growing list of public pages (not people) that I am slowing WARC'ing with Webrecorder. It captures a lot, obviously not everything. But a good amount of pics/videos/comments, etc. |
05:23:42 | <@JAA> | Bad WARCs though :-/ |
05:24:16 | | evan quits [Remote host closed the connection] |
05:24:16 | | shreyasminocha quits [Read error: Connection reset by peer] |
05:24:16 | | c3manu quits [Remote host closed the connection] |
05:24:16 | | thehedgeh0g quits [Read error: Connection reset by peer] |
05:24:19 | | evan joins |
05:24:22 | | thehedgeh0g (mrHedgehog0) joins |
05:24:22 | | c3manu (c3manu) joins |
05:24:22 | | shreyasminocha (shreyasminocha) joins |
05:24:27 | <HP_Archivist> | Ah, well, true. But they're at list filed to warczone, JAA |
05:25:50 | <HP_Archivist> | Actually, pretty much what you did with Chromebot. Though I suspect CB might've been grabbing more than Webrecorder does/can |
05:26:36 | <@JAA> | crocoite also had extra flaws on top of what the webrecorder folks do. |
05:26:57 | | systwi quits [Ping timeout: 272 seconds] |
05:34:00 | <HP_Archivist> | Unrelated, I have a very long list of color science sites - color management software, profiling and calibration tech, technical pages, companies, services, products, blogs, and so on. There may be a few here/there unrelated. There are some that I've already submitted to AB. Most of these aren't at-risk. One or two are, and I've already thrown them in. Sharing here in case others are curious |
05:34:08 | <HP_Archivist> | https://transfer.archivete.am/XpV9R/color-science-profiling-management-URLS.txt |
05:34:08 | <eggdrop> | inline (for browser viewing): https://transfer.archivete.am/inline/XpV9R/color-science-profiling-management-URLS.txt |
05:35:37 | <HP_Archivist> | For the past ~3 months, I've just been adding to this list one after the other and now slowing working through it to see if previously crawled or not. |
05:39:14 | | jasons (jasons) joins |
05:43:34 | <HP_Archivist> | Oh, right. And a ton of it is related to, or ties in with, scanning and photography. I remember dpreview.com was grabbed last year. Not sure to what extent their forums, though. I've found forums on all these niche sites to be a goldmine for technical knowledge. |
05:45:00 | <@JAA> | Nice, and yeah, there are a lot of forums out there for very niche topics with a wealth of information. |
05:47:29 | <HP_Archivist> | :) |
06:10:00 | | systwi (systwi) joins |
06:32:13 | | cdreimanu (c3manu) joins |
06:34:50 | | jasons quits [Ping timeout: 240 seconds] |
06:44:47 | | kiryu quits [Client Quit] |
06:51:38 | | Island quits [Read error: Connection reset by peer] |
07:12:19 | | Chris5010 (Chris5010) joins |
07:14:03 | | cdreimanu quits [Remote host closed the connection] |
07:28:55 | | line joins |
07:38:20 | | jasons (jasons) joins |
08:02:27 | | Gereon5 (Gereon) joins |
08:03:25 | | Arcorann (Arcorann) joins |
08:04:20 | | Gereon quits [Ping timeout: 240 seconds] |
08:04:20 | | Gereon5 is now known as Gereon |
08:11:18 | | thehedgeh0g quits [Remote host closed the connection] |
08:11:18 | | c3manu quits [Remote host closed the connection] |
08:11:19 | | shreyasminocha quits [Remote host closed the connection] |
08:11:19 | | evan quits [Remote host closed the connection] |
08:11:49 | | evan joins |
08:11:52 | | thehedgeh0g (mrHedgehog0) joins |
08:11:52 | | c3manu (c3manu) joins |
08:11:53 | | shreyasminocha (shreyasminocha) joins |
08:11:54 | | c3manu quits [Remote host closed the connection] |
08:11:54 | | shreyasminocha quits [Remote host closed the connection] |
08:11:54 | | thehedgeh0g quits [Remote host closed the connection] |
08:11:55 | | evan quits [Remote host closed the connection] |
08:12:31 | | evan joins |
08:12:33 | | shreyasminocha (shreyasminocha) joins |
08:12:34 | | thehedgeh0g (mrHedgehog0) joins |
08:12:34 | | c3manu (c3manu) joins |
08:13:17 | | thehedgeh0g quits [Remote host closed the connection] |
08:14:04 | | shreyasminocha quits [Remote host closed the connection] |
08:14:12 | | c3manu quits [Remote host closed the connection] |
08:14:13 | | evan quits [Remote host closed the connection] |
08:20:21 | | evan joins |
08:20:23 | | c3manu (c3manu) joins |
08:20:24 | | thehedgeh0g (mrHedgehog0) joins |
08:20:24 | | shreyasminocha (shreyasminocha) joins |
08:28:25 | | parfait quits [Client Quit] |
08:33:20 | | jasons quits [Ping timeout: 240 seconds] |
09:28:53 | | icedice (icedice) joins |
09:36:50 | | jasons (jasons) joins |
09:51:50 | | bf_ joins |
09:53:59 | | pseudorizer quits [Quit: ZNC 1.8.2 - https://znc.in] |
09:56:42 | | pseudorizer (pseudorizer) joins |
10:17:01 | | f_ quits [Ping timeout: 272 seconds] |
10:36:50 | | jasons quits [Ping timeout: 240 seconds] |
11:09:08 | | f_ (funderscore) joins |
11:17:37 | | Wohlstand (Wohlstand) joins |
11:18:21 | | bf_ quits [Remote host closed the connection] |
11:25:28 | | Raya joins |
11:27:27 | | bf_ joins |
11:27:52 | <Raya> | Hello, sorry for the silly question. I've been letting the warrior run on the telegram project for a while and noticed it downloads an huge quantity of data compared to the quantity it uploads (RN it's at 4.1 GB downloaded for 349 MB uploaded). Why does that happen? Am I doing anything wrong? |
11:28:24 | | bf_ quits [Remote host closed the connection] |
11:40:24 | | jasons (jasons) joins |
11:41:09 | <datechnoman> | Raya - The warrior heavily compresses the downloaded files before they are uploaded to the targets. This is expected and quite normal :) |
11:43:38 | <Raya> | Pretty amazing!! I didn't think we could compress data that much :) thank you |
11:48:22 | | bf_ joins |
12:30:20 | | Raya quits [Ping timeout: 240 seconds] |
12:39:31 | | jasons quits [Ping timeout: 272 seconds] |
12:46:30 | | Raya joins |
12:50:55 | | Arcorann quits [Ping timeout: 272 seconds] |
13:10:14 | | Hackerpcs quits [Client Quit] |
13:15:13 | | bf__ joins |
13:15:16 | <TheTechRobo> | zstd is magic :-) |
13:42:22 | | jasons (jasons) joins |
13:46:30 | | bf__ quits [Remote host closed the connection] |
13:46:30 | | bf_ quits [Read error: Connection reset by peer] |
14:00:00 | <yzqzss> | zstd -22 --ultra --long=31 can even compress many MediaWiki XML dumps to 1% |
14:00:26 | <yzqzss> | magic |
14:08:20 | | Felce joins |
14:11:50 | | Raya quits [Ping timeout: 240 seconds] |
14:20:57 | | Felce quits [Client Quit] |
14:23:57 | | bf_ joins |
14:33:46 | | bf_ quits [Remote host closed the connection] |
14:38:20 | | jasons quits [Ping timeout: 240 seconds] |
14:46:14 | | bf_ joins |
14:48:07 | | icedice quits [Client Quit] |
14:48:40 | | bf__ joins |
14:50:37 | | magmaus3 quits [Ping timeout: 272 seconds] |
15:03:22 | | bf__ quits [Remote host closed the connection] |
15:03:23 | | bf_ quits [Remote host closed the connection] |
15:06:08 | | bf_ joins |
15:11:29 | | icedice (icedice) joins |
15:20:27 | | riku quits [Quit: WeeChat 4.2.1] |
15:21:31 | | DogsRNice joins |
15:30:04 | | kiryu joins |
15:30:04 | | kiryu is now authenticated as kiryu |
15:30:04 | | kiryu quits [Changing host] |
15:30:04 | | kiryu (kiryu) joins |
15:30:26 | | bf_ quits [Client Quit] |
15:31:28 | | bf_ joins |
15:32:30 | | kiryu quits [Remote host closed the connection] |
15:33:21 | | magmaus3 (magmaus3) joins |
15:34:03 | | kiryu joins |
15:34:03 | | kiryu is now authenticated as kiryu |
15:34:03 | | kiryu quits [Changing host] |
15:34:03 | | kiryu (kiryu) joins |
15:39:52 | | bf_ quits [Remote host closed the connection] |
15:41:56 | | jasons (jasons) joins |
15:46:28 | | bf_ joins |
15:57:47 | | bf_ quits [Remote host closed the connection] |
16:10:26 | | bf_ joins |
16:27:14 | | riku (riku) joins |
16:28:16 | | riku quits [Client Quit] |
16:28:19 | | riku (riku) joins |
16:39:59 | | riku quits [Client Quit] |
16:40:20 | | jasons quits [Ping timeout: 240 seconds] |
16:49:26 | | bf_ quits [Remote host closed the connection] |
16:51:28 | | riku (riku) joins |
17:06:35 | | toss (toss) joins |
17:44:05 | | jasons (jasons) joins |
17:56:22 | | ThreeHM quits [Quit: WeeChat 4.1.1] |
17:59:03 | | ThreeHM (ThreeHeadedMonkey) joins |
18:03:50 | | datechnoman quits [Ping timeout: 240 seconds] |
18:06:06 | | datechnoman (datechnoman) joins |
18:25:41 | | Chris5010 quits [Client Quit] |
18:35:03 | | Chris5010 (Chris5010) joins |
18:40:56 | | Gooshka (Gooshka) joins |
18:44:19 | | jasons quits [Ping timeout: 272 seconds] |
18:47:06 | <Gooshka> | chat.ru is another website hosting ( http://visluga.chat.ru/ , http://fractals.chat.ru/ ), many of websites are inactive ( http://visluga.chat.ru/ ), some of them redirects to new sites ( http://rocich.chat.ru/ -> http://rocich.ru/ , http://heraldica.chat.ru/ -> https://geraldika.ru/ ) |
18:58:10 | <Gooshka> | There are some bad signs: http://www.chat.ru/ has these words "Этот домен продаётся. Подробности здесь.", which mean "This domain is for sale. Details here." and it is inactive since 2011. So, probably they want to close it. |
19:03:17 | | toss quits [Client Quit] |
19:04:19 | | Gooshka quits [Remote host closed the connection] |
19:36:26 | | Gooshka (Gooshka) joins |
19:42:59 | | Gooshka quits [Ping timeout: 265 seconds] |
19:47:03 | | jasons (jasons) joins |
19:48:50 | | Wohlstand quits [Client Quit] |
19:51:28 | | Island joins |
19:56:42 | | adamus1red quits [Quit: SigTerm] |
19:58:52 | | adamus1red (adamus1red) joins |
20:02:17 | | SootBector quits [Remote host closed the connection] |
20:06:26 | | SootBector (SootBector) joins |
20:20:48 | | lizardexile joins |
20:32:27 | <pokechu22> | I've found 56 more JIRA instances (though some may be dead). This is going to take a while :| |
20:45:17 | | jasons quits [Ping timeout: 272 seconds] |
20:51:11 | | BlueMaxima joins |
20:54:44 | <pokechu22> | since we're saving the DBs for those anyways I can probably merge multiple into the same job |
21:10:56 | | BlueMaxima quits [Read error: Connection reset by peer] |
21:45:07 | <flashfire42> | Um is VBOX7 project supposed to be empty? |
21:46:24 | <fireonlive> | flashfire42: yeah, not started yet |
21:46:31 | <fireonlive> | -> #vboxxy |
21:48:12 | | jasons (jasons) joins |
22:20:29 | | Darken2 (Darken) joins |
22:23:50 | | Darken quits [Ping timeout: 240 seconds] |
22:46:53 | | jasons quits [Ping timeout: 272 seconds] |
23:11:32 | | driib quits [Quit: Ping timeout (120 seconds)] |
23:11:35 | | DigitalDragons quits [Quit: Ping timeout (120 seconds)] |
23:11:46 | | emberquill080 quits [Quit: Ping timeout (120 seconds)] |
23:11:54 | | DigitalDragons (DigitalDragons) joins |
23:13:11 | | emberquill080 (emberquill) joins |
23:14:19 | | driib (driib) joins |
23:15:23 | | JTL quits [Ping timeout: 272 seconds] |
23:16:11 | | JTL (JTL) joins |
23:19:07 | | Starlyte joins |
23:19:16 | | Starlyte quits [Remote host closed the connection] |
23:50:22 | | jasons (jasons) joins |