00:23:58 | | flotwig quits [Ping timeout: 260 seconds] |
00:27:47 | | endrift quits [Quit: +++CARRIER LOST+++] |
00:33:36 | | endrift joins |
00:55:59 | | robin quits [Quit: Ooops, wrong browser tab.] |
01:14:13 | | sec^nd quits [Remote host closed the connection] |
01:14:35 | | sec^nd (second) joins |
01:19:51 | | BlueMaxima quits [Read error: Connection reset by peer] |
01:20:45 | | etnguyen03 (etnguyen03) joins |
01:42:15 | | xkey quits [Quit: WeeChat 4.4.3] |
01:43:16 | | xkey (xkey) joins |
01:49:06 | | Webuser294463 joins |
01:50:41 | | Webuser294463 quits [Client Quit] |
02:04:54 | | i_have_n0_idea quits [Remote host closed the connection] |
02:55:04 | | DogsRNice joins |
02:58:20 | | Shyy joins |
03:01:28 | | moth_ quits [Ping timeout: 260 seconds] |
03:06:17 | | sec^nd quits [Ping timeout: 276 seconds] |
03:11:56 | | sec^nd (second) joins |
03:23:07 | | moth_ joins |
03:23:16 | | flotwig joins |
04:06:06 | | etnguyen03 quits [Client Quit] |
04:07:14 | | etnguyen03 (etnguyen03) joins |
04:25:09 | | DogsRNice quits [Read error: Connection reset by peer] |
04:34:27 | | phosphenes quits [Quit: Leaving] |
04:38:33 | | etnguyen03 quits [Remote host closed the connection] |
05:01:32 | | HP_Archivist (HP_Archivist) joins |
05:01:32 | | AlsoHP_Archivist quits [Read error: Connection reset by peer] |
05:01:36 | | Sidpatchy quits [Quit: The Lounge - https://thelounge.chat] |
05:09:29 | <h2ibot> | Wickedplayer494 edited Tucows Downloads (+26, Add data field): https://wiki.archiveteam.org/?diff=54209&oldid=46229 |
05:17:30 | <h2ibot> | Wickedplayer494 edited HQ Trivia (+66, HQ is dead as a doorknob): https://wiki.archiveteam.org/?diff=54210&oldid=53413 |
05:22:31 | <h2ibot> | Wickedplayer494 edited Computer Chronicles (+39, Said third-party is well-known malware…): https://wiki.archiveteam.org/?diff=54211&oldid=47819 |
05:24:28 | | sec^nd quits [Remote host closed the connection] |
05:24:29 | | SootBector quits [Remote host closed the connection] |
05:24:49 | | sec^nd (second) joins |
05:27:55 | | Sidpatchy (Sidpatchy) joins |
05:30:33 | <h2ibot> | Wickedplayer494 edited YTMND (+0, Capitalization fixes): https://wiki.archiveteam.org/?diff=54212&oldid=44146 |
05:30:35 | | Sidpatchy quits [Client Quit] |
05:43:35 | <h2ibot> | Wickedplayer494 edited Google+ (+1230, Currents, the business version, is also dead): https://wiki.archiveteam.org/?diff=54215&oldid=50727 |
06:00:38 | <h2ibot> | Wickedplayer494 created Google Currents (+21, Redirected page to [[Google+]]): https://wiki.archiveteam.org/?title=Google%20Currents |
06:03:28 | | lflare quits [Ping timeout: 260 seconds] |
06:46:38 | | IRC2DC quits [Ping timeout: 260 seconds] |
06:58:33 | | lflare (lflare) joins |
07:09:32 | | flotwig_ joins |
07:11:08 | | flotwig quits [Ping timeout: 260 seconds] |
07:11:08 | | flotwig_ is now known as flotwig |
08:25:07 | | SootBector (SootBector) joins |
08:46:51 | | lennier2 joins |
08:49:34 | | lennier2_ quits [Ping timeout: 250 seconds] |
09:22:36 | | Island quits [Read error: Connection reset by peer] |
09:55:15 | | Webuser906473 joins |
09:55:46 | <Webuser906473> | I don't see much on the wiki about the UK's Online Safety Act, which is causing lots of forums to close on the 16th of March. You only have one mention of it in Deathwatch regarding LGSSS. Microcosm are opening up their robot files to allow archiving, but I bet many other fora will close silently |
09:55:59 | <Webuser906473> | Do you have a team in place to monitor the situation? |
09:56:11 | <Webuser906473> | See https://www.lfgss.com/conversations/401475/ for the issues a small forum provider is facing |
10:06:15 | <katia> | Webuser906473: be the change you want to see and monitor the situation, we can archive small sites with archivebot |
10:07:03 | | Webuser342695 joins |
10:07:06 | | Webuser342695 quits [Client Quit] |
10:10:27 | <Webuser906473> | I am only loosely connected with ArchiveTeam, running a warrior. I don't plan to get more involved, just wanted to ensure the problem had visibility. |
10:15:12 | | lennier2_ joins |
10:17:58 | | lennier2 quits [Ping timeout: 250 seconds] |
10:24:57 | <szczot3k> | Webuser906473 it's not a lot of work. You could start by sending links to sites (or preferably - their shutdown/changes announcement) here, or in #archivebot, and someone will archive it. |
10:25:36 | <szczot3k> | s/will archive/will try to archive/ |
10:26:36 | <szczot3k> | We are constantly running things on AB, but scouting 100% of the internet, for those notices, is impossible. |
10:26:37 | | Webuser906473 quits [Client Quit] |
10:26:43 | <szczot3k> | And t hey're gone :) |
10:31:08 | <katia> | on this cursed day we are all only loosely connected with ArchiveTeam |
10:39:39 | | Mist8kenGAS (Mist8kenGAS) joins |
10:40:32 | <h2ibot> | Manu edited Mailman/2 (-9, /* started lists.cert.at */): https://wiki.archiveteam.org/?diff=54218&oldid=54001 |
10:54:01 | <nulldata> | lol - that being said we probably should start to compile a list of popular UK based forums to proactively throw in AB as capacity permits. |
11:07:23 | <nulldata> | https://pad.notkiska.pw/p/archivebot-ukforums |
11:42:03 | | loug83181421 joins |
11:44:08 | | loug8318142 quits [Ping timeout: 260 seconds] |
11:44:08 | | loug83181421 is now known as loug8318142 |
12:00:03 | | Bleo1826007227196234 quits [Quit: The Lounge - https://thelounge.chat] |
12:02:30 | | MrMcNuggets (MrMcNuggets) joins |
12:02:54 | | Bleo1826007227196234 joins |
12:43:21 | | SkilledAlpaca418962 quits [Quit: SkilledAlpaca418962] |
12:45:21 | | SkilledAlpaca418962 joins |
12:49:49 | | Sluggs quits [Excess Flood] |
12:52:22 | | Sluggs joins |
13:20:00 | | HP_Archivist quits [Read error: Connection reset by peer] |
13:20:12 | | HP_Archivist (HP_Archivist) joins |
13:53:41 | | SootBector quits [Ping timeout: 276 seconds] |
13:55:15 | | SootBector (SootBector) joins |
14:11:28 | | flotwig quits [Excess Flood] |
14:11:54 | | flotwig joins |
14:14:57 | | flotwig quits [Excess Flood] |
14:16:13 | | flotwig joins |
14:29:06 | | etnguyen03 (etnguyen03) joins |
15:02:24 | <h2ibot> | Nir edited Arhivach (+23, /* External links */): https://wiki.archiveteam.org/?diff=54220&oldid=53532 |
15:03:53 | | etnguyen03 quits [Client Quit] |
15:15:04 | | etnguyen03 (etnguyen03) joins |
15:37:18 | | graham9 joins |
15:50:26 | | etnguyen03 quits [Client Quit] |
16:18:40 | | ThreeHM quits [Quit: WeeChat 4.4.3] |
16:37:59 | | ThreeHM (ThreeHeadedMonkey) joins |
16:52:22 | | kokos- joins |
16:54:16 | | beastbg8_ quits [Read error: Connection reset by peer] |
16:57:47 | | etnguyen03 (etnguyen03) joins |
17:02:04 | | lexikiq joins |
17:07:24 | | etnguyen03 quits [Client Quit] |
17:10:19 | | Webuser319842 joins |
17:10:20 | | Webuser319842 quits [Client Quit] |
17:14:50 | | yarrow (yarrow) joins |
17:15:16 | | MrMcNuggets quits [Quit: WeeChat 4.3.2] |
17:19:08 | | yarrow quits [Client Quit] |
17:19:46 | | yarrow joins |
17:20:10 | | yarrow quits [Remote host closed the connection] |
17:20:47 | | yarrow (yarrow) joins |
17:22:26 | | yarrow quits [Client Quit] |
17:35:26 | | etnguyen03 (etnguyen03) joins |
17:39:18 | <Dango360> | nulldata: added .uk domains from Discourse/uncategorized to your list |
17:45:15 | | etnguyen03 quits [Client Quit] |
17:50:15 | | DogsRNice joins |
17:51:29 | | xkey quits [Quit: WeeChat 4.4.3] |
17:51:45 | | xkey (xkey) joins |
17:55:19 | | notarobot10 joins |
17:56:53 | | notarobot1 quits [Ping timeout: 260 seconds] |
17:56:53 | | notarobot10 is now known as notarobot1 |
17:58:59 | <nstrom|m> | From reading that article it seems like it should be any forum w users in the UK not just UK based ones |
18:02:40 | | etnguyen03 (etnguyen03) joins |
18:09:59 | <h2ibot> | Monika edited The WARC Ecosystem (+145, Add Zeno crawler): https://wiki.archiveteam.org/?diff=54221&oldid=53840 |
18:47:45 | <moth_> | *Probably* UK based ones should be prioritized. Forums outside of the UK are less likely to have heard of it/care, so are less likely to be planning to shut down. |
18:47:46 | <nulldata> | nstrom|m - perhaps, but the chances are much smaller that someone who owns a forum outside of the UK would bother following UK law. |
18:54:11 | | phoenix joins |
18:58:04 | | phoenix quits [Client Quit] |
18:58:47 | <steering> | from what ive seen most people outside of UK who are planning to comply with it are planning to do so by just blocking UK users :p |
19:01:57 | <nulldata> | Basically GDPR lol |
19:02:59 | | etnguyen03 quits [Client Quit] |
19:11:38 | | etnguyen03 (etnguyen03) joins |
19:12:00 | <that_lurker> | Anyone from here used Zeno? Does it have the ability to add live ignores? |
19:25:28 | | etnguyen03 quits [Client Quit] |
19:32:51 | | beastbg8 (beastbg8) joins |
19:42:26 | | Argonaut joins |
19:43:38 | | flotwig quits [Ping timeout: 260 seconds] |
19:45:58 | | moth_ quits [Ping timeout: 260 seconds] |
19:48:15 | <Argonaut> | Quick question regarding viewing WARC files. I'm using grab-site and successfully created the WARC.gz files, however when opening them with ReplayWeb.page, I get "No Pages are defined in this archive". Is there something I should be doing differently to get pages to appear (and not just the resources)? |
19:51:27 | | flotwig joins |
20:01:02 | | etnguyen03 (etnguyen03) joins |
20:02:29 | | flotwig quits [Excess Flood] |
20:04:55 | | flotwig joins |
20:07:22 | <TheTechRobo> | I believe Replayweb.page uses a format called WACZ to define "pages". Most WARC tools (ones that aren't made by Webrecorder, basically) don't support writing those. You can use the "Search" dropdown in the Resources tab to filter by HTML, which should show all the actual webpages (as opposed to page requisites, like scripts, that you probably don't |
20:07:22 | <TheTechRobo> | care about). |
20:09:12 | <TheTechRobo> | Argonaut: ^ |
20:17:21 | | Deewiant quits [Remote host closed the connection] |
20:18:32 | | Deewiant (Deewiant) joins |
20:24:42 | <Argonaut> | TheTechRobo that makes sense; is there a good way to index many warc files at once? I've got a large list of URLs I'd like to feed into grab-site and I'm trying to figure out how to handle them once it's done. They're all disconnected from each other I suppose and there isn't a way to browse from one to another, correct? |
20:31:30 | | etnguyen03 quits [Client Quit] |
20:36:06 | | etnguyen03 (etnguyen03) joins |
20:41:23 | | Island joins |
21:02:23 | | BlueMaxima joins |
21:02:29 | | etnguyen03 quits [Client Quit] |
21:05:06 | | etnguyen03 (etnguyen03) joins |
21:06:50 | <TheTechRobo> | Argonaut: If they are all valid warc.gz files, you can literally just concatenate them all together. But that'll make replayweb.page pretty slow if it gets too big. |
21:08:25 | <TheTechRobo> | Unfortunately replayweb.page doesn't seem to be able to use a CDX file for indexing into multiple WARCs at once, otherwise I'd suggest that. You might be able to create a WACZ file with all the WARCs. There is also https://github.com/webrecorder/pywb, although I've never used that so I'm not sure how well it works. |
21:08:50 | <katia> | i'd like to know how to put grab-site output into pywb too but last time i tried i failed |
21:32:50 | | Mist8kenGAS_ joins |
21:35:38 | | Mist8kenGAS quits [Ping timeout: 260 seconds] |
21:52:36 | | yano quits [Quit: WeeChat, the better IRC client, https://weechat.org/] |
21:56:32 | | yano (yano) joins |
22:09:15 | | BornOn420 quits [Remote host closed the connection] |
22:09:19 | | lexikiq quits [Quit: Leaving] |
22:09:54 | | BornOn420 (BornOn420) joins |
22:22:34 | | beastbg8 quits [Read error: Connection reset by peer] |
22:22:36 | | graham9 quits [Quit: The Lounge - https://thelounge.chat] |
22:25:01 | | beastbg8 (beastbg8) joins |
22:26:44 | <h2ibot> | OrIdow6 edited Cohost (+1468, Site is shut down, some updates to the grab…): https://wiki.archiveteam.org/?diff=54222&oldid=54206 |
22:30:45 | <h2ibot> | OrIdow6 edited Deathwatch (+64, Cohost is dead, first entry of 2025!): https://wiki.archiveteam.org/?diff=54223&oldid=54174 |
22:50:54 | | legoktm quits [Quit: http://quassel-irc.org - Chat comfortably. Anywhere.] |
22:51:31 | | legoktm joins |
23:05:54 | | Webuser801002 joins |
23:07:22 | <Webuser801002> | hello. could someone here add daddyanity.com (not its redirect) to the excluded from WBM page list? Thanks in advance. |
23:09:28 | <that_lurker> | done |
23:09:51 | <h2ibot> | That lurker edited List of websites excluded from the Wayback Machine (+89, add daddyanity.com): https://wiki.archiveteam.org/?diff=54224&oldid=54149 |
23:25:18 | | le0n_ quits [Ping timeout: 260 seconds] |
23:53:50 | | loug8318142 quits [Quit: The Lounge - https://thelounge.chat] |
23:59:43 | | bladem quits [Ping timeout: 260 seconds] |