00:31:50etnguyen03 quits [Ping timeout: 265 seconds]
00:37:37<Ryz>Hmm, I'm pondering on doing more proactive archiving on anything Blogspot related, I stumbled upon https://fieldsofhether.blogspot.com/2016/04/we-spend-way-too-much-time-digging.html during internet searching, but it doesn't exist anymore,
00:37:58<Ryz>https://web.archive.org/web/20181226201600/http://fieldsofhether.blogspot.com/2016/04/we-spend-way-too-much-time-digging.html exists - I was initially wondering if the website got cybersquatted or something, but nope, the website still holds up more or less
00:38:15<Ryz>The big trouble that made me gaze my eye a bit wider is at the bottom of the page in the website in general:
00:38:26<Ryz>"404 Errors & Missing Links"
00:38:29<Ryz>Followed by,
00:38:44<Ryz>"Please note that some of my posts are currently being removed by blogger. They are then reviewed, and put back - so the post may be there one day, missing the next, and back a week later. I have no control over this unfortunately. For all the latest free svgs, be sure to check out the facebook group for this page - and if you have any suggestions
00:38:44<Ryz>on other website hosts, I am currently looking at options."
00:40:44<Ryz>...So yeah, I'm not sure if those removals were automated or what, and the person had to try and retrieve it
00:40:48<fireonlive>:|
00:41:15<fireonlive>i could see google throwing some ill advised ML at 'spam detection'
00:41:58<flashfire42>Ryz so can Blogspot be hit as hard as tumblr?
00:42:59<Ryz>flashfire42, eeeeeh, I'm not too sure, I think the individual posts are fine, but the pagination navigation will suffer if hit too harshly S:
00:43:44<Ryz>Also, there's some funkyness on archiving Blogger profiles, as they're quite strict with that giving 429s for too much checking too many times
00:43:48<fireonlive>https://krebsonsecurity.com/ used to be blogspot... but i think that changed during the big ddosing, can't find a source article anymore
00:43:50<Ryz>...And other stuff that I found out
00:45:16<pabs>Barto: ISTR you often do company acquisitions archiving
00:46:52<pabs>superkuh, JAA: re mastodon, either append /embed to the URL to get plain HTML of the single post, or use zygolophodon in a terminal to get the thread https://github.com/jwilk/zygolophodon
00:47:58<pabs>re attachment content disposition, there is a browser extension that lets you override any content type/disposition for any request and set your own
00:51:07<@JAA>fireonlive: If true, it must've been much longer ago I think. On the big DDoS a few years ago (the one that was a record at the time), it was Akamai dropping him. But I believe he was already using a self-hosted Wordpress blog for years prior to that.
00:51:25<@JAA>pabs: Good to know re /embed, shame that it's only the single post though.
00:51:47<@JAA>And yeah, there are several extensions like that.
00:52:08<fireonlive>ahh
00:52:10<pabs>yeah, I usually do /embed to determine if I want to read the thread, then go zygolophodon in a terminal to read it
00:52:13<fireonlive>kk
00:52:20<fireonlive>my krabs timelines are super fuzzy
00:52:41<@JAA>zygolophodon looks interesting, hadn't seen it before, thanks!
00:58:43andrew (andrew) joins
00:58:52etnguyen03 (etnguyen03) joins
01:01:28<h2ibot>Flashfire42 edited URLTeam/Warrior (+54, /* Warrior projects */): https://wiki.archiveteam.org/?diff=50265&oldid=48605
01:06:33andrew0 (andrew) joins
01:09:03andrew quits [Ping timeout: 265 seconds]
01:09:03andrew0 is now known as andrew
01:11:30<h2ibot>Flashfire42 edited URLTeam/Warrior (+67, /* Warrior projects */): https://wiki.archiveteam.org/?diff=50266&oldid=50265
01:16:47etnguyen03 quits [Ping timeout: 265 seconds]
01:19:56lennier1 quits [Quit: Going offline, see ya! (www.adiirc.com)]
01:29:57andrew quits [Ping timeout: 258 seconds]
01:35:42AlsoHP_Archivist quits [Client Quit]
01:36:03HP_Archivist (HP_Archivist) joins
01:37:35andrew (andrew) joins
01:38:20beario quits [Ping timeout: 252 seconds]
01:39:21lennier1 (lennier1) joins
01:40:14<pabs>JAA: IIRC it uses the same APIs used by the JS frontend, because they work without being logged in
02:02:32AmAnd0A quits [Ping timeout: 252 seconds]
02:03:15AmAnd0A joins
02:05:28etnguyen03 (etnguyen03) joins
02:12:57<superkuh>pabs, thanks for the /embed tip. I'll try that. zygolophodon is okay but quite a hassle to leave the browser.
02:24:08andrew0 (andrew) joins
02:25:32AmAnd0A quits [Read error: Connection reset by peer]
02:25:51AmAnd0A joins
02:26:11andrew quits [Ping timeout: 252 seconds]
02:26:11andrew0 is now known as andrew
02:36:05andrew8 (andrew) joins
02:37:25andrew quits [Ping timeout: 258 seconds]
02:37:25andrew8 is now known as andrew
02:38:45asda joins
02:41:21xkey quits [Quit: xkey]
02:42:39asda quits [Remote host closed the connection]
02:44:53andrew quits [Ping timeout: 252 seconds]
02:48:09andrew (andrew) joins
02:50:43qwe joins
02:59:44dumbgoy__ quits [Ping timeout: 265 seconds]
03:04:42andrew quits [Client Quit]
03:05:04andrew (andrew) joins
03:09:53andrew5 (andrew) joins
03:12:18andrew quits [Ping timeout: 265 seconds]
03:12:18andrew5 is now known as andrew
03:19:29nicolas17 quits [Client Quit]
03:26:06etnguyen03 quits [Ping timeout: 258 seconds]
03:31:38andrew quits [Ping timeout: 252 seconds]
03:32:05etnguyen03 (etnguyen03) joins
03:39:10andrew (andrew) joins
04:13:59andrew quits [Ping timeout: 252 seconds]
04:17:25andrew (andrew) joins
04:17:37etnguyen03 quits [Client Quit]
04:23:56Dango360_ (Dango360) joins
04:25:46andrew0 (andrew) joins
04:26:38Matthww1 quits [Ping timeout: 252 seconds]
04:27:44andrew quits [Ping timeout: 252 seconds]
04:27:44andrew0 is now known as andrew
04:27:49Dango360 quits [Ping timeout: 258 seconds]
04:28:08Matthww1 joins
04:36:37andrew8 (andrew) joins
04:38:44andrew quits [Ping timeout: 252 seconds]
04:38:44andrew8 is now known as andrew
05:03:27Island_ quits [Read error: Connection reset by peer]
05:14:27andrew1 (andrew) joins
05:16:41andrew quits [Ping timeout: 252 seconds]
05:16:41andrew1 is now known as andrew
05:18:25systwi quits [Read error: Connection reset by peer]
05:19:10eroc1990 quits [Client Quit]
05:19:11systwi (systwi) joins
05:19:35eroc1990 (eroc1990) joins
05:37:45qwe quits [Remote host closed the connection]
05:43:49andrew2 (andrew) joins
05:46:00andrew quits [Ping timeout: 265 seconds]
05:49:17andrew (andrew) joins
05:49:51andrew2 quits [Ping timeout: 258 seconds]
06:04:11jacksonchen666 quits [Ping timeout: 245 seconds]
06:15:10andrew9 (andrew) joins
06:16:56andrew quits [Ping timeout: 265 seconds]
06:16:56andrew9 is now known as andrew
06:26:16andrew quits [Ping timeout: 258 seconds]
06:33:58andrew (andrew) joins
06:38:24<h2ibot>Entartet edited List of websites excluded from the Wayback Machine (+24, Added zainamro.com.): https://wiki.archiveteam.org/?diff=50267&oldid=50192
06:43:53hitgrr8 joins
06:44:00andrew3 (andrew) joins
06:46:25andrew quits [Ping timeout: 265 seconds]
06:46:25andrew3 is now known as andrew
06:50:00andrew quits [Client Quit]
06:50:24andrew (andrew) joins
06:59:10BlueMaxima quits [Read error: Connection reset by peer]
06:59:12andrew5 (andrew) joins
07:01:11andrew quits [Ping timeout: 252 seconds]
07:01:11andrew5 is now known as andrew
07:04:10xkey (xkey) joins
07:05:03Unholy236131 quits [Remote host closed the connection]
07:07:01Unholy236131 (Unholy2361) joins
07:28:49Arcorann (Arcorann) joins
08:00:36jacksonchen666 (jacksonchen666) joins
08:05:01jacksonchen666 quits [Ping timeout: 245 seconds]
08:10:21jacksonchen666 (jacksonchen666) joins
08:58:09Naruyoko5 quits [Read error: Connection reset by peer]
08:58:46Naruyoko joins
09:08:47Naruyoko quits [Ping timeout: 252 seconds]
09:12:38geezabiscuit quits [Client Quit]
09:26:13beario joins
09:42:37imer quits [Quit: Oh no]
09:43:12imer (imer) joins
09:45:38PredatorIWD quits [Read error: Connection reset by peer]
09:48:06AmAnd0A quits [Remote host closed the connection]
09:48:19AmAnd0A joins
09:50:24PredatorIWD joins
09:57:05beario quits [Remote host closed the connection]
09:57:05qwertyasdfuiopghjkl quits [Remote host closed the connection]
09:57:19beario joins
09:59:35qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
10:00:01railen69 quits [Remote host closed the connection]
10:00:17railen69 joins
10:02:00igloo22225 quits [Quit: The Lounge - https://thelounge.chat]
10:03:18igloo22225 (igloo22225) joins
10:05:30Shjosan quits [Quit: Am sleepy (-, – )…zzzZZZ]
10:06:18Shjosan (Shjosan) joins
10:54:51AmAnd0A quits [Ping timeout: 265 seconds]
10:57:49AmAnd0A joins
11:00:29AmAnd0A quits [Read error: Connection reset by peer]
11:01:40AmAnd0A joins
11:24:38Ryz quits [Ping timeout: 252 seconds]
11:24:38yts98 leaves
11:24:46yts98 joins
11:24:49IDK_ quits [Ping timeout: 265 seconds]
11:40:31AmAnd0A quits [Read error: Connection reset by peer]
11:41:34AmAnd0A joins
11:53:55IDK_ joins
11:54:20Ryz (Ryz) joins
12:04:52jacksonchen666 quits [Client Quit]
12:07:07Naruyoko joins
12:26:14monoxane quits [Ping timeout: 252 seconds]
12:29:29AmAnd0A quits [Read error: Connection reset by peer]
12:29:40AmAnd0A joins
12:35:48Ryz quits [Ping timeout: 258 seconds]
12:36:21IDK_ quits [Ping timeout: 265 seconds]
12:38:51IDK_ joins
12:39:12Ryz (Ryz) joins
12:45:30dumbgoy__ joins
12:48:12etnguyen03 (etnguyen03) joins
12:57:39AmAnd0A quits [Ping timeout: 258 seconds]
12:57:48AmAnd0A joins
13:00:19<pabs>is there a Vimeo project? manu|m said on #archivebot this person died https://vimeo.com/channels/suemarxfilms
13:16:13railen69 quits [Remote host closed the connection]
13:20:04railen63 joins
13:26:14geezabiscuit (geezabiscuit) joins
13:41:58Mattehari joins
13:50:51monoxane (monoxane) joins
13:54:47Arcorann quits [Ping timeout: 252 seconds]
13:54:48eroc1990 quits [Client Quit]
13:55:13knewt joins
13:58:16railen69 joins
13:58:18railen63 quits [Remote host closed the connection]
13:58:18beario quits [Remote host closed the connection]
13:58:18knewt quits [Remote host closed the connection]
13:58:23beario joins
14:03:02ShakespeareFan00 joins
14:03:12<ShakespeareFan00>Hi.
14:03:40<ShakespeareFan00>Has there been any progress on this :- https://wiki.archiveteam.org/index.php/Usenet ?
14:03:54<ShakespeareFan00>Google Groups is for most groups effectively unusable
14:04:31<ShakespeareFan00>And Google certainly has removed entire groups like uk.railway and comp.lang.oberon rather than actually clean out spam.
14:06:11<nstrom|m>that's so sad
14:10:39<ShakespeareFan00>I'm especially annoyed in respect of uk.railway, because I was flagging spam in that group in the hope that someone reasonable would actually tackle the issue.
14:10:57<ShakespeareFan00>(But this is the wrong forum for calling out Google's behaviour.)
14:11:33<ShakespeareFan00>Running an NTTP server to collate current postings to various NNTP groups is a technical feasibility.
14:11:49<ShakespeareFan00>That way 'new' content to those groups will not be lost.
14:12:25<ShakespeareFan00>However, there is the issue of archives Google and other servers clearly hold, but which can;t be accessed.
14:13:10Naruyoko quits [Ping timeout: 258 seconds]
14:13:21<ShakespeareFan00>BTW Is there an effort to 'archive' sites like Discord?
14:13:53<ShakespeareFan00>(Aside: I can retrive material I posted to Discord, but I can't retrieve the replies from others.)
14:15:03<nstrom|m>discord is very archival unfriendly, they seem to be trying really hard to keep their stuff a walled garden
14:18:50<ShakespeareFan00>Well , some of it's almost certainly GDPR...
14:19:02<ShakespeareFan00>Or equivalent...
14:20:16<ShakespeareFan00>One other suggestion I had for future archival work , is archiving the responses from Bing/OpenAI etc...
14:20:52<ShakespeareFan00>(Yes those responses are effectively random fictions right now, but it helps to have evidence of the mistakes they are making)
14:21:03<ShakespeareFan00>Not sure how you'd archive them though,
14:22:06<ShakespeareFan00>BTW I can't currently suggest things like Discord archival on the Wiki, as I had a disagreement with a wiki mod several years ago.
14:25:26BigBrain quits [Ping timeout: 245 seconds]
14:27:25BigBrain (bigbrain) joins
14:35:45ShakespeareFan00 quits [Remote host closed the connection]
15:27:16IDK (IDK) joins
15:37:16nicolas17 joins
15:46:10Dango360 (Dango360) joins
15:50:10Naruyoko joins
16:09:42lennier1 quits [Ping timeout: 258 seconds]
16:11:28lennier1 (lennier1) joins
16:25:42<albertlarsan68>FWIW, I am currently building an Edge extension to collect all interesting links (Imgur, Mediafire). Let me know if you want more URLs or IDs.
16:32:21eroc1990 (eroc1990) joins
16:38:01Mattehari quits [Ping timeout: 265 seconds]
16:47:07<albertlarsan68>I plan on submitting its findings, is there any problems with that?
16:48:06<@rewby>We have IRC bots you can use to submit imgur and mediafire links I think
16:55:30thenes (thenes) joins
16:58:38<fireonlive>we do
16:58:47<fireonlive>and submit away they take anything
16:59:13<fireonlive>only limits i’m aware of is #down-the-tube where’s there’s criteria (explained in the wiki, exceptions granted on case by case basis)
17:00:45<fireonlive>(dtt is for youtube)
17:26:32rageear quits [Ping timeout: 252 seconds]
17:32:35etnguyen03 quits [Ping timeout: 252 seconds]
17:33:42dumbgoy joins
17:33:42fuzzy8021 quits [Read error: Connection reset by peer]
17:33:57Icyelut (Icyelut) joins
17:34:15Icyelut|2 quits [Read error: Connection reset by peer]
17:35:50fuzzy8021 (fuzzy8021) joins
17:36:30dumbgoy__ quits [Ping timeout: 263 seconds]
18:07:01etnguyen03 (etnguyen03) joins
18:09:58<Barto>pabs: ah, that is true
18:10:17<Barto>gotta do it then :)
18:13:26<fireonlive>pabs: https://wiki.archiveteam.org/index.php/Vimeo saeems no
18:36:34<albertlarsan68>Great! Once I have around 100 matches, I will dump them. i use the (very wide) regexes in the wiki pages, so there may be many false positives. Hope it works, and that I will be useful!
18:38:42<fireonlive>:) no worries about fps, bot can filter those out
18:44:22VickoSaviour joins
18:44:49<VickoSaviour>did we collected all of the Wysp data?
18:51:54Megame (Megame) joins
18:52:51VickoSaviour29 joins
18:54:16VickoSaviour29 leaves
18:54:19VickoSaviour quits [Ping timeout: 265 seconds]
18:54:33<fireonlive>lol
19:20:56Dango360_ quits [Read error: Connection reset by peer]
19:21:02Dango360 quits [Client Quit]
19:21:14Dango360 (Dango360) joins
19:45:32AmAnd0A quits [Read error: Connection reset by peer]
19:45:46AmAnd0A joins
20:27:24rageear joins
20:35:11iCaotix quits [Read error: Connection reset by peer]
20:35:49iCaotix joins
20:41:50Island joins
21:17:20Dango360 quits [Read error: Connection reset by peer]
21:20:31Dango360 (Dango360) joins
21:28:10hitgrr8 quits [Client Quit]
21:28:10W7RFa6AbNFz quits [Read error: Connection reset by peer]
21:28:36W7RFa6AbNFz joins
21:54:56yasomi quits [Ping timeout: 252 seconds]
21:56:46yasomi (yasomi) joins
22:05:21ITMan joins
22:05:50ITMan leaves
22:18:12Megame quits [Client Quit]
22:27:23etnguyen03 quits [Ping timeout: 252 seconds]
23:12:08BlueMaxima joins
23:18:54etnguyen03 (etnguyen03) joins
23:32:43geezabiscuit quits [Read error: Connection reset by peer]
23:32:59geezabiscuit (geezabiscuit) joins
23:54:41AmAnd0A quits [Ping timeout: 258 seconds]
23:54:47AmAnd0A joins