00:17:51SM joins
00:26:03Arcorann (Arcorann) joins
00:44:35bonga quits [Ping timeout: 265 seconds]
00:45:10bonga joins
00:55:34Arcorann quits [Ping timeout: 265 seconds]
00:55:34bonga quits [Read error: Connection reset by peer]
00:55:41bonga joins
00:59:53chrismeller (chrismeller) joins
01:02:18Jake (Jake) joins
01:03:57wyatt8740 joins
01:06:07Megame quits [Client Quit]
01:16:26<TheTechRobo>Nevermind, I was an idiot lol
01:16:46<TheTechRobo>I forgot to go through the rule agreement of the server, so I couldn't click the role-giving buttons...lol
01:38:03pabs quits [Read error: Connection reset by peer]
01:41:04mrfooooo quits [Quit: Ping timeout (120 seconds)]
01:45:18TheTechRobo quits [Remote host closed the connection]
01:45:38TheTechRobo joins
01:57:06mrfooooo joins
02:02:36dm4v_ joins
02:04:20dm4v quits [Ping timeout: 265 seconds]
02:04:20dm4v_ is now known as dm4v
02:04:21dm4v quits [Changing host]
02:04:21dm4v (dm4v) joins
02:04:53<TheTechRobo>http://digitize.archiveteam.org/ is broken, just says "Hello archive team.org!"
02:07:23pabs (pabs) joins
02:08:40Arcorann (Arcorann) joins
02:11:05<LegitSi>We do a good job of archiving, so good that we forget to update stuff. At least, I think.
02:18:56<@JAA>Just started looking into that MediaFire folder for Vanced. Looks like all the file links are dead, unless I'm doing something wrong somehow.
02:19:09<@JAA>https://m.mediafire.com/773e97cz2ezx1
02:19:40<TheTechRobo>Yeah, same :/
02:31:36<TheTechRobo>Wait, hang on
02:31:56<TheTechRobo>Isn't there that GitHub API events archival project going on by another organisation?
02:32:23<TheTechRobo>If that's still around, isn't it likely that they would have gotten most if not all of the issues/comments/PRs?
02:32:42<TheTechRobo>(for vanced's repos that is)
02:32:51<TheTechRobo>Obviously not in WBM format, but is the data still saved?
02:33:12Arcorann quits [Ping timeout: 265 seconds]
02:33:30<@JAA>You're thinking of https://www.gharchive.org/ and the answer is a definite maybe.
02:34:11<TheTechRobo>I don't really want to parse though those hourly dumps though ^^"
02:34:25<@JAA>I was going to run the still-existing repos through #gitgud, but arkiver needs to fix something first there.
02:34:26<TheTechRobo>Although, it is available on BigQuery
02:35:41<TheTechRobo>Out of curiosity, what is socialscraper's new method of archiving the tweets you mentioned? (The one that prompted you to make the notweets igset.)
02:35:50<TheTechRobo>*socialbot
03:01:50daxxy (daxxy) joins
03:04:12Arcorann (Arcorann) joins
03:04:57bonga quits [Read error: Connection reset by peer]
03:05:09bonga joins
03:06:04daxxy_ quits [Ping timeout: 265 seconds]
03:09:35SM quits [Ping timeout: 265 seconds]
03:23:31march_happy quits [Remote host closed the connection]
03:23:40bonga quits [Read error: Connection reset by peer]
03:24:02bonga joins
03:25:58<@arkiver>JAA: fixed
03:29:35march_happy (march_happy) joins
03:32:53<@JAA>:-)
03:46:22march_happy quits [Remote host closed the connection]
03:52:01march_happy (march_happy) joins
04:03:41HackMii quits [Ping timeout: 252 seconds]
04:06:43HackMii (hacktheplanet) joins
04:30:44atphoenix_ (atphoenix) joins
04:31:06lennier2 joins
04:34:02lennier1 quits [Ping timeout: 265 seconds]
04:34:02atphoenix quits [Ping timeout: 265 seconds]
04:34:08lennier2 is now known as lennier1
04:36:16sonick quits [Client Quit]
05:04:57fuzzy8021 quits [Read error: Connection reset by peer]
05:07:49fuzzy8021 (fuzzy8021) joins
05:14:34fuzzy8021 quits [Killed (NickServ (GHOST command used by fuzzy802!~fuzzy8021@173-224-26-244.ptcnet.net))]
05:14:40fuzzy8021 (fuzzy8021) joins
05:20:41omni quits [Ping timeout: 252 seconds]
05:23:45omni joins
05:46:05Eighty quits [Quit: leaving]
05:54:15atphoenix_ is now known as atphoenix
06:18:46Barto quits [Ping timeout: 240 seconds]
06:19:01Barto (Barto) joins
06:22:32Ajay_m joins
06:22:36Ajay_m quits [Client Quit]
06:42:50bonga quits [Remote host closed the connection]
06:43:02bonga joins
07:16:55BPCZ quits [Ping timeout: 252 seconds]
09:44:39march_happy quits [Remote host closed the connection]
09:50:33march_happy (march_happy) joins
10:11:37lunik1 quits [Quit: Ping timeout (120 seconds)]
10:11:46lunik1 joins
10:20:28Webuser805 joins
10:24:06march_happy quits [Read error: Connection reset by peer]
10:24:21march_happy (march_happy) joins
10:30:54useretail__ joins
10:30:54useretail_ quits [Read error: Connection reset by peer]
10:40:03useretail__ quits [Client Quit]
10:50:12Niklink quits [Ping timeout: 265 seconds]
12:04:29Niklink joins
12:13:39Megame (Megame) joins
12:14:57qwertyasdfuiopghjkl is now known as qwertyasdfuiopghjkl_
12:15:08qwertyasdfuiopghjkl joins
12:16:01qwertyasdfuiopghjkl_ quits [Client Quit]
12:18:15sonick (sonick) joins
12:37:49lunik1 quits [Client Quit]
12:37:57lunik1 joins
13:12:39Arcorann quits [Ping timeout: 265 seconds]
13:26:21march_happy quits [Read error: Connection reset by peer]
13:27:02march_happy (march_happy) joins
13:37:36<sonick>Did anyone recently mention WEBCROW and "Star Server Free", a Japanese web hosting service that will be shutting down at the end of the month?
13:44:57<TheTechRobo>It's on Deathwatch, I think
13:45:12<TheTechRobo>sonick: http://wiki.archiveteam.org/index.php/Deathwatch
13:46:34TheTechRobo quits [Remote host closed the connection]
13:47:13TheTechRobo joins
14:13:48lunik1 quits [Client Quit]
14:13:55lunik1 joins
14:51:52<sonick>I see.
14:56:21<sonick>When will we start capturing the data? It may be difficult to process a large number of requests in a short time, because these services consist of CGI, such as Wordpress.
14:58:48<@arkiver>users have collected some lists of sites i believe
14:59:03<@arkiver>if it's not a huge amount it can probably be done with archivebot
15:01:53bonga quits [Ping timeout: 265 seconds]
15:02:33bonga joins
15:03:07lunik1 quits [Read error: Connection reset by peer]
15:03:09<sonick>I understood. Thanks.
15:21:42bonga quits [Ping timeout: 265 seconds]
15:21:57Minkafighter quits [Remote host closed the connection]
15:22:29bonga joins
15:22:30Minkafighter joins
15:26:12lunik1 joins
15:59:03chrismeller quits [Ping timeout: 265 seconds]
16:08:45march_happy quits [Read error: Connection reset by peer]
16:08:57march_happy (march_happy) joins
16:41:06qwertyasdfuiopghjkl quits [Remote host closed the connection]
16:43:48coderobe quits [Remote host closed the connection]
16:45:22LeGoupil joins
17:07:12march_happy quits [Ping timeout: 265 seconds]
17:14:54immibis_ joins
17:17:42immibis quits [Ping timeout: 265 seconds]
17:20:44Webuser805 quits [Ping timeout: 265 seconds]
17:36:46cm quits [Ping timeout: 240 seconds]
17:36:47cm joins
18:01:43immibis_ quits [Remote host closed the connection]
18:01:53immibis_ joins
18:02:12yano quits [Quit: WeeChat, the better IRC client, https://weechat.org/]
18:03:46coderobe (coderobe) joins
18:06:06yano (yano) joins
18:17:45<h2ibot>Malibo edited Vendor-hosted forums (+102, typo; archive link): https://wiki.archiveteam.org/?diff=48361&oldid=48330
18:17:46<h2ibot>Themadprogramer edited Discourse (+59, Added Cloudflare Community): https://wiki.archiveteam.org/?diff=48362&oldid=48235
18:17:47<h2ibot>Pcr created Russia (+160, Create Russia): https://wiki.archiveteam.org/?title=Russia
18:17:48<h2ibot>Pcr created Belarus (+164, Create Belarus): https://wiki.archiveteam.org/?title=Belarus
18:17:49<h2ibot>Usernam edited Megalodon.jp (+536): https://wiki.archiveteam.org/?diff=48365&oldid=48248
18:24:52immibis_ quits [Remote host closed the connection]
18:24:56immibis_ joins
18:29:42<tech234a>https://www.theverge.com/2022/3/15/22979126/vimeo-patreon-creators-price-increase
18:32:48<h2ibot>Arkiver uploaded File:Telegram-icon.png: https://wiki.archiveteam.org/?title=File%3ATelegram-icon.png
18:35:10immibis_ quits [Remote host closed the connection]
18:35:41immibis_ joins
18:38:04bonga quits [Ping timeout: 265 seconds]
18:39:22<TheTechRobo>Out of curiosity, what is socialbot's new method of archiving the tweets you mentioned? (The one that prompted you to make the notweets igset.)
18:52:13immibis_ quits [Remote host closed the connection]
19:16:15Niklink quits [Ping timeout: 265 seconds]
19:24:20Barto quits [Remote host closed the connection]
19:25:12Barto (Barto) joins
19:25:32user90100 joins
19:28:55binzyboi joins
19:33:37<binzyboi>While the site hasn't stated they're closing, Buzzly.art recently had a poll that's having a significant amount of users either leaving, or debating on leaving
19:34:56Niklink joins
19:34:58BPCZ (BPCZ) joins
19:35:00<binzyboi>https://buzzly.art/feed the feed has just about entirely been people making blog posts about the poll and announcements of leaving or debating leaving
19:37:28<appledash>Wow, I can't believe it's 2022 and people still get mad about what people get off to
19:37:58<binzyboi>I don't understand that portion of it either
19:52:25<user90100>i can do an emergency scrape if you're not doing so already
19:53:24<user90100>time is of the essence; if the artists there believe that the owner will steal their art for NFTs, they'll likely wipe their art from the site before that can happen
19:54:46<user90100>nvm, registration's closed :/ you could still do it but i'm not sure if that site has content that requires registration to view
19:58:41<binzyboi>Some content requires registration, I have a registered account
19:59:01<binzyboi>Basically it's mostly stuff that's 18+ in rating that's not viewable without an account afaik
19:59:42<appledash>Yeah, please do scrape if possible
20:00:33<binzyboi>How would I do so? This would likely be my first scrape I've done
20:01:16<user90100>Install wget on Windows, then follow the instructions here: https://wiki.archiveteam.org/index.php/Wget
20:01:35<binzyboi>I use Ubuntu
20:01:42<user90100>perfect, it's already installed
20:01:47<binzyboi>Wait really?
20:01:50<binzyboi>That's dope
20:02:08<@JAA>I don't think this site can be crawled that easily. JS, GraphQL, etc.
20:02:49<appledash>Ugh, yeah, it won't work without JS
20:03:39<binzyboi>What makes those get in the way of scraping?
20:07:53<appledash>wget and most other command line downloaders don't execute JavaScript; the page itself is likely built using JS, and the actual URLs for images are likely generated or fetched by the JS as well
20:08:15<appledash>So it becomes a more complicated problem because you have to figure out how the JS is getting that data and then write your own code to do the same thing
20:09:09<binzyboi>Fun, just another reason to learn coding then.
20:15:34<@OrIdow6>Is there much on this website that doesn't require an account?
20:16:40<binzyboi>Most of the content doesn't require an account, is just any of the rated content that requires one.
20:16:55<binzyboi>Which for this site, doesn't seem to be a whole lot from what I've seen
20:18:55<@OrIdow6>Do you have any numbers on how many people are leaving?
20:19:00<@OrIdow6>What proportion
20:20:38<binzyboi>Uhhhh, I can try to get list going of users either leaving or debating leaving.
20:21:38Eighty (Eighty) joins
20:22:00<@JAA>Just a rough guess is sufficient for this. Is it a small loud minority or a significant fraction of the whole user base complaining?
20:23:39<binzyboi>Seems to be a significant fraction from what I see. Could just be my neck of the woods on the site though tbf. I will say that the last 25 pages of the feed are either people mentioning their distaste of said poll, mentioning leaving or debating leaving, or at the very least sharing their socials
20:24:29<binzyboi>And the feed is global, not from accounts you follow either
20:27:08<@OrIdow6>How are you seeing that significant fraction?
20:27:32nyuuzyou (nyuuzyou) joins
20:28:01<binzyboi>https://buzzly.art/feed/ plus my notifications from accounts I follow
20:29:43user90100 quits [Remote host closed the connection]
20:29:56name10010 joins
20:30:20<name10010>i'm gonna try headless chrome, and i'll report back if i can get something up and running. As a last resort i'll run octoparse on my machine
20:30:26<nyuuzyou>Hi all. A large Russian short video service Coub (Alexa rating 7650) is closing on April 1, I asked about it in #// and was told that it is better to write here.
20:30:30<nyuuzyou>I looked on main archiveteam wiki page and found no mention about it. The notice of closure is on the website + https://dtf.ru/life/1119397-servis-korotkih-video-coub-zakroetsya-1-aprelya-2022-goda
20:30:52<nyuuzyou>Are there any plans for this?
20:32:06<@JAA>I enjoy the irony that one of the last posts on their blog is 'Save the Vine community with Coub'.
20:36:11<h2ibot>JustAnotherArchivist edited Deathwatch (+42, /* 2022 */ Add Coub): https://wiki.archiveteam.org/?diff=48367&oldid=48356
20:37:33name10010 quits [Remote host closed the connection]
20:40:36immibis_ joins
20:44:23Ruthalas1 (Ruthalas) joins
20:46:01Ruthalas quits [Ping timeout: 265 seconds]
20:46:01Ruthalas1 is now known as Ruthalas
20:59:56immibis_ is now known as immibis
21:07:01bonga joins
21:11:46bonga quits [Ping timeout: 265 seconds]
21:38:14immibis quits [Remote host closed the connection]
21:38:31immibis (immibis) joins
21:40:24<@JAA>Does anyone recognise what kind of listing server this is and how its pagination works? https://coub-anubis-a.akamaized.net/coub_storage/
21:40:34<@JAA>It's not S3 or Azure.
21:42:23<appledash>I mean, it appears to be Akamai?
21:43:13<@JAA>I mostly know Akamai as a CDN, but yeah, appears they have a cloud storage offer as well.
21:46:45wyatt8740 quits [Remote host closed the connection]
21:47:55nimaje quits [Quit: WeeChat 3.4]
21:49:41<@JAA>Oh boy, the HTTP headers there indicate that the container has 793.6 TB of data in 473M objects. Fun.
21:49:43nimaje joins
21:50:08immibis quits [Read error: Connection reset by peer]
21:50:12immibis_ (immibis) joins
21:50:28wyatt8740 joins
21:51:10march_happy (march_happy) joins
21:52:20<@arkiver>nyuuzyou: thank you, looking into it
21:53:12<@arkiver>nyuuzyou: will you be sticking around in case of questions? (are you a russian speaker?)
21:53:38<@arkiver>JAA: certainly fun hah
21:54:00<@JAA>FWIW, Coub is entirely available in English and based in NYC these days.
21:55:03<nyuuzyou>arkiver: I have not used this service, but yes, I speak Russian
21:55:17<@arkiver>nyuuzyou: i assumed it was a russian service
21:55:23<@arkiver>so they have sequential identifiers
21:55:28<@arkiver>discovering everything is not a problem
21:55:55<@JAA>Not entirely sequential, but close enough, yeah. There are numeric and alphanumeric IDs for each 'coub'.
21:56:06<@arkiver>hmm yeah /view/
21:56:09<@arkiver>still seems somewhat sequential
21:56:21<@JAA>I didn't manage to figure out the mapping between those two. It's not a simple alphabet change it seems.
21:56:25BlueMaxima joins
21:56:44<@JAA>E.g. 183891963 = 315b8o
21:57:26<@arkiver>they have different sizes videos
21:57:43<@arkiver>so that'll help reduce the ~800 TB
21:57:46<@JAA>Yes, and separate files for video and audio.
21:58:28<@JAA>At least that's what it looks like in the API, didn't actually try it.
21:58:51<@JAA>The API is entirely open: https://coub.com/api/v2/coubs/183891963 = https://coub.com/api/v2/coubs/315b8o
21:59:05<@arkiver>yep
21:59:11<@arkiver>this'll be fun
21:59:22<@arkiver>16 days left :P
21:59:26<@arkiver>lets make a channel!
21:59:40<@JAA>coup?
21:59:58<@arkiver>yes
22:00:02<@arkiver>#coup
22:03:44<Barto>couperet? :p
22:05:27<Barto>damn, it's a b in fact. well coup it is
22:06:00<@arkiver>Barto: you're late to the naming party :P
22:06:06<Barto>;)
22:10:16immibis_ quits [Remote host closed the connection]
22:10:25immibis joins
22:12:28<h2ibot>JustAnotherArchivist created Coub (+212, Created page with "{{Infobox project | URL =…): https://wiki.archiveteam.org/?title=Coub
22:12:29<h2ibot>JustAnotherArchivist edited Deathwatch (-22, /* 2022 */ Link to Coub page): https://wiki.archiveteam.org/?diff=48369&oldid=48367
22:15:17immibis quits [Read error: Connection reset by peer]
22:15:22immibis (immibis) joins
22:15:55immibis quits [Remote host closed the connection]
22:17:12immibis (immibis) joins
22:17:29<@arkiver>anyone have ideas for a vkontakte channel?
22:17:34LeGoupil quits [Client Quit]
22:17:36<@arkiver>Barto: ^
22:17:38<@arkiver>:)
22:19:02Barto thinks... slowly
22:19:14<Barto>lossofkontakt ?
22:19:43<@JAA>lastkontakt
22:20:12<Barto>kontaktless (payment joke)
22:21:36<datechnoman>Sounds like they want all of their data copied :P
22:21:44<@JAA>(For the unaware, I'm referring to this of course: https://en.wikipedia.org/wiki/First_contact_(anthropology) )
22:23:44<Barto>(2010)theyearwemadekontakt?
22:24:01<Barto>make or made, i don't know
22:29:07bonga joins
22:31:01<immibis>JAA: it randomly switches between XML and text/plain probably due to some caching issue - fun
22:31:56<@JAA>Huh, what does?
22:33:46<immibis>JAA: https://coub-anubis-a.akamaized.net/coub_storage/
22:34:06<@JAA>Hmm, I've only seen XML responses.
22:34:56<tech234a>Should anything be done about Vimeo? Apparently they’ve been starting to ask creators in the top 1% of bandwidth usage to pay thousands per year to continue using the platform.
22:35:26<@arkiver>tech234a: do we know when videos may be deleted?
22:35:40<immibis>JAA: try adding some random query parameters, it seems to be text by default
22:35:45<@arkiver>is there a "pay up or else" date?
22:36:02IDK quits [Quit: Connection closed for inactivity]
22:36:16<tech234a>Not sure
22:36:26<@JAA>immibis: Well, I did just that when I was trying to figure out the pagination. Only got XML, every time.
22:36:45<tech234a>Some info was posted last month but it didn't make news until today: https://vimeo.com/blog/post/understanding-bandwidth-on-vimeo/
22:37:29<tech234a>Article from today: https://www.theverge.com/2022/3/15/22979126/vimeo-patreon-creators-price-increase
22:37:46<tech234a>it seems they are gradually emailing users given the variety of dates quoted in the article
22:38:51<immibis>JAA: my guess is they have nginx reverse proxy in front of openstack swift, and these pagination parameters are not getting passed through: https://docs.openstack.org/swift/latest/api/object_api_v1_overview.html
22:39:18<immibis>alternatively, they are running something different but for some reason it pretends to be openstack swift
22:39:48<@arkiver>in that case results (while different for different params) should still be consistent for same params right?
22:40:57<@JAA>Ah, thanks.
22:41:16<tech234a>arkiver: sample Vimeo email, though it looks like different people are getting affected at different times: https://www.patreon.com/posts/vimeo-is-holding-61514364
22:41:31<immibis>arkiver: it may depend on something like which cache node you happen to hit
22:41:39<immibis>and which format was last requested through it
22:41:49<@JAA>Let's move this to #coup.
22:44:42<tech234a>The Vimeo thing does seem to have ties to Patreon videos so I'm not sure how much public content is affected here but I assume that there is some public content that exceeds their bandwidth limits.
22:44:43bonga quits [Read error: Connection reset by peer]
22:46:06bonga joins
22:52:21user01010 joins
22:54:36<h2ibot>Tech234a edited Vimeo (+551, Add note about high-bandwidth users): https://wiki.archiveteam.org/?diff=48370&oldid=46649
22:55:28nyuuzyou quits [Remote host closed the connection]
22:55:36<h2ibot>Tech234a created Dw (+24, Create redirect from dw to deathwatch): https://wiki.archiveteam.org/?title=Dw
22:57:29bonga quits [Ping timeout: 265 seconds]
22:57:36<h2ibot>Tech234a edited Deathwatch (+175, /* 2022 */ Add Vimeo high bandwidth users): https://wiki.archiveteam.org/?diff=48372&oldid=48369
22:58:36<h2ibot>Tech234a edited Vimeo (+100, Add additional reference): https://wiki.archiveteam.org/?diff=48373&oldid=48370
22:59:31bonga joins
23:01:07Arcorann (Arcorann) joins
23:13:26TheTechRobo quits [Ping timeout: 265 seconds]
23:18:29TheTechRobo joins
23:20:44TheTechRobo quits [Remote host closed the connection]
23:21:14TheTechRobo joins
23:30:58march_happy quits [Ping timeout: 265 seconds]
23:31:10Ruthalas quits [Read error: Connection reset by peer]
23:31:24Ruthalas (Ruthalas) joins
23:31:42march_happy (march_happy) joins
23:31:58Ruthalas quits [Client Quit]
23:33:03BlueMaxima quits [Client Quit]
23:45:43<user01010>@binzyboi i think i've found a crawler that should work on Buzzly, but i'm still installing it https://github.com/internetarchive/brozzler/
23:57:16<appledash>IRC is not Twitter, @ means nothing
23:58:06<@OrIdow6>appledash: I suspect there are some clients (or maybe bridges?) that add that
23:58:23<@OrIdow6>binzyboi: Is there a way to list all users on this site?
23:58:42<@OrIdow6>I am giving myself half an hour or so to look at this