00:00:44dm4v quits [Read error: Connection reset by peer]
00:01:08dm4v joins
00:01:10dm4v quits [Changing host]
00:01:10dm4v (dm4v) joins
00:22:06BlueMaxima joins
00:44:08<@JAA>Seems like my wiki bot isn't functioning correctly, at least on the WBM exclusion page. Looking into that.
01:02:45dm4v quits [Ping timeout: 244 seconds]
01:04:49dm4v joins
01:04:51dm4v quits [Changing host]
01:04:51dm4v (dm4v) joins
01:09:36Arcorann (Arcorann) joins
01:37:08pabs quits [Ping timeout: 250 seconds]
01:39:49qwertyasdfuiopghjkl quits [Client Quit]
02:29:03Iki joins
02:44:03pabs (pabs) joins
03:07:17fuzzy8021 quits [Killed (NickServ (GHOST command used by fuzzy802!~fuzzy8021@173-224-26-244.ptcnet.net))]
03:07:23fuzzy8021 (fuzzy8021) joins
03:08:28lukash7 joins
03:15:57<@JAA>ArchiveBot just completed its 300000th job.
03:16:05<Ryz>300,000 JOBS O:
03:16:36<@JAA>We also recently reached 2 PiB of AB WARCs uploaded to IA, approximately on 2021-07-11.
03:17:17<@JAA>(2 PB were reached back in early February.)
03:18:53<@JAA>We are now at some 21.5 billion HTTP responses from ArchiveBot in the WBM.
03:23:05<Craigle>WOOOOOHOOOOO!!!!!
03:23:28<Craigle>That's awesome!
03:26:36fuzzy8021 quits [Read error: Connection reset by peer]
03:28:19fuzzy8021 (fuzzy8021) joins
03:44:54nertzy_ joins
03:48:37qw3rty__ joins
03:52:13qw3rty_ quits [Ping timeout: 244 seconds]
03:57:54nertzy_ quits [Client Quit]
04:00:00treora quits [Client Quit]
04:01:16treora joins
04:22:11DogsRNice quits [Read error: Connection reset by peer]
04:32:10<@JAA>Wiki bot should be fixed (next run at the full hour). It was most likely broken for just over a month due to changes on the wiki side. :-|
04:49:39BlueMaxima quits [Read error: Connection reset by peer]
04:50:50nicolas17 quits [Ping timeout: 250 seconds]
05:01:02<h2ibot>JAABot edited List of websites excluded from the Wayback Machine (+0): https://wiki.archiveteam.org/?diff=47081&oldid=47080
05:53:25nertzy_ joins
06:09:53nertzy_ quits [Client Quit]
07:59:20tzt quits [Ping timeout: 250 seconds]
08:34:26nico_32 quits [Ping timeout: 250 seconds]
08:46:40nico_32 (nico) joins
09:46:08wyatt8740 quits [Ping timeout: 244 seconds]
09:52:06wyatt8740 joins
11:20:22Megame quits [Client Quit]
11:28:57Iki quits [Ping timeout: 244 seconds]
11:59:27AK quits [Quit: AK]
12:01:04AK (AK) joins
12:06:56AK quits [Client Quit]
12:09:07AK (AK) joins
13:21:04lunik1 quits [Ping timeout: 244 seconds]
14:15:11IDK (IDK) joins
15:01:44tzt joins
15:12:21lunik1 joins
15:25:35Arcorann quits [Ping timeout: 244 seconds]
15:36:21<@OrIdow6>arkiver: I think I should have Google Drive folders finished within an hour or so
15:42:42<duce1337>is there a way for wget to disable js when website is fully downloaded like archive.today?
16:13:41<@OrIdow6>duce1337: Presumably you could just manually strip out scripts, but I suspect that you're actually asking how to get a rendered page without Javascript, which is not really something wget can do
16:18:17benjins quits [Ping timeout: 244 seconds]
16:56:04<h2ibot>OrIdow6 uploaded File:Google drive logo.png: https://wiki.archiveteam.org/?title=File%3AGoogle%20drive%20logo.png
16:58:04<h2ibot>OrIdow6 created Google Drive (+1963, Created page with "{{Infobox project | title =…): https://wiki.archiveteam.org/?title=Google%20Drive
17:04:44<duce1337>OrIdow6: i mean it can download js, but later remove the js files just like archive.today
17:10:02<@OrIdow6>duce1337: First, Javascript isn't just in separate files; to practically disable it you'd have to modify the page. Second, many sites depend heavily on Javascript, an to replicate the behavior of archive.questionable you'd have to render the page before taking the scripts out, which is something beyond the scope of what wget can do.
17:10:10<@OrIdow6>*and
17:10:59<duce1337>ok
17:12:07<@OrIdow6>Anyone ever had experience of -hitting a limit for how often a Google Drive folder can be viewed or -getting downloader-side limited on Google Drive, and having something happen to you other than being sent to google.com/sorry?
17:15:01<@OrIdow6>arkiver: https://github.com/OrIdow6/google-drive-grab - needs multiitem tweaked (including removing from item_names_to_submit), and needs handling for rate limiting, but other than that I think folders are finished
17:20:16wyatt8740 quits [Ping timeout: 252 seconds]
17:20:32<@OrIdow6>Also do_debug is on, so if you run this from a real queue it will finish items without sending anything to backfeed
17:20:40<@OrIdow6>I do not have an item list yet
17:55:46<tech234a>I think the YouTube Attributions removal mostly affects videos from before 9/20/2017 when the online editor was discontinued... I think the online editor was the only way to populate the attributions page but I'm not sure https://support.google.com/youtube/forum/AAAAiuErobUdx5xKn6v6rM/
18:04:14hexa- quits [Quit: WeeChat 3.1]
18:04:30hexa- (hexa-) joins
18:08:28nicolas17 joins
18:24:56IDK quits [Client Quit]
18:30:32<tech234a>Found an example to the contrary of the 9/20/2017 cut-off: https://www.youtube.com/watch?v=2cF7u1jyDKw
18:55:17IDK (IDK) joins
18:55:23lennier2 joins
18:57:56lennier1 quits [Ping timeout: 244 seconds]
18:58:04lennier2 is now known as lennier1
18:59:16sembiance (sembiance) joins
19:07:03wyatt8740 joins
19:24:20<IDK>https://onlytech.com/community/threads/google-bookmarks-is-shutting-down-on-30-september-2021.47584/
19:24:25<IDK>is there anything ti do here
19:26:07qwertyasdfuiopghjkl joins
19:26:26<Jake>seems to be all private stuff, saved into lists? nothing I think is public?
19:26:58<IDK>Ok
19:41:53wyatt8740 quits [Remote host closed the connection]
19:43:51wyatt8740 joins
19:53:12benjins joins
19:54:30benjinsmith joins
19:55:12wyatt8740 quits [Ping timeout: 250 seconds]
19:57:52benjins quits [Ping timeout: 244 seconds]
19:57:58wyatt8740 joins
19:58:22benjinsmith is now known as benjins
20:34:42DogsRNice (Webuser299) joins
20:54:34Hackerpcs quits [Quit: Hackerpcs]
20:57:07Hackerpcs (Hackerpcs) joins
21:03:48<h2ibot>JustAnotherArchivist edited ISP Hosting (+42, Add alternative Tiscali Italy domain…): https://wiki.archiveteam.org/?diff=47084&oldid=47057
21:05:33qwertyasdfuiopghjkl quits [Ping timeout: 244 seconds]
21:06:38Hackerpcs quits [Client Quit]
21:07:46Hackerpcs (Hackerpcs) joins
21:38:52qwertyasdfuiopghjkl joins
21:49:57<h2ibot>JustAnotherArchivist edited ISP Hosting (+1086, Add several Italian ISPs from…): https://wiki.archiveteam.org/?diff=47085&oldid=47084
21:54:58<h2ibot>JustAnotherArchivist edited ISP Hosting (+149, Add InWind): https://wiki.archiveteam.org/?diff=47086&oldid=47085
22:24:36lukash7 quits [Ping timeout: 244 seconds]
22:32:17driib3 (driib) joins
22:35:58driib quits [Ping timeout: 252 seconds]
22:35:58driib3 is now known as driib
22:43:58Megame (Megame) joins
23:12:57neon joins
23:34:10<neon>i saw that you guys archived some content i'm currently looking for when you archived "the artist union" before it shutdown, but I have absolutely no clue how to browse everything that has been archived. do i have to individually download all of the warc files and parse them to find the files that i want or am i missing something? thanks.
23:48:12nuroten joins
23:49:17BlueMaxima joins
23:57:41<nuroten>hi, I'd like to suggest another site for the HK media list https://collection.news/ it's an archive of Apple Daily articles. there's likely some overlap with what AT were able to grab from the official site, but it could be a good supplement