| 00:00:44 | | dm4v quits [Read error: Connection reset by peer] |
| 00:01:08 | | dm4v joins |
| 00:01:10 | | dm4v is now authenticated as dm4v |
| 00:01:10 | | dm4v quits [Changing host] |
| 00:01:10 | | dm4v (dm4v) joins |
| 00:22:06 | | BlueMaxima joins |
| 00:44:08 | <@JAA> | Seems like my wiki bot isn't functioning correctly, at least on the WBM exclusion page. Looking into that. |
| 01:02:45 | | dm4v quits [Ping timeout: 244 seconds] |
| 01:04:49 | | dm4v joins |
| 01:04:51 | | dm4v is now authenticated as dm4v |
| 01:04:51 | | dm4v quits [Changing host] |
| 01:04:51 | | dm4v (dm4v) joins |
| 01:09:36 | | Arcorann (Arcorann) joins |
| 01:37:08 | | pabs quits [Ping timeout: 250 seconds] |
| 01:39:49 | | qwertyasdfuiopghjkl quits [Client Quit] |
| 02:29:03 | | Iki joins |
| 02:44:03 | | pabs (pabs) joins |
| 03:07:17 | | fuzzy8021 quits [Killed (NickServ (GHOST command used by fuzzy802!~fuzzy8021@173-224-26-244.ptcnet.net))] |
| 03:07:23 | | fuzzy8021 (fuzzy8021) joins |
| 03:08:28 | | lukash7 joins |
| 03:15:57 | <@JAA> | ArchiveBot just completed its 300000th job. |
| 03:16:05 | <Ryz> | 300,000 JOBS O: |
| 03:16:36 | <@JAA> | We also recently reached 2 PiB of AB WARCs uploaded to IA, approximately on 2021-07-11. |
| 03:17:17 | <@JAA> | (2 PB were reached back in early February.) |
| 03:18:53 | <@JAA> | We are now at some 21.5 billion HTTP responses from ArchiveBot in the WBM. |
| 03:23:05 | <Craigle> | WOOOOOHOOOOO!!!!! |
| 03:23:28 | <Craigle> | That's awesome! |
| 03:26:36 | | fuzzy8021 quits [Read error: Connection reset by peer] |
| 03:28:19 | | fuzzy8021 (fuzzy8021) joins |
| 03:44:54 | | nertzy_ joins |
| 03:48:37 | | qw3rty__ joins |
| 03:52:13 | | qw3rty_ quits [Ping timeout: 244 seconds] |
| 03:57:54 | | nertzy_ quits [Client Quit] |
| 04:00:00 | | treora quits [Client Quit] |
| 04:01:16 | | treora joins |
| 04:22:11 | | DogsRNice quits [Read error: Connection reset by peer] |
| 04:32:10 | <@JAA> | Wiki bot should be fixed (next run at the full hour). It was most likely broken for just over a month due to changes on the wiki side. :-| |
| 04:49:39 | | BlueMaxima quits [Read error: Connection reset by peer] |
| 04:50:50 | | nicolas17 quits [Ping timeout: 250 seconds] |
| 05:01:02 | <h2ibot> | JAABot edited List of websites excluded from the Wayback Machine (+0): https://wiki.archiveteam.org/?diff=47081&oldid=47080 |
| 05:53:25 | | nertzy_ joins |
| 06:09:53 | | nertzy_ quits [Client Quit] |
| 07:59:20 | | tzt quits [Ping timeout: 250 seconds] |
| 08:34:26 | | nico_32 quits [Ping timeout: 250 seconds] |
| 08:46:40 | | nico_32 (nico) joins |
| 09:46:08 | | wyatt8740 quits [Ping timeout: 244 seconds] |
| 09:52:06 | | wyatt8740 joins |
| 11:20:22 | | Megame quits [Client Quit] |
| 11:28:57 | | Iki quits [Ping timeout: 244 seconds] |
| 11:59:27 | | AK quits [Quit: AK] |
| 12:01:04 | | AK (AK) joins |
| 12:06:56 | | AK quits [Client Quit] |
| 12:09:07 | | AK (AK) joins |
| 13:21:04 | | lunik1 quits [Ping timeout: 244 seconds] |
| 14:15:11 | | IDK (IDK) joins |
| 15:01:44 | | tzt joins |
| 15:12:21 | | lunik1 joins |
| 15:25:35 | | Arcorann quits [Ping timeout: 244 seconds] |
| 15:36:21 | <@OrIdow6> | arkiver: I think I should have Google Drive folders finished within an hour or so |
| 15:42:42 | <duce1337> | is there a way for wget to disable js when website is fully downloaded like archive.today? |
| 16:13:41 | <@OrIdow6> | duce1337: Presumably you could just manually strip out scripts, but I suspect that you're actually asking how to get a rendered page without Javascript, which is not really something wget can do |
| 16:18:17 | | benjins quits [Ping timeout: 244 seconds] |
| 16:56:04 | <h2ibot> | OrIdow6 uploaded File:Google drive logo.png: https://wiki.archiveteam.org/?title=File%3AGoogle%20drive%20logo.png |
| 16:58:04 | <h2ibot> | OrIdow6 created Google Drive (+1963, Created page with "{{Infobox project | title =…): https://wiki.archiveteam.org/?title=Google%20Drive |
| 17:04:44 | <duce1337> | OrIdow6: i mean it can download js, but later remove the js files just like archive.today |
| 17:10:02 | <@OrIdow6> | duce1337: First, Javascript isn't just in separate files; to practically disable it you'd have to modify the page. Second, many sites depend heavily on Javascript, an to replicate the behavior of archive.questionable you'd have to render the page before taking the scripts out, which is something beyond the scope of what wget can do. |
| 17:10:10 | <@OrIdow6> | *and |
| 17:10:59 | <duce1337> | ok |
| 17:12:07 | <@OrIdow6> | Anyone ever had experience of -hitting a limit for how often a Google Drive folder can be viewed or -getting downloader-side limited on Google Drive, and having something happen to you other than being sent to google.com/sorry? |
| 17:15:01 | <@OrIdow6> | arkiver: https://github.com/OrIdow6/google-drive-grab - needs multiitem tweaked (including removing from item_names_to_submit), and needs handling for rate limiting, but other than that I think folders are finished |
| 17:20:16 | | wyatt8740 quits [Ping timeout: 252 seconds] |
| 17:20:32 | <@OrIdow6> | Also do_debug is on, so if you run this from a real queue it will finish items without sending anything to backfeed |
| 17:20:40 | <@OrIdow6> | I do not have an item list yet |
| 17:55:46 | <tech234a> | I think the YouTube Attributions removal mostly affects videos from before 9/20/2017 when the online editor was discontinued... I think the online editor was the only way to populate the attributions page but I'm not sure https://support.google.com/youtube/forum/AAAAiuErobUdx5xKn6v6rM/ |
| 18:04:14 | | hexa- quits [Quit: WeeChat 3.1] |
| 18:04:30 | | hexa- (hexa-) joins |
| 18:08:28 | | nicolas17 joins |
| 18:24:56 | | IDK quits [Client Quit] |
| 18:30:32 | <tech234a> | Found an example to the contrary of the 9/20/2017 cut-off: https://www.youtube.com/watch?v=2cF7u1jyDKw |
| 18:55:17 | | IDK (IDK) joins |
| 18:55:23 | | lennier2 joins |
| 18:57:56 | | lennier1 quits [Ping timeout: 244 seconds] |
| 18:58:04 | | lennier2 is now known as lennier1 |
| 18:59:16 | | sembiance (sembiance) joins |
| 19:07:03 | | wyatt8740 joins |
| 19:24:20 | <IDK> | https://onlytech.com/community/threads/google-bookmarks-is-shutting-down-on-30-september-2021.47584/ |
| 19:24:25 | <IDK> | is there anything ti do here |
| 19:26:07 | | qwertyasdfuiopghjkl joins |
| 19:26:26 | <Jake> | seems to be all private stuff, saved into lists? nothing I think is public? |
| 19:26:58 | <IDK> | Ok |
| 19:41:53 | | wyatt8740 quits [Remote host closed the connection] |
| 19:43:51 | | wyatt8740 joins |
| 19:53:12 | | benjins joins |
| 19:54:30 | | benjinsmith joins |
| 19:55:12 | | wyatt8740 quits [Ping timeout: 250 seconds] |
| 19:57:52 | | benjins quits [Ping timeout: 244 seconds] |
| 19:57:58 | | wyatt8740 joins |
| 19:58:22 | | benjinsmith is now known as benjins |
| 19:58:24 | | benjins is now authenticated as benjins |
| 20:34:42 | | DogsRNice (Webuser299) joins |
| 20:54:34 | | Hackerpcs quits [Quit: Hackerpcs] |
| 20:57:07 | | Hackerpcs (Hackerpcs) joins |
| 21:03:48 | <h2ibot> | JustAnotherArchivist edited ISP Hosting (+42, Add alternative Tiscali Italy domain…): https://wiki.archiveteam.org/?diff=47084&oldid=47057 |
| 21:05:33 | | qwertyasdfuiopghjkl quits [Ping timeout: 244 seconds] |
| 21:06:38 | | Hackerpcs quits [Client Quit] |
| 21:07:46 | | Hackerpcs (Hackerpcs) joins |
| 21:38:52 | | qwertyasdfuiopghjkl joins |
| 21:49:57 | <h2ibot> | JustAnotherArchivist edited ISP Hosting (+1086, Add several Italian ISPs from…): https://wiki.archiveteam.org/?diff=47085&oldid=47084 |
| 21:54:58 | <h2ibot> | JustAnotherArchivist edited ISP Hosting (+149, Add InWind): https://wiki.archiveteam.org/?diff=47086&oldid=47085 |
| 22:24:36 | | lukash7 quits [Ping timeout: 244 seconds] |
| 22:32:17 | | driib3 (driib) joins |
| 22:35:58 | | driib quits [Ping timeout: 252 seconds] |
| 22:35:58 | | driib3 is now known as driib |
| 22:43:58 | | Megame (Megame) joins |
| 23:12:57 | | neon joins |
| 23:34:10 | <neon> | i saw that you guys archived some content i'm currently looking for when you archived "the artist union" before it shutdown, but I have absolutely no clue how to browse everything that has been archived. do i have to individually download all of the warc files and parse them to find the files that i want or am i missing something? thanks. |
| 23:48:12 | | nuroten joins |
| 23:49:17 | | BlueMaxima joins |
| 23:57:41 | <nuroten> | hi, I'd like to suggest another site for the HK media list https://collection.news/ it's an archive of Apple Daily articles. there's likely some overlap with what AT were able to grab from the official site, but it could be a good supplement |