00:07:38HP_Archivist quits [Read error: Connection reset by peer]
00:41:12qwertyasdfuiopghjkl joins
00:45:49<@JAA>Since there was no movement on this, I'm archiving Channel 9 videos now. 320k URLs, expected total size 20-30 TiB. Curious to see how well qwarc can handle this. (Famous last words...)
00:48:12<@arkiver>thanks TheTechRobo !
00:48:23<@arkiver>IDK: how much data is it?
00:48:43<@arkiver>if it's not a very large amount, you can also compress and send the text to me and i'll extract URLs
01:00:01dm4v quits [Client Quit]
01:05:06dm4v joins
01:05:08dm4v quits [Changing host]
01:05:08dm4v (dm4v) joins
01:14:01Megame (Megame) joins
01:45:56tzt (tzt) joins
02:02:32dm4v_ joins
02:03:14dm4v quits [Ping timeout: 258 seconds]
02:03:14dm4v_ is now known as dm4v
02:03:14dm4v quits [Changing host]
02:03:14dm4v (dm4v) joins
02:10:11Mateon2 joins
02:12:03Mateon1 quits [Ping timeout: 258 seconds]
02:12:03Mateon2 is now known as Mateon1
02:33:10Megame quits [Client Quit]
02:55:09todrobbins joins
03:00:46driib798 (driib) joins
03:04:10driib79 quits [Ping timeout: 265 seconds]
03:04:10driib798 is now known as driib79
03:23:07todrobbins quits [Client Quit]
03:23:22ragu joins
03:30:18sonick quits [Client Quit]
04:00:41Atom-- joins
04:04:35Atom quits [Ping timeout: 265 seconds]
05:02:15qw3rty__ quits [Ping timeout: 258 seconds]
05:19:53HackMii_ quits [Remote host closed the connection]
05:20:24HackMii_ (hacktheplanet) joins
05:46:20sonick (sonick) joins
06:08:52qwertyasdfuiopghjkl quits [Client Quit]
06:13:25qwertyasdfuiopghjkl joins
07:22:05BlueMaxima quits [Client Quit]
08:00:40pabs quits [Remote host closed the connection]
09:21:46<IDK>Arkiver: I think its around 35mb
09:21:58<IDK>txt file
09:22:08<IDK>Im currently doing the dht archive
09:24:00<IDK>Arkiver: Here is the data: https://transfer.archivete.am/SSH1/discordscrape.txt
09:28:34revi quits [Ping timeout: 252 seconds]
09:28:34murmur quits [Ping timeout: 252 seconds]
09:28:54IDK quits [Ping timeout: 265 seconds]
09:28:54justcool393 quits [Ping timeout: 265 seconds]
09:28:54Ctrl-S quits [Ping timeout: 265 seconds]
09:29:07mgrandi quits [Ping timeout: 252 seconds]
09:29:40sonick quits [Ping timeout: 252 seconds]
09:29:40NotEggplant quits [Ping timeout: 252 seconds]
09:29:40tech234a quits [Ping timeout: 252 seconds]
09:29:40JSharp quits [Ping timeout: 252 seconds]
09:29:40Dragnog quits [Ping timeout: 252 seconds]
09:29:40Dallas quits [Ping timeout: 252 seconds]
09:30:13mgrandi (mgrandi) joins
09:30:15justcool393 (justcool393) joins
09:30:22murmur joins
09:30:25jrwr_ (jrwr) joins
09:30:25@ChanServ sets mode: +o jrwr_
09:30:30Dragnog joins
09:30:30Fusl__ (Fusl) joins
09:30:30@ChanServ sets mode: +o Fusl__
09:30:54NotEggplant joins
09:30:58ghuntley quits [Ping timeout: 622 seconds]
09:30:58@Fusl_ quits [Ping timeout: 622 seconds]
09:30:58@HCross quits [Ping timeout: 622 seconds]
09:30:58@Fusl__ is now known as @Fusl_
09:31:03themadpro_ (themadpro) joins
09:31:43revi (revi) joins
09:31:44jonty quits [Ping timeout: 622 seconds]
09:32:09tech234a (tech234a) joins
09:32:21HCross (HCross) joins
09:32:21@ChanServ sets mode: +o HCross
09:32:30devsnek quits [Ping timeout: 622 seconds]
09:32:53aarchi quits [Ping timeout: 622 seconds]
09:33:32russss_ (russss) joins
09:33:33aarchi (aarchi) joins
09:34:04JSharp (JSharp) joins
09:34:13devsnek (devsnek) joins
09:34:30jonty (jonty) joins
09:34:40sonick (sonick) joins
09:34:48russss quits [Ping timeout: 622 seconds]
09:34:48themadpro quits [Ping timeout: 622 seconds]
09:34:48@jrwr quits [Ping timeout: 622 seconds]
09:34:48russss_ is now known as russss
09:34:48themadpro_ is now known as themadpro
09:34:48@jrwr_ is now known as @jrwr
09:35:41IDK (IDK) joins
09:35:47Dallas (Dallas) joins
09:35:57@hook54321 quits [Ping timeout: 622 seconds]
09:36:45hook54321 (hook54321) joins
09:36:45@ChanServ sets mode: +o hook54321
09:39:02@HCross quits [Max SendQ exceeded]
09:40:08HCross (HCross) joins
09:40:08@ChanServ sets mode: +o HCross
09:40:42Ctrl-S joins
09:53:44ghuntley joins
11:03:04pabs (pabs) joins
12:24:37mutantmnky quits [Ping timeout: 258 seconds]
12:37:45mutantmnky (mutantmonkey) joins
12:45:34mutantmnky quits [Remote host closed the connection]
12:45:50mutantmnky (mutantmonkey) joins
13:07:11<TheTechRobo>arkiver: Did you use the ones in #// or the ones here????
13:07:20<TheTechRobo>i.e. my transfer link
13:07:27<TheTechRobo>There was a newer one in #//
13:07:35<TheTechRobo>Which has more urls
14:20:23Arcorann_ quits [Ping timeout: 258 seconds]
14:28:59qw3rty joins
14:47:20pawbs joins
14:51:46gazorpazorp quits [Read error: Connection reset by peer]
14:51:48user_ (gazorpazorp) joins
14:56:14fuzzy8021 quits [Killed (NickServ (GHOST command used by fuzzy802!~fuzzy8021@173-224-26-244.ptcnet.net))]
14:56:20fuzzy8021 (fuzzy8021) joins
15:08:18Myself (myself) joins
16:30:58qwertyasdfuiopghjkl quits [Remote host closed the connection]
16:32:43qwertyasdfuiopghjkl joins
16:43:39xit7 quits [Quit: The Lounge - https://thelounge.chat]
16:43:39air joins
16:45:45xit7 joins
16:47:31air quits [Remote host closed the connection]
16:50:05xit7 quits [Client Quit]
16:53:05xit7 joins
17:44:01<h2ibot>Jake edited Tvtag (+61, Add link to AB jobs. (The social media pages…): https://wiki.archiveteam.org/?diff=47897&oldid=33971
17:46:01<h2ibot>Jake edited Gfycat (+46, Add link to collection.): https://wiki.archiveteam.org/?diff=47898&oldid=47573
17:46:16spirit joins
17:57:03<h2ibot>Jake edited Gigaom (-394, Add link to main site AB job. (fix up…): https://wiki.archiveteam.org/?diff=47899&oldid=31808
17:59:04<h2ibot>Jake edited GitHost (+97, Add link to AB search): https://wiki.archiveteam.org/?diff=47900&oldid=47472
17:59:12<Jake>https://jakel.rocks/up/d63169a474383ee8/image.png Seen this quite a few times, is this something we can fix, or should we just downscale the pics or?
18:02:04<h2ibot>Jake edited GitHub Downloads (-16, Move links to items.): https://wiki.archiveteam.org/?diff=47901&oldid=46640
18:16:28todrobbins joins
18:41:32<@JAA>jrwr: ^
19:00:43user_ is now known as gazorpazorp
19:03:39LeGoupil joins
19:10:34LeGoupil quits [Ping timeout: 258 seconds]
19:12:52LeGoupil joins
19:33:20<Jake>thanks! :)
19:35:09<@jrwr>Downscale them
19:37:37<Jake>alright
19:43:24<h2ibot>Jake uploaded File:GitHub 1303511667338.png (Get the main front page to allow it to display…): https://wiki.archiveteam.org/?title=File%3AGitHub%201303511667338.png
19:43:51<Jake>https://wiki.archiveteam.org/index.php/Gitorious (this says saved, but I can't seem to find anything on IA?)
19:46:44<@JAA>Oh right, I wanted to poke astrid about that one.
19:59:01<Jake>glad I could remind you :)
20:00:06<@JAA>:-)
20:00:28<h2ibot>Jake edited Svalbard Global Seed Vault (+23, Add link to AB job (lol)): https://wiki.archiveteam.org/?diff=47903&oldid=45231
20:40:47<ghuntley>How much of channel 9 did we manage to archive?
20:40:50<ghuntley>https://twitter.com/geoffreyhuntley/status/1462883577218035712?s=21
20:42:46<Jake>I believe J AA is currently in-progress on Channel 9, around 20-30 TiB.
20:45:40<@JAA>The site is partially archived by ArchiveBot. I have a copy of the API data that I believe to be complete. I'm currently downloading the videos, thumbnails, ZIPs, and slides. Those are still online for the time being.
20:47:36qwertyasdfuiopghjkl quits [Remote host closed the connection]
20:47:39<@JAA>The ArchiveBot jobs also downloaded some of the videos. No idea how complete that is.
20:50:38<h2ibot>Jake edited Gmane (+1, Update status, doesn't seem to be any public…): https://wiki.archiveteam.org/?diff=47904&oldid=47474
20:51:06<AK>Looks like it's still downloading some videos at the moment from the queue
20:54:39<h2ibot>Jake edited Gna! (+50, Update to IA item): https://wiki.archiveteam.org/?diff=47905&oldid=47524
20:56:40<h2ibot>Jake edited Google+ (+50, Add link to collection.): https://wiki.archiveteam.org/?diff=47906&oldid=46698
21:00:40<h2ibot>Jake edited Google Answers (+44, Add link to IA item): https://wiki.archiveteam.org/?diff=47907&oldid=27600
21:06:41<h2ibot>Jake edited Google Baraza (+118, Add link to collection and AB jobs.): https://wiki.archiveteam.org/?diff=47908&oldid=47457
21:08:03<ghuntley>Legends!
21:11:29<@JAA>Looks like my download is going to take a while. Only 900 GiB done after ~21 hours. Also the IA uploads are going slow as usual, and I don't have space to buffer the entire thing.
21:13:43<h2ibot>Jake edited Google Books Ngram (+106, Add link to IA collection as well as update…): https://wiki.archiveteam.org/?diff=47909&oldid=28607
21:17:29<Jake>(Ngram seems to have had an update in 2020 that isn't on IA yet, not sure if it's worth uploading?)
21:31:23<@JAA>I wonder why those items are restricted.
21:31:47<@JAA>Maybe just too many people were downloading from IA instead of going to Google.
21:32:01<@JAA>In any case, I don't see the 2012 version on IA either.
21:32:36<@JAA>And yeah, I think these would likely be worth uploading.
21:34:58<Jake>Oh yeah, this was uploaded in 2011, so they don't have 2012 or 2020.
21:38:33katocala quits [Read error: Connection reset by peer]
21:38:38<Jake>https://wiki.archiveteam.org/index.php/Google_Business_Sitebuilder doesn't seem to have been uploaded to IA? Can't find a collection or individual items, the 'items' don't seem to have been run on AB (as well as the 'items' not playing back on WBM, ironic)
21:38:55katocala joins
21:44:06<Jake>https://wiki.archiveteam.org/index.php/Google_Fusion_Tables seems to be archived exclusively on this one GitHub repo? https://github.com/paulwalko/fuslontable
21:47:49<h2ibot>Jake edited Google Groups Files (+52, Add link to IA collection): https://wiki.archiveteam.org/?diff=47910&oldid=29386
21:50:31<@JAA>Yeah, that seems right re Fusion Tables.
22:01:32LeGoupil quits [Client Quit]
22:02:58spirit quits [Client Quit]
22:04:53Arcorann_ joins
22:35:43Megame (Megame) joins
22:37:58<h2ibot>Jake edited Google Helpouts (+56, Add link to IA item.): https://wiki.archiveteam.org/?diff=47911&oldid=47456
22:39:39todrobbins quits [Client Quit]
22:49:07BlueMaxima joins
23:00:02<h2ibot>Jake edited Knol (+49, There doesn't seem to have been a project? (as…): https://wiki.archiveteam.org/?diff=47912&oldid=47358
23:00:05<Jake>(Someone has been here before... http://archive.fart.website/bin/irclogger_log/archiveteam?date=2014-10-01,Wed&sel=60#l60 )
23:04:03<h2ibot>Jake edited Google Plus Comments on Blogspot (+45, Add link to IA item): https://wiki.archiveteam.org/?diff=47913&oldid=40921
23:06:31<Jake>https://archive.org/download/Blogspot_GPlus_Comments_Index/index.html another .html searcher broken, maybe we could get it moved to another collection to render?
23:09:03<Jake>(The other one I found http://archive.org/download/test-memac-index-test/tabblo.html )
23:13:04<h2ibot>Jake edited Google Reader (+107, Add link to IA collection.): https://wiki.archiveteam.org/?diff=47914&oldid=47380
23:14:23<@JAA>arkiver: ^
23:18:01thelounge31 joins
23:23:46<Jake>Thanks! :) https://wiki.archiveteam.org/index.php/Google_Video_(Archive) seems to be restricted from editing, but I'm also somewhat unsure what data was gotten for that project, I see a metadata item from Jason 'google-video-metadata-dumpage' as well as a IA access restricted collection 'googlevideo2011'
23:32:08<h2ibot>Jake edited Halo (+91, Add link to IA collection): https://wiki.archiveteam.org/?diff=47915&oldid=47696
23:32:36<@JAA>Guess the protection stuff doesn't make it into the announcements, but you can edit that page now.
23:34:08<h2ibot>TheTechRobo edited 4shared (+445, Add stuff that I just found out through…): https://wiki.archiveteam.org/?diff=47917&oldid=47682
23:34:44<Jake>thanks! :)
23:35:08<h2ibot>Jake edited Hardware Canucks (+42, Add link to AB jobs): https://wiki.archiveteam.org/?diff=47918&oldid=47537
23:37:09<h2ibot>Jake edited Google Video (Archive) (+110, Add link to collection and item.): https://wiki.archiveteam.org/?diff=47919&oldid=47916
23:39:09thelounge31 quits [Client Quit]
23:40:59Megame quits [Client Quit]
23:58:04yawkat quits [Ping timeout: 258 seconds]
23:58:13<h2ibot>JustAnotherArchivist created Channel 9 (+1240, Created page with "{{Infobox project |…): https://wiki.archiveteam.org/?title=Channel%209
23:58:14<h2ibot>JustAnotherArchivist edited Microsoft Developer Network (-27, Link to [[Channel 9]]): https://wiki.archiveteam.org/?diff=47921&oldid=45151
23:59:02yawkat (yawkat) joins
23:59:13<h2ibot>JustAnotherArchivist edited Channel 9 (+43, Add URL): https://wiki.archiveteam.org/?diff=47922&oldid=47920
23:59:14<h2ibot>JustAnotherArchivist edited Deathwatch (-21, /* 2021 */ Link to [[Channel 9]]): https://wiki.archiveteam.org/?diff=47923&oldid=47859