01:14:17jamesp quits [Remote host closed the connection]
01:14:48upintheairsheep joins
01:36:43BlueMaxima joins
01:41:17<h2ibot>Wickedplayer494 edited Current Projects (+191, Bump imgur up in upcoming, restore Zippyshare…): https://wiki.archiveteam.org/?diff=49688&oldid=49676
02:02:31andrew quits [Client Quit]
02:03:13andrew (andrew) joins
02:07:25tbc1887 (tbc1887) joins
02:08:54upintheairsheep quits [Remote host closed the connection]
02:09:29<atphoenix>OrIdow6, there may be a second chance for Shutterfly per https://old.reddit.com/r/Archiveteam/comments/11s1gjh/shutterfly_is_shutting_down_share_sites_also_new/jh2bvl5/
02:10:12<atphoenix>and https://www.change.org/p/tell-shutterfly-to-respond-to-people-who-lost-all-their-memories/u/31512720
02:23:02BlueMaxima quits [Read error: Connection reset by peer]
02:23:16BlueMaxima joins
02:23:27<@OrIdow6>Thanks atphoenix!
02:23:35<@OrIdow6>Hopefully they do put them back
02:24:43<atphoenix>just note that the Shutterfly VP's note said May 10-30, so a limited window
02:25:09<atphoenix>maybe before May 10, but they communicated by May 10
02:26:24Doran is now known as Doranwen
02:26:26<atphoenix>I myself found I had some stuff on their site from so long ago that the photos were only about 200 kB each or less.
02:26:45<@OrIdow6>I checked one of the test sites I was using and it's still down atm
02:26:53<@OrIdow6>If it does come back the script is basically complete
02:27:00<atphoenix>and they're not my only copies (unless my local copies have bitrotted since my last check
02:28:53<atphoenix>every time someone things depending on cloud for important personal stuff or photos on only needs to look at the long list of companies that have done purges. Yahoo Photos and many others came before Shutterfly.
02:29:13<atphoenix>r/things/thinks
02:29:29<atphoenix>s/r/s
02:29:31<atphoenix>:D
02:46:39<@OrIdow6>Reminds me I have some work to do on my own personal backups...
02:50:51Niklink quits [Ping timeout: 265 seconds]
02:56:30<h2ibot>JustAnotherArchivist created Template:CTA URL lists (+1455, Created page with "<includeonly>== How to help…): https://wiki.archiveteam.org/?title=Template%3ACTA%20URL%20lists
02:58:30<h2ibot>JustAnotherArchivist edited Imgur (+73, Add URL lists CTA): https://wiki.archiveteam.org/?diff=49690&oldid=49684
03:04:31<h2ibot>JustAnotherArchivist edited Imgur (+423, Clarify sitemaps and add note about SPN): https://wiki.archiveteam.org/?diff=49691&oldid=49690
03:15:04Shjosan quits [Quit: Am sleepy (-, – )…zzzZZZ]
03:15:41Shjosan (Shjosan) joins
03:22:37pikablu quits [Read error: Connection reset by peer]
03:27:38nexusxe (nexusxe) joins
03:27:49xenonhelix (nexusxe) joins
03:33:32xenonhelix quits [Client Quit]
03:33:32nexusxe quits [Client Quit]
03:35:50Niklink joins
03:42:59Guest50 quits [Client Quit]
03:59:41<h2ibot>Tech234a edited Template:Instant messengers (+7, Google Talk no longer exists): https://wiki.archiveteam.org/?diff=49692&oldid=45858
04:00:01treora quits [Quit: blub blub.]
04:01:19treora joins
04:05:50tbc1887 quits [Read error: Connection reset by peer]
04:47:17tbc1887 (tbc1887) joins
04:57:21nicolas17 quits [Client Quit]
05:07:06Island quits [Read error: Connection reset by peer]
05:10:32lennier2 joins
05:12:13lennier1 quits [Ping timeout: 252 seconds]
05:12:23lennier2 is now known as lennier1
05:20:55umgr036 quits [Read error: Connection reset by peer]
05:21:19umgr036 joins
05:47:05superkuh quits [Remote host closed the connection]
05:48:52superkuh joins
05:56:13dumbgoy quits [Ping timeout: 252 seconds]
06:01:27igloo22225 quits [Client Quit]
06:01:37igloo22225 (igloo22225) joins
06:02:16nepeat quits [Ping timeout: 252 seconds]
06:03:44ave quits [Ping timeout: 252 seconds]
06:04:04igloo22225 quits [Client Quit]
06:04:11lun4 quits [Ping timeout: 265 seconds]
06:08:52igloo22225 (igloo22225) joins
06:13:21jacksonchen666 quits [Ping timeout: 245 seconds]
06:14:58ave (ave) joins
06:14:59lun4 (lun4) joins
06:15:01nepeat (nepeat) joins
06:18:41DiscantX quits [Ping timeout: 265 seconds]
06:22:25DiscantX joins
06:26:50BlueMaxima quits [Read error: Connection reset by peer]
06:32:13jacksonchen666 (jacksonchen666) joins
06:46:52lexikiq quits [Read error: Connection reset by peer]
06:48:58jacksonchen666 quits [Client Quit]
07:40:47<Thibaultmol>Why are the Warriors limited to 6 concurrent. that seems like a small amount (I get that it's not to overload the services and cause bans.. but still, 6?)
08:03:08dvd__ joins
08:03:42dvd_ quits [Remote host closed the connection]
08:07:57adamus1red quits [Client Quit]
08:09:36adamus1red (adamus1red) joins
08:29:40Niklink quits [Ping timeout: 265 seconds]
08:45:53<masterX244>Potential bug territory. Powerusers usually load projects with docker directly
08:46:18<masterX244>warrior is fire+forget and newcomer-friendly
09:12:50<schwarzkatz|m>Btw Thibaultmol : people who are not using matrix probably cannot see your reactions. The majority uses the irc
09:13:16<Thibaultmol>true yeah, forgot
09:13:37<Thibaultmol>👍️ unicode emote's do work
09:13:59<Thibaultmol>,right?
09:14:55<@OrIdow6>The thumbs-up appears for me
09:41:05Minkafighter7225 quits [Quit: The Lounge - https://thelounge.chat]
09:41:22Minkafighter7225 joins
10:00:52JTL quits [Quit: WeeChat 2.9]
10:30:09tbc1887 quits [Read error: Connection reset by peer]
10:56:01Adrmcr (Adrmcr) joins
10:56:58<Adrmcr>I've been slightly looking into attempting to archive google play apps, but every website that claims to be able to download them feels like they'll give me a million viruses by just being on them :/
10:58:09Adrmcr quits [Remote host closed the connection]
10:58:22Adrmcr (Adrmcr) joins
10:59:16Guest50 joins
11:13:58<voltagex|m>there's at least one uni and one security research firm that had a complete copy of the Google Play Store
11:14:16<voltagex|m>and you'd want to be pulling directly from the Play Store if your goal was to archive (current) apps.
11:15:20<voltagex|m>>Status: Downloaded newer image for atdr.meo.ws/archiveteam/warrior-dockerfile:latest
11:15:20<voltagex|m>WARNING: The requested image's platform (linux/amd64) does not match the detected host platform (linux/arm64/v8) and no specific platform was requested
11:15:23<voltagex|m>sigh
11:17:31<@Sanqui>there's no arm warrior yet
11:19:52<voltagex|m>and the stage images in the dockerfile are arch specific...
11:20:56<@Sanqui>my understanding is we compile our own wget fork with lua and porting that has been problematic. #archiveteam-dev if you're interested in working on this though
11:21:29<@Sanqui|m>(that's #archiveteam-dev:hackint.org )
11:23:09<@Sanqui|m>as an aside, regular reminder that I maintain an archive team "space" on matrix. check out https://matrix.to/#/#archive-team:matrix.org
11:54:14Guest50 quits [Client Quit]
12:11:16icedice joins
12:23:23masterx244|m joins
12:31:56umgr036 quits [Remote host closed the connection]
12:37:22umgr036 joins
12:38:04umgr036 quits [Read error: Connection reset by peer]
12:38:26umgr036 joins
13:02:57icedice quits [Client Quit]
13:04:55icedice joins
13:06:28dumbgoy joins
14:12:52Arcorann quits [Ping timeout: 252 seconds]
14:20:30xkey quits [Quit: xkey]
14:25:06xkey (xkey) joins
14:32:15a joins
14:32:28a quits [Remote host closed the connection]
14:48:30hitgrr8 joins
14:53:06Guest50 joins
15:29:24Island joins
15:44:05dan_a quits [Quit: [weboootz]]
15:45:39dan_a (dan_a) joins
16:21:54dumbgoy quits [Client Quit]
16:22:41dumbgoy joins
16:41:22dumbgoy quits [Ping timeout: 252 seconds]
16:43:57dumbgoy joins
17:05:22nicolas17 joins
17:28:00en3r0 joins
17:28:07<en3r0>Greetz.
17:28:09<en3r0>I recently shutdown Legit Torrents and was told to contact here for getting some of it backed up to the Internet Archive? HN discussion here: https://news.ycombinator.com/item?id=35639370
17:39:04Guest50 quits [Read error: Connection reset by peer]
17:47:22lexikiq joins
17:58:52jacksonchen666 (jacksonchen666) joins
18:07:56JTL (jtl) joins
18:08:12<Terbium>Hi en3r0 if you have a database dump with PII removed it can probably be uploaded to the Internet Archive
18:09:36<en3r0>Hi Terbium, I can certainly get one of those, but I am curious how would that end up looking / being displayed / what you would do with it?
18:11:21<masterX244>rawest data possible allows to convert it to useable formats as needed.
18:13:26icedice quits [Client Quit]
18:16:27<Terbium>most likely it will end up being just a DB dump available for download. probably export to a single file db like SQLite would be ideal.
18:16:27<Terbium>To convert it to something visible on the Wayback Machine (e.g. as a web page) would require sometime to render out the individual pages and convert them to WARCs that can be ingested into WBM
18:19:17<tech234a>arkiver or JAA might know what's best to do here ^
18:20:23<en3r0>Terbium makes sense. You would probably want a tar of the torrent files themselves also ya?
18:20:48<nicolas17>(inb4 "they're blobs in the database")
18:21:42<en3r0>Well, I am not so sure in this case. I would have to check nicolas17.
18:41:01<@JAA>'Conversion' to WARCs isn't a thing. It would have to be served by an actual HTTP server.
18:41:49<@JAA>A DB dump and a tar of the torrents sounds reasonable, yeah. Not sure what other data there might be as I'm not familiar with the site.
18:58:04<@arkiver>en3r0: are you perhaps able to get the site online again for enough time for us to make a copy?
18:58:14<@arkiver>we'd crawl it, and it'd show up in the Wayback Machine
19:09:37umgr036 quits [Read error: Connection reset by peer]
19:09:59umgr036 joins
19:20:53<@arkiver>but if that is not possible, what JAA said it best - DB dump and tar of torrents
19:28:52<en3r0>arkiver I will have to look, I might be able to get it online without the tracking component..
19:29:48<@JAA>That would be great! :-)
19:32:21<en3r0>If I get it online this weekend, what is the best way to get it crawled quickly?
19:34:41<@arkiver>en3r0: pinging us as soon as it's up
19:34:53<@arkiver>your site does not contain difficult fancy script stuff I hope?
19:35:11<en3r0>Nope, old school layouts with tables =D
19:35:17<@arkiver>if not, if it is just straightforward HTML pages we should be able to make a good copy with ArchiveBot
19:35:18<@arkiver>perfect :)
19:35:42<pokechu22>It'd probably be good to also upload the database to archive.org directly so that people don't need to scrape web.archive.org if they want the data (but having it on web.archive.org would also be good)
19:35:51<@arkiver>yeah, both would be good!
19:37:18<en3r0>Ok cool
19:40:28<@JAA>A few thousand torrents, a couple ten thousand user profiles, and a handful of forum posts. Yeah, this should be a matter of hours with ArchiveBot. :-)
19:48:16qwertyasdfuiopghjkl quits [Ping timeout: 265 seconds]
19:51:36qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
20:01:03<en3r0>I don't think user profile were ever public facing, so really it should just be the torrents and forum posts, not much for an archive effort!
20:01:24<en3r0>I participated in the G+ archive effort a long time back, that was pretty fun.
20:02:00<@JAA>Ah, only looked at the numbers at the bottom. Yeah, probably under an hour then assuming decent throughput.
20:42:16dvd__ quits [Ping timeout: 252 seconds]
20:50:48Ruthalas5 (Ruthalas) joins
20:54:07en3r0 quits [Remote host closed the connection]
21:48:22Guest50 joins
21:57:04Guest50 quits [Ping timeout: 252 seconds]
22:15:46Guest50 joins
22:27:17Guest50 quits [Ping timeout: 265 seconds]
22:31:12hitgrr8 quits [Client Quit]
22:34:07Adrmcr quits [Remote host closed the connection]
22:42:23Guest50 joins
22:48:02umgr036 quits [Read error: Connection reset by peer]
22:48:24umgr036 joins
22:52:00TheTechRobo quits [Read error: Connection reset by peer]
22:52:09fishingforsoup_ joins
22:52:56TheTechRobo (TheTechRobo) joins
22:55:44fishingforsoup quits [Ping timeout: 252 seconds]
23:08:17benjinsm joins
23:11:19benjins quits [Ping timeout: 252 seconds]
23:12:49benjinsm is now known as benjins
23:30:07Guest50 quits [Ping timeout: 265 seconds]