| 01:14:17 | | jamesp quits [Remote host closed the connection] |
| 01:14:48 | | upintheairsheep joins |
| 01:36:43 | | BlueMaxima joins |
| 01:41:17 | <h2ibot> | Wickedplayer494 edited Current Projects (+191, Bump imgur up in upcoming, restore Zippyshare…): https://wiki.archiveteam.org/?diff=49688&oldid=49676 |
| 02:02:31 | | andrew quits [Client Quit] |
| 02:03:13 | | andrew (andrew) joins |
| 02:07:25 | | tbc1887 (tbc1887) joins |
| 02:08:54 | | upintheairsheep quits [Remote host closed the connection] |
| 02:09:29 | <atphoenix> | OrIdow6, there may be a second chance for Shutterfly per https://old.reddit.com/r/Archiveteam/comments/11s1gjh/shutterfly_is_shutting_down_share_sites_also_new/jh2bvl5/ |
| 02:10:12 | <atphoenix> | and https://www.change.org/p/tell-shutterfly-to-respond-to-people-who-lost-all-their-memories/u/31512720 |
| 02:23:02 | | BlueMaxima quits [Read error: Connection reset by peer] |
| 02:23:16 | | BlueMaxima joins |
| 02:23:27 | <@OrIdow6> | Thanks atphoenix! |
| 02:23:35 | <@OrIdow6> | Hopefully they do put them back |
| 02:24:43 | <atphoenix> | just note that the Shutterfly VP's note said May 10-30, so a limited window |
| 02:25:09 | <atphoenix> | maybe before May 10, but they communicated by May 10 |
| 02:26:24 | | Doran is now known as Doranwen |
| 02:26:26 | <atphoenix> | I myself found I had some stuff on their site from so long ago that the photos were only about 200 kB each or less. |
| 02:26:45 | <@OrIdow6> | I checked one of the test sites I was using and it's still down atm |
| 02:26:53 | <@OrIdow6> | If it does come back the script is basically complete |
| 02:27:00 | <atphoenix> | and they're not my only copies (unless my local copies have bitrotted since my last check |
| 02:28:53 | <atphoenix> | every time someone things depending on cloud for important personal stuff or photos on only needs to look at the long list of companies that have done purges. Yahoo Photos and many others came before Shutterfly. |
| 02:29:13 | <atphoenix> | r/things/thinks |
| 02:29:29 | <atphoenix> | s/r/s |
| 02:29:31 | <atphoenix> | :D |
| 02:46:39 | <@OrIdow6> | Reminds me I have some work to do on my own personal backups... |
| 02:50:51 | | Niklink quits [Ping timeout: 265 seconds] |
| 02:56:30 | <h2ibot> | JustAnotherArchivist created Template:CTA URL lists (+1455, Created page with "<includeonly>== How to help…): https://wiki.archiveteam.org/?title=Template%3ACTA%20URL%20lists |
| 02:58:30 | <h2ibot> | JustAnotherArchivist edited Imgur (+73, Add URL lists CTA): https://wiki.archiveteam.org/?diff=49690&oldid=49684 |
| 03:04:31 | <h2ibot> | JustAnotherArchivist edited Imgur (+423, Clarify sitemaps and add note about SPN): https://wiki.archiveteam.org/?diff=49691&oldid=49690 |
| 03:15:04 | | Shjosan quits [Quit: Am sleepy (-, – )…zzzZZZ] |
| 03:15:41 | | Shjosan (Shjosan) joins |
| 03:22:37 | | pikablu quits [Read error: Connection reset by peer] |
| 03:27:38 | | nexusxe (nexusxe) joins |
| 03:27:49 | | xenonhelix (nexusxe) joins |
| 03:33:32 | | xenonhelix quits [Client Quit] |
| 03:33:32 | | nexusxe quits [Client Quit] |
| 03:35:50 | | Niklink joins |
| 03:42:59 | | Guest50 quits [Client Quit] |
| 03:59:41 | <h2ibot> | Tech234a edited Template:Instant messengers (+7, Google Talk no longer exists): https://wiki.archiveteam.org/?diff=49692&oldid=45858 |
| 04:00:01 | | treora quits [Quit: blub blub.] |
| 04:01:19 | | treora joins |
| 04:05:50 | | tbc1887 quits [Read error: Connection reset by peer] |
| 04:47:17 | | tbc1887 (tbc1887) joins |
| 04:57:21 | | nicolas17 quits [Client Quit] |
| 05:07:06 | | Island quits [Read error: Connection reset by peer] |
| 05:10:32 | | lennier2 joins |
| 05:12:13 | | lennier1 quits [Ping timeout: 252 seconds] |
| 05:12:23 | | lennier2 is now known as lennier1 |
| 05:20:55 | | umgr036 quits [Read error: Connection reset by peer] |
| 05:21:19 | | umgr036 joins |
| 05:47:05 | | superkuh quits [Remote host closed the connection] |
| 05:48:52 | | superkuh joins |
| 05:56:13 | | dumbgoy quits [Ping timeout: 252 seconds] |
| 06:01:27 | | igloo22225 quits [Client Quit] |
| 06:01:37 | | igloo22225 (igloo22225) joins |
| 06:02:16 | | nepeat quits [Ping timeout: 252 seconds] |
| 06:03:44 | | ave quits [Ping timeout: 252 seconds] |
| 06:04:04 | | igloo22225 quits [Client Quit] |
| 06:04:11 | | lun4 quits [Ping timeout: 265 seconds] |
| 06:08:52 | | igloo22225 (igloo22225) joins |
| 06:13:21 | | jacksonchen666 quits [Ping timeout: 245 seconds] |
| 06:14:58 | | ave (ave) joins |
| 06:14:59 | | lun4 (lun4) joins |
| 06:15:01 | | nepeat (nepeat) joins |
| 06:18:41 | | DiscantX quits [Ping timeout: 265 seconds] |
| 06:22:25 | | DiscantX joins |
| 06:26:50 | | BlueMaxima quits [Read error: Connection reset by peer] |
| 06:32:13 | | jacksonchen666 (jacksonchen666) joins |
| 06:46:52 | | lexikiq quits [Read error: Connection reset by peer] |
| 06:48:58 | | jacksonchen666 quits [Client Quit] |
| 07:40:47 | <Thibaultmol> | Why are the Warriors limited to 6 concurrent. that seems like a small amount (I get that it's not to overload the services and cause bans.. but still, 6?) |
| 08:03:08 | | dvd__ joins |
| 08:03:42 | | dvd_ quits [Remote host closed the connection] |
| 08:07:57 | | adamus1red quits [Client Quit] |
| 08:09:36 | | adamus1red (adamus1red) joins |
| 08:29:40 | | Niklink quits [Ping timeout: 265 seconds] |
| 08:45:53 | <masterX244> | Potential bug territory. Powerusers usually load projects with docker directly |
| 08:46:18 | <masterX244> | warrior is fire+forget and newcomer-friendly |
| 09:12:50 | <schwarzkatz|m> | Btw Thibaultmol : people who are not using matrix probably cannot see your reactions. The majority uses the irc |
| 09:13:16 | <Thibaultmol> | true yeah, forgot |
| 09:13:37 | <Thibaultmol> | 👍️ unicode emote's do work |
| 09:13:59 | <Thibaultmol> | ,right? |
| 09:14:55 | <@OrIdow6> | The thumbs-up appears for me |
| 09:41:05 | | Minkafighter7225 quits [Quit: The Lounge - https://thelounge.chat] |
| 09:41:22 | | Minkafighter7225 joins |
| 10:00:52 | | JTL quits [Quit: WeeChat 2.9] |
| 10:30:09 | | tbc1887 quits [Read error: Connection reset by peer] |
| 10:56:01 | | Adrmcr (Adrmcr) joins |
| 10:56:58 | <Adrmcr> | I've been slightly looking into attempting to archive google play apps, but every website that claims to be able to download them feels like they'll give me a million viruses by just being on them :/ |
| 10:58:09 | | Adrmcr quits [Remote host closed the connection] |
| 10:58:22 | | Adrmcr (Adrmcr) joins |
| 10:59:16 | | Guest50 joins |
| 11:13:58 | <voltagex|m> | there's at least one uni and one security research firm that had a complete copy of the Google Play Store |
| 11:14:16 | <voltagex|m> | and you'd want to be pulling directly from the Play Store if your goal was to archive (current) apps. |
| 11:15:20 | <voltagex|m> | >Status: Downloaded newer image for atdr.meo.ws/archiveteam/warrior-dockerfile:latest |
| 11:15:20 | <voltagex|m> | WARNING: The requested image's platform (linux/amd64) does not match the detected host platform (linux/arm64/v8) and no specific platform was requested |
| 11:15:23 | <voltagex|m> | sigh |
| 11:17:31 | <@Sanqui> | there's no arm warrior yet |
| 11:19:52 | <voltagex|m> | and the stage images in the dockerfile are arch specific... |
| 11:20:56 | <@Sanqui> | my understanding is we compile our own wget fork with lua and porting that has been problematic. #archiveteam-dev if you're interested in working on this though |
| 11:21:29 | <@Sanqui|m> | (that's #archiveteam-dev:hackint.org ) |
| 11:23:09 | <@Sanqui|m> | as an aside, regular reminder that I maintain an archive team "space" on matrix. check out https://matrix.to/#/#archive-team:matrix.org |
| 11:54:14 | | Guest50 quits [Client Quit] |
| 12:11:16 | | icedice joins |
| 12:23:23 | | masterx244|m joins |
| 12:31:56 | | umgr036 quits [Remote host closed the connection] |
| 12:37:22 | | umgr036 joins |
| 12:38:04 | | umgr036 quits [Read error: Connection reset by peer] |
| 12:38:26 | | umgr036 joins |
| 13:02:57 | | icedice quits [Client Quit] |
| 13:04:55 | | icedice joins |
| 13:06:28 | | dumbgoy joins |
| 14:12:52 | | Arcorann quits [Ping timeout: 252 seconds] |
| 14:20:30 | | xkey quits [Quit: xkey] |
| 14:25:06 | | xkey (xkey) joins |
| 14:32:15 | | a joins |
| 14:32:28 | | a quits [Remote host closed the connection] |
| 14:48:30 | | hitgrr8 joins |
| 14:53:06 | | Guest50 joins |
| 15:01:38 | | fishingforsoup is now authenticated as fishingforsoup |
| 15:29:24 | | Island joins |
| 15:44:05 | | dan_a quits [Quit: [weboootz]] |
| 15:45:39 | | dan_a (dan_a) joins |
| 16:21:54 | | dumbgoy quits [Client Quit] |
| 16:22:41 | | dumbgoy joins |
| 16:41:22 | | dumbgoy quits [Ping timeout: 252 seconds] |
| 16:43:57 | | dumbgoy joins |
| 17:05:22 | | nicolas17 joins |
| 17:28:00 | | en3r0 joins |
| 17:28:07 | <en3r0> | Greetz. |
| 17:28:09 | <en3r0> | I recently shutdown Legit Torrents and was told to contact here for getting some of it backed up to the Internet Archive? HN discussion here: https://news.ycombinator.com/item?id=35639370 |
| 17:39:04 | | Guest50 quits [Read error: Connection reset by peer] |
| 17:47:22 | | lexikiq joins |
| 17:58:52 | | jacksonchen666 (jacksonchen666) joins |
| 18:07:56 | | JTL (jtl) joins |
| 18:08:12 | <Terbium> | Hi en3r0 if you have a database dump with PII removed it can probably be uploaded to the Internet Archive |
| 18:09:36 | <en3r0> | Hi Terbium, I can certainly get one of those, but I am curious how would that end up looking / being displayed / what you would do with it? |
| 18:11:21 | <masterX244> | rawest data possible allows to convert it to useable formats as needed. |
| 18:13:26 | | icedice quits [Client Quit] |
| 18:16:27 | <Terbium> | most likely it will end up being just a DB dump available for download. probably export to a single file db like SQLite would be ideal. |
| 18:16:27 | <Terbium> | To convert it to something visible on the Wayback Machine (e.g. as a web page) would require sometime to render out the individual pages and convert them to WARCs that can be ingested into WBM |
| 18:19:17 | <tech234a> | arkiver or JAA might know what's best to do here ^ |
| 18:20:23 | <en3r0> | Terbium makes sense. You would probably want a tar of the torrent files themselves also ya? |
| 18:20:48 | <nicolas17> | (inb4 "they're blobs in the database") |
| 18:21:42 | <en3r0> | Well, I am not so sure in this case. I would have to check nicolas17. |
| 18:41:01 | <@JAA> | 'Conversion' to WARCs isn't a thing. It would have to be served by an actual HTTP server. |
| 18:41:49 | <@JAA> | A DB dump and a tar of the torrents sounds reasonable, yeah. Not sure what other data there might be as I'm not familiar with the site. |
| 18:58:04 | <@arkiver> | en3r0: are you perhaps able to get the site online again for enough time for us to make a copy? |
| 18:58:14 | <@arkiver> | we'd crawl it, and it'd show up in the Wayback Machine |
| 19:09:37 | | umgr036 quits [Read error: Connection reset by peer] |
| 19:09:59 | | umgr036 joins |
| 19:20:53 | <@arkiver> | but if that is not possible, what JAA said it best - DB dump and tar of torrents |
| 19:28:52 | <en3r0> | arkiver I will have to look, I might be able to get it online without the tracking component.. |
| 19:29:48 | <@JAA> | That would be great! :-) |
| 19:32:21 | <en3r0> | If I get it online this weekend, what is the best way to get it crawled quickly? |
| 19:34:41 | <@arkiver> | en3r0: pinging us as soon as it's up |
| 19:34:53 | <@arkiver> | your site does not contain difficult fancy script stuff I hope? |
| 19:35:11 | <en3r0> | Nope, old school layouts with tables =D |
| 19:35:17 | <@arkiver> | if not, if it is just straightforward HTML pages we should be able to make a good copy with ArchiveBot |
| 19:35:18 | <@arkiver> | perfect :) |
| 19:35:42 | <pokechu22> | It'd probably be good to also upload the database to archive.org directly so that people don't need to scrape web.archive.org if they want the data (but having it on web.archive.org would also be good) |
| 19:35:51 | <@arkiver> | yeah, both would be good! |
| 19:37:18 | <en3r0> | Ok cool |
| 19:40:28 | <@JAA> | A few thousand torrents, a couple ten thousand user profiles, and a handful of forum posts. Yeah, this should be a matter of hours with ArchiveBot. :-) |
| 19:48:16 | | qwertyasdfuiopghjkl quits [Ping timeout: 265 seconds] |
| 19:51:36 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
| 20:01:03 | <en3r0> | I don't think user profile were ever public facing, so really it should just be the torrents and forum posts, not much for an archive effort! |
| 20:01:24 | <en3r0> | I participated in the G+ archive effort a long time back, that was pretty fun. |
| 20:02:00 | <@JAA> | Ah, only looked at the numbers at the bottom. Yeah, probably under an hour then assuming decent throughput. |
| 20:42:16 | | dvd__ quits [Ping timeout: 252 seconds] |
| 20:50:48 | | Ruthalas5 (Ruthalas) joins |
| 20:54:07 | | en3r0 quits [Remote host closed the connection] |
| 21:48:22 | | Guest50 joins |
| 21:57:04 | | Guest50 quits [Ping timeout: 252 seconds] |
| 22:15:46 | | Guest50 joins |
| 22:27:17 | | Guest50 quits [Ping timeout: 265 seconds] |
| 22:31:12 | | hitgrr8 quits [Client Quit] |
| 22:34:07 | | Adrmcr quits [Remote host closed the connection] |
| 22:42:23 | | Guest50 joins |
| 22:48:02 | | umgr036 quits [Read error: Connection reset by peer] |
| 22:48:24 | | umgr036 joins |
| 22:52:00 | | TheTechRobo quits [Read error: Connection reset by peer] |
| 22:52:09 | | fishingforsoup_ joins |
| 22:52:56 | | TheTechRobo (TheTechRobo) joins |
| 22:55:44 | | fishingforsoup quits [Ping timeout: 252 seconds] |
| 23:08:17 | | benjinsm joins |
| 23:11:19 | | benjins quits [Ping timeout: 252 seconds] |
| 23:12:49 | | benjinsm is now known as benjins |
| 23:12:50 | | benjins is now authenticated as benjins |
| 23:30:07 | | Guest50 quits [Ping timeout: 265 seconds] |