| 00:31:43 | | HP_Archivist (HP_Archivist) joins |
| 00:50:15 | | march_happy quits [Ping timeout: 252 seconds] |
| 00:50:53 | | march_happy (march_happy) joins |
| 00:55:06 | <Jake> | I assume there's no easy script out there to pull out specific records from ZSTD warcs? :) |
| 00:58:23 | <@JAA> | Not yet, but soon™. (Don't hold your breath though...) |
| 01:02:34 | | dm4v_ joins |
| 01:03:18 | | dm4v quits [Ping timeout: 265 seconds] |
| 01:03:18 | | dm4v_ is now known as dm4v |
| 01:03:19 | | dm4v is now authenticated as dm4v |
| 01:03:19 | | dm4v quits [Changing host] |
| 01:03:19 | | dm4v (dm4v) joins |
| 01:10:48 | | qwertyasdfuiopghjkl joins |
| 01:18:45 | | wyatt8740 joins |
| 01:19:57 | | wyatt8750 quits [Ping timeout: 252 seconds] |
| 01:39:30 | | eroc1990 quits [Client Quit] |
| 01:39:50 | | eroc1990 (eroc1990) joins |
| 01:43:25 | | Stiletto quits [Ping timeout: 265 seconds] |
| 01:54:19 | <Terbium> | I've tinkered around with modding WARCIO to support zstd, has anyone else worked on something similar? |
| 02:05:08 | <@JAA> | I've been working on a new Python package for WARC in general. It's been dormant for a while, but I've made progress again recently and hope to get it to a usable state in the not-too-distant future. |
| 02:13:15 | | HackMii quits [Remote host closed the connection] |
| 02:14:31 | | HackMii (hacktheplanet) joins |
| 02:15:15 | | AlsoHP_Archivist joins |
| 02:18:07 | | hackbug quits [Quit: Lost terminal] |
| 02:19:21 | | HP_Archivist quits [Ping timeout: 252 seconds] |
| 02:27:11 | | AlsoHP_Archivist quits [Client Quit] |
| 03:01:01 | | Arcorann (Arcorann) joins |
| 03:16:04 | | hackbug (hackbug) joins |
| 03:32:40 | | HackMii quits [Remote host closed the connection] |
| 03:33:53 | | HackMii (hacktheplanet) joins |
| 03:36:07 | | hackbug quits [Client Quit] |
| 03:48:15 | | hackbug (hackbug) joins |
| 03:54:38 | | hackbug quits [Remote host closed the connection] |
| 04:00:56 | | qwertyasdfuiopghjkl quits [Client Quit] |
| 04:17:17 | | qwertyasdfuiopghjkl joins |
| 04:55:24 | | BlueMaxima quits [Read error: Connection reset by peer] |
| 04:56:31 | | BlueMaxima joins |
| 05:53:47 | | march_happy quits [Ping timeout: 265 seconds] |
| 05:54:28 | | march_happy (march_happy) joins |
| 06:02:28 | | DiscantX joins |
| 06:04:01 | | HackMii quits [Remote host closed the connection] |
| 06:34:05 | | Mateon1 quits [Remote host closed the connection] |
| 06:35:02 | | Mateon1 joins |
| 06:39:28 | | HackMii (hacktheplanet) joins |
| 08:05:43 | | BlueMaxima quits [Client Quit] |
| 08:43:35 | | pabs quits [Client Quit] |
| 08:45:51 | | pabs (pabs) joins |
| 08:53:38 | | twitcharchive joins |
| 08:54:07 | <twitcharchive> | https://m.twitch.tv/videos/1425034743 |
| 08:54:18 | <twitcharchive> | https://m.twitch.tv/videos/1467094023 |
| 08:54:34 | <twitcharchive> | https://m.twitch.tv/videos/1414207243 |
| 08:55:12 | <twitcharchive> | https://m.twitch.tv/videos/1466980574 |
| 08:55:52 | | twitcharchive quits [Remote host closed the connection] |
| 09:08:39 | <spirit> | ^ spam, dont bother clicking |
| 09:13:33 | | r joins |
| 09:14:00 | <r> | Archive the twitch streams, as they may be useful for FNF related datamining |
| 09:14:09 | <r> | like facts about the character |
| 09:14:10 | <r> | s |
| 09:14:27 | | r leaves |
| 09:14:47 | <systwi> | FNF? |
| 09:14:59 | <systwi> | Thanks for the warning, spirit. |
| 09:48:31 | | T31M quits [Quit: ZNC - https://znc.in] |
| 09:48:44 | <spirit> | some game |
| 09:49:19 | | T31M joins |
| 09:49:21 | | T31M is now authenticated as T31M |
| 10:09:03 | | march_happy quits [Ping timeout: 252 seconds] |
| 10:10:27 | | march_happy (march_happy) joins |
| 10:32:21 | | march_happy quits [Remote host closed the connection] |
| 10:41:25 | | march_happy (march_happy) joins |
| 11:22:54 | | rellu quits [Quit: ZNC - https://znc.in] |
| 11:23:00 | | rellu joins |
| 11:27:09 | | nimaje quits [Ping timeout: 265 seconds] |
| 11:28:26 | | nimaje joins |
| 11:46:58 | | wyatt8740 quits [Ping timeout: 265 seconds] |
| 11:47:04 | | wyatt8750 joins |
| 12:20:40 | <TheTechRobo> | systwi: by FNF they might be referring to Five Nights at Freddie's...? not sure |
| 12:21:24 | <audrooku|m> | Friday night funkin |
| 12:42:43 | <TheTechRobo> | oh, that makes more sense |
| 13:27:25 | | HP_Archivist (HP_Archivist) joins |
| 13:43:33 | | sec^nd quits [Ping timeout: 252 seconds] |
| 13:44:33 | <h2ibot> | Usernam edited List of websites excluded from the Wayback Machine (+82, booru.allthefallen.moe was not excluded like 1…): https://wiki.archiveteam.org/?diff=48513&oldid=48508 |
| 13:49:54 | | sec^nd (second) joins |
| 13:50:42 | | Iki1 quits [Ping timeout: 252 seconds] |
| 13:51:29 | | AlsoHP_Archivist joins |
| 13:53:44 | | HP_Archivist quits [Ping timeout: 265 seconds] |
| 14:04:51 | | DiscantX quits [Ping timeout: 265 seconds] |
| 14:23:09 | | wyatt8750 quits [Ping timeout: 252 seconds] |
| 14:24:55 | | wyatt8740 joins |
| 14:29:27 | | Harzilein joins |
| 14:46:27 | | wyatt8750 joins |
| 14:46:49 | | wyatt8740 quits [Ping timeout: 252 seconds] |
| 14:46:49 | | Arcorann quits [Ping timeout: 252 seconds] |
| 15:08:48 | | march_happy quits [Ping timeout: 252 seconds] |
| 15:09:19 | | march_happy (march_happy) joins |
| 15:18:16 | | eroc19906 (eroc1990) joins |
| 15:20:36 | | eroc1990 quits [Ping timeout: 265 seconds] |
| 15:20:36 | | eroc19906 is now known as eroc1990 |
| 15:34:34 | | Iki joins |
| 16:02:07 | | AlsoHP_Archivist quits [Client Quit] |
| 16:02:16 | | onetruth quits [Read error: Connection reset by peer] |
| 16:11:42 | | spirit quits [Quit: Leaving] |
| 16:19:13 | | march_happy quits [Ping timeout: 265 seconds] |
| 16:19:29 | | march_happy (march_happy) joins |
| 16:28:46 | | hackbug (hackbug) joins |
| 16:39:06 | <systwi> | TheTechRobo, audrooku|m: Five Nights a Freddie's was my first thought, too, but yeah, Friday Night Fuckin makes more sense. Thanks. |
| 17:03:38 | | spirit joins |
| 17:31:43 | | Chris5010 quits [Ping timeout: 265 seconds] |
| 17:53:54 | | immibis quits [Remote host closed the connection] |
| 17:53:54 | | apache2 quits [Remote host closed the connection] |
| 17:53:54 | | superkuh_ quits [Remote host closed the connection] |
| 17:53:54 | | apache2 joins |
| 17:53:56 | | jtagcat6 quits [Quit: Bye!] |
| 17:54:05 | | superkuh_ joins |
| 17:55:00 | | immibis (immibis) joins |
| 17:55:18 | | immibis quits [Remote host closed the connection] |
| 17:56:30 | | immibis (immibis) joins |
| 17:56:48 | | immibis quits [Remote host closed the connection] |
| 17:57:14 | | immibis (immibis) joins |
| 17:58:53 | <cptcobalt> | hey, I'm struggling a tiny bit to open a particular archive team archived warc, curious if anybody has tips: I used `wget` and `gzip -d` to grab and download this file: `http://archive.org/download/archiveteam-mobileme-hero-2607x/archiveteam-mobileme-hero-2607x-26071.tar/.%2Fa%2Fan%2Fand%2Fandrey_kaplunenko%2Fpublic.me.com%2Fpublic.me.com-andrey_kaplunenko.warc.gz` and getting unexpected end of file |
| 17:59:02 | <cptcobalt> | (a friend and I are spelunking the MobileMe archive to see if we can find any old apple ads or marketing collateral that aren't *generally* available on the internet anymore, we've found *some* poking around here) |
| 17:59:10 | <cptcobalt> | any tips for this file though? |
| 18:04:06 | | jtagcat6 (jtagcat) joins |
| 18:07:49 | <systwi> | cptcobalt: I can't view the link on my end, but I'm assuming the archive is quite large. Did you download it via BitTorrent? I know IA has had problems with their torrent tracker for years when it comes to large amounts of files. |
| 18:08:03 | <cptcobalt> | it caps out at a 1GB download |
| 18:08:06 | | tzt quits [Ping timeout: 252 seconds] |
| 18:08:26 | <cptcobalt> | and I used wget for the dl (but also it fails in browser/etc) |
| 18:08:32 | <systwi> | Did you verify the sizes match, down to the byte? |
| 18:09:30 | <cptcobalt> | checkuing |
| 18:09:43 | | tzt (tzt) joins |
| 18:10:39 | <systwi> | It looks to be 6418465256 bytes |
| 18:10:47 | <systwi> | 6,418,465,256 |
| 18:11:32 | <cptcobalt> | yeah okay, I'm only getting `1074528256` |
| 18:11:54 | <systwi> | Hmm, could your destination be running out of space? |
| 18:12:27 | <cptcobalt> | nope |
| 18:13:00 | <systwi> | Does wget return errors of any kind? |
| 18:13:12 | <systwi> | Can you paste the log in a pastebin? |
| 18:13:16 | <systwi> | *into |
| 18:14:33 | <cptcobalt> | no, and sure, and I'm going to try a new clean download again with maybe a few extra flags to see if something went wrong with my download |
| 18:25:04 | <@JAA> | It could be that the IA thing that lets you download from within .tar files has a size limit. |
| 18:28:01 | <@JAA> | So you might need to download the entire .tar instead and extract the desired file locally. In theory, a range request would also work, but not sure if there is any way to get the offset within the file. |
| 18:28:41 | <cptcobalt> | https://www.irccloud.com/pastebin/TxwrLTgb/ |
| 18:29:12 | <cptcobalt> | yeah, that seems somewhat plausible, that might be the next step |
| 18:29:40 | <systwi> | Hmm, yeah, really strange. I don't like that it fails silently like that. |
| 18:29:58 | <systwi> | I think downloading the .tar might be the more reliable next step. |
| 18:30:19 | <systwi> | Thankfully it's not a multi-TB tarball. |
| 18:30:53 | <cptcobalt> | indeed |
| 18:33:43 | <@JAA> | Yeah, at least the WBM truncation is indicated in the HTTP headers. Don't see anything here. |
| 18:54:14 | | eroc1990 quits [Ping timeout: 265 seconds] |
| 19:06:29 | <cptcobalt> | well, full tar download in progress, wish us luck |
| 19:08:41 | | Stiletto joins |
| 19:40:34 | | spirit quits [Client Quit] |
| 19:45:31 | | DiscantX joins |
| 20:00:56 | | tzt quits [Ping timeout: 265 seconds] |
| 20:02:23 | | lennier1 quits [Ping timeout: 265 seconds] |
| 20:02:54 | | lennier1 (lennier1) joins |
| 20:23:54 | <TheTechRobo> | http://digitize.archiveteam.org/ is still down ("Hello archive team.org!") |
| 20:27:10 | | DiscantX quits [Ping timeout: 265 seconds] |
| 20:29:53 | | wyatt8740 joins |
| 20:29:56 | | wyatt8750 quits [Ping timeout: 265 seconds] |
| 20:47:52 | | onetruth joins |
| 21:12:52 | <cptcobalt> | followup: the full tar download worked, file extracted successfully |
| 21:12:57 | <cptcobalt> | ty all |
| 22:27:34 | | HackMii quits [Remote host closed the connection] |
| 22:28:57 | | HackMii (hacktheplanet) joins |
| 22:31:56 | | eroc1990 (eroc1990) joins |
| 22:50:57 | <h2ibot> | OrIdow6 edited Deathwatch (-4, Switter is dead): https://wiki.archiveteam.org/?diff=48514&oldid=48499 |
| 22:51:57 | <h2ibot> | OrIdow6 edited Deathwatch (-4, excite friends is dead): https://wiki.archiveteam.org/?diff=48515&oldid=48514 |
| 22:52:57 | <h2ibot> | OrIdow6 edited Deathwatch (+0, StackOverflow Jobs is dead): https://wiki.archiveteam.org/?diff=48516&oldid=48515 |
| 22:53:57 | <h2ibot> | OrIdow6 edited Deathwatch (-4, Webcrow is dead): https://wiki.archiveteam.org/?diff=48517&oldid=48516 |
| 22:55:57 | <h2ibot> | OrIdow6 edited Deathwatch (+0, Duolingo forums are dead): https://wiki.archiveteam.org/?diff=48518&oldid=48517 |
| 22:56:57 | <h2ibot> | OrIdow6 edited Deathwatch (+0, Feneas stuff is dead): https://wiki.archiveteam.org/?diff=48519&oldid=48518 |
| 22:58:58 | <h2ibot> | OrIdow6 edited Deathwatch (+0, EDUCAUSE listserv is dead): https://wiki.archiveteam.org/?diff=48520&oldid=48519 |
| 23:00:17 | | leo60228 quits [Quit: ZNC 1.8.1 - https://znc.in] |
| 23:00:36 | <@OrIdow6> | That's not all of it but at least 2022 fits on my screen now |
| 23:00:53 | <@OrIdow6> | For log-searchers, AFAICT we didn't get excite friends |
| 23:01:07 | | leo60228 (leo60228) joins |
| 23:06:09 | | onetruth quits [Remote host closed the connection] |
| 23:06:09 | | @Fusl quits [Excess Flood] |
| 23:06:09 | | apache2 quits [Remote host closed the connection] |
| 23:06:09 | | immibis quits [Remote host closed the connection] |
| 23:06:09 | | gazorpazorp quits [Remote host closed the connection] |
| 23:06:10 | | onetruth joins |
| 23:06:14 | | user_ (gazorpazorp) joins |
| 23:06:15 | | immibis (immibis) joins |
| 23:06:40 | | apache2 joins |
| 23:06:51 | | Fusl (Fusl) joins |
| 23:06:51 | | @ChanServ sets mode: +o Fusl |
| 23:34:53 | | HackMii quits [Remote host closed the connection] |
| 23:35:49 | | hackbug quits [Remote host closed the connection] |
| 23:36:53 | | HackMii (hacktheplanet) joins |
| 23:37:32 | | hackbug (hackbug) joins |
| 23:40:01 | | march_happy quits [Ping timeout: 265 seconds] |
| 23:41:27 | | march_happy (march_happy) joins |
| 23:42:26 | | hackbug quits [Ping timeout: 265 seconds] |
| 23:52:48 | | tzt (tzt) joins |
| 23:55:19 | | hackbug (hackbug) joins |