00:38:27<h2ibot>JustAnotherArchivist edited Deathwatch (+339, /* 2023 */ Add Imgur's Tumblr impression): https://wiki.archiveteam.org/?diff=49680&oldid=49673
00:38:28<h2ibot>John123521 edited Parler (+136): https://wiki.archiveteam.org/?diff=49681&oldid=47733
00:39:27<h2ibot>Usernam edited Imgur (+764): https://wiki.archiveteam.org/?diff=49682&oldid=49328
00:49:28<@OrIdow6>*Sounds like a good guess tech234a
00:49:33<@OrIdow6> not sure how distracted I had to be for that to end up as just your name
00:50:00<tech234a>Yeah I was a little confused there :)
00:57:47sonick (sonick) joins
01:02:28Mateon2 joins
01:03:43Mateon1 quits [Ping timeout: 252 seconds]
01:03:43Mateon2 is now known as Mateon1
02:54:18<Terbium>datechnoman: i work with cdx files a lot for my archive. I generally use a short Python script to load CDXs into SQLite or SQL DB. There's a bunch of tools (especially from the data science world) that can easily import CSVs, TSVs, and other arbitrary text tables into SQLite. I used them as well and set "space" as the delimiter without issue for importing.
02:56:14<Terbium>i ingest multi-GB CDXes into SQL databases on a job schedule for querying. As long as the CDX is properly formatting, it's relatively easy to import to a DB
03:16:16Pingerfowder quits [Ping timeout: 252 seconds]
03:18:19Pingerfowder (Pingerfowder) joins
03:46:35dumbgoy__ quits [Ping timeout: 265 seconds]
03:58:48zhongfu_ quits [Ping timeout: 252 seconds]
04:00:00aGerman quits [Quit: The Lounge - https://thelounge.chat]
04:00:49fishingforsoup_ quits [Read error: Connection reset by peer]
04:03:00aGerman (aGerman) joins
04:03:01Aoede quits [Quit: ZNC - https://znc.in]
04:03:01@dxrt quits [Quit: ZNC - http://znc.sourceforge.net]
04:03:20Aoede (Aoede) joins
04:13:13dumbgoy__ joins
04:15:07fishingforsoup joins
04:16:46zhongfu (zhongfu) joins
04:38:55dxrt joins
04:38:57dxrt quits [Changing host]
04:38:57dxrt (dxrt) joins
04:38:57@ChanServ sets mode: +o dxrt
04:47:15@dxrt quits [Client Quit]
04:47:19fishingforsoup quits [Read error: Connection reset by peer]
04:48:55fishingforsoup joins
04:50:47fishingforsoup quits [Read error: Connection reset by peer]
04:51:35dxrt joins
04:51:37dxrt quits [Changing host]
04:51:37dxrt (dxrt) joins
04:51:37@ChanServ sets mode: +o dxrt
04:55:09fishingforsoup joins
04:57:28lukash799 quits [Ping timeout: 252 seconds]
05:22:03Guest50 joins
05:23:28Guest50 quits [Remote host closed the connection]
05:24:07Guest50 joins
05:25:04dvd__ joins
05:25:38dvd_ quits [Remote host closed the connection]
05:39:29Niklink joins
06:16:52Island quits [Read error: Connection reset by peer]
06:21:37BlueMaxima quits [Read error: Connection reset by peer]
06:22:42dvd__ quits [Remote host closed the connection]
06:23:02dvd__ joins
06:37:59lun4 (lun4) joins
06:38:40igloo22225 quits [Ping timeout: 252 seconds]
06:38:40lun41 quits [Ping timeout: 252 seconds]
06:40:19nepeat quits [Ping timeout: 252 seconds]
06:40:25ave5 (ave) joins
06:41:13nepeat (nepeat) joins
06:41:25ave quits [Ping timeout: 252 seconds]
06:41:25ave5 is now known as ave
06:55:10Niklink50 joins
06:57:30Niklink quits [Ping timeout: 265 seconds]
07:07:48<schwarzkatz|m>have there been some endeavors in archiving pomf.cat already? it looks like only the uploading is disabled, but the files are still online
07:08:34<schwarzkatz|m>I'm not that active around here and couldn't find anything in the logs, so sorry for being that late
07:08:55<schwarzkatz|m>I'll try contacting the owner, maybe we can repeat a uploadir situation
07:19:48<pabs>a few jobs in here https://archive.fart.website/archivebot/viewer/?q=pomf.cat
07:19:58<pabs>how many subdomains does it have?
07:20:38Arcorann (Arcorann) joins
07:20:47<schwarzkatz|m>ah thx
07:20:47<schwarzkatz|m>a. is for the hosted files, I don't think there are more
08:04:36dvd_ joins
08:04:42dvd__ quits [Remote host closed the connection]
08:44:18umgr036 quits [Read error: Connection reset by peer]
08:44:40umgr036 joins
08:47:10<immibis>btw what's the reasoning for the default warrior project being telegram?
08:47:29<immibis>just wondering because reddit is going to implode and i wonder if it should be set to reddit now
08:55:41umgr036 quits [Read error: Connection reset by peer]
09:00:12umgr036 joins
09:00:52umgr036 quits [Read error: Connection reset by peer]
09:01:10umgr036 joins
09:16:56<masterX244>Res
09:17:07<masterX244>Reddit bugged in warrior atm
09:38:10sonick quits [Client Quit]
09:45:19<h2ibot>Entartet edited Deathwatch (+1, Spelling.): https://wiki.archiveteam.org/?diff=49683&oldid=49680
10:46:27dumbgoy joins
10:50:34dumbgoy__ quits [Ping timeout: 252 seconds]
11:34:26icedice joins
11:54:20qwertyasdfuiopghjkl quits [Client Quit]
11:59:53qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
12:15:49Nulo quits [Ping timeout: 252 seconds]
12:16:10Nulo joins
12:28:15Megame (Megame) joins
12:34:48insubstantial joins
12:37:12insubstantial quits [Remote host closed the connection]
13:00:58Niklink50 quits [Ping timeout: 265 seconds]
13:17:42Niklink joins
13:48:01<h2ibot>Entartet edited Imgur (+4, Added an internal link to [[Reddit]].): https://wiki.archiveteam.org/?diff=49684&oldid=49682
13:51:14Barto quits [Ping timeout: 265 seconds]
14:00:57Megame quits [Client Quit]
14:12:31igloo22225 (igloo22225) joins
14:26:35andrew (andrew) joins
14:51:02Guest50 quits [Client Quit]
14:52:34Arcorann quits [Ping timeout: 252 seconds]
14:52:39Guest50 joins
15:01:57Barto (Barto) joins
15:07:15<h2ibot>Yts98 edited ISP Hosting (+129, Added HiNet): https://wiki.archiveteam.org/?diff=49685&oldid=49213
15:07:16<h2ibot>Yts98 edited Xuite (+13377, Added domains, API & extend site structure): https://wiki.archiveteam.org/?diff=49686&oldid=49667
15:08:54<pabs>this site shut down: https://www.legittorrents.info/ https://news.ycombinator.com/item?id=35639370
15:16:03hitgrr8 joins
15:17:50jacksonchen666 (jacksonchen666) joins
15:31:18Island joins
15:34:44<pabs>https://www.thedailybeast.com/buzzfeed-news-is-shutting-down https://news.ycombinator.com/item?id=35641448
15:47:21nepeat quits [Client Quit]
15:50:09nepeat (nepeat) joins
16:04:39<AK>Damn
16:04:51<AK>Buzzfeed news (at least used to be) was actually pretty good
16:05:28<Guest50>yeah, the News part produced some good content
16:05:39<Guest50>not sure what's going to happen to the site
16:06:31<AK>Just thrown it into AB, we do seem to be getting content so hopefully we can make sure it's all saved
16:07:55MrRadar_ is now known as MrRadar
16:13:27<@JAA>We probably also got a decent amount of it through #//.
16:19:28<@arkiver>JAA: is AB enough for buzzfeed news?
16:20:55<@JAA>arkiver: They seem to have decent sitemaps, so assuming no/sane rate limits and no blocks, yeah, should be fine.
16:21:05<@arkiver>alright
16:21:09<@arkiver>and no fancy scripts :P
16:21:18<@arkiver>and assuming no fancy scripts
16:21:49<@JAA>There are some stupid scripts for image resizing and the like, but yeah, the site is pretty usable without JS.
16:21:58<@JAA>Not sure if they have videos anywhere.
16:52:00payeco joins
16:53:02payeco quits [Remote host closed the connection]
16:54:11jacksonchen666 quits [Ping timeout: 245 seconds]
16:58:30jacksonchen666 (jacksonchen666) joins
17:03:36Niklink quits [Ping timeout: 265 seconds]
17:19:03jacksonchen666 quits [Client Quit]
17:20:39Wingy (Wingy) joins
17:38:12Niklink joins
17:38:28jacksonchen666 (jacksonchen666) joins
17:50:52nicolas17 joins
17:56:38Guest50 quits [Client Quit]
17:58:54Guest50 joins
18:07:48Guest50 quits [Client Quit]
18:11:51Guest50 joins
18:49:41empress joins
18:53:05adamus1red quits [Quit: SigTerm]
18:55:46adamus1red (adamus1red) joins
19:05:46Guest50 quits [Client Quit]
19:06:14Guest50 joins
19:10:56umgr036 quits [Read error: Connection reset by peer]
19:11:18umgr036 joins
19:12:31jacksonchen666 quits [Ping timeout: 245 seconds]
19:13:17jacksonchen666 (jacksonchen666) joins
20:09:50jacksonchen666 quits [Remote host closed the connection]
20:10:37jacksonchen666 (jacksonchen666) joins
20:15:00Niklink quits [Ping timeout: 265 seconds]
20:15:23umgr036 quits [Read error: Connection reset by peer]
20:15:45umgr036 joins
21:06:26michaelblob quits [Read error: Connection reset by peer]
21:07:34lexikiq (lexikiq) joins
21:14:02michaelblob (michaelblob) joins
21:16:16DLoader quits [Remote host closed the connection]
21:17:32Niklink joins
21:20:25itter joins
21:21:21DLoader joins
21:22:03<itter>Hello guys. I feel like I might be in the wrong place, but I'm trying to find the result of the Google Video archive effort in 2011.
21:22:19<itter>https://wiki.archiveteam.org/index.php/Google_Video_(Archive)
21:23:33<itter>It seems like archive.org has the captures, but I can't figure out how to search for the docid of an old video on archive.org
21:38:01<pokechu22>itter: this is probably the right place, but I don't have an answer for you; hopefully someone else here does
21:44:10Guest50 quits [Client Quit]
21:44:32hitgrr8 quits [Client Quit]
21:44:46Guest50 joins
22:00:22Niklink quits [Ping timeout: 265 seconds]
22:12:24jamesp (jamesp) joins
22:14:26Niklink joins
22:17:51itter quits [Remote host closed the connection]
22:49:42<h2ibot>JustAnotherArchivist edited DPReview (+435, Add shutdown't, details on AB job, warrior →…): https://wiki.archiveteam.org/?diff=49687&oldid=49641
22:51:00icedice quits [Client Quit]
23:22:54pikablu joins
23:25:43cat joins
23:53:16Arcorann (Arcorann) joins