00:02:36etnguyen03 (etnguyen03) joins
00:11:53tekulvw (tekulvw) joins
00:14:07spark3 joins
00:14:21spark74568 joins
00:15:38spark74568 leaves
00:15:43spark3 leaves
00:16:07nepeat quits [Ping timeout: 272 seconds]
00:16:45tekulvw quits [Ping timeout: 272 seconds]
00:18:50nepeat (nepeat) joins
00:38:36Wohlstand quits [Quit: Wohlstand]
00:51:07etnguyen03 quits [Client Quit]
01:11:13mls quits [Ping timeout: 272 seconds]
01:13:37etnguyen03 (etnguyen03) joins
01:15:14ljcool2006 joins
01:15:19tekulvw (tekulvw) joins
01:20:05tekulvw quits [Ping timeout: 272 seconds]
01:23:51roverinexile joins
01:26:37tekulvw (tekulvw) joins
01:27:03rover quits [Ping timeout: 272 seconds]
01:31:26tekulvw quits [Ping timeout: 268 seconds]
01:32:42mls (mls) joins
01:37:49tekulvw (tekulvw) joins
01:42:53tekulvw quits [Ping timeout: 272 seconds]
02:17:49tekulvw (tekulvw) joins
02:26:19tekulvw quits [Ping timeout: 268 seconds]
02:42:25ducky quits [Ping timeout: 272 seconds]
02:43:17ducky (ducky) joins
02:43:18tekulvw (tekulvw) joins
02:46:04hackbug quits [Remote host closed the connection]
02:52:13tekulvw quits [Ping timeout: 268 seconds]
02:52:55hackbug (hackbug) joins
02:57:33tekulvw (tekulvw) joins
03:04:42<Cupping1285>nicolas17 you can use the internet archive extension to archive all of it.
03:06:33fireatseaparks (fireatseaparks) joins
03:08:23tekulvw quits [Ping timeout: 272 seconds]
03:12:27tekulvw (tekulvw) joins
03:13:04fireatseaparks quits [Client Quit]
03:16:17fireatseaparks (fireatseaparks) joins
03:17:15tekulvw quits [Ping timeout: 272 seconds]
03:22:31tekulvw (tekulvw) joins
03:30:33tekulvw quits [Ping timeout: 272 seconds]
03:36:15Juest quits [Ping timeout: 272 seconds]
03:41:36<Yakov>#archiveteam RE: https://tirespy.ca/ is shutting down: Looks like the search functionality is all clientsided and all the items are already preloaded before search at https://storage.googleapis.com/winged-record-376000.appspot.com/json/en_CA/index.json and
03:41:36<Yakov>https://storage.googleapis.com/winged-record-376000.appspot.com/json/en_CA/categories_flattened.json
03:42:26Juest (Juest) joins
03:44:25<Yakov>and for preserving all product images, you just need to generate a list of https://storage.googleapis.com/winged-record-376000.appspot.com/images/products/[LOWERCASEDPRODUCTIDHERE].png from the index.json
03:44:52<Yakov>I'll get a list going now.
03:54:37<Yakov>Waiting for list to be queued by an OP in #archivebot :)
03:58:58etnguyen03 quits [Remote host closed the connection]
04:01:17Juest quits [Ping timeout: 268 seconds]
04:10:31Juest (Juest) joins
04:18:41datechnoman quits [Ping timeout: 272 seconds]
04:19:11Wohlstand (Wohlstand) joins
04:37:37Island quits [Read error: Connection reset by peer]
04:43:02<h2ibot>Zen edited List of websites excluded from the Wayback Machine (+25, add https://www.2chan.net/): https://wiki.archiveteam.org/?diff=60543&oldid=60511
04:43:56Wohlstand quits [Client Quit]
04:45:43DogsRNice_ quits [Read error: Connection reset by peer]
04:52:16nicolas17 quits [Quit: Konversation terminated!]
04:55:05tekulvw (tekulvw) joins
04:59:52tekulvw quits [Ping timeout: 268 seconds]
05:04:48n9nes quits [Ping timeout: 268 seconds]
05:05:22n9nes joins
05:06:35beastbg8_ joins
05:09:21beastbg8 quits [Ping timeout: 272 seconds]
06:03:31datechnoman (datechnoman) joins
06:16:45<hexagonwin>OrIdow6 thanks for the suggestion, but it's a custom python based scraper with automated chrome browser
06:17:37nexussfan quits [Quit: Konversation terminated!]
06:17:56tekulvw (tekulvw) joins
06:19:51<hexagonwin>for now i'll just do it the semi-manual way..
06:22:49tekulvw quits [Ping timeout: 272 seconds]
06:45:55tekulvw (tekulvw) joins
06:50:41tekulvw quits [Ping timeout: 272 seconds]
07:14:54pokechu22 quits [Quit: System maintenance]
07:30:03ljcool2006_ joins
07:31:34ljcool2006 quits [Ping timeout: 268 seconds]
07:48:02pokechu22 (pokechu22) joins
07:55:17tekulvw (tekulvw) joins
07:57:17ducky_ (ducky) joins
07:59:19ducky quits [Ping timeout: 268 seconds]
07:59:20ducky_ is now known as ducky
08:11:02emphie quits [Ping timeout: 268 seconds]
08:34:22emphie joins
08:53:14APOLLO03 joins
08:54:12CraftByte quits [Quit: Ping timeout (120 seconds)]
08:54:39CraftByte (DragonSec|CraftByte) joins
09:09:51APOLLO03 quits [Client Quit]
09:10:06APOLLO03 joins
09:23:19tekulvw quits [Ping timeout: 272 seconds]
09:35:42ZoeB joins
09:38:46<ZoeB>Hi! There's a website I'd like to nominate for (slow, careful) archiving, but it's tricky, as it involves getting sequentially numbered pages that aren't linked to (which curl is generally best at, to my understanding) then getting their contents (which seems more of a wget thing). Does anyone know how to do this?
09:40:59<ZoeB>I guess I can automatically generate an --input-file for wget with the necessary thousands of sequentialy numbered pages..?
09:42:56APOLLO03 quits [Client Quit]
09:50:08APOLLO03 joins
09:59:56TheEnbyperor_ quits [Remote host closed the connection]
09:59:56TheEnbyperor quits [Remote host closed the connection]
10:01:12<pabs>ZoeB: ArchiveBot can do that, and the results will go to web.archive.org. whats the site?
10:02:34<ZoeB>http://spheremusic.com , an auction site for synthesiser gear that's sold off equipment by many notable musicians. Most of the pages aren't linked anywhere, but starts at https://spheremusic.com/Bargaindtl.asp?Item=1 , with pictures starting at https://spheremusic.com/Bargaindtl.asp?Item=3495 and http://spheremusic.com/userimages/Img3495.jpg .
10:03:11<ZoeB>I don't want to DDOS the poor site, but it's worth noting there's talk of their next batch of auctions possibly being the last one as the original founder's retiring.
10:04:19<ZoeB>That's in April.
10:05:30<pabs>should we wait until after that perhaps?
10:09:12<pabs>do you know what the largest item number is?
10:09:30tekulvw (tekulvw) joins
10:10:11petrichor (petrichor) joins
10:14:18TheEnbyperor joins
10:14:37tekulvw quits [Ping timeout: 272 seconds]
10:16:11TheEnbyperor_ (TheEnbyperor) joins
10:25:14APOLLO03 quits [Client Quit]
10:25:59APOLLO03 joins
10:51:36tekulvw (tekulvw) joins
10:55:09@imer quits [Quit: Oh no]
10:56:25tekulvw quits [Ping timeout: 272 seconds]
11:00:34APOLLO03 quits [Client Quit]
11:01:26APOLLO03 joins
11:15:52<h2ibot>Manu edited Discourse/archived (+96, Queued devforum.play.date): https://wiki.archiveteam.org/?diff=60544&oldid=60542
11:20:08<ZoeB>I'm not sure how much longer the site's going to stick around after that... Certainly it might be worth waiting until it starts.
11:21:34APOLLO03 quits [Client Quit]
11:21:47APOLLO03 joins
11:22:29<ZoeB>I'm not sure how high it goes... The highest I looked at during their last auction was https://spheremusic.com/Bargaindtl.asp?Item=30407
11:23:27<ZoeB>This is spanning 25 years of seasonal auctions.
11:35:32<pabs>and the item numbers are sequential?
11:35:48<ZoeB>As far as I know, it certainly looks that way very much.
11:36:13<ZoeB>I don't think they've changed the code since the millennium.
11:37:24<pabs>so /userimages/ is linked from each auction page, so will be auto-discovered
11:38:25<ZoeB>It should be, yes. The only tricky part as far as regular wget is concerned is finding the pages themselves, which although sequential don't seem to be linked to from anywhere anymore once the season ends... even though other websites link to noteworthy pages.
11:38:59<ZoeB>As you've got that covered, it should hopefully be pretty straightforward, I'd imagine.
11:39:06<pabs>that should be easy, just generate the full list and add it to the ArchiveBot URL list
11:39:09APOLLO03 quits [Client Quit]
11:39:41APOLLO03 joins
11:39:46<pabs>fun, the img src URLs use the wrong slashes, but AB seems to cope with that
11:40:54<pabs>do we have a date for adding it to https://wiki.archiveteam.org/index.php/Deathwatch
11:41:14TheEnbyperor_ quits [Read error: Connection reset by peer]
11:42:10linuxgemini7 (linuxgemini) joins
11:43:10linuxgemini quits [Ping timeout: 268 seconds]
11:43:11linuxgemini7 is now known as linuxgemini
11:45:42TheEnbyperor_ (TheEnbyperor) joins
11:58:02APOLLO03 quits [Client Quit]
12:00:01Bleo1826007227196234552220 quits [Quit: The Lounge - https://thelounge.chat]
12:02:35APOLLO03 joins
12:02:43Bleo1826007227196234552220 joins
12:03:37imer (imer) joins
12:03:37@ChanServ sets mode: +o imer
12:05:35APOLLO03 quits [Client Quit]
12:09:22APOLLO03 joins
12:13:08SootBect1 quits [Remote host closed the connection]
12:14:18SootBector (SootBector) joins
12:16:39@imer quits [Client Quit]
12:16:43APOLLO03a joins
12:16:51APOLLO03 quits [Ping timeout: 272 seconds]
12:17:05imer (imer) joins
12:17:06@ChanServ sets mode: +o imer
12:18:50<ZoeB>Let's see... the final auctions start on April 4 and end on April 11.
12:19:28<ZoeB>I have no idea if they'll pull the plug right there and then, or a month later, or a year.
12:20:31<ZoeB>They might add more items halfway through that active week, though I'm mostly concerned about the 25 years of history.
12:22:20<ZoeB>e.g. if it's possible to spread out the first 30,000 items to, say, 1,000 a day before then, that might be better..?
12:28:00<ZoeB>Thank you for your help, by the way! I'd have done a worse job of it by myself, I'm sure.
12:28:07<ZoeB>It's nice to know it'll be done right.
12:31:51APOLLO03a quits [Client Quit]
12:32:34APOLLO03 joins
12:47:56@imer quits [Client Quit]
12:48:25imer (imer) joins
12:48:25@ChanServ sets mode: +o imer
12:53:06SootBector quits [Remote host closed the connection]
12:54:15SootBector (SootBector) joins
12:56:15notSokar quits [Remote host closed the connection]
12:57:33Sokar joins
13:10:44Arcorann__ quits [Ping timeout: 268 seconds]
13:21:27roverinexile quits [Ping timeout: 272 seconds]
13:21:46rover joins
13:50:41<pabs>ZoeB: maybe its best to do the whole thing now, and then do the few missing pages after the last auctions
13:50:55<pabs>I'll look at it tomorrow if I get a chance
13:52:13<ZoeB>That sounds good. Thank you!
14:13:46sec^nd quits [Remote host closed the connection]
14:14:04sec^nd (second) joins
14:22:16<h2ibot>Sanqui edited Deathwatch (+299, Add Deutsch für dich): https://wiki.archiveteam.org/?diff=60545&oldid=60537
14:22:22SootBector quits [Remote host closed the connection]
14:23:16<h2ibot>Sanqui edited Deathwatch (-3, typo): https://wiki.archiveteam.org/?diff=60546&oldid=60545
14:23:34SootBector (SootBector) joins
14:23:49tekulvw (tekulvw) joins
14:28:35tekulvw quits [Ping timeout: 272 seconds]
14:30:23APOLLO03 quits [Client Quit]
14:31:36APOLLO03 joins