| 00:02:36 | | etnguyen03 (etnguyen03) joins |
| 00:11:53 | | tekulvw (tekulvw) joins |
| 00:14:07 | | spark3 joins |
| 00:14:21 | | spark74568 joins |
| 00:15:38 | | spark74568 leaves |
| 00:15:43 | | spark3 leaves |
| 00:16:07 | | nepeat quits [Ping timeout: 272 seconds] |
| 00:16:45 | | tekulvw quits [Ping timeout: 272 seconds] |
| 00:18:50 | | nepeat (nepeat) joins |
| 00:38:36 | | Wohlstand quits [Quit: Wohlstand] |
| 00:51:07 | | etnguyen03 quits [Client Quit] |
| 01:11:13 | | mls quits [Ping timeout: 272 seconds] |
| 01:13:37 | | etnguyen03 (etnguyen03) joins |
| 01:15:14 | | ljcool2006 joins |
| 01:15:19 | | tekulvw (tekulvw) joins |
| 01:20:05 | | tekulvw quits [Ping timeout: 272 seconds] |
| 01:23:51 | | roverinexile joins |
| 01:26:37 | | tekulvw (tekulvw) joins |
| 01:27:03 | | rover quits [Ping timeout: 272 seconds] |
| 01:31:26 | | tekulvw quits [Ping timeout: 268 seconds] |
| 01:32:42 | | mls (mls) joins |
| 01:37:49 | | tekulvw (tekulvw) joins |
| 01:42:53 | | tekulvw quits [Ping timeout: 272 seconds] |
| 02:17:49 | | tekulvw (tekulvw) joins |
| 02:26:19 | | tekulvw quits [Ping timeout: 268 seconds] |
| 02:42:25 | | ducky quits [Ping timeout: 272 seconds] |
| 02:43:17 | | ducky (ducky) joins |
| 02:43:18 | | tekulvw (tekulvw) joins |
| 02:46:04 | | hackbug quits [Remote host closed the connection] |
| 02:52:13 | | tekulvw quits [Ping timeout: 268 seconds] |
| 02:52:55 | | hackbug (hackbug) joins |
| 02:57:33 | | tekulvw (tekulvw) joins |
| 03:04:42 | <Cupping1285> | nicolas17 you can use the internet archive extension to archive all of it. |
| 03:06:33 | | fireatseaparks (fireatseaparks) joins |
| 03:08:23 | | tekulvw quits [Ping timeout: 272 seconds] |
| 03:12:27 | | tekulvw (tekulvw) joins |
| 03:13:04 | | fireatseaparks quits [Client Quit] |
| 03:16:17 | | fireatseaparks (fireatseaparks) joins |
| 03:17:15 | | tekulvw quits [Ping timeout: 272 seconds] |
| 03:22:31 | | tekulvw (tekulvw) joins |
| 03:30:33 | | tekulvw quits [Ping timeout: 272 seconds] |
| 03:36:15 | | Juest quits [Ping timeout: 272 seconds] |
| 03:41:36 | <Yakov> | #archiveteam RE: https://tirespy.ca/ is shutting down: Looks like the search functionality is all clientsided and all the items are already preloaded before search at https://storage.googleapis.com/winged-record-376000.appspot.com/json/en_CA/index.json and |
| 03:41:36 | <Yakov> | https://storage.googleapis.com/winged-record-376000.appspot.com/json/en_CA/categories_flattened.json |
| 03:42:26 | | Juest (Juest) joins |
| 03:44:25 | <Yakov> | and for preserving all product images, you just need to generate a list of https://storage.googleapis.com/winged-record-376000.appspot.com/images/products/[LOWERCASEDPRODUCTIDHERE].png from the index.json |
| 03:44:52 | <Yakov> | I'll get a list going now. |
| 03:54:37 | <Yakov> | Waiting for list to be queued by an OP in #archivebot :) |
| 03:58:58 | | etnguyen03 quits [Remote host closed the connection] |
| 04:01:17 | | Juest quits [Ping timeout: 268 seconds] |
| 04:10:31 | | Juest (Juest) joins |
| 04:18:41 | | datechnoman quits [Ping timeout: 272 seconds] |
| 04:19:11 | | Wohlstand (Wohlstand) joins |
| 04:37:37 | | Island quits [Read error: Connection reset by peer] |
| 04:43:02 | <h2ibot> | Zen edited List of websites excluded from the Wayback Machine (+25, add https://www.2chan.net/): https://wiki.archiveteam.org/?diff=60543&oldid=60511 |
| 04:43:56 | | Wohlstand quits [Client Quit] |
| 04:45:43 | | DogsRNice_ quits [Read error: Connection reset by peer] |
| 04:52:16 | | nicolas17 quits [Quit: Konversation terminated!] |
| 04:55:05 | | tekulvw (tekulvw) joins |
| 04:59:52 | | tekulvw quits [Ping timeout: 268 seconds] |
| 05:04:48 | | n9nes quits [Ping timeout: 268 seconds] |
| 05:05:22 | | n9nes joins |
| 05:06:35 | | beastbg8_ joins |
| 05:09:21 | | beastbg8 quits [Ping timeout: 272 seconds] |
| 06:03:31 | | datechnoman (datechnoman) joins |
| 06:16:45 | <hexagonwin> | OrIdow6 thanks for the suggestion, but it's a custom python based scraper with automated chrome browser |
| 06:17:37 | | nexussfan quits [Quit: Konversation terminated!] |
| 06:17:56 | | tekulvw (tekulvw) joins |
| 06:19:51 | <hexagonwin> | for now i'll just do it the semi-manual way.. |
| 06:22:49 | | tekulvw quits [Ping timeout: 272 seconds] |
| 06:45:55 | | tekulvw (tekulvw) joins |
| 06:50:41 | | tekulvw quits [Ping timeout: 272 seconds] |
| 07:14:54 | | pokechu22 quits [Quit: System maintenance] |
| 07:30:03 | | ljcool2006_ joins |
| 07:31:34 | | ljcool2006 quits [Ping timeout: 268 seconds] |
| 07:48:02 | | pokechu22 (pokechu22) joins |
| 07:55:17 | | tekulvw (tekulvw) joins |
| 07:57:17 | | ducky_ (ducky) joins |
| 07:59:19 | | ducky quits [Ping timeout: 268 seconds] |
| 07:59:20 | | ducky_ is now known as ducky |
| 08:11:02 | | emphie quits [Ping timeout: 268 seconds] |
| 08:34:22 | | emphie joins |
| 08:53:14 | | APOLLO03 joins |
| 08:54:12 | | CraftByte quits [Quit: Ping timeout (120 seconds)] |
| 08:54:39 | | CraftByte (DragonSec|CraftByte) joins |
| 09:09:51 | | APOLLO03 quits [Client Quit] |
| 09:10:06 | | APOLLO03 joins |
| 09:23:19 | | tekulvw quits [Ping timeout: 272 seconds] |
| 09:35:42 | | ZoeB joins |
| 09:38:46 | <ZoeB> | Hi! There's a website I'd like to nominate for (slow, careful) archiving, but it's tricky, as it involves getting sequentially numbered pages that aren't linked to (which curl is generally best at, to my understanding) then getting their contents (which seems more of a wget thing). Does anyone know how to do this? |
| 09:40:59 | <ZoeB> | I guess I can automatically generate an --input-file for wget with the necessary thousands of sequentialy numbered pages..? |
| 09:42:56 | | APOLLO03 quits [Client Quit] |
| 09:50:08 | | APOLLO03 joins |
| 09:59:56 | | TheEnbyperor_ quits [Remote host closed the connection] |
| 09:59:56 | | TheEnbyperor quits [Remote host closed the connection] |
| 10:01:12 | <pabs> | ZoeB: ArchiveBot can do that, and the results will go to web.archive.org. whats the site? |
| 10:02:34 | <ZoeB> | http://spheremusic.com , an auction site for synthesiser gear that's sold off equipment by many notable musicians. Most of the pages aren't linked anywhere, but starts at https://spheremusic.com/Bargaindtl.asp?Item=1 , with pictures starting at https://spheremusic.com/Bargaindtl.asp?Item=3495 and http://spheremusic.com/userimages/Img3495.jpg . |
| 10:03:11 | <ZoeB> | I don't want to DDOS the poor site, but it's worth noting there's talk of their next batch of auctions possibly being the last one as the original founder's retiring. |
| 10:04:19 | <ZoeB> | That's in April. |
| 10:05:30 | <pabs> | should we wait until after that perhaps? |
| 10:09:12 | <pabs> | do you know what the largest item number is? |
| 10:09:30 | | tekulvw (tekulvw) joins |
| 10:10:11 | | petrichor (petrichor) joins |
| 10:14:18 | | TheEnbyperor joins |
| 10:14:37 | | tekulvw quits [Ping timeout: 272 seconds] |
| 10:16:11 | | TheEnbyperor_ (TheEnbyperor) joins |
| 10:25:14 | | APOLLO03 quits [Client Quit] |
| 10:25:59 | | APOLLO03 joins |
| 10:51:36 | | tekulvw (tekulvw) joins |
| 10:55:09 | | @imer quits [Quit: Oh no] |
| 10:56:25 | | tekulvw quits [Ping timeout: 272 seconds] |
| 11:00:34 | | APOLLO03 quits [Client Quit] |
| 11:01:26 | | APOLLO03 joins |
| 11:15:52 | <h2ibot> | Manu edited Discourse/archived (+96, Queued devforum.play.date): https://wiki.archiveteam.org/?diff=60544&oldid=60542 |
| 11:20:08 | <ZoeB> | I'm not sure how much longer the site's going to stick around after that... Certainly it might be worth waiting until it starts. |
| 11:21:34 | | APOLLO03 quits [Client Quit] |
| 11:21:47 | | APOLLO03 joins |
| 11:22:29 | <ZoeB> | I'm not sure how high it goes... The highest I looked at during their last auction was https://spheremusic.com/Bargaindtl.asp?Item=30407 |
| 11:23:27 | <ZoeB> | This is spanning 25 years of seasonal auctions. |
| 11:35:32 | <pabs> | and the item numbers are sequential? |
| 11:35:48 | <ZoeB> | As far as I know, it certainly looks that way very much. |
| 11:36:13 | <ZoeB> | I don't think they've changed the code since the millennium. |
| 11:37:24 | <pabs> | so /userimages/ is linked from each auction page, so will be auto-discovered |
| 11:38:25 | <ZoeB> | It should be, yes. The only tricky part as far as regular wget is concerned is finding the pages themselves, which although sequential don't seem to be linked to from anywhere anymore once the season ends... even though other websites link to noteworthy pages. |
| 11:38:59 | <ZoeB> | As you've got that covered, it should hopefully be pretty straightforward, I'd imagine. |
| 11:39:06 | <pabs> | that should be easy, just generate the full list and add it to the ArchiveBot URL list |
| 11:39:09 | | APOLLO03 quits [Client Quit] |
| 11:39:41 | | APOLLO03 joins |
| 11:39:46 | <pabs> | fun, the img src URLs use the wrong slashes, but AB seems to cope with that |
| 11:40:54 | <pabs> | do we have a date for adding it to https://wiki.archiveteam.org/index.php/Deathwatch |
| 11:41:14 | | TheEnbyperor_ quits [Read error: Connection reset by peer] |
| 11:42:10 | | linuxgemini7 (linuxgemini) joins |
| 11:43:10 | | linuxgemini quits [Ping timeout: 268 seconds] |
| 11:43:11 | | linuxgemini7 is now known as linuxgemini |
| 11:45:42 | | TheEnbyperor_ (TheEnbyperor) joins |
| 11:58:02 | | APOLLO03 quits [Client Quit] |
| 12:00:01 | | Bleo1826007227196234552220 quits [Quit: The Lounge - https://thelounge.chat] |
| 12:02:35 | | APOLLO03 joins |
| 12:02:43 | | Bleo1826007227196234552220 joins |
| 12:03:37 | | imer (imer) joins |
| 12:03:37 | | @ChanServ sets mode: +o imer |
| 12:05:35 | | APOLLO03 quits [Client Quit] |
| 12:09:22 | | APOLLO03 joins |
| 12:13:08 | | SootBect1 quits [Remote host closed the connection] |
| 12:14:18 | | SootBector (SootBector) joins |
| 12:16:39 | | @imer quits [Client Quit] |
| 12:16:43 | | APOLLO03a joins |
| 12:16:51 | | APOLLO03 quits [Ping timeout: 272 seconds] |
| 12:17:05 | | imer (imer) joins |
| 12:17:06 | | @ChanServ sets mode: +o imer |
| 12:18:50 | <ZoeB> | Let's see... the final auctions start on April 4 and end on April 11. |
| 12:19:28 | <ZoeB> | I have no idea if they'll pull the plug right there and then, or a month later, or a year. |
| 12:20:31 | <ZoeB> | They might add more items halfway through that active week, though I'm mostly concerned about the 25 years of history. |
| 12:22:20 | <ZoeB> | e.g. if it's possible to spread out the first 30,000 items to, say, 1,000 a day before then, that might be better..? |
| 12:28:00 | <ZoeB> | Thank you for your help, by the way! I'd have done a worse job of it by myself, I'm sure. |
| 12:28:07 | <ZoeB> | It's nice to know it'll be done right. |
| 12:31:51 | | APOLLO03a quits [Client Quit] |
| 12:32:34 | | APOLLO03 joins |
| 12:47:56 | | @imer quits [Client Quit] |
| 12:48:25 | | imer (imer) joins |
| 12:48:25 | | @ChanServ sets mode: +o imer |
| 12:53:06 | | SootBector quits [Remote host closed the connection] |
| 12:54:15 | | SootBector (SootBector) joins |
| 12:56:15 | | notSokar quits [Remote host closed the connection] |
| 12:57:33 | | Sokar joins |
| 13:10:44 | | Arcorann__ quits [Ping timeout: 268 seconds] |
| 13:21:27 | | roverinexile quits [Ping timeout: 272 seconds] |
| 13:21:46 | | rover joins |
| 13:50:41 | <pabs> | ZoeB: maybe its best to do the whole thing now, and then do the few missing pages after the last auctions |
| 13:50:55 | <pabs> | I'll look at it tomorrow if I get a chance |
| 13:52:13 | <ZoeB> | That sounds good. Thank you! |
| 14:13:46 | | sec^nd quits [Remote host closed the connection] |
| 14:14:04 | | sec^nd (second) joins |
| 14:22:16 | <h2ibot> | Sanqui edited Deathwatch (+299, Add Deutsch für dich): https://wiki.archiveteam.org/?diff=60545&oldid=60537 |
| 14:22:22 | | SootBector quits [Remote host closed the connection] |
| 14:23:16 | <h2ibot> | Sanqui edited Deathwatch (-3, typo): https://wiki.archiveteam.org/?diff=60546&oldid=60545 |
| 14:23:34 | | SootBector (SootBector) joins |
| 14:23:49 | | tekulvw (tekulvw) joins |
| 14:28:35 | | tekulvw quits [Ping timeout: 272 seconds] |
| 14:30:23 | | APOLLO03 quits [Client Quit] |
| 14:31:36 | | APOLLO03 joins |