00:42:30hfuller (hfuller) joins
01:17:24Jake (Jake) joins
01:58:10Jake quits [Client Quit]
02:01:49Jake (Jake) joins
02:01:49Jake quits [Client Quit]
02:03:20Jake (Jake) joins
02:28:35hfuller quits [Ping timeout: 252 seconds]
02:35:13hfuller (hfuller) joins
05:39:30hfuller quits [Client Quit]
05:39:41hfuller (hfuller) joins
06:02:43qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
06:35:05datechnoman quits [Quit: The Lounge - https://thelounge.chat]
06:36:53datechnoman (datechnoman) joins
09:42:00qwertyasdfuiopghjkl quits [Client Quit]
09:44:03qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
09:46:02qwertyasdfuiopghjkl quits [Client Quit]
09:56:44qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
10:15:58pie joins
10:38:50pie quits [Remote host closed the connection]
10:38:55pie joins
12:11:51Ryz quits [Ping timeout: 258 seconds]
13:05:16SrainUser joins
13:06:43qwertyasdfuiopghjkl quits [Client Quit]
13:06:43pie quits [Remote host closed the connection]
13:54:37qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
15:57:51Ryz (Ryz) joins
17:13:30<SrainUser>hello, I made a dump of Orange website directory so I have (probably) all the urls of the hosted website with their metadata, im a nub at archiving so what I should do next ?
17:34:40<pokechu22>If you've got a list of urls, you could upload it to https://transfer.archivete.am/ and we can take a look at it
17:38:20<SrainUser>ok, are you also interested in the metadatas ? (it's in a sqlite3 db)
17:42:02<SrainUser>the urls are here: https://transfer.archivete.am/Y5Qsp/orange_isp_hosting_urls.txt
17:42:43<SrainUser>maybe some are dead i didn't take a look at it, because some are listed as "deleted" in the directory but work anyway so I just put everything in this file
17:54:34<pokechu22>Metadata could also be useful - there's not really any situation where more info is harmful at least :)
17:57:17<pokechu22>I'm not sure if that list is complete, since you have e.g. assoultz.monsite-orange.fr but not assoultz.monsite-orange.fr/competitionsunss/index.html (similarly you have 4 listed for majasau.pagesperso-orange.fr but there seem to be more pages there, although it uses JS links it seems)
18:04:36saveallthejunk joins
19:04:13<SrainUser>I only list what is on the directory (https://annuaire-pp.orange.fr/) so the homepage, this is not a crawl of every html pages.
19:04:44<pokechu22>Ah, ok
19:06:13<SrainUser>so in order to save every pages you need to crawl the links it i guess ?
19:08:01<pokechu22>Yeah, but that's something archivebot can do fairly easily for the most part (though sites like majasau.pagesperso-orange.fr might be a problem)
19:48:36DigitalDragons quits [Quit: Ping timeout (120 seconds)]
19:49:37DigitalDragons (DigitalDragons) joins
20:13:36saveallthejunk quits [Client Quit]
21:17:44SrainUser quits [Ping timeout: 252 seconds]
21:30:48colona (colona) joins