| 00:27:32 | <@arkiver> | processing your lists |
| 00:37:22 | <pokechu22> | It's also worth adding https://transfer.archivete.am/bE5jI/orangefr_raw.txt.zst and/or https://transfer.archivete.am/SB82D/orangefr_scrubbed.txt.zst from thuban (those are also in the 7z file, along with files that I derived from them that probably don't matter too much) |
| 00:38:11 | | kiryu (kiryu) joins |
| 00:41:29 | <@arkiver> | thank you! |
| 00:41:41 | <@arkiver> | it's not wise to start this now, but i think this is processed and ready |
| 00:41:44 | <@arkiver> | starting when i get up |
| 00:42:10 | <@arkiver> | we should be able to make it, plenty of second left to end of this month, and we should be able to get plenty of IPs on this too |
| 00:43:02 | <pokechu22> | How will links from JS be handled? Do you have the same aggressive extraction archivebot has? |
| 00:48:15 | <pokechu22> | (probably fine to not do anything since most of these are old sites, but still something to wonder about. There's also flash which is always a mess - I can take a look at any instances of it using ruffle if I get a list afterwards. I guess looking at .jar/.class files is also sometimes useful) |
| 00:49:35 | <@arkiver> | pokechu22: do you have some example of web pages with JS links? |
| 00:49:51 | <@arkiver> | i was not planning to extract them due to these being old sites |
| 00:49:56 | <pokechu22> | I don't on hand |
| 00:49:59 | <@arkiver> | but i could still look into aggressive extraction |
| 00:50:05 | <@arkiver> | i'll add it tomorrow |
| 00:50:10 | <pokechu22> | I do remember seeing some interesting-looking multimediay stuff though, one sec |
| 00:50:16 | <pokechu22> | should be in the logs for this channel |
| 00:50:42 | <fireonlive> | have a good sleep arkiver :3 |
| 00:51:45 | <pokechu22> | Right, https://majasau.pagesperso-orange.fr/ - you've got links like javascript:na_open_window('MEDIA2', 'INDEX22A.htm', 0, 0, 1920, 1080, 0, 0, 0, 0, 0) which resolves to https://majasau.pagesperso-orange.fr/INDEX22A.htm |
| 00:54:40 | <pokechu22> | that said, there's no limit to the weird things you can do with JS, and just flagging those sites for later investigation might help a fair bit too; archivebot's aggressive extraction doesn't always find everything useful (particularly with relative links) and also produces a lot of junk |
| 00:56:33 | <@arkiver> | thanks fireonlive |
| 00:56:42 | <@arkiver> | pokechu22: ouch those sucks... |
| 00:57:01 | <@arkiver> | yeah we will likely not extract those |
| 00:57:09 | <pokechu22> | and then there's whatever on earth this is: https://majasau.pagesperso-orange.fr/ECUREUIL/ECUR2.htm |
| 00:57:15 | <pokechu22> | probably knowing french would help some |
| 00:59:44 | <pokechu22> | sounds like that function's from something called "Namo WebEditor"; you *could* hardcode some of the relevant functions, but it probably would get complicated quickly :| |
| 01:00:22 | <@arkiver> | that will likely not happen |
| 01:00:33 | <fireonlive> | :) |
| 01:00:36 | <pokechu22> | view-source:https://majasau.pagesperso-orange.fr/pageDEFILmrt22.htm also has na_change_img_src to e.g. https://majasau.pagesperso-orange.fr/boutons/genie1.gif - yeah, not worth it right now |
| 01:00:49 | <pokechu22> | I'm just going to throw this one into archivebot and see what happens |
| 01:00:54 | <@arkiver> | alrigt! |
| 01:02:52 | <@JAA> | I also have no idea what's going on there, but that looks beautiful. Very 2000s. |
| 01:04:18 | <TheTechRobo> | looks like a game about acting like a squirrel? |
| 01:04:26 | <TheTechRobo> | my french knowledge is admittedly very basic |
| 01:07:55 | <@JAA> | Yeah, something like that, collecting hazelnuts I think. |
| 01:24:19 | | systwi__ (systwi) joins |
| 01:24:53 | | systwi quits [Ping timeout: 252 seconds] |
| 01:25:52 | <fireonlive> | gotta get that nut |
| 01:35:25 | | tzt quits [Read error: Connection reset by peer] |
| 01:36:49 | | tzt (tzt) joins |
| 01:42:07 | | Exorcism (exorcism) joins |
| 02:08:20 | | imer quits [Ping timeout: 252 seconds] |
| 03:05:23 | <project10> | I managed to get 20 nuts in the squirrel game before an eagle came and decimated my squirrel |
| 03:06:04 | | imer (imer) joins |
| 05:59:15 | | decky joins |
| 06:53:35 | | Matthww11 quits [Read error: Connection reset by peer] |
| 06:54:09 | | Matthww11 joins |
| 09:18:23 | <thuban> | thank you, arkiver! sorry i was afk for the discussion lol |
| 09:47:52 | | Peroniko joins |
| 09:48:14 | | Peroniko is now authenticated as Peroniko |
| 10:16:28 | | RetiredTurtle joins |
| 10:18:56 | | Peroniko quits [Ping timeout: 252 seconds] |
| 11:54:38 | | RetiredTurtle quits [Ping timeout: 252 seconds] |
| 17:29:35 | | imer quits [Ping timeout: 252 seconds] |
| 17:31:17 | | imer (imer) joins |
| 17:42:09 | | Maturion joins |
| 19:56:14 | | Chris5010 (Chris5010) joins |
| 22:46:02 | | decky_e_ joins |
| 22:49:26 | | decky quits [Ping timeout: 265 seconds] |
| 22:58:21 | | Maturion quits [Remote host closed the connection] |
| 23:22:18 | | Aoede quits [Ping timeout: 265 seconds] |
| 23:22:33 | | Aoede_ (Aoede) joins |