03:29:37 | | steering is now authenticated as * |
03:29:45 | | steering is now known as Erebus |
03:30:18 | | Erebus quits [Remote host closed the connection] |
03:30:37 | | steering (steering) joins |
07:41:08 | | qwertyasdfuiopghjkl69 joins |
07:44:10 | | qwertyasdfuiopghjkl quits [Ping timeout: 255 seconds] |
10:22:23 | | Xanthos joins |
10:22:23 | | Xanthon quits [Read error: Connection reset by peer] |
10:22:30 | | Xanthos is now known as Xanthon |
10:22:30 | | Xanthon is now authenticated as Xanthon |
10:22:30 | | Xanthon quits [Changing host] |
10:22:30 | | Xanthon (Xanthon) joins |
10:49:56 | | Xanthos joins |
10:49:56 | | Xanthon quits [Read error: Connection reset by peer] |
10:50:03 | | Xanthos is now known as Xanthon |
10:50:03 | | Xanthon is now authenticated as Xanthon |
10:50:03 | | Xanthon quits [Changing host] |
10:50:03 | | Xanthon (Xanthon) joins |
11:01:08 | | Matthww quits [Quit: The Lounge - https://thelounge.chat] |
11:03:05 | | benjins2 quits [Ping timeout: 258 seconds] |
11:03:27 | | benjins quits [Ping timeout: 258 seconds] |
11:20:29 | | Matthww joins |
11:34:28 | | Xanthos joins |
11:34:28 | | Xanthon quits [Read error: Connection reset by peer] |
11:34:35 | | Xanthos is now known as Xanthon |
11:34:35 | | Xanthon is now authenticated as Xanthon |
11:34:35 | | Xanthon quits [Changing host] |
11:34:35 | | Xanthon (Xanthon) joins |
14:03:39 | | benjins joins |
14:46:08 | | f_ quits [Remote host closed the connection] |
16:10:55 | | f_ (funderscore) joins |
16:11:14 | | f_ quits [Read error: Connection reset by peer] |
16:21:38 | | f_ (funderscore) joins |
16:29:09 | | wiki_guy joins |
16:32:22 | <wiki_guy> | hello!! sadly i mantain a url shortener for a kinda big mediawiki project and we have 10k+ urls. i have exported CSV backups to archive.org and in the past others have helped me create a "beacon file" but i don't know what it is or how to recreate it. what's the best way to preserve the links i have? is a csv enough ("id, url")? |
16:33:28 | <wiki_guy> | its s.wikicharlie.cl by the way, already on the urlteam wiki page. ive been keeping it online for like two years now. feel free to highlight me for ideas or whatevers |
16:46:54 | <TheTechRobo> | wiki_guy: I think that's a fine way to do it, although someone more knowledgeable than me might have some more guidance. |
17:06:51 | | qwertyasdfuiopghjkl33 joins |
17:08:55 | | qwertyasdfuiopghjkl69 quits [Ping timeout: 255 seconds] |
17:43:10 | | Xanthos joins |
17:45:12 | | Xanthon quits [Ping timeout: 258 seconds] |
17:45:12 | | Xanthos is now known as Xanthon |
17:45:14 | | Xanthon is now authenticated as Xanthon |
17:45:14 | | Xanthon quits [Changing host] |
17:45:14 | | Xanthon (Xanthon) joins |
18:30:03 | | Chris5010 quits [Quit: Ping timeout (120 seconds)] |
18:30:50 | | Chris5010 (Chris5010) joins |
19:19:37 | | JaffaCakes118 quits [Remote host closed the connection] |
19:21:48 | | JaffaCakes118 (JaffaCakes118) joins |
20:19:33 | | JaffaCakes118 quits [Remote host closed the connection] |
20:23:19 | <@JAA> | wiki_guy: Yeah, a CSV is fine. BEACON is similar but uses vertical bars as separators. |
20:23:41 | <@JAA> | There's also a header, and some special processing is needed if your targets contain vertical bars. |
20:23:47 | <@JAA> | Full spec is here: https://gbv.github.io/beaconspec/beacon.html |
20:25:50 | <@JAA> | If a service is shutting down, we can also do a complete archive as WARC so it is all available in the Wayback Machine afterwards. For that, we'd just need the short URL format and either a list of shortcodes or (if they're sequential) the alphabet and upper bound. We'd then retrieve each shortcode once from our side; this isn't something you can do because uploading into the WBM is restricted. |
21:48:55 | | JaffaCakes118 (JaffaCakes118) joins |
22:31:22 | <wiki_guy> | thanks guys, i'll do a csv and wait until wayback machine is back online. |
22:31:37 | <wiki_guy> | for the warc, do i run it and you upload it? or do i just provide a full list of short urls and you can create it? |
22:33:31 | <@JAA> | The latter |
23:13:57 | <wiki_guy> | well that was a pain. i removed 1434 links that were obviously spam, sql injection attempts or just phishing forms |
23:14:05 | <wiki_guy> | i don't like running a url shortener |
23:25:08 | | benjins2 joins |
23:34:50 | <wiki_guy> | https://s.wikicharlie.cl/database/ i've set up a page to store the dumps. ranges go from 1 to 12999 ("1" to "3nF"), sequentially, using the 0-9a-zA-Z character set. |
23:39:52 | <wiki_guy> | i updated the wiki too but got sent to a moderation queue :D |
23:57:52 | <pabs> | wiki_guy: how big is the wiki? we have #wikibot for archiving them btw https://wikicharlie.cl/w/P%A1gina_principal |