03:29:45steering is now known as Erebus
03:30:18Erebus quits [Remote host closed the connection]
03:30:37steering (steering) joins
07:41:08qwertyasdfuiopghjkl69 joins
07:44:10qwertyasdfuiopghjkl quits [Ping timeout: 255 seconds]
10:22:23Xanthos joins
10:22:23Xanthon quits [Read error: Connection reset by peer]
10:22:30Xanthos is now known as Xanthon
10:22:30Xanthon quits [Changing host]
10:22:30Xanthon (Xanthon) joins
10:49:56Xanthos joins
10:49:56Xanthon quits [Read error: Connection reset by peer]
10:50:03Xanthos is now known as Xanthon
10:50:03Xanthon quits [Changing host]
10:50:03Xanthon (Xanthon) joins
11:01:08Matthww quits [Quit: The Lounge - https://thelounge.chat]
11:03:05benjins2 quits [Ping timeout: 258 seconds]
11:03:27benjins quits [Ping timeout: 258 seconds]
11:20:29Matthww joins
11:34:28Xanthos joins
11:34:28Xanthon quits [Read error: Connection reset by peer]
11:34:35Xanthos is now known as Xanthon
11:34:35Xanthon quits [Changing host]
11:34:35Xanthon (Xanthon) joins
14:03:39benjins joins
14:46:08f_ quits [Remote host closed the connection]
16:10:55f_ (funderscore) joins
16:11:14f_ quits [Read error: Connection reset by peer]
16:21:38f_ (funderscore) joins
16:29:09wiki_guy joins
16:32:22<wiki_guy>hello!! sadly i mantain a url shortener for a kinda big mediawiki project and we have 10k+ urls. i have exported CSV backups to archive.org and in the past others have helped me create a "beacon file" but i don't know what it is or how to recreate it. what's the best way to preserve the links i have? is a csv enough ("id, url")?
16:33:28<wiki_guy>its s.wikicharlie.cl by the way, already on the urlteam wiki page. ive been keeping it online for like two years now. feel free to highlight me for ideas or whatevers
16:46:54<TheTechRobo>wiki_guy: I think that's a fine way to do it, although someone more knowledgeable than me might have some more guidance.
17:06:51qwertyasdfuiopghjkl33 joins
17:08:55qwertyasdfuiopghjkl69 quits [Ping timeout: 255 seconds]
17:43:10Xanthos joins
17:45:12Xanthon quits [Ping timeout: 258 seconds]
17:45:12Xanthos is now known as Xanthon
17:45:14Xanthon quits [Changing host]
17:45:14Xanthon (Xanthon) joins
18:30:03Chris5010 quits [Quit: Ping timeout (120 seconds)]
18:30:50Chris5010 (Chris5010) joins
19:19:37JaffaCakes118 quits [Remote host closed the connection]
19:21:48JaffaCakes118 (JaffaCakes118) joins
20:19:33JaffaCakes118 quits [Remote host closed the connection]
20:23:19<@JAA>wiki_guy: Yeah, a CSV is fine. BEACON is similar but uses vertical bars as separators.
20:23:41<@JAA>There's also a header, and some special processing is needed if your targets contain vertical bars.
20:23:47<@JAA>Full spec is here: https://gbv.github.io/beaconspec/beacon.html
20:25:50<@JAA>If a service is shutting down, we can also do a complete archive as WARC so it is all available in the Wayback Machine afterwards. For that, we'd just need the short URL format and either a list of shortcodes or (if they're sequential) the alphabet and upper bound. We'd then retrieve each shortcode once from our side; this isn't something you can do because uploading into the WBM is restricted.
21:48:55JaffaCakes118 (JaffaCakes118) joins
22:31:22<wiki_guy>thanks guys, i'll do a csv and wait until wayback machine is back online.
22:31:37<wiki_guy>for the warc, do i run it and you upload it? or do i just provide a full list of short urls and you can create it?
22:33:31<@JAA>The latter
23:13:57<wiki_guy>well that was a pain. i removed 1434 links that were obviously spam, sql injection attempts or just phishing forms
23:14:05<wiki_guy>i don't like running a url shortener
23:25:08benjins2 joins
23:34:50<wiki_guy>https://s.wikicharlie.cl/database/ i've set up a page to store the dumps. ranges go from 1 to 12999 ("1" to "3nF"), sequentially, using the 0-9a-zA-Z character set.
23:39:52<wiki_guy>i updated the wiki too but got sent to a moderation queue :D
23:57:52<pabs>wiki_guy: how big is the wiki? we have #wikibot for archiving them btw https://wikicharlie.cl/w/P%A1gina_principal