00:08:46 | | nepeat quits [Ping timeout: 255 seconds] |
00:12:00 | | nepeat (nepeat) joins |
00:20:09 | | Notrealname1234 (Notrealname1234) joins |
00:29:02 | | Notrealname1234 quits [Client Quit] |
00:35:11 | | nic8693 quits [Read error: Connection reset by peer] |
00:36:51 | | etnguyen03 quits [Client Quit] |
00:53:43 | | etnguyen03 (etnguyen03) joins |
01:15:22 | | sludge__ quits [Ping timeout: 255 seconds] |
01:29:13 | | kiryu__ quits [Read error: Connection reset by peer] |
01:36:08 | <pabs> | AK: yeah, I co-ordinated with the OSM admins on #osm-dev (OFTC) to get it saved, some discussion here too https://github.com/openstreetmap/operations/issues/149 |
01:39:18 | <nicolas17> | this is the second time I see someone ask about it, maybe we do need a wiki page |
01:48:53 | | sludge joins |
02:57:39 | | Arcorann (Arcorann) joins |
03:19:10 | | nic86931 (nic) joins |
03:31:24 | <nicolas17> | pabs: https://archive.org/details/stateofthemap2014-raw-recordings |
03:39:55 | <nicolas17> | let's see what the derivation looks like... |
03:52:04 | <fireonlive> | nicolas17++ |
03:52:04 | <eggdrop> | [karma] 'nicolas17' now has 6 karma! |
03:54:10 | | Island quits [Read error: Connection reset by peer] |
04:20:30 | <pabs> | https://gone.domains/ https://news.ycombinator.com/item?id=40412959 |
04:22:01 | <nicolas17> | dropcatching |
04:22:29 | <nicolas17> | fuck SEO |
04:23:54 | <pabs> | could be interesting for saving imminently dying sites |
04:25:53 | <fireonlive> | "Day 2, DVD 2 will not be uploaded as it contained the OSM Foundation general meeting." secrets |
04:41:49 | | Craigle quits [Quit: The Lounge - https://thelounge.chat] |
04:42:21 | | Craigle (Craigle) joins |
04:53:51 | <nicolas17> | fireonlive: I think that wasn't even supposed to be recorded but nobody told the video people that |
04:54:27 | <nicolas17> | oh derive is still running -_- |
04:55:29 | <fireonlive> | ahh |
05:12:46 | | etnguyen03 quits [Client Quit] |
05:21:10 | | etnguyen03 (etnguyen03) joins |
05:27:14 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
05:36:26 | | etnguyen03 quits [Remote host closed the connection] |
05:37:56 | | datechnoman quits [Quit: The Lounge - https://thelounge.chat] |
05:38:34 | | datechnoman (datechnoman) joins |
05:39:58 | | G4te_Keep3r34924 quits [Ping timeout: 255 seconds] |
06:46:24 | | grid joins |
06:51:37 | | Flashfire42 is now authenticated as flashfire42 |
07:05:02 | | Unholy236192464537 quits [Remote host closed the connection] |
07:06:07 | | Unholy236192464537 (Unholy2361) joins |
07:33:30 | | loug joins |
07:37:58 | | shgaqnyrjp (shgaqnyrjp) joins |
07:41:53 | | BlueMaxima quits [Read error: Connection reset by peer] |
08:17:33 | | Justin[home] quits [Remote host closed the connection] |
08:29:02 | | loug quits [Read error: Connection reset by peer] |
08:29:03 | | loug joins |
08:54:43 | | flotwig_ joins |
08:55:16 | | flotwig quits [Ping timeout: 255 seconds] |
08:55:16 | | flotwig_ is now known as flotwig |
08:57:19 | | loug4 joins |
08:57:22 | | loug quits [Read error: Connection reset by peer] |
08:58:04 | | loug joins |
09:00:02 | | Bleo182600722719 quits [Client Quit] |
09:01:21 | | Bleo182600722719 joins |
09:02:01 | | loug4 quits [Ping timeout: 255 seconds] |
09:06:18 | | grid quits [Client Quit] |
10:19:26 | | nulldata8 (nulldata) joins |
10:21:13 | | nulldata quits [Ping timeout: 255 seconds] |
10:21:13 | | nulldata8 is now known as nulldata |
11:06:20 | | Wohlstand (Wohlstand) joins |
11:15:29 | | Megame (Megame) joins |
13:01:48 | | xkey is now known as ruediger |
13:01:58 | | ruediger is now known as xkey |
13:09:55 | | grid joins |
13:21:21 | | etnguyen03 (etnguyen03) joins |
13:38:19 | | Arcorann quits [Ping timeout: 255 seconds] |
13:41:20 | | hexa- quits [Quit: WeeChat 4.1.1] |
13:42:38 | | hexa- (hexa-) joins |
13:49:52 | | JaffaCakes118_2 quits [Remote host closed the connection] |
13:50:16 | | JaffaCakes118_2 (JaffaCakes118) joins |
14:11:10 | | wyatt8750 quits [Ping timeout: 255 seconds] |
14:19:08 | | Wohlstand quits [Client Quit] |
14:19:50 | | wyatt8740 joins |
14:41:24 | | wyatt8740 quits [Ping timeout: 265 seconds] |
14:43:09 | | wyatt8740 joins |
15:13:04 | | Megame quits [Client Quit] |
15:39:46 | | grid quits [Client Quit] |
15:41:45 | | BearFortress_ quits [Client Quit] |
16:17:12 | | lflare quits [Quit: Ping timeout (120 seconds)] |
16:17:38 | | lflare (lflare) joins |
16:43:39 | | emtee quits [Quit: The Lounge - https://thelounge.chat] |
16:43:45 | <Harzilein> | JaffaCakes118_2: totally digging your nick, though the amount of numbers tends to spook me. |
16:43:55 | <Harzilein> | oh |
16:43:56 | <Harzilein> | argh |
16:44:05 | <Harzilein> | ot/bs mixup again |
16:45:17 | | BearFortress joins |
16:48:17 | <rktk> | Hmm. So a while ago I worked with nicolas17 to resolve long-document printing for unprintable Google Docs. adding /preview to the end of the URL + removing two stylesheet options, but that doesn't seem to be working anymore |
16:52:02 | <rktk> | easiest solution now is to use https://addons.mozilla.org/en-US/firefox/addon/single-file/ with the /preview workaround. No CSS editing needed |
16:52:15 | <rktk> | HTML is better than PDF I guess? |
17:12:05 | | Notrealname1234 (Notrealname1234) joins |
17:31:34 | | JaffaCakes118_2 quits [Remote host closed the connection] |
17:31:54 | | JaffaCakes118_2 (JaffaCakes118) joins |
17:38:52 | | loug quits [Read error: Connection reset by peer] |
17:39:29 | | loug4 joins |
17:45:06 | | Island joins |
17:46:46 | <pie_> | thuban: here is another one adn-cis.org/forum |
17:47:13 | | Notrealname1234 quits [Client Quit] |
17:49:13 | | etnguyen03 quits [Client Quit] |
18:04:03 | | etnguyen03 (etnguyen03) joins |
18:05:08 | | balrog_ quits [Client Quit] |
18:12:59 | | balrog (balrog) joins |
18:31:15 | <@JAA> | So via arkiver's AB job, I learned that https://defence.pk/ aka https://pdf.defence.pk/ aka Pakistan Defence (Forum) is back online. It had disappeared in January following an investigation by the Pakistan government (ft. family threats and other cool things) and started redirecting to quran.com at the time. It's been back up since sometime in April, it seems. Not known who operates it now, cf. |
18:31:21 | <@JAA> | discussion on the replacement board: https://defencepk.com/forums/threads/defence-pk-back-online.6359/ |
18:34:58 | <@JAA> | (Last post in that thread is by the previous admin of defence.pk.) |
18:44:00 | <@arkiver> | JAA: i had no idea you were aware of it |
18:44:05 | <@arkiver> | someone else made me aware of this site |
18:45:08 | <fireonlive> | i always thought the pdf was like the file |
18:46:31 | <Harzilein> | three letter acronym namespace is _tiny_, so collisions are to be expected |
18:46:52 | <@JAA> | arkiver: Yeah, I ran an AB job under the pdf subdomain in December. Didn't finish in time, of course. |
18:47:04 | <@JAA> | They serve the same content under both domains. |
18:47:29 | <@JAA> | fireonlive: Not the probability density function? |
18:47:55 | <fireonlive> | 🤔 |
18:47:59 | <@arkiver> | JAA: we're resuming the pdf job then i guess? |
18:48:12 | <@JAA> | No way to resume AB jobs. |
18:48:18 | <nyany> | i'll show you a density function JAA |
18:48:23 | <@arkiver> | hmm https://pdf.defence.pk/ just redirect to the main one |
18:48:25 | <nyany> | oh crap this isn't -ot? oops |
18:48:35 | <@JAA> | Oh yeah, it does redirect now. |
18:48:43 | <@JAA> | It just returned the same content at the time. |
18:49:07 | <@JAA> | But the subdomain seemed more commonly linked, so that's what I went with. |
18:49:44 | <@JAA> | I probably kept the database of the job, so we could run everything that wasn't retrieved there, but it'd be incomplete still due to pagination etc. |
18:50:39 | <@arkiver> | probably fine this way if we just grab a full copy of the main one now |
18:50:57 | <@JAA> | Well, depending on how long the site stays up, but yeah. |
18:52:07 | <@JAA> | I'll look into maybe qwarcing it, too. |
18:54:28 | <@arkiver> | alright :) |
18:54:35 | <@arkiver> | outlinks always welcome in #// ! |
18:58:27 | <fireonlive> | arkiver: is there a 'process these pdfs only' kinda thing for #//? |
18:58:42 | <fireonlive> | i.e. they've already been saved by IA via AB or whatever but not been ripped through |
19:07:47 | <@arkiver> | fireonlive: what do you mean? |
19:07:53 | <@arkiver> | i can always queue URLs directly |
19:07:57 | <@arkiver> | i'll be off to be now though |
19:08:17 | <fireonlive> | arkiver: ah so we don't save the PDF twice or more times |
19:08:31 | <fireonlive> | iirc if you !a a txt file with a bunch of PDFs it'll save the PDF again |
19:09:49 | <fireonlive> | ah nvm i can't think today |
19:14:20 | <nicolas17> | s/today// |
19:18:38 | <fireonlive> | true |
19:32:16 | | etnguyen03 quits [Client Quit] |
20:01:16 | | ^ quits [Remote host closed the connection] |
20:02:55 | | ^ (^) joins |
20:12:04 | | ^ quits [Remote host closed the connection] |
20:12:19 | | ^ (^) joins |
20:12:58 | | Unholy236192464537 quits [Ping timeout: 265 seconds] |
20:42:40 | | Lord_Nightmare quits [Ping timeout: 255 seconds] |
20:57:45 | | tmob joins |
21:00:57 | <tmob> | Hi all a video I'm searching for from google video was crawled in the project (https://web.archive.org/web/20110417171623/http://video.google.com/videoplay?docid=-7507963840058831210#) but I've been told it wasn't included for some reason and that it should have been saved. Just wondering if that's true that it could have been saved and if so where |
21:00:57 | <tmob> | might it exist? |
21:06:19 | | etnguyen03 (etnguyen03) joins |
21:09:08 | | Notrealname1234 (Notrealname1234) joins |
21:15:08 | | Wohlstand (Wohlstand) joins |
21:22:10 | | tablechair joins |
21:32:21 | <tablechair> | !archive https://www.boostmobile.com/ |
21:33:19 | <tablechair> | hello, I would like help to have the ArchiveBot archive the entire website https:\\boostmobile.com , they have been been treating their poorer customers who were using a government support program to help pay for phone service unfairly, I am trying to help a civil rights advocate in archiving the website so he can have all the webpages save, with |
21:33:19 | <tablechair> | their policies and updates so that he can show they have been unfairly treating poor, disadvantaged and disabled persons. |
21:34:26 | <tablechair> | > !archive https:\\boostmobile.com |
21:35:21 | | shgaqnyrjp quits [Remote host closed the connection] |
21:36:03 | <pokechu22> | tablechair: I've started an archivebot job for that |
21:36:08 | | shgaqnyrjp (shgaqnyrjp) joins |
21:36:20 | <tablechair> | thanks pokechu22 |
21:43:23 | | treora quits [Remote host closed the connection] |
21:43:25 | | treora joins |
21:43:32 | | treora quits [Remote host closed the connection] |
21:43:33 | | treora joins |
21:52:32 | | tablechair quits [Ping timeout: 265 seconds] |
21:53:53 | | shgaqnyrjp quits [Remote host closed the connection] |
21:54:21 | | shgaqnyrjp (shgaqnyrjp) joins |
21:57:28 | | ^ quits [Remote host closed the connection] |
21:58:34 | | ^ (^) joins |
22:10:47 | <Notrealname1234> | Can someone scrape https://duckduckgo.com/?q=site%3Ajamboard.google.com&t=h_&ia=web ? |
22:10:57 | <Notrealname1234> | https://9to5google.com/2023/09/28/google-jamboard/ |
22:41:39 | | loug4 quits [Client Quit] |
22:43:55 | | Lord_Nightmare (Lord_Nightmare) joins |
22:56:38 | | etnguyen03 quits [Client Quit] |
22:57:21 | | DopefishJustin joins |
22:57:21 | | DopefishJustin is now authenticated as DopefishJustin |
22:58:58 | | JaffaCakes118_2 quits [Remote host closed the connection] |
22:59:22 | | JaffaCakes118_2 (JaffaCakes118) joins |
23:03:39 | | Notrealname1234 quits [Client Quit] |
23:04:28 | | Notrealname1234 (Notrealname1234) joins |
23:15:44 | | etnguyen03 (etnguyen03) joins |
23:20:47 | | Notrealname1234 quits [Client Quit] |
23:24:09 | <nicolas17> | https://archive.org/details/stateofthemap2014-raw-recordings got a playable mp4 video derived :D |
23:27:29 | | Notrealname1234 (Notrealname1234) joins |
23:27:58 | | BlueMaxima joins |
23:35:10 | | Notrealname1234 quits [Client Quit] |