00:04:38nimaje1 joins
00:05:53nimaje quits [Ping timeout: 252 seconds]
00:55:23etnguyen03 quits [Ping timeout: 258 seconds]
01:10:33DogsRNice joins
01:11:20pabs quits [Quit: Don't rest until all the world is paved in moss and greenery.]
01:15:51pabs (pabs) joins
01:16:20pabs quits [Remote host closed the connection]
01:17:39etnguyen03 (etnguyen03) joins
01:19:18pabs (pabs) joins
01:39:05igloo22225 quits [Quit: Ping timeout (120 seconds)]
01:39:27ave quits [Read error: Connection reset by peer]
01:39:35igloo22225 (igloo22225) joins
01:39:40ave (ave) joins
01:40:02nepeat_ quits [Client Quit]
01:40:36nepeat (nepeat) joins
01:48:02ave quits [Client Quit]
01:48:22ave (ave) joins
02:16:19killsushi joins
02:18:11etnguyen03 quits [Ping timeout: 258 seconds]
02:30:01railen63 quits [Remote host closed the connection]
02:31:25fuzzy8021 quits [Read error: Connection reset by peer]
02:33:57fuzzy8021 (fuzzy8021) joins
02:34:50railen63 joins
02:38:04etnguyen03 (etnguyen03) joins
02:52:52fuzzy8021 quits [Remote host closed the connection]
02:55:11fuzzy8021 (fuzzy8021) joins
03:12:03superkuh_ joins
03:12:41igloo222253 joins
03:12:43driib4 (driib) joins
03:12:44ave4 (ave) joins
03:13:16ave quits [Client Quit]
03:13:16driib quits [Client Quit]
03:13:16igloo22225 quits [Client Quit]
03:13:16trainingdata quits [Client Quit]
03:13:16qwertyasdfuiopghjkl quits [Client Quit]
03:13:16superkuh quits [Remote host closed the connection]
03:13:16adamus1red quits [Client Quit]
03:13:16driib4 is now known as driib
03:13:16ave4 is now known as ave
03:13:17igloo222253 is now known as igloo22225
03:13:25adamus1red_ (adamus1red) joins
03:13:49adamus1red_ is now known as adamus1red
03:16:36hibhhdd joins
03:17:04hibhhdd quits [Remote host closed the connection]
03:18:59DogsRNice quits [Read error: Connection reset by peer]
03:34:36nicolas17 quits [Ping timeout: 265 seconds]
03:37:34nicolas17 joins
03:45:19qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
03:45:51<Ryz>Heya folks, what would be any recent Warrior projects that need help that isn't maybe Reddit or Imgur? Because otherwise I'll probably default to one of the two projects I mentioned while running a Warrior, since it's been some time since running one because of weather, ugh
03:46:44<fireonlive>i'd say telegram probably
03:47:11<nicolas17>Ryz: well if you use the docker containers the answer can be "all of them"
03:47:30<nicolas17>reddit and imgur don't have enough tasks or are rate limited -> run both
03:48:15<Ryz>Hmm, currently with Imgur, looks like rate-limited, lemme poke into #imgone
03:50:11<Ryz>Whoa, checking the to do, 211.87 million right now oO;
03:50:36<fireonlive>yee, running it slow
03:50:58<Ryz>Oops, that message was supposed to be in that channel
04:05:32Ketchup901 quits [Remote host closed the connection]
04:05:56Ketchup901 (Ketchup901) joins
04:17:47etnguyen03 quits [Ping timeout: 252 seconds]
04:23:14<flashfire42|m>So ryz I would say just run archiveteam choice if you are running the warrior VM. If you are using docker then configure to run all
04:24:08etnguyen03 (etnguyen03) joins
04:56:10etnguyen03 quits [Client Quit]
05:01:56kiryu leaves
05:02:29kiryu joins
05:05:19skyrocket quits [Ping timeout: 258 seconds]
05:25:59Letur quits [Ping timeout: 252 seconds]
05:27:06Letur joins
05:27:37kiryu quits [Client Quit]
05:27:49kiryu joins
06:05:56skyrocket joins
06:11:05siinus quits [Ping timeout: 252 seconds]
06:13:46jtagcat quits [Quit: Bye!]
06:14:23jtagcat (jtagcat) joins
06:22:29siinus (siinus) joins
06:24:57<h2ibot>FireonLive edited Periscope (+24, it's a DPoS): https://wiki.archiveteam.org/?diff=50291&oldid=47711
06:35:04datechnoman quits [Client Quit]
06:35:38datechnoman (datechnoman) joins
06:45:50Minkafighter52 quits [Quit: The Lounge - https://thelounge.chat]
06:46:23Minkafighter525 joins
07:05:15nicolas17 quits [Client Quit]
07:06:57Unholy2361313 (Unholy2361) joins
07:09:23Unholy236131 quits [Ping timeout: 252 seconds]
07:09:23Unholy2361313 is now known as Unholy236131
07:30:59trainingdata joins
07:31:08Arcorann (Arcorann) joins
07:57:39Naruyoko5 quits [Remote host closed the connection]
07:57:58Naruyoko5 joins
09:12:35@rewby quits [Ping timeout: 252 seconds]
10:00:01railen63 quits [Remote host closed the connection]
10:00:59railen63 joins
10:47:08rewby (rewby) joins
10:47:08@ChanServ sets mode: +o rewby
11:18:33bf_ quits [Client Quit]
11:18:36bf_ joins
11:37:55jackdanielsfan55 joins
11:53:38kiryu quits [Read error: Connection reset by peer]
11:53:48VerifiedJ quits [Quit: The Lounge - https://thelounge.chat]
11:54:19VerifiedJ (VerifiedJ) joins
11:55:07kiryu joins
12:00:20bf_ quits [Ping timeout: 252 seconds]
12:01:03bf_ joins
12:02:39TastyWiener953 (TastyWiener95) joins
12:03:42TastyWiener95 quits [Read error: Connection reset by peer]
12:03:43TastyWiener953 is now known as TastyWiener95
12:34:00AmAnd0A quits [Ping timeout: 265 seconds]
12:34:34AmAnd0A joins
12:52:35<h2ibot>Exorcism uploaded File:Bandcamp-logo.png: https://wiki.archiveteam.org/?title=File%3ABandcamp-logo.png
12:53:35<h2ibot>Exorcism uploaded File:Bandcamp-screenshot.png: https://wiki.archiveteam.org/?title=File%3ABandcamp-screenshot.png
12:53:36<h2ibot>Exorcism edited Bandcamp (+40): https://wiki.archiveteam.org/?diff=50294&oldid=48993
12:55:41Unholy2361310 (Unholy2361) joins
12:59:11Unholy236131 quits [Ping timeout: 252 seconds]
12:59:11Unholy2361310 is now known as Unholy236131
13:00:25railen64 joins
13:00:37etnguyen03 (etnguyen03) joins
13:03:20railen63 quits [Ping timeout: 258 seconds]
13:38:36TheTechRobo quits [Client Quit]
13:40:07AnotherTechRobo joins
13:44:50Arcorann quits [Ping timeout: 252 seconds]
13:50:45AnotherTechRobo quits [Remote host closed the connection]
14:01:53TheTechRobo (TheTechRobo) joins
14:15:24monoxane quits [Ping timeout: 258 seconds]
14:15:59AmAnd0A quits [Ping timeout: 265 seconds]
14:21:11<bleb>I want to experiment with writing a web archiving tool which runs a page's javascript in the browser, then stores a representation of the resulting DOM.
14:21:45<bleb>is there any standard or convention or precedent for this? it seems like archive.today might do something like this but the code is closed
14:22:39monoxane (monoxane) joins
14:23:17Megame (Megame) joins
14:24:57<bleb>it wouldn't be too hard to write a userscript which sends a json representation of the dom to a local web server to be logged to a file. then you could write a script that takes a URL and points a fresh browser instance at it, then gets the json via a local web server and stores it.
14:25:58<bleb>the result would be a lot of json; you could also convert it back to HTML but you might lose some information
14:26:37<bleb>I will start playing with this soon but if anyone knows of any precedent let me know
14:50:31<sknebel>bleb: there is a library/tool called freeze-dry which tries something similar
14:57:19<imer>https://github.com/internetarchive/brozzler could check what IA's browser based crawler does as well, might be a good starting point?
14:57:43<@JAA>crocoite did this, but its WARC output has many problems. SingleFile and SingleFileZ also come to mind.
14:58:03<@JAA>Doing it with brozzler would be nice, yeah.
15:02:49jacksonchen666 quits [Client Quit]
15:11:47superkuh joins
15:13:40superkuh_ quits [Ping timeout: 258 seconds]
15:46:02W7RFa6AbNFz_ quits [Read error: Connection reset by peer]
15:46:22W7RFa6AbNFz_ joins
15:50:24<fireonlive>does brozzler have the JAA 🦭 of approval?
16:00:12<@JAA>fireonlive: I never verified it, but it's IA's work, and the relevant people there seem to care about following standards.
16:00:35<fireonlive>sounds good to me :)
16:00:42<fireonlive>tks
16:05:17nicolas17 joins
16:07:11<@JAA>IA: presumed innocent until proven guilty
16:07:17<@JAA>webrecorder: presumed guilty until proven innocent
16:07:24<@JAA>For me personally, anyway. :-)
16:07:55<fireonlive>:D
16:08:12<fireonlive>given their track records…
16:08:23<TheTechRobo>Its not like its undeserved
16:09:28<@JAA>Yeah, I'm happy to change my stance if they address the issues and show that they care.
16:11:09<@JAA>And I realise it's open-source and they'd accept PRs, but they're getting paid for this, and they've implemented other things in the meantime, i.e. correctness is clearly not one of their priorities, which immediately disqualifies it.
16:15:05<fireonlive>yeah :/. that's a big blight if it's their actual job
16:18:53Ruthalas5 quits [Client Quit]
16:19:26Ruthalas5 (Ruthalas) joins
16:21:29bf__ joins
16:23:14bf_ quits [Ping timeout: 252 seconds]
16:28:02killsushi quits [Ping timeout: 258 seconds]
16:34:40TheTechRobo quits [Excess Flood]
16:34:40Naruyoko5 quits [Remote host closed the connection]
16:34:58Naruyoko5 joins
16:36:56W7RFa6AbNFz_ quits [Remote host closed the connection]
16:36:56superkuh quits [Remote host closed the connection]
16:36:57W7RFa6AbNFz_ joins
16:36:58DogsRNice joins
16:37:07superkuh joins
16:41:12TheTechRobo (TheTechRobo) joins
17:36:38bf_ joins
17:36:39bf__ quits [Ping timeout: 258 seconds]
17:43:40<bleb>brozzler works differently though
17:44:34<bleb>it records the network activity with a MITM proxy, so if content is obfuscated with javascript you still need to use a browser when viewing it
17:44:47<bleb>ty for all the suggestions :)
17:45:06<imer>yes, I was thinking it might be a good starting point since you "just" need to add in saving the page before it's closed
17:45:24<imer>all the other automation being in place already pretty much
17:46:14<bleb>ya maybe
17:46:29<bleb>I tried testing it out a week ago and couldn't get it to run
17:46:37<imer>ah
17:47:13<bleb>iirc, either a problem with vagrant or a python version issue depending on what I tried
17:48:25<bleb>I gotta brush up on perl and start using it instead of python
17:49:42<bleb>pip sucks so much and they don't maintain compatibility between minor versions. I asked #debian what would be the easiest way to run some python code written for an earlier minor version of python 3 and they said to use a container or vm
17:57:09<imer>bleb: JA_A recommended pyenv to me in a similar situation
17:57:39<@JAA>Yes, pyenv, and ignore any Python versions installed through the package manager.
18:00:41Megame quits [Client Quit]
18:55:11TheTechRobo quits [Client Quit]
19:32:55IDK quits [Client Quit]
19:38:53Minkafighter525 quits [Client Quit]
19:38:55jtagcat quits [Killed (ing.hackint.org (Nickname regained by services))]
19:38:56Ruthalas5 quits [Client Quit]
19:38:56TastyWiener95 quits [Client Quit]
19:38:56monoxane quits [Client Quit]
19:38:56W7RFa6AbNFz_ quits [Remote host closed the connection]
19:38:56DogsRNice quits [Remote host closed the connection]
19:38:56superkuh quits [Remote host closed the connection]
19:38:56W7RFa6AbNFz_ joins
19:38:57DogsRNice joins
19:38:57jtagcat (jtagcat) joins
19:39:00monoxane3 (monoxane) joins
19:39:01Minkafighter525 joins
19:39:01superkuh joins
19:39:06TastyWiener95 (TastyWiener95) joins
19:39:07Ruthalas5 (Ruthalas) joins
19:43:28TastyWiener955 (TastyWiener95) joins
19:43:47TastyWiener95 quits [Client Quit]
19:43:47DogsRNice quits [Remote host closed the connection]
19:43:47W7RFa6AbNFz_ quits [Remote host closed the connection]
19:43:47trainingdata quits [Client Quit]
19:43:47W7RFa6AbNFz_ joins
19:43:47DogsRNice joins
19:43:47TastyWiener955 is now known as TastyWiener95
20:06:09yts98 leaves
20:14:40cobertos quits [Remote host closed the connection]
20:19:02yts98 joins
20:32:23benjins2_ joins
20:32:25benjins2_ quits [Client Quit]
20:47:06nulldata quits [Quit: Ping timeout (120 seconds)]
20:47:40nulldata (nulldata) joins
21:28:11atphoenix_ quits [Ping timeout: 258 seconds]
21:28:55atphoenix_ (atphoenix) joins
21:37:49AmAnd0A joins
21:59:35that_lurker quits [Quit: Clowning around is not the same as fooling around...I am a clown, not a fool]
22:01:17that_lurker (that_lurker) joins
22:19:05etnguyen03 quits [Ping timeout: 252 seconds]
22:48:52etnguyen03 (etnguyen03) joins
22:50:13DogsRNice_ joins
22:51:11W7RFa6AbNFz joins
22:51:26DogsRNice quits [Remote host closed the connection]
22:51:26W7RFa6AbNFz_ quits [Remote host closed the connection]
22:55:00BlueMaxima joins
22:57:02Matthww1 quits [Ping timeout: 252 seconds]
22:57:20Matthww1 joins
22:57:56jackdanielsfan55 quits [Remote host closed the connection]
23:24:59TheTechRobo (TheTechRobo) joins