00:06:10collat quits [Ping timeout: 258 seconds]
00:06:30collat joins
00:21:07HP_Archivist quits [Ping timeout: 258 seconds]
00:23:55nepeat quits [Ping timeout: 260 seconds]
00:29:19nepeat (nepeat) joins
00:50:55nulldata quits [Client Quit]
00:51:45nulldata (nulldata) joins
00:56:40HP_Archivist joins
01:00:35Juest (Juest) joins
01:05:14Ryz2 quits [Quit: Ping timeout (120 seconds)]
01:05:20s-crypt2 (s-crypt) joins
01:05:23Ryz2 (Ryz) joins
01:05:38Flashfire424 joins
01:07:03linuxgemini (linuxgemini) joins
01:07:30collat quits [Ping timeout: 258 seconds]
01:07:31Flashfire42 quits [Ping timeout: 258 seconds]
01:07:31s-crypt quits [Ping timeout: 258 seconds]
01:07:31s-crypt2 is now known as s-crypt
01:07:31Flashfire424 is now known as Flashfire42
01:10:30Wohlstand (Wohlstand) joins
01:14:19HP_Archivist quits [Read error: Connection reset by peer]
01:14:45seacow31 joins
01:18:31seacow quits [Ping timeout: 255 seconds]
02:04:21SootBector quits [Ping timeout: 240 seconds]
02:05:01SootBector (SootBector) joins
02:38:06adryd2 (adryd) joins
02:38:40adryd quits [Ping timeout: 260 seconds]
02:38:40adryd2 is now known as adryd
02:58:15Wohlstand quits [Client Quit]
02:59:28etnguyen03 quits [Remote host closed the connection]
03:18:34pixel leaves [Error from remote client]
03:38:45th3z0l4 quits [Ping timeout: 260 seconds]
03:38:47xDEADBEEF joins
04:45:01<h2ibot>Switchnode edited Political parties/Georgia (-14, /* ქართული ოცნება – დემოკრატიული საქართველო…): https://wiki.archiveteam.org/?diff=53661&oldid=53646
05:01:38collat joins
05:06:20collat quits [Ping timeout: 258 seconds]
05:25:42BlueMaxima quits [Read error: Connection reset by peer]
06:05:30pixel (pixel) joins
06:09:39StarletCharlotte joins
06:10:44Snivy quits [Ping timeout: 258 seconds]
06:51:50Snivy (Snivy) joins
07:05:50Unholy236192464537713 (Unholy2361) joins
07:23:36<pabs>some of these might be worth archiving https://adstransparency.google.com/political?region=US&topic=political https://news.ycombinator.com/item?id=41937507
07:30:20collat joins
07:42:56le0n_ (le0n) joins
07:43:06le0n quits [Ping timeout: 258 seconds]
07:44:38collat quits [Ping timeout: 258 seconds]
07:58:20collat joins
08:11:33PredatorIWD26 joins
08:12:37PredatorIWD2 quits [Ping timeout: 258 seconds]
08:12:38PredatorIWD26 is now known as PredatorIWD2
08:12:55collat quits [Ping timeout: 260 seconds]
08:13:37Commander001 quits [Remote host closed the connection]
08:14:12Commander001 joins
08:16:21collat joins
08:43:34leo60228- quits [Read error: Connection reset by peer]
08:45:02leo60228 (leo60228) joins
08:52:35sralracer joins
09:42:56<h2ibot>Manu edited Discourse/archived (+190, pabs running forums in other contexts): https://wiki.archiveteam.org/?diff=53662&oldid=53660
10:14:16loug8318142 joins
10:34:41MrMcNuggets (MrMcNuggets) joins
10:51:28pixel leaves [Error from remote client]
11:00:01Bleo18260072271962 quits [Quit: The Lounge - https://thelounge.chat]
11:00:55le0n_ quits [Ping timeout: 260 seconds]
11:01:32pixel (pixel) joins
11:02:47Bleo18260072271962 joins
11:03:58loug8318142 quits [Ping timeout: 258 seconds]
11:04:47loug8318142 joins
11:15:46le0n (le0n) joins
11:38:03Wohlstand (Wohlstand) joins
11:38:31Wohlstand quits [Client Quit]
11:38:50monoxane quits [Ping timeout: 260 seconds]
11:39:35SkilledAlpaca418 quits [Quit: SkilledAlpaca418]
11:42:35SkilledAlpaca418 joins
11:43:32<c3manu>the adstransparency page is too scripty for AB to discover all the advertisers, removed ad info, etc. if someone has more experience with stuff like that, feel free to have a go at it ;)
11:43:55<c3manu>there's also an API for that, but i'm not sure whether that will be helpful with retrieving the URLs for that stuff
11:44:05<c3manu>fetching the advertiser's pages seems to work though
11:44:26<@arkiver>if they have youtube videos, we could queue them in #down-the-tube
11:44:28<c3manu>i might give it a shot again later, but i'm probably not able to do it without any help
11:45:51<tzt>There's a CSV export option
11:47:26<tzt>I'm not sure if that includes the video IDs
11:49:15<tzt>It does not unfortunately
11:50:20<@arkiver>that is unfortunate
11:51:02monoxane (monoxane) joins
11:51:24monoxane quits [Remote host closed the connection]
11:51:54Commander001 quits [Ping timeout: 258 seconds]
11:52:13Commander001 joins
11:52:20monoxane (monoxane) joins
12:02:25vix5110_ joins
12:10:51SootBector quits [Remote host closed the connection]
12:11:05SootBector (SootBector) joins
12:39:51Matthww quits [Quit: The Lounge - https://thelounge.chat]
12:42:55etnguyen03 (etnguyen03) joins
12:43:43loug8318142 quits [Client Quit]
12:57:52Commander001 quits [Read error: Connection reset by peer]
12:58:04Commander001 joins
13:07:49<pabs>hmm you can get video ids from the https://i.ytimg.com/ URLs that get loaded in the browser, like https://i.ytimg.com/vi/IFf0rRdDWzQ/hqdefault.jpg -> https://www.youtube.com/watch?v=IFf0rRdDWzQ
13:11:35collat quits [Ping timeout: 260 seconds]
13:14:06Matthww joins
13:15:44<pabs>ugh, the video ids are in JS code
13:17:53<pabs>and it looks like you need one request to the JS endpoint per ad
13:20:29<pabs>in Firefox devtools, save all as HAR, then you can open the resulting json file in a text editor and trace which data comes from where
13:21:01<pabs>hmm, the ad videos say they are copies of the originals
13:21:04<tzt>pabs: this is what i've been doing manually for the past few months, but also since the most of the videos are re-uploaded to Google's own channel and not the original ad video, i look at each video to check if its not a reupload. i don't think this could be automated
13:21:18<pabs>"This is a copy of an election ad that an advertiser ran on YouTube. It is being retained for display in Google's Political Advertising Transparency Report. For more information, please visit the Transparency Report at https://transparencyreport.google.com..."
13:21:47<pabs>Google--
13:21:47<eggdrop>[karma] 'Google' now has -5 karma!
13:21:50<pabs>ads--
13:21:51<eggdrop>[karma] 'ads' now has -1 karma!
13:21:58<pabs>they couldn't make this easy could they...
13:22:35<pabs>maybe the BigQuery export is the best option
13:22:50<pabs>ah, you need an account
13:23:00<tzt>if it's the same data as the CSV export, it will be useless
13:23:26<tzt>the CSV export only contained links to the advertiser pages
13:26:42<pabs>hmm "Finally, links to the actual political ad in the Google Transparency Report are provided"
13:27:27<tzt>there's two different types of videos on the ad pages, there's ones that are youtube embeds and IMA (Interactive Media Ads) embeds, only the IMA embeds have the original YouTube video on the website
13:28:08cas joins
13:28:30<cas>pabs ok awesome, thank you kindly
13:28:37cas quits [Client Quit]
13:45:31etnguyen03 quits [Client Quit]
13:56:18<immibis>Medowar: wouldn't that be a better donation to one of the projects that deals with things that are about to disappear?
13:58:47etnguyen03 (etnguyen03) joins
14:28:59JohnnyJ2 quits [Quit: Ping timeout (120 seconds)]
14:29:12JohnnyJ2 joins
14:33:50JohnnyJ2 quits [Ping timeout: 260 seconds]
14:43:59JohnnyJ2 joins
15:07:28<vix5110_>What does @ERROR : max connections (400) mean ?
15:09:24mossss joins
15:10:26<Xanthon>the number of people uploading to the server is maxed out. It will keep retrying until there's a slot
15:16:46etnguyen03 quits [Client Quit]
15:18:04etnguyen03 (etnguyen03) joins
15:25:14mossss quits [Client Quit]
15:41:23xkey quits [Quit: WeeChat 4.2.2]
15:43:28xkey (xkey) joins
15:47:14<vix5110_>Xanthon is it possible to have info about server load ?
15:47:24<vix5110_>A stats page or sum...
15:53:46pixel leaves [Error from remote client]
15:59:09etnguyen03 quits [Client Quit]
16:11:22pixel (pixel) joins
16:29:35MOSTech6502 joins
16:35:03MrMcNuggets quits [Quit: WeeChat 4.3.2]
16:41:35MOSTech6502 quits [Ping timeout: 260 seconds]
16:41:55StarletCharlotte quits [Read error: Connection reset by peer]
16:57:53etnguyen03 (etnguyen03) joins
17:08:31lennier2_ quits [Ping timeout: 258 seconds]
17:14:59icedice (icedice) joins
17:28:10etnguyen03 quits [Client Quit]
17:32:48collat joins
17:39:49useretail joins
17:49:52etnguyen03 (etnguyen03) joins
18:12:22<h2ibot>Manu edited Discourse/archived (+96, JAA grabbing https://community.raspberryshake.org/): https://wiki.archiveteam.org/?diff=53663&oldid=53662
18:40:35Jake quits [Quit: Leaving for a bit!]
18:41:01Jake (Jake) joins
19:12:33lennier2_ joins
19:29:49Juesto (Juest) joins
19:32:30Juest quits [Ping timeout: 260 seconds]
19:32:30Juesto is now known as Juest
19:33:42pixel leaves [Error from remote client]
19:39:20Radzig joins
19:53:44<h2ibot>Manu edited Webring/fediring.net (+371, /* Add more archived pages */): https://wiki.archiveteam.org/?diff=53664&oldid=53658
20:29:24Radzig quits [Remote host closed the connection]
20:29:59etnguyen03 quits [Client Quit]
20:34:57Radzig joins
20:38:00tbc1887 quits [Quit: Ping timeout (120 seconds)]
20:38:30tbc1887 (tbc1887) joins
20:43:23NF885 (NF885) joins
20:50:24<NF885>looks like this site should be run in ArchiveBothttps://britishcomics.wordpress.com/
20:50:29<NF885>oops
20:50:33<NF885>https://britishcomics.wordpress.com/
20:50:48<NF885>(from https://www.reddit.com/r/Archiveteam/comments/1gdj9pc/need_help_regarding_downloading_british_comics/)
20:52:28<Barto>thrown into archivebot NF885
20:52:36<NF885>Thanks
20:52:52<Barto>you're welcome
20:54:08BlueMaxima joins
20:54:41<pokechu22>It looks like actual files from that are hosted on a different site
20:54:58<immibis>Xanthon: normally the tracker rate limiting would be lower than what the staging server can handle, I suppose, but right now for veoh it is not
20:55:21<immibis>Why is the word "target" consistently used instead of "staging server"?
21:08:55vix5110_ quits [Client Quit]
21:21:49NF885 quits [Client Quit]
21:26:19etnguyen03 (etnguyen03) joins
21:36:06etnguyen03 quits [Client Quit]
21:37:26etnguyen03 (etnguyen03) joins
21:37:55icedice quits [Ping timeout: 260 seconds]
22:07:01SootBector quits [Remote host closed the connection]
22:07:18SootBector (SootBector) joins
22:17:12<TheTechRobo>Short for 'upload target' or 'rsync target'
22:20:05<Vokun>Just be glad i'm not saying tracker anymore...
22:24:04<immibis>it's such a narrow view that hides the fact the system has another half
22:42:58useretail quits [Quit: Leaving]
23:09:51hexa- quits [Quit: WeeChat 4.2.2]
23:11:39hexa- (hexa-) joins
23:28:56etnguyen03 quits [Client Quit]
23:55:24<@JAA>c3manu++
23:55:24<eggdrop>[karma] 'c3manu' now has 37 karma!