00:06:10 | | collat quits [Ping timeout: 258 seconds] |
00:06:30 | | collat joins |
00:21:07 | | HP_Archivist quits [Ping timeout: 258 seconds] |
00:23:55 | | nepeat quits [Ping timeout: 260 seconds] |
00:29:19 | | nepeat (nepeat) joins |
00:50:55 | | nulldata quits [Client Quit] |
00:51:45 | | nulldata (nulldata) joins |
00:56:40 | | HP_Archivist joins |
01:00:35 | | Juest (Juest) joins |
01:05:14 | | Ryz2 quits [Quit: Ping timeout (120 seconds)] |
01:05:20 | | s-crypt2 (s-crypt) joins |
01:05:23 | | Ryz2 (Ryz) joins |
01:05:38 | | Flashfire424 joins |
01:07:03 | | linuxgemini (linuxgemini) joins |
01:07:30 | | collat quits [Ping timeout: 258 seconds] |
01:07:31 | | Flashfire42 quits [Ping timeout: 258 seconds] |
01:07:31 | | s-crypt quits [Ping timeout: 258 seconds] |
01:07:31 | | s-crypt2 is now known as s-crypt |
01:07:31 | | Flashfire424 is now known as Flashfire42 |
01:07:46 | | Flashfire42 is now authenticated as flashfire42 |
01:10:30 | | Wohlstand (Wohlstand) joins |
01:14:19 | | HP_Archivist quits [Read error: Connection reset by peer] |
01:14:45 | | seacow31 joins |
01:18:31 | | seacow quits [Ping timeout: 255 seconds] |
02:04:21 | | SootBector quits [Ping timeout: 240 seconds] |
02:05:01 | | SootBector (SootBector) joins |
02:38:06 | | adryd2 (adryd) joins |
02:38:40 | | adryd quits [Ping timeout: 260 seconds] |
02:38:40 | | adryd2 is now known as adryd |
02:58:15 | | Wohlstand quits [Client Quit] |
02:59:28 | | etnguyen03 quits [Remote host closed the connection] |
03:18:34 | | pixel leaves [Error from remote client] |
03:38:45 | | th3z0l4 quits [Ping timeout: 260 seconds] |
03:38:47 | | xDEADBEEF joins |
04:45:01 | <h2ibot> | Switchnode edited Political parties/Georgia (-14, /* ქართული ოცნება – დემოკრატიული საქართველო…): https://wiki.archiveteam.org/?diff=53661&oldid=53646 |
05:01:38 | | collat joins |
05:06:20 | | collat quits [Ping timeout: 258 seconds] |
05:25:42 | | BlueMaxima quits [Read error: Connection reset by peer] |
06:05:30 | | pixel (pixel) joins |
06:09:39 | | StarletCharlotte joins |
06:10:44 | | Snivy quits [Ping timeout: 258 seconds] |
06:51:50 | | Snivy (Snivy) joins |
07:05:50 | | Unholy236192464537713 (Unholy2361) joins |
07:23:36 | <pabs> | some of these might be worth archiving https://adstransparency.google.com/political?region=US&topic=political https://news.ycombinator.com/item?id=41937507 |
07:30:20 | | collat joins |
07:42:56 | | le0n_ (le0n) joins |
07:43:06 | | le0n quits [Ping timeout: 258 seconds] |
07:44:38 | | collat quits [Ping timeout: 258 seconds] |
07:58:20 | | collat joins |
08:11:33 | | PredatorIWD26 joins |
08:12:37 | | PredatorIWD2 quits [Ping timeout: 258 seconds] |
08:12:38 | | PredatorIWD26 is now known as PredatorIWD2 |
08:12:55 | | collat quits [Ping timeout: 260 seconds] |
08:13:37 | | Commander001 quits [Remote host closed the connection] |
08:14:12 | | Commander001 joins |
08:16:21 | | collat joins |
08:43:34 | | leo60228- quits [Read error: Connection reset by peer] |
08:45:02 | | leo60228 (leo60228) joins |
08:52:35 | | sralracer joins |
08:52:54 | | sralracer is now authenticated as sralracer |
09:42:56 | <h2ibot> | Manu edited Discourse/archived (+190, pabs running forums in other contexts): https://wiki.archiveteam.org/?diff=53662&oldid=53660 |
10:14:16 | | loug8318142 joins |
10:34:41 | | MrMcNuggets (MrMcNuggets) joins |
10:51:28 | | pixel leaves [Error from remote client] |
11:00:01 | | Bleo18260072271962 quits [Quit: The Lounge - https://thelounge.chat] |
11:00:55 | | le0n_ quits [Ping timeout: 260 seconds] |
11:01:32 | | pixel (pixel) joins |
11:02:47 | | Bleo18260072271962 joins |
11:03:58 | | loug8318142 quits [Ping timeout: 258 seconds] |
11:04:47 | | loug8318142 joins |
11:15:46 | | le0n (le0n) joins |
11:38:03 | | Wohlstand (Wohlstand) joins |
11:38:31 | | Wohlstand quits [Client Quit] |
11:38:50 | | monoxane quits [Ping timeout: 260 seconds] |
11:39:35 | | SkilledAlpaca418 quits [Quit: SkilledAlpaca418] |
11:42:35 | | SkilledAlpaca418 joins |
11:43:32 | <c3manu> | the adstransparency page is too scripty for AB to discover all the advertisers, removed ad info, etc. if someone has more experience with stuff like that, feel free to have a go at it ;) |
11:43:55 | <c3manu> | there's also an API for that, but i'm not sure whether that will be helpful with retrieving the URLs for that stuff |
11:44:05 | <c3manu> | fetching the advertiser's pages seems to work though |
11:44:26 | <@arkiver> | if they have youtube videos, we could queue them in #down-the-tube |
11:44:28 | <c3manu> | i might give it a shot again later, but i'm probably not able to do it without any help |
11:45:51 | <tzt> | There's a CSV export option |
11:47:26 | <tzt> | I'm not sure if that includes the video IDs |
11:49:15 | <tzt> | It does not unfortunately |
11:50:20 | <@arkiver> | that is unfortunate |
11:51:02 | | monoxane (monoxane) joins |
11:51:24 | | monoxane quits [Remote host closed the connection] |
11:51:54 | | Commander001 quits [Ping timeout: 258 seconds] |
11:52:13 | | Commander001 joins |
11:52:20 | | monoxane (monoxane) joins |
12:02:25 | | vix5110_ joins |
12:10:51 | | SootBector quits [Remote host closed the connection] |
12:11:05 | | SootBector (SootBector) joins |
12:39:51 | | Matthww quits [Quit: The Lounge - https://thelounge.chat] |
12:42:55 | | etnguyen03 (etnguyen03) joins |
12:43:43 | | loug8318142 quits [Client Quit] |
12:57:52 | | Commander001 quits [Read error: Connection reset by peer] |
12:58:04 | | Commander001 joins |
13:07:49 | <pabs> | hmm you can get video ids from the https://i.ytimg.com/ URLs that get loaded in the browser, like https://i.ytimg.com/vi/IFf0rRdDWzQ/hqdefault.jpg -> https://www.youtube.com/watch?v=IFf0rRdDWzQ |
13:11:35 | | collat quits [Ping timeout: 260 seconds] |
13:14:06 | | Matthww joins |
13:15:44 | <pabs> | ugh, the video ids are in JS code |
13:17:53 | <pabs> | and it looks like you need one request to the JS endpoint per ad |
13:20:29 | <pabs> | in Firefox devtools, save all as HAR, then you can open the resulting json file in a text editor and trace which data comes from where |
13:21:01 | <pabs> | hmm, the ad videos say they are copies of the originals |
13:21:04 | <tzt> | pabs: this is what i've been doing manually for the past few months, but also since the most of the videos are re-uploaded to Google's own channel and not the original ad video, i look at each video to check if its not a reupload. i don't think this could be automated |
13:21:18 | <pabs> | "This is a copy of an election ad that an advertiser ran on YouTube. It is being retained for display in Google's Political Advertising Transparency Report. For more information, please visit the Transparency Report at https://transparencyreport.google.com..." |
13:21:47 | <pabs> | Google-- |
13:21:47 | <eggdrop> | [karma] 'Google' now has -5 karma! |
13:21:50 | <pabs> | ads-- |
13:21:51 | <eggdrop> | [karma] 'ads' now has -1 karma! |
13:21:58 | <pabs> | they couldn't make this easy could they... |
13:22:35 | <pabs> | maybe the BigQuery export is the best option |
13:22:50 | <pabs> | ah, you need an account |
13:23:00 | <tzt> | if it's the same data as the CSV export, it will be useless |
13:23:26 | <tzt> | the CSV export only contained links to the advertiser pages |
13:26:42 | <pabs> | hmm "Finally, links to the actual political ad in the Google Transparency Report are provided" |
13:27:27 | <tzt> | there's two different types of videos on the ad pages, there's ones that are youtube embeds and IMA (Interactive Media Ads) embeds, only the IMA embeds have the original YouTube video on the website |
13:28:08 | | cas joins |
13:28:30 | <cas> | pabs ok awesome, thank you kindly |
13:28:37 | | cas quits [Client Quit] |
13:45:31 | | etnguyen03 quits [Client Quit] |
13:56:18 | <immibis> | Medowar: wouldn't that be a better donation to one of the projects that deals with things that are about to disappear? |
13:58:47 | | etnguyen03 (etnguyen03) joins |
14:28:59 | | JohnnyJ2 quits [Quit: Ping timeout (120 seconds)] |
14:29:12 | | JohnnyJ2 joins |
14:33:50 | | JohnnyJ2 quits [Ping timeout: 260 seconds] |
14:43:59 | | JohnnyJ2 joins |
15:07:28 | <vix5110_> | What does @ERROR : max connections (400) mean ? |
15:09:24 | | mossss joins |
15:10:26 | <Xanthon> | the number of people uploading to the server is maxed out. It will keep retrying until there's a slot |
15:16:46 | | etnguyen03 quits [Client Quit] |
15:18:04 | | etnguyen03 (etnguyen03) joins |
15:25:14 | | mossss quits [Client Quit] |
15:41:23 | | xkey quits [Quit: WeeChat 4.2.2] |
15:43:28 | | xkey (xkey) joins |
15:47:14 | <vix5110_> | Xanthon is it possible to have info about server load ? |
15:47:24 | <vix5110_> | A stats page or sum... |
15:53:46 | | pixel leaves [Error from remote client] |
15:59:09 | | etnguyen03 quits [Client Quit] |
16:11:22 | | pixel (pixel) joins |
16:29:35 | | MOSTech6502 joins |
16:35:03 | | MrMcNuggets quits [Quit: WeeChat 4.3.2] |
16:41:35 | | MOSTech6502 quits [Ping timeout: 260 seconds] |
16:41:55 | | StarletCharlotte quits [Read error: Connection reset by peer] |
16:57:53 | | etnguyen03 (etnguyen03) joins |
17:08:31 | | lennier2_ quits [Ping timeout: 258 seconds] |
17:14:59 | | icedice (icedice) joins |
17:28:10 | | etnguyen03 quits [Client Quit] |
17:32:48 | | collat joins |
17:39:49 | | useretail joins |
17:49:52 | | etnguyen03 (etnguyen03) joins |
18:12:22 | <h2ibot> | Manu edited Discourse/archived (+96, JAA grabbing https://community.raspberryshake.org/): https://wiki.archiveteam.org/?diff=53663&oldid=53662 |
18:40:35 | | Jake quits [Quit: Leaving for a bit!] |
18:41:01 | | Jake (Jake) joins |
19:12:33 | | lennier2_ joins |
19:29:49 | | Juesto (Juest) joins |
19:32:30 | | Juest quits [Ping timeout: 260 seconds] |
19:32:30 | | Juesto is now known as Juest |
19:33:42 | | pixel leaves [Error from remote client] |
19:39:20 | | Radzig joins |
19:53:44 | <h2ibot> | Manu edited Webring/fediring.net (+371, /* Add more archived pages */): https://wiki.archiveteam.org/?diff=53664&oldid=53658 |
20:29:24 | | Radzig quits [Remote host closed the connection] |
20:29:59 | | etnguyen03 quits [Client Quit] |
20:34:57 | | Radzig joins |
20:38:00 | | tbc1887 quits [Quit: Ping timeout (120 seconds)] |
20:38:30 | | tbc1887 (tbc1887) joins |
20:43:23 | | NF885 (NF885) joins |
20:50:24 | <NF885> | looks like this site should be run in ArchiveBothttps://britishcomics.wordpress.com/ |
20:50:29 | <NF885> | oops |
20:50:33 | <NF885> | https://britishcomics.wordpress.com/ |
20:50:48 | <NF885> | (from https://www.reddit.com/r/Archiveteam/comments/1gdj9pc/need_help_regarding_downloading_british_comics/) |
20:52:28 | <Barto> | thrown into archivebot NF885 |
20:52:36 | <NF885> | Thanks |
20:52:52 | <Barto> | you're welcome |
20:54:08 | | BlueMaxima joins |
20:54:41 | <pokechu22> | It looks like actual files from that are hosted on a different site |
20:54:58 | <immibis> | Xanthon: normally the tracker rate limiting would be lower than what the staging server can handle, I suppose, but right now for veoh it is not |
20:55:21 | <immibis> | Why is the word "target" consistently used instead of "staging server"? |
21:08:55 | | vix5110_ quits [Client Quit] |
21:21:49 | | NF885 quits [Client Quit] |
21:26:19 | | etnguyen03 (etnguyen03) joins |
21:36:06 | | etnguyen03 quits [Client Quit] |
21:37:26 | | etnguyen03 (etnguyen03) joins |
21:37:55 | | icedice quits [Ping timeout: 260 seconds] |
22:07:01 | | SootBector quits [Remote host closed the connection] |
22:07:18 | | SootBector (SootBector) joins |
22:17:12 | <TheTechRobo> | Short for 'upload target' or 'rsync target' |
22:20:05 | <Vokun> | Just be glad i'm not saying tracker anymore... |
22:24:04 | <immibis> | it's such a narrow view that hides the fact the system has another half |
22:42:58 | | useretail quits [Quit: Leaving] |
23:09:51 | | hexa- quits [Quit: WeeChat 4.2.2] |
23:11:39 | | hexa- (hexa-) joins |
23:28:56 | | etnguyen03 quits [Client Quit] |
23:55:24 | <@JAA> | c3manu++ |
23:55:24 | <eggdrop> | [karma] 'c3manu' now has 37 karma! |