00:04:27 | | jacksonchen666 is now authenticated as * |
00:04:27 | | jacksonchen666 is now known as RJHacker18086 |
00:04:31 | | jacksonchen666 (jacksonchen666) joins |
00:06:51 | | RJHacker18086 quits [Ping timeout: 250 seconds] |
00:09:43 | <Naruyoko> | https://docs.google.com/spreadsheets/d/1BtUSgPffbcaW4bMuGClYi8FGvaYmYyc1p4SkfpNty-U/htmlview A list of Yandex Disk links, spreadsheet found from https://siivagunner.fandom.com/wiki/Ripping . I don't know about the archive status or the legality of these song stems files. |
00:55:40 | | Earendil7 (Earendil7) joins |
00:56:08 | <nicolas17> | there was a terrible storm in Bahia Blanca, Buenos Aires, https://www.lanueva.com/ has news and photos about it |
00:56:24 | <nicolas17> | should I make a list of relevant articles to do a non-recursive AB? |
01:01:26 | <pokechu22> | That sounds reasonable to me |
01:02:10 | <nicolas17> | (since a recursive AB on a news site seems like a bad idea) |
01:36:59 | | BlueMaxima quits [Read error: Connection reset by peer] |
02:25:59 | | systwi quits [Ping timeout: 272 seconds] |
02:35:02 | | systwi (systwi) joins |
02:43:44 | <nicolas17> | JAA: https://www.lanueva.com/nota/2023-12-16-20-48-0-fotos-y-videos-del-violento-temporal-que-provoco-destrozos-por-toda-bahia-blanca |
02:43:51 | <nicolas17> | has <amp-img src="https://pxcdn.lanueva.com/122023/1702770456117.jpeg"> |
02:43:57 | <nicolas17> | I guess AB can't follow that? |
02:44:13 | <nicolas17> | (on SPN I guess it would execute the Javascript that turns it into an <img>) |
02:44:26 | <pokechu22> | AB should extract that normally, since it likes extracting stuff from data attributes too |
02:44:42 | <nicolas17> | oh good |
02:48:02 | | qwertyasdfuiopghjkl quits [Remote host closed the connection] |
02:49:25 | <nicolas17> | https://transfer.archivete.am/QFklN/lanueva.com-20231216-storm.txt |
02:49:25 | <eggdrop> | inline (for browser viewing): https://transfer.archivete.am/inline/QFklN/lanueva.com-20231216-storm.txt |
02:54:59 | <pokechu22> | Yeah, it extracted that and also https://pxcdn.lanueva.com/122023/1702770456117.jpeg?cw=83&ch=46 and https://pxcdn.lanueva.com/122023/1702770456117.jpeg?cw=168&ch=94 |
04:12:12 | | nicolas17 quits [Client Quit] |
04:13:09 | | DogsRNice_ quits [Read error: Connection reset by peer] |
04:26:29 | | Fiszl (Kitty) joins |
04:27:19 | | Kitty quits [Quit: WeeChat 3.5] |
04:27:54 | | Fiszl quits [Client Quit] |
04:28:19 | | Kitty (Kitty) joins |
04:29:15 | | Kitty quits [Client Quit] |
04:39:39 | | Kitty (Kitty) joins |
04:39:47 | | Kitty quits [Client Quit] |
04:40:07 | | Kitty (Kitty) joins |
04:59:50 | | pabs quits [Ping timeout: 240 seconds] |
05:02:03 | | pabs (pabs) joins |
05:14:04 | | muu joins |
05:20:27 | | muu quits [Ping timeout: 265 seconds] |
05:37:03 | | benjins2_ quits [Read error: Connection reset by peer] |
05:55:39 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
06:02:04 | | nexusxe (nexusxe) joins |
06:10:23 | | muu joins |
06:27:06 | | qwertyasdfuiopghjkl quits [Remote host closed the connection] |
06:27:06 | | muu quits [Remote host closed the connection] |
06:29:05 | | AlsoHP_Archivist quits [Read error: Connection reset by peer] |
06:32:51 | | nexusxe quits [Client Quit] |
06:33:07 | | qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins |
06:35:36 | | pabs quits [Remote host closed the connection] |
06:37:18 | | pabs (pabs) joins |
07:21:57 | | muu joins |
07:54:55 | | hitgrr8 joins |
08:05:18 | | itachi1706 quits [Quit: Bye :P] |
08:05:45 | | muu quits [Ping timeout: 265 seconds] |
08:05:49 | | itachi1706 (itachi1706) joins |
08:40:39 | | c3manu (c3manu) joins |
08:40:44 | | c3manu quits [Max SendQ exceeded] |
08:40:53 | | Kitty quits [Client Quit] |
08:41:00 | | c3manu (c3manu) joins |
08:41:00 | | c3manu quits [Max SendQ exceeded] |
08:41:04 | | Kitty (Kitty) joins |
08:41:17 | | c3manu (c3manu) joins |
08:43:03 | | Kitty quits [Client Quit] |
08:43:10 | | Kitty (Kitty) joins |
08:48:27 | | muu joins |
08:48:47 | | muu quits [Remote host closed the connection] |
09:02:26 | | xkey quits [Quit: WeeChat 4.0.4] |
09:36:24 | | Island quits [Read error: Connection reset by peer] |
09:56:58 | | xkey (xkey) joins |
10:00:05 | | Bleo1826 quits [Client Quit] |
10:01:21 | | Bleo1826 joins |
10:10:47 | <h2ibot> | Exorcism uploaded File:Screenshot-moegirlpedia.png: https://wiki.archiveteam.org/?title=File%3AScreenshot-moegirlpedia.png |
10:11:47 | <h2ibot> | Exorcism edited Moegirlpedia (+38): https://wiki.archiveteam.org/?diff=51369&oldid=51335 |
10:14:47 | <h2ibot> | Exorcism uploaded File:Screenshot-wikiapiary.png: https://wiki.archiveteam.org/?title=File%3AScreenshot-wikiapiary.png |
10:15:47 | <h2ibot> | Exorcism uploaded File:Logo-wikiapiary.png: https://wiki.archiveteam.org/?title=File%3ALogo-wikiapiary.png |
10:15:48 | <h2ibot> | Exorcism edited WikiApiary (+44): https://wiki.archiveteam.org/?diff=51372&oldid=51328 |
10:19:53 | | icedice (icedice) joins |
10:20:49 | <h2ibot> | Exorcism uploaded File:Screenshot-xeno-canto.png: https://wiki.archiveteam.org/?title=File%3AScreenshot-xeno-canto.png |
10:21:49 | <h2ibot> | Exorcism edited Xeno-canto (+36): https://wiki.archiveteam.org/?diff=51374&oldid=51330 |
12:03:47 | | Megame (Megame) joins |
12:39:20 | | Arcorann quits [Ping timeout: 240 seconds] |
13:00:35 | | Megame quits [Client Quit] |
13:04:53 | | T31M quits [Quit: ZNC - https://znc.in] |
13:05:13 | | T31M joins |
13:06:27 | | T31M is now authenticated as T31M |
13:18:13 | | BearFortress quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.] |
13:33:50 | | nulldata quits [Ping timeout: 240 seconds] |
13:41:07 | | icedice quits [Ping timeout: 272 seconds] |
13:42:25 | | nulldata (nulldata) joins |
13:49:25 | | BearFortress joins |
13:58:21 | | riku quits [Client Quit] |
14:40:55 | | second (second) joins |
14:42:37 | | sec^nd quits [Ping timeout: 250 seconds] |
14:42:38 | | second is now known as sec^nd |
15:05:36 | | Jojo111 joins |
15:09:47 | | systwi quits [Ping timeout: 272 seconds] |
15:19:47 | <Jojo111> | Hello everyone. I am looking for an old deleted coursera course called "ECEN 5017 Power Electronics For Electric Drive Vehicles". I would be immensely grateful if someone can help me find it. Thanks. |
15:19:55 | | riku (riku) joins |
15:20:06 | | systwi (systwi) joins |
15:38:44 | | andrew7 is now known as andrew |
15:58:50 | | tertu quits [Ping timeout: 240 seconds] |
16:00:24 | | tertu (tertu) joins |
16:08:36 | | Jojo111 quits [Ping timeout: 265 seconds] |
16:20:10 | | Jojo111 joins |
16:31:30 | | DogsRNice joins |
16:33:08 | | Doranwen quits [Remote host closed the connection] |
16:33:41 | | Doranwen (Doranwen) joins |
16:37:07 | | Jojo111 quits [Ping timeout: 265 seconds] |
17:06:29 | | Jojo111 joins |
17:21:54 | | a joins |
17:22:15 | | a quits [Remote host closed the connection] |
17:29:34 | | nicolas17 joins |
17:29:39 | <nicolas17> | welp |
17:29:41 | <nicolas17> | the storm reached me |
17:29:47 | <nicolas17> | that was the strongest wind I've ever seen irl |
17:30:57 | <@JAA> | nicolas17: Stay safe, mate! |
17:32:32 | <nicolas17> | do we have anything functional for twitter archival? |
17:32:36 | <nicolas17> | https://twitter.com/sangarciacorre/status/1736391098140856741 |
17:32:37 | <eggdrop> | nitter: https://nitter.net/sangarciacorre/status/1736391098140856741 |
17:34:21 | <nicolas17> | https://twitter.com/pablezlo/status/1736299091942850669 |
17:34:21 | <eggdrop> | nitter: https://nitter.net/pablezlo/status/1736299091942850669 |
17:34:22 | <nicolas17> | https://twitter.com/martipueente/status/1736286981271720249 |
17:35:21 | <nicolas17> | https://twitter.com/TransitoAereoAr/status/1736397747249463450 |
17:35:21 | <eggdrop> | nitter: https://nitter.net/TransitoAereoAr/status/1736397747249463450 |
17:37:05 | | hexa- quits [Changing host] |
17:37:05 | | hexa- (hexa-) joins |
17:38:10 | | hexa- quits [Changing host] |
17:38:10 | | hexa- (hexa-) joins |
17:48:07 | <Barto> | nicolas17: twitter archival? I run my own nitter instance at nitter.vloup.ch and it works quite well with twitterminator tokens :-) It's not advertised in their public github wiki, and that's the spirit of it. Just be careful about the 429 ;-) |
17:51:33 | | Jojo111 quits [Ping timeout: 265 seconds] |
18:05:51 | | jacksonchen666 quits [Ping timeout: 250 seconds] |
18:06:41 | | jacksonchen666 (jacksonchen666) joins |
18:08:20 | | riku quits [Ping timeout: 240 seconds] |
18:15:58 | | icedice (icedice) joins |
18:53:28 | | Jojo111 joins |
18:54:47 | | Matthww119 quits [Quit: The Lounge - https://thelounge.chat] |
18:57:21 | <nicolas17> | Barto: well we could throw those nitter.net links into AB |
19:06:14 | | qwertyasdfuiopghjkl quits [Remote host closed the connection] |
19:15:27 | | riku (riku) joins |
19:26:59 | <nicolas17> | I walked around the neighborhood and took 1000+ photos for Mapillary, there wasn't much destruction in this area tho |
19:29:13 | <fireonlive> | raw dog nitter.net doesn’t work with AB |
19:29:27 | <fireonlive> | but Barto’s does |
19:29:45 | <nicolas17> | phrasing |
19:29:57 | <fireonlive> | :p |
19:36:10 | | Matthww119 joins |
19:40:19 | | c3manu quits [Remote host closed the connection] |
19:47:05 | | icedice quits [Client Quit] |
19:52:28 | | Jojo111 quits [Remote host closed the connection] |
19:57:59 | <Barto> | muahaha |
20:07:27 | <Pedrosso> | Would https://gamebanana.com/ be good to be saved? It's got lots of mods (such as portal 2 & HL2) It has very limited coverage but seems quite big. https://sitemap.gamebanana.com/index.xml (each sitemap in the index seems to refer to only 1 URL). If it's up to date that's about 19k pages |
20:09:42 | <pokechu22> | 19k seems kinda small for something like that, hmm |
20:09:52 | <Pedrosso> | It does |
20:09:54 | <Pedrosso> | Although the site isn't in danger afaik, the coverage is limited and mods can be deleted at any time |
20:10:25 | <pokechu22> | Index (2,103,542) for members - that's a lot of users |
20:10:38 | <Pedrosso> | Ah. The 1 page per sitemap applies to categories but not necessarily for others |
20:11:18 | <Pedrosso> | mod categories. Yeah my main guess was way off. Must be much bigger |
20:12:37 | <aninternettroll> | Wow, I've never seen that many sitemap files |
20:12:57 | <pokechu22> | Also looks like things are pretty JS-based: view-source:https://gamebanana.com/games doesn't have any games in it |
20:24:34 | <Pedrosso> | What about pages that aren't search pages? |
20:26:56 | <pokechu22> | https://gamebanana.com/mods/483831 seems to be blank, depending on at least https://gamebanana.com/apiv11/Mod/483831/ProfilePage / https://gamebanana.com/apiv11/Mod/483831/Config / https://gamebanana.com/apiv11/Mod/483831/Posts?_nPage=1&_nPerpage=15&_sSort=popular / https://gamebanana.com/apiv11/Member/UiConfig?_sUrl=%2Fmods%2F483831 |
20:27:46 | <Pedrosso> | oh wow. So AB is a a no-go? Or do the .js files have necessary info? |
20:29:13 | <pokechu22> | AB almost certainly won't work I think |
20:31:32 | | bladem quits [Read error: Connection reset by peer] |
20:36:47 | | bladem (bladem) joins |
20:48:43 | | Max|m12 is now known as MaxG |
20:50:52 | | BlueMaxima joins |
20:55:23 | | jacksonchen666 is now authenticated as * |
20:55:23 | | jacksonchen666 is now known as RJHacker77207 |
20:55:28 | | jacksonchen666 (jacksonchen666) joins |
20:56:26 | | RJHacker77207 quits [Remote host closed the connection] |
21:00:06 | <Pedrosso> | Quite long... https://transfer.archivete.am/WXonx/gamebanana.com_full_sitemap.txt.gz |
21:00:20 | <Pedrosso> | 0.7 million, hah. |
21:04:45 | | MaxG is now authenticated as MaxG |
21:05:49 | | jacksonchen666 quits [Client Quit] |
21:15:53 | | bf_ quits [Remote host closed the connection] |
21:25:33 | <Pedrosso> | The mod downloads appear to work off of a https://gamebanana.com/dl/{id} system. Where the ID can be gotten from the mod page url, https://gamebanana.com/mods/233183 so a full list of those is an easy one. |
21:28:29 | | DigitalDragons quits [Read error: Connection reset by peer] |
21:41:13 | <@OrIdow6> | Is this shutting down? |
21:51:26 | <Pedrosso> | I stated before it wasn't, rather that it's just low coverage |
21:58:44 | <phuzion> | Can someone take a look at the archivebot job for forums.questionablecontent.net and see if it's worth upping the speed on the job? I think they managed to get the server migrated, it seems to be pretty stable. |
21:59:16 | <phuzion> | Also might be worth looking into increasing the ignores on that job, it seems to go off on full-ass tangents of archiving some random other websites. |
22:05:28 | <pokechu22> | Yeah, archivebot saves outlinks and embedded images by default, and unfortunately a lot of old forum image hosts are dead :/ |
22:06:16 | | icedice (icedice) joins |
22:06:22 | <pokechu22> | I've bumped up the speed to 1s-2s, let's see if it's stable like that |
22:29:47 | | Island joins |
23:14:49 | | DigitalDragons (DigitalDragons) joins |
23:38:20 | | nulldata quits [Ping timeout: 240 seconds] |