00:04:27jacksonchen666 is now known as RJHacker18086
00:04:31jacksonchen666 (jacksonchen666) joins
00:06:51RJHacker18086 quits [Ping timeout: 250 seconds]
00:09:43<Naruyoko>https://docs.google.com/spreadsheets/d/1BtUSgPffbcaW4bMuGClYi8FGvaYmYyc1p4SkfpNty-U/htmlview A list of Yandex Disk links, spreadsheet found from https://siivagunner.fandom.com/wiki/Ripping . I don't know about the archive status or the legality of these song stems files.
00:55:40Earendil7 (Earendil7) joins
00:56:08<nicolas17>there was a terrible storm in Bahia Blanca, Buenos Aires, https://www.lanueva.com/ has news and photos about it
00:56:24<nicolas17>should I make a list of relevant articles to do a non-recursive AB?
01:01:26<pokechu22>That sounds reasonable to me
01:02:10<nicolas17>(since a recursive AB on a news site seems like a bad idea)
01:36:59BlueMaxima quits [Read error: Connection reset by peer]
02:25:59systwi quits [Ping timeout: 272 seconds]
02:35:02systwi (systwi) joins
02:43:44<nicolas17>JAA: https://www.lanueva.com/nota/2023-12-16-20-48-0-fotos-y-videos-del-violento-temporal-que-provoco-destrozos-por-toda-bahia-blanca
02:43:51<nicolas17>has <amp-img src="https://pxcdn.lanueva.com/122023/1702770456117.jpeg">
02:43:57<nicolas17>I guess AB can't follow that?
02:44:13<nicolas17>(on SPN I guess it would execute the Javascript that turns it into an <img>)
02:44:26<pokechu22>AB should extract that normally, since it likes extracting stuff from data attributes too
02:44:42<nicolas17>oh good
02:48:02qwertyasdfuiopghjkl quits [Remote host closed the connection]
02:49:25<nicolas17>https://transfer.archivete.am/QFklN/lanueva.com-20231216-storm.txt
02:49:25<eggdrop>inline (for browser viewing): https://transfer.archivete.am/inline/QFklN/lanueva.com-20231216-storm.txt
02:54:59<pokechu22>Yeah, it extracted that and also https://pxcdn.lanueva.com/122023/1702770456117.jpeg?cw=83&ch=46 and https://pxcdn.lanueva.com/122023/1702770456117.jpeg?cw=168&ch=94
04:12:12nicolas17 quits [Client Quit]
04:13:09DogsRNice_ quits [Read error: Connection reset by peer]
04:26:29Fiszl (Kitty) joins
04:27:19Kitty quits [Quit: WeeChat 3.5]
04:27:54Fiszl quits [Client Quit]
04:28:19Kitty (Kitty) joins
04:29:15Kitty quits [Client Quit]
04:39:39Kitty (Kitty) joins
04:39:47Kitty quits [Client Quit]
04:40:07Kitty (Kitty) joins
04:59:50pabs quits [Ping timeout: 240 seconds]
05:02:03pabs (pabs) joins
05:14:04muu joins
05:20:27muu quits [Ping timeout: 265 seconds]
05:37:03benjins2_ quits [Read error: Connection reset by peer]
05:55:39qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
06:02:04nexusxe (nexusxe) joins
06:10:23muu joins
06:27:06qwertyasdfuiopghjkl quits [Remote host closed the connection]
06:27:06muu quits [Remote host closed the connection]
06:29:05AlsoHP_Archivist quits [Read error: Connection reset by peer]
06:32:51nexusxe quits [Client Quit]
06:33:07qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
06:35:36pabs quits [Remote host closed the connection]
06:37:18pabs (pabs) joins
07:21:57muu joins
07:54:55hitgrr8 joins
08:05:18itachi1706 quits [Quit: Bye :P]
08:05:45muu quits [Ping timeout: 265 seconds]
08:05:49itachi1706 (itachi1706) joins
08:40:39c3manu (c3manu) joins
08:40:44c3manu quits [Max SendQ exceeded]
08:40:53Kitty quits [Client Quit]
08:41:00c3manu (c3manu) joins
08:41:00c3manu quits [Max SendQ exceeded]
08:41:04Kitty (Kitty) joins
08:41:17c3manu (c3manu) joins
08:43:03Kitty quits [Client Quit]
08:43:10Kitty (Kitty) joins
08:48:27muu joins
08:48:47muu quits [Remote host closed the connection]
09:02:26xkey quits [Quit: WeeChat 4.0.4]
09:36:24Island quits [Read error: Connection reset by peer]
09:56:58xkey (xkey) joins
10:00:05Bleo1826 quits [Client Quit]
10:01:21Bleo1826 joins
10:10:47<h2ibot>Exorcism uploaded File:Screenshot-moegirlpedia.png: https://wiki.archiveteam.org/?title=File%3AScreenshot-moegirlpedia.png
10:11:47<h2ibot>Exorcism edited Moegirlpedia (+38): https://wiki.archiveteam.org/?diff=51369&oldid=51335
10:14:47<h2ibot>Exorcism uploaded File:Screenshot-wikiapiary.png: https://wiki.archiveteam.org/?title=File%3AScreenshot-wikiapiary.png
10:15:47<h2ibot>Exorcism uploaded File:Logo-wikiapiary.png: https://wiki.archiveteam.org/?title=File%3ALogo-wikiapiary.png
10:15:48<h2ibot>Exorcism edited WikiApiary (+44): https://wiki.archiveteam.org/?diff=51372&oldid=51328
10:19:53icedice (icedice) joins
10:20:49<h2ibot>Exorcism uploaded File:Screenshot-xeno-canto.png: https://wiki.archiveteam.org/?title=File%3AScreenshot-xeno-canto.png
10:21:49<h2ibot>Exorcism edited Xeno-canto (+36): https://wiki.archiveteam.org/?diff=51374&oldid=51330
12:03:47Megame (Megame) joins
12:39:20Arcorann quits [Ping timeout: 240 seconds]
13:00:35Megame quits [Client Quit]
13:04:53T31M quits [Quit: ZNC - https://znc.in]
13:05:13T31M joins
13:18:13BearFortress quits [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]
13:33:50nulldata quits [Ping timeout: 240 seconds]
13:41:07icedice quits [Ping timeout: 272 seconds]
13:42:25nulldata (nulldata) joins
13:49:25BearFortress joins
13:58:21riku quits [Client Quit]
14:40:55second (second) joins
14:42:37sec^nd quits [Ping timeout: 250 seconds]
14:42:38second is now known as sec^nd
15:05:36Jojo111 joins
15:09:47systwi quits [Ping timeout: 272 seconds]
15:19:47<Jojo111>Hello everyone. I am looking for an old deleted coursera course called "ECEN 5017 Power Electronics For Electric Drive Vehicles". I would be immensely grateful if someone can help me find it. Thanks.
15:19:55riku (riku) joins
15:20:06systwi (systwi) joins
15:38:44andrew7 is now known as andrew
15:58:50tertu quits [Ping timeout: 240 seconds]
16:00:24tertu (tertu) joins
16:08:36Jojo111 quits [Ping timeout: 265 seconds]
16:20:10Jojo111 joins
16:31:30DogsRNice joins
16:33:08Doranwen quits [Remote host closed the connection]
16:33:41Doranwen (Doranwen) joins
16:37:07Jojo111 quits [Ping timeout: 265 seconds]
17:06:29Jojo111 joins
17:21:54a joins
17:22:15a quits [Remote host closed the connection]
17:29:34nicolas17 joins
17:29:39<nicolas17>welp
17:29:41<nicolas17>the storm reached me
17:29:47<nicolas17>that was the strongest wind I've ever seen irl
17:30:57<@JAA>nicolas17: Stay safe, mate!
17:32:32<nicolas17>do we have anything functional for twitter archival?
17:32:36<nicolas17>https://twitter.com/sangarciacorre/status/1736391098140856741
17:32:37<eggdrop>nitter: https://nitter.net/sangarciacorre/status/1736391098140856741
17:34:21<nicolas17>https://twitter.com/pablezlo/status/1736299091942850669
17:34:21<eggdrop>nitter: https://nitter.net/pablezlo/status/1736299091942850669
17:34:22<nicolas17>https://twitter.com/martipueente/status/1736286981271720249
17:35:21<nicolas17>https://twitter.com/TransitoAereoAr/status/1736397747249463450
17:35:21<eggdrop>nitter: https://nitter.net/TransitoAereoAr/status/1736397747249463450
17:37:05hexa- quits [Changing host]
17:37:05hexa- (hexa-) joins
17:38:10hexa- quits [Changing host]
17:38:10hexa- (hexa-) joins
17:48:07<Barto>nicolas17: twitter archival? I run my own nitter instance at nitter.vloup.ch and it works quite well with twitterminator tokens :-) It's not advertised in their public github wiki, and that's the spirit of it. Just be careful about the 429 ;-)
17:51:33Jojo111 quits [Ping timeout: 265 seconds]
18:05:51jacksonchen666 quits [Ping timeout: 250 seconds]
18:06:41jacksonchen666 (jacksonchen666) joins
18:08:20riku quits [Ping timeout: 240 seconds]
18:15:58icedice (icedice) joins
18:53:28Jojo111 joins
18:54:47Matthww119 quits [Quit: The Lounge - https://thelounge.chat]
18:57:21<nicolas17>Barto: well we could throw those nitter.net links into AB
19:06:14qwertyasdfuiopghjkl quits [Remote host closed the connection]
19:15:27riku (riku) joins
19:26:59<nicolas17>I walked around the neighborhood and took 1000+ photos for Mapillary, there wasn't much destruction in this area tho
19:29:13<fireonlive>raw dog nitter.net doesn’t work with AB
19:29:27<fireonlive>but Barto’s does
19:29:45<nicolas17>phrasing
19:29:57<fireonlive>:p
19:36:10Matthww119 joins
19:40:19c3manu quits [Remote host closed the connection]
19:47:05icedice quits [Client Quit]
19:52:28Jojo111 quits [Remote host closed the connection]
19:57:59<Barto>muahaha
20:07:27<Pedrosso>Would https://gamebanana.com/ be good to be saved? It's got lots of mods (such as portal 2 & HL2) It has very limited coverage but seems quite big. https://sitemap.gamebanana.com/index.xml (each sitemap in the index seems to refer to only 1 URL). If it's up to date that's about 19k pages
20:09:42<pokechu22>19k seems kinda small for something like that, hmm
20:09:52<Pedrosso>It does
20:09:54<Pedrosso>Although the site isn't in danger afaik, the coverage is limited and mods can be deleted at any time
20:10:25<pokechu22>Index (2,103,542) for members - that's a lot of users
20:10:38<Pedrosso>Ah. The 1 page per sitemap applies to categories but not necessarily for others
20:11:18<Pedrosso>mod categories. Yeah my main guess was way off. Must be much bigger
20:12:37<aninternettroll>Wow, I've never seen that many sitemap files
20:12:57<pokechu22>Also looks like things are pretty JS-based: view-source:https://gamebanana.com/games doesn't have any games in it
20:24:34<Pedrosso>What about pages that aren't search pages?
20:26:56<pokechu22>https://gamebanana.com/mods/483831 seems to be blank, depending on at least https://gamebanana.com/apiv11/Mod/483831/ProfilePage / https://gamebanana.com/apiv11/Mod/483831/Config / https://gamebanana.com/apiv11/Mod/483831/Posts?_nPage=1&_nPerpage=15&_sSort=popular / https://gamebanana.com/apiv11/Member/UiConfig?_sUrl=%2Fmods%2F483831
20:27:46<Pedrosso>oh wow. So AB is a a no-go? Or do the .js files have necessary info?
20:29:13<pokechu22>AB almost certainly won't work I think
20:31:32bladem quits [Read error: Connection reset by peer]
20:36:47bladem (bladem) joins
20:48:43Max|m12 is now known as MaxG
20:50:52BlueMaxima joins
20:55:23jacksonchen666 is now known as RJHacker77207
20:55:28jacksonchen666 (jacksonchen666) joins
20:56:26RJHacker77207 quits [Remote host closed the connection]
21:00:06<Pedrosso>Quite long... https://transfer.archivete.am/WXonx/gamebanana.com_full_sitemap.txt.gz
21:00:20<Pedrosso>0.7 million, hah.
21:05:49jacksonchen666 quits [Client Quit]
21:15:53bf_ quits [Remote host closed the connection]
21:25:33<Pedrosso>The mod downloads appear to work off of a https://gamebanana.com/dl/{id} system. Where the ID can be gotten from the mod page url, https://gamebanana.com/mods/233183 so a full list of those is an easy one.
21:28:29DigitalDragons quits [Read error: Connection reset by peer]
21:41:13<@OrIdow6>Is this shutting down?
21:51:26<Pedrosso>I stated before it wasn't, rather that it's just low coverage
21:58:44<phuzion>Can someone take a look at the archivebot job for forums.questionablecontent.net and see if it's worth upping the speed on the job? I think they managed to get the server migrated, it seems to be pretty stable.
21:59:16<phuzion>Also might be worth looking into increasing the ignores on that job, it seems to go off on full-ass tangents of archiving some random other websites.
22:05:28<pokechu22>Yeah, archivebot saves outlinks and embedded images by default, and unfortunately a lot of old forum image hosts are dead :/
22:06:16icedice (icedice) joins
22:06:22<pokechu22>I've bumped up the speed to 1s-2s, let's see if it's stable like that
22:29:47Island joins
23:14:49DigitalDragons (DigitalDragons) joins
23:38:20nulldata quits [Ping timeout: 240 seconds]