00:05:02Megame1_ quits [Client Quit]
00:08:16etnguyen03 (etnguyen03) joins
00:09:38Arcorann (Arcorann) joins
00:11:36nic quits [Client Quit]
00:15:15nic (nic) joins
00:16:50Fob joins
00:31:56Fob quits [Ping timeout: 252 seconds]
00:37:49PredatorIWD joins
00:44:02etnguyen03 quits [Ping timeout: 252 seconds]
00:50:27PredatorIWD quits [Client Quit]
01:00:21mindstrut1 quits [Read error: Connection reset by peer]
01:00:21mindstrut quits [Read error: Connection reset by peer]
01:00:44mindstrut joins
01:00:55mindstrut1 joins
01:02:26DogsRNice_ joins
01:04:56DogsRNice quits [Ping timeout: 252 seconds]
01:05:11JC|m joins
01:06:25<JC|m>!help
01:08:22<JC|m>is this the room for archive bot?
01:08:42etnguyen03 (etnguyen03) joins
01:09:06<JC|m>!archive
01:09:35<JC|m>!archive https://github.com
01:10:42systwi_ joins
01:36:08Exorcism quits [Remote host closed the connection]
01:36:49Exorcism (exorcism) joins
01:37:08<pabs>JC|m: the bot is in #archivebot, but only voiced/ops have access.
01:37:26<pabs>github.com is also way too big to be archived using archivebot
01:37:54<pabs>github.com is also not the right way to archive individual GitHub repos/orgs, we have #gitgud for doing that
01:38:31<pabs>if you have any other sites to save via ArchiveBot, state the URLs here and the reasons for archiving them
01:41:14etnguyen03 quits [Ping timeout: 252 seconds]
01:44:42pabs quits [Quit: Don't rest until all the world is paved in moss and greenery.]
01:46:19pabs (pabs) joins
02:07:48etnguyen03 (etnguyen03) joins
02:10:40<DogsRNice_>computer, please archive the entire internet
02:25:21etnguyen03 quits [Ping timeout: 265 seconds]
02:28:12Exorcism quits [Remote host closed the connection]
02:28:38Exorcism (exorcism) joins
02:51:32<h2ibot>PaulWise edited Mailman2 (+964, add more, one done): https://wiki.archiveteam.org/?diff=50948&oldid=50886
02:51:32AmAnd0A quits [Read error: Connection reset by peer]
02:51:35AmAnd0A joins
03:09:42etnguyen03 (etnguyen03) joins
03:17:16<JC|m><pabs> "JC: the bot is in #archivebot..." <- Is there a difference between archiving here or directly from the website?
03:17:50<JC|m>in https://archive.org/web/
03:18:45<imer>Two different systems, yes. archiveteam is not the internet archive
03:19:48<JC|m>https://subscene.com
03:20:09<JC|m>I was looking into archiving this site.
03:21:46<JC|m>imer:
03:22:04<JC|m>pabs:
03:24:44<pabs>JC|m: what is the reason for archiving subscene?
03:25:27<JC|m>They have a big directory of subtitles for movies and TV shows.
03:26:01<JC|m>The website has had some downtime recently.
03:26:11<pabs>hmm, lots of filenames indicating pirate sites.
03:26:26<JC|m>I wanted to be archived if anything happened
03:26:26pabs wonders what the policy is for that sort of stuff
03:27:28<JC|m>The subtitles are not pirated.
03:28:09<JC|m>A lot of famous streaming sites upload their subs on subsence
03:28:54<pabs>streaming sites like Disney+ or?
03:29:03<JC|m>nah
03:29:24<pabs>Netflix?
03:29:51<JC|m>30Nama
03:29:53<JC|m>filimo
03:29:56<JC|m>namava
03:31:08<JC|m>And independent people who translate subtitles
03:44:13<pabs>looks like it was last saved in 2018 https://archive.fart.website/archivebot/viewer/domain/subscene.com
03:44:24<pabs>so time for an update I guess
03:45:22<pabs>interesting, Google deleted one of its results for this site because of https://www.lumendatabase.org/notices/18635190
03:49:59<project10>overzealous court. SRT files (the actual subtitles) are just timestamps and text.
03:50:22<pabs>presumably the dialog/script is copyrighted though :)
03:51:04<pabs>and the movie cover/posters
03:51:09<pabs>anyway, running now
03:51:28<pabs>JC|m: see http://archivebot.com/ if you want to follow the job
03:51:41<pabs>oh, it got a 403 error
03:55:11<pabs>JC|m: no dice, all my attempts got 403 errors for the front page
03:59:02<pabs>maybe something for qwarc from JAA
03:59:24<JC|m>pabs: is it the robots.txt?
03:59:59<pabs>no, AB ignores robots.txt, the first request AB sent (for the front page) got a 403 error
04:00:27<pabs>same goes for the subdomains, including the forum
04:01:36<JC|m>Do you guys also archive forums, right?
04:02:07<sepro>I archived all of subscene last year. Though not in the warc format, but just an archive of all subtitle files.
04:02:12<sepro>The main problem was URL discovery, as there is no easy way to get a list of all shows. Also had some problems with cloudflare.
04:07:38<JC|m>Is v3.2 The latest version of Warrior?
04:07:50<JC|m>from 2021
04:08:31<project10>from https://warriorhq.archiveteam.org/downloads/warrior3/ ? those are virtual machine images
04:09:36<project10>if you're familiar with docker, you can also run a container and get the latest that way. I think the VM images would keep themselves up to date, but I've never used them
04:10:13<project10>there is a dedicated #warrior channel if you need support for either option
04:13:08AmAnd0A quits [Ping timeout: 265 seconds]
04:13:11AmAnd0A joins
04:13:36<pabs>JC|m: yeah, we often safe forums
04:13:41<pabs>er save forums
04:16:26<JC|m>https://discuss.privacyguides.net/
04:16:30<JC|m>https://linustechtips.com/
04:18:30<JC|m>Some forums that you may want to archive
04:20:06<Peroniko>I tried to archive forum.cdm.me before, but there any many ignores that I had to apply
04:20:33<Peroniko>All the pages with this were behind the login screen: ^(?:(?!private\.php\?|register\.php\?|sendmessage\.php\?|itrader_feedback\.php\?|newreply\.php\?|usercp\.php\?|subscription\.php\?).)*$
04:22:49<DigitalDragons>seems like a very aggressive cloudflare configuration on subscene
04:51:05etnguyen03 quits [Client Quit]
04:57:11Exorcism quits [Remote host closed the connection]
04:58:39Exorcism (exorcism) joins
05:00:35hitgrr8 joins
05:01:26@dxrt quits [Ping timeout: 252 seconds]
05:02:32kiska5 quits [Ping timeout: 252 seconds]
05:02:32Ryz quits [Ping timeout: 252 seconds]
05:02:55IDK_ quits [Ping timeout: 265 seconds]
05:05:22kiska5 joins
05:05:25dxrt joins
05:05:27dxrt quits [Changing host]
05:05:27dxrt (dxrt) joins
05:05:27@ChanServ sets mode: +o dxrt
05:05:34IDK_ joins
05:05:46Ryz (Ryz) joins
05:06:19Wohlstand (Wohlstand) joins
05:17:03DogsRNice_ quits [Read error: Connection reset by peer]
05:36:45dumbgoy quits [Ping timeout: 265 seconds]
06:03:35balrog quits [Ping timeout: 252 seconds]
06:04:11sec^nd quits [Ping timeout: 245 seconds]
06:04:30balrog (balrog) joins
06:09:31sec^nd (second) joins
06:12:33Exorcism quits [Remote host closed the connection]
06:13:10Exorcism (exorcism) joins
06:28:01ssss joins
06:29:11<Exorcism>https://twitter.com/PretendoNetwork/status/1710896499700150390
06:29:12<eggdrop>nitter: https://nitter.net/PretendoNetwork/status/1710896499700150390
06:40:51BigBrain quits [Ping timeout: 245 seconds]
06:42:43BigBrain (bigbrain) joins
06:42:54<pabs>hmm, I feel like the AB websocket is not passing on all URL requests/responses
06:54:25<@JAA>pabs: There's a known bug near the end of jobs, where the last couple lines might get swallowed. Other than that, it should only drop messages when it can't keep up. It currently looks like there are two clients that are too slow and get messages dropped regularly.
06:55:25<pabs>hhmm, I have two non-browser clients attached, that must be me
06:57:32eroc19903 (eroc1990) joins
06:59:08eroc1990 quits [Ping timeout: 252 seconds]
06:59:08magmaus3 quits [Ping timeout: 252 seconds]
07:11:18<pabs>JAA: an example: I just redid upload.systems, but the job doesn't show up at all in the browser, id y6rqls8zrwd2r7hc1fa2znok
07:28:14Wohlstand quits [Client Quit]
07:40:39BlueMaxima quits [Read error: Connection reset by peer]
08:08:52Island quits [Read error: Connection reset by peer]
08:19:09threedeeitguy39 quits [Ping timeout: 265 seconds]
08:22:56icedice quits [Client Quit]
08:31:53threedeeitguy39 (threedeeitguy) joins
08:45:07ssss quits [Remote host closed the connection]
09:08:20magmaus3 (magmaus3) joins
09:10:18Exorcism quits [Client Quit]
09:11:33Exorcism (exorcism) joins
09:11:43bf_ joins
09:24:28petrichor (petrichor) joins
09:48:23Deewiant quits [Remote host closed the connection]
09:49:32Deewiant (Deewiant) joins
10:13:48ssss joins
10:33:21sec^nd quits [Ping timeout: 245 seconds]
10:38:29sec^nd (second) joins
10:43:24Chris5010 (Chris5010) joins
11:36:27icedice (icedice) joins
11:46:54justhere66 joins
11:47:03<justhere66>hi
11:47:10<justhere66>could I have this whole website archived on the internet archive? It's a very small website.
11:47:13<justhere66>https://oowmun.org/
11:47:18<justhere66>!archive https://oowmun.org/
11:49:21justhere66 quits [Remote host closed the connection]
11:49:28qwertyasdfuiopghjkl quits [Client Quit]
11:50:45qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
12:25:51Barto quits [Read error: Connection reset by peer]
12:34:57Barto (Barto) joins
13:07:08sonick (sonick) joins
13:09:38AmAnd0A quits [Ping timeout: 265 seconds]
13:10:18AmAnd0A joins
13:13:38<nulldata>PinstripedProspects.com, a blog covering the New York Yankees minor league system, has announced it's shutting down in 2 weeks. https://www.pinstripedprospects.com/pinstriped-prospects-website-shutting-down-65038/
13:20:16lunik173 quits [Ping timeout: 265 seconds]
13:33:58lunik173 joins
13:48:45icedice quits [Client Quit]
13:58:12etnguyen03 (etnguyen03) joins
14:12:44<h2ibot>PaulWise edited Software Heritage (+424, add some more info and related projects): https://wiki.archiveteam.org/?diff=50949&oldid=28671
14:14:44<h2ibot>TheTechRobo edited URLTeam (-564, Improve tiny.cc entry): https://wiki.archiveteam.org/?diff=50950&oldid=50889
14:15:44<h2ibot>TheTechRobo edited URLTeam (+19, Add another t.ly link): https://wiki.archiveteam.org/?diff=50951&oldid=50950
14:16:33T31M quits [Quit: ZNC - https://znc.in]
14:16:53T31M joins
14:17:45<h2ibot>TheTechRobo edited URLTeam (+44, T.ly is non-incremental): https://wiki.archiveteam.org/?diff=50952&oldid=50951
14:24:50katocala joins
14:26:26DogsRNice joins
14:26:58etnguyen03 quits [Ping timeout: 265 seconds]
14:27:46<h2ibot>PaulWise created Trac (+2072, create Trac project page): https://wiki.archiveteam.org/?title=Trac
14:29:35Arcorann quits [Ping timeout: 252 seconds]
14:29:47<h2ibot>PaulWise edited Bugzilla (+0, redhat bugzilla crashed): https://wiki.archiveteam.org/?diff=50954&oldid=50756
14:30:47<h2ibot>PaulWise edited Mailman2 (+157, more): https://wiki.archiveteam.org/?diff=50955&oldid=50948
14:32:48<h2ibot>PaulWise edited GitHub (+19, Category:Code): https://wiki.archiveteam.org/?diff=50956&oldid=50737
14:34:14katocala quits [Remote host closed the connection]
14:34:32katocala joins
14:36:02etnguyen03 (etnguyen03) joins
14:38:32HP_Archivist (HP_Archivist) joins
14:41:49<h2ibot>PaulWise edited IRC/Logs (+31, wordpress logs): https://wiki.archiveteam.org/?diff=50957&oldid=50917
14:44:22AmAnd0A quits [Ping timeout: 265 seconds]
14:44:55AmAnd0A joins
14:47:08AmAnd0A quits [Read error: Connection reset by peer]
14:47:43AmAnd0A joins
14:59:50AmAnd0A quits [Ping timeout: 265 seconds]
15:00:32AmAnd0A joins
15:14:43hitgrr8 quits [Client Quit]
15:15:34<audrooku|m>what's the forecast for AT spinning back up? is it a storage or cpu/net saturation issue on IA's end?
15:19:59lunik173 quits [Client Quit]
15:21:00lunik173 joins
15:25:39lunik173 quits [Client Quit]
15:26:38lunik173 joins
15:30:30sonick quits [Client Quit]
15:32:59mindstrut quits [Read error: Connection reset by peer]
15:33:14mindstrut joins
15:36:32icedice (icedice) joins
15:44:18katocala quits [Ping timeout: 265 seconds]
15:45:27lunik173 quits [Client Quit]
15:51:12dumbgoy joins
15:52:22lunik173 joins
15:57:55lunik173 quits [Client Quit]
15:58:21lunik173 joins
16:00:15AmAnd0A quits [Ping timeout: 265 seconds]
16:00:54AmAnd0A joins
16:29:04<imer>audrooku|m: "soon" I believe. #shreddit is going straight to IA and #// is due to restart (in some capacity) as well I think?
16:29:22<imer>from #archiveteam "<ark_iver>: The problems at IA that prevented us from uploading large amounts of data are getting better. We will now start uploading (part of) the offloaded data to IA, and probably resume projects after. The situation is not completely 'back to normal' yet, but will likely be in about a month."
16:30:08<audrooku|m>that's good to hear, thanks for answering my question as I missed that message
16:30:19<imer>no worries
17:12:56etnguyen03 quits [Ping timeout: 252 seconds]
17:15:47icedice quits [Client Quit]
17:21:41jacksonchen666 quits [Ping timeout: 245 seconds]
17:40:01BigBrain quits [Ping timeout: 245 seconds]
17:42:14BigBrain (bigbrain) joins
17:44:28etnguyen03 (etnguyen03) joins
18:02:03etnguyen03 quits [Ping timeout: 265 seconds]
18:08:49AmAnd0A quits [Ping timeout: 265 seconds]
18:09:36AmAnd0A joins
18:10:22etnguyen03 (etnguyen03) joins
18:11:28Naruyoko5 joins
18:15:05Naruyoko quits [Ping timeout: 252 seconds]
18:15:19Hackerpcs quits [Quit: Hackerpcs]
18:17:22Hackerpcs (Hackerpcs) joins
18:20:19AmAnd0A quits [Read error: Connection reset by peer]
18:20:35AmAnd0A joins
18:23:39that_lurker quits [Quit: Clowning around is not the same as fooling around...I am a clown, not a fool]
18:23:51that_lurker (that_lurker) joins
18:24:46Naruyoko5 quits [Ping timeout: 265 seconds]
18:31:02etnguyen03 quits [Ping timeout: 252 seconds]
18:37:04Naruyoko joins
18:42:17bilboed8 joins
18:44:06bilboed quits [Ping timeout: 265 seconds]
18:44:06bilboed8 is now known as bilboed
18:45:35luna joins
18:46:02Naruyoko quits [Ping timeout: 265 seconds]
18:53:00Naruyoko joins
19:01:38kiryu quits [Remote host closed the connection]
19:04:02AmAnd0A quits [Ping timeout: 252 seconds]
19:04:39AmAnd0A joins
19:13:38Naruyoko quits [Read error: Connection reset by peer]
19:28:34AmAnd0A quits [Ping timeout: 265 seconds]
19:29:07AmAnd0A joins
19:34:45Megame (Megame) joins
19:41:34Naruyoko joins
19:45:45<@JAA>One week after the supposed shutdown, the Canucks forum is still going. I'm running a continuous thing that fetches new posts as they're being made until it does shut down.
19:47:10luna_ joins
19:48:46Exorcism is now known as Exorcism|TheLounge
19:50:19luna quits [Ping timeout: 265 seconds]
19:51:36<@JAA>Also, for the record, the community-chosen successor seems to be https://www.canucksfanforum.com/ (which somehow already has 35k posts since mid-Sept).
19:54:00Exorcism|TheLounge is now known as Exorcism
19:54:20imer quits [Quit: Oh no]
19:55:03imer (imer) joins
19:55:24Exorcism|Matrix (exorcism) joins
19:57:03Naruyoko5 joins
19:58:20threedeeitguy39 quits [Client Quit]
19:59:56threedeeitguy39 (threedeeitguy) joins
20:00:28Naruyoko quits [Ping timeout: 265 seconds]
20:00:42sec^nd quits [Remote host closed the connection]
20:01:13sec^nd (second) joins
20:05:02<pokechu22>JAA: did you see mountainbladder's message about TaleWorlds whitelisting AB pipeline IPs?
20:05:07<pokechu22>it was a few days ago I think
20:05:32<@JAA>pokechu22: Yes, I replied as well, just didn't have time to act on it yet.
20:12:33AmAnd0A quits [Ping timeout: 265 seconds]
20:12:47petrichor quits [Ping timeout: 252 seconds]
20:26:06Island joins
20:38:41luna_ is now known as luna
20:46:54etnguyen03 (etnguyen03) joins
20:54:21BlueMaxima joins
21:11:37katocala joins
21:17:48qwertyasdfuiopghjkl quits [Ping timeout: 265 seconds]
21:18:57qwertyasdfuiopghjkl (qwertyasdfuiopghjkl) joins
21:38:35Island quits [Ping timeout: 265 seconds]
21:44:30AmAnd0A joins
21:50:12Island joins
22:34:10treora quits [Ping timeout: 265 seconds]
22:37:18treora joins
22:44:02katocala quits [Ping timeout: 252 seconds]
22:44:05katocala joins
22:57:14etnguyen03 quits [Ping timeout: 252 seconds]
23:13:54<fireonlive>blast from the past: Archiverse – Archive Team's dump of Nintendo's Miiverse (2012–2017): https://archiverse.guide/ :)
23:21:17luna quits [Client Quit]
23:26:29dumbgoy_ joins
23:28:00kiryu joins
23:28:00kiryu quits [Changing host]
23:28:00kiryu (kiryu) joins
23:29:41dumbgoy quits [Ping timeout: 252 seconds]
23:35:04katocala quits [Ping timeout: 265 seconds]
23:35:59katocala joins
23:37:12abirkill quits [Client Quit]
23:39:05AlsoHP_Archivist joins
23:40:52HP_Archivist quits [Ping timeout: 265 seconds]
23:49:20BlueMaxima quits [Read error: Connection reset by peer]