00:01:26tbc1887 (tbc1887) joins
00:09:05l09wk joins
00:14:25l09wk quits [Client Quit]
00:20:30fl0w joins
00:20:53h4sh quits [Remote host closed the connection]
00:25:31sonick (sonick) joins
00:55:26AlsoTheTechRobo (TheTechRobo) joins
00:58:50TheTechRobo quits [Ping timeout: 264 seconds]
01:14:30Void0 leaves
01:44:09IDK quits [Quit: Connection closed for inactivity]
02:07:35<fuzzy8021>default warrior can probably be changed as nothing left in grab queue
02:12:31luna joins
02:15:15qwertyasdfuiopghjkl quits [Quit: qwertyasdfuiopghjkl]
02:36:17Hackerpcs quits [Quit: Hackerpcs]
02:37:18qwertyasdfuiopghjkl joins
02:38:15Hackerpcs (Hackerpcs) joins
02:43:12<@arkiver>fuzzy8021: right! good call
02:43:16<@arkiver>forgot about that
02:43:26<@arkiver>back to telegram
02:43:42<@arkiver>actually i'll put it on reddit for a bit
03:00:04<h2ibot>JAABot edited CurrentWarriorProject (-8): https://wiki.archiveteam.org/?diff=49332&oldid=49321
03:27:30AlsoTheTechRobo is now known as TheTechRobo
03:44:53xkey quits [Client Quit]
03:56:40<fishingforsoup>Is there any conclusive way to Google search the Internet Archive?
03:57:29<fishingforsoup>Well, specifically the Wayback Machine?
04:00:13<ivan>fishingforsoup: nope
04:00:51<ivan>if you want to full-text search WBM you have to download some wayback (or the underlying WARCs, which you generally can't) and index them yourself
04:01:13<ivan>WBM does have a search for homepage titles
04:01:29<fishingforsoup>AGH.
04:03:10<ivan>it's the real dark web
04:03:39<ivan>to find something you often just have to have been born earlier or know another old
04:04:00<ivan>sometimes there's another dataset you can full text search
04:05:44<ivan>I'm sure for the right donation someone at IA will let you map-reduce over all their WARCs lol
04:11:05<ivan>I sent this idea to someone over there just now, good luck
04:11:39<neggles>I shudder to think at how much infrastructure and processing power it would take to properly full text index and search the entire WBM
04:12:23<neggles>there's gotta be exabytes (compressed) in there
04:45:45xkey (xkey) joins
04:49:05BlueMaxima quits [Client Quit]
04:53:29Jonimus quits [Ping timeout: 250 seconds]
05:18:21Jonimus joins
05:43:35Megame (Megame) joins
06:23:55treora quits [Ping timeout: 265 seconds]
06:24:12treora joins
06:28:56Megame quits [Client Quit]
06:29:15treora quits [Ping timeout: 250 seconds]
06:36:49treora joins
07:59:50chrismeller (chrismeller) joins
08:16:00hitgrr8 joins
08:34:03chrismeller quits [Ping timeout: 250 seconds]
08:35:19sonick quits [Client Quit]
08:52:45Megame (Megame) joins
08:54:52robhagemans joins
08:57:40<robhagemans>Hi all, I've been trying to create an account at fileformats.archiveteam.org and it's seems I'm failing the Turing test - it asks "The Archive Team IRC channel is on what IRC network". It seems to me the answer is hackint, or some variation, but I keep getting rejected with "Login error: Incorrect or missing confirmation code.". Can anyone help?
08:57:41<robhagemans>Thanks!
09:02:31Island quits [Read error: Connection reset by peer]
09:05:29<ivan>try “ether”?
09:05:37<ivan>er “efnet”
09:19:48<Jake>yup, "efnet" should be correct. We've migrated, but haven't updated all of the questions :)
09:27:10Megame quits [Client Quit]
09:30:15<robhagemans>Thanks! I'll give that a try now
09:32:27<robhagemans>Awesome, that worked, thanks both!
09:32:33IDK (IDK) joins
10:23:22Ketchup901 quits [Remote host closed the connection]
10:24:01Ketchup901 (Ketchup901) joins
10:26:04robhagemans quits [Ping timeout: 265 seconds]
11:04:39sonick (sonick) joins
11:10:57spirit joins
11:34:35<jacksonchen666>current status for sourcehut crypto related projects archiving? can't find the wiki page or anything really and the new ToS went into effect today
11:40:41<@JAA>jacksonchen666: We archived the ones that were obviously crypto-related based on a few keyword searches, along with all other repos by the same users. Let me try to find the list.
11:41:23<@JAA>https://transfer.archivete.am/WZZDz/sourcehut.crypto-related.txt
11:42:41<@JAA>Repo bundles are at https://archive.org/details/@justanotherarchivist?query=SourceHut+addeddate%3A2022-12-22 and the web things were run through ArchiveBot and should be in the WBM.
11:43:25<@JAA>Uh, rather this to include the one special case: https://archive.org/details/@justanotherarchivist?query=SourceHut+addeddate%3A%5B2022-12-22+TO+2022-12-30%5D
11:55:21<@JAA>Funny timing, by the way, as I just handed over the list of all SourceHut repos over to Software Heritage as they seemed interested in mirroring them minutes before your question.
11:55:35<@JAA>Speaking of which, they also mirrored those crypto repos.
11:57:17shoghicp quits [Ping timeout: 250 seconds]
12:02:29<h2ibot>Jarshua created Telegra.ph (+368, skeleton page): https://wiki.archiveteam.org/?title=Telegra.ph
12:02:30<h2ibot>JJMC edited Tom's Hardware (+49, Tom's Hardware has a large discussion forum.…): https://wiki.archiveteam.org/?diff=49334&oldid=49017
12:02:31<h2ibot>Nintendofan885 edited In The Media (+299, /* 2020 */ add Hindustan Times): https://wiki.archiveteam.org/?diff=49335&oldid=48453
12:22:26tbc1887 quits [Read error: Connection reset by peer]
12:25:33<h2ibot>Nintendofan885 edited In The Media (+226, /* 2022 */ Input): https://wiki.archiveteam.org/?diff=49336&oldid=49335
13:00:38<h2ibot>JAABot edited Main Page/In The Media (+137): https://wiki.archiveteam.org/?diff=49337&oldid=48454
13:30:42Megame (Megame) joins
13:48:22hogchips joins
13:53:32hogchips is now known as shoghicp
13:54:26shoghicp quits [Changing host]
13:54:26shoghicp (shoghicp) joins
13:54:42shoghicp is now known as hogchips
13:59:14Megame quits [Client Quit]
14:32:41adamus1red quits [Quit: SigTerm]
14:37:53adamus1red (adamus1red) joins
15:10:13HP_Archivist quits [Client Quit]
15:37:57Terbium quits [Quit: http://quassel-irc.org - Chat comfortably. Anywhere.]
15:38:22Terbium joins
15:44:28sonick quits [Client Quit]
16:20:41fl0w_ joins
16:24:13fl0w quits [Ping timeout: 265 seconds]
16:36:31<fl0w_>exit
16:36:33fl0w_ quits [Client Quit]
16:43:06fl0w joins
16:58:02michaelblob quits [Client Quit]
17:36:30<h2ibot>Nemo bis edited Medium (+367, /* Archival */ URL discovery is not that bad): https://wiki.archiveteam.org/?diff=49338&oldid=46671
17:36:31<h2ibot>Nemo bis edited Medium (+1, close header): https://wiki.archiveteam.org/?diff=49339&oldid=49338
18:31:18michaelblob (michaelblob) joins
18:54:13LeGoupil joins
19:03:49fishingforsoup_ joins
19:04:34fishingforsoup__ joins
19:07:09fishingforsoup quits [Ping timeout: 250 seconds]
19:08:26fishingforsoup_ quits [Ping timeout: 264 seconds]
20:12:05Iki1 joins
20:15:37Iki quits [Ping timeout: 250 seconds]
20:28:14LeGoupil quits [Ping timeout: 264 seconds]
21:02:11jacksonchen666 quits [Ping timeout: 245 seconds]
21:05:01fl0w quits [Ping timeout: 250 seconds]
21:10:29Island joins
21:10:42fl0w joins
21:15:47fl0w_ joins
21:16:11LeGoupil joins
21:17:22LeGoupil quits [Client Quit]
21:18:53fl0w quits [Ping timeout: 250 seconds]
21:24:38Wingy quits [Ping timeout: 264 seconds]
22:01:03<tech234a>I made a list of all of ~37.3 million domains that have ever appeared in one of the Chrome UX Reports: https://archive.org/details/crux_origin_list
22:01:18<tech234a>I also included instructions for creating updated versions of the lists in the future
22:01:19jacksonchen666 (jacksonchen666) joins
22:30:20hitgrr8 quits [Client Quit]
22:34:24<h2ibot>Tech234a created Finding subdomains (+1084, Initial page): https://wiki.archiveteam.org/?title=Finding%20subdomains
22:35:24<h2ibot>Tech234a edited Finding subdomains (+4, Improve formatting): https://wiki.archiveteam.org/?diff=49341&oldid=49340
22:37:20<@OrIdow6>tech234a: See also https://wiki.archiveteam.org/index.php/Site_exploration
22:37:25<h2ibot>Tech234a edited Finding subdomains (+12, More formatting fixes): https://wiki.archiveteam.org/?diff=49342&oldid=49341
22:37:43<tech234a>Ah didn't see that in my search
22:41:49<TheTechRobo>tech234a: what about subfinder?
22:42:21<TheTechRobo>https://github.com/projectdiscovery/subfinder
22:42:52<tech234a>oh cool, I haven't seen that before
22:45:42<TheTechRobo>it's been able to find subdomains of my domain that I haven't ever shared before. i don't know how.
22:46:52<@JAA>+ Certificate Transparency logs and https://osint.sh/subdomain/
22:47:49<@JAA>TheTechRobo: CT might be why if you have TLS certificates for those domains.
22:47:59<@JAA>from a public CA*
22:48:00<TheTechRobo>ah, that would explain it!
22:48:12<TheTechRobo>time to use http for my private stuff... :P
22:48:29<@JAA>https://crt.sh/ is one easy way to search those logs.
22:48:54<@JAA>HTTP tunnelled with WireGuard would be the modern way, I guess.
22:49:23<TheTechRobo>i usually use ssh forwarding for ports that probably shouldn't be accessed by others.
22:49:46<TheTechRobo>like for example, admin UIs, #burnthetwitch's server,...
22:49:55<@JAA>Yeah, I do that more often than I'd care to admit.
22:50:22<TheTechRobo>i mean, it's not a bad solution
22:50:25<TheTechRobo>it's hacky tho
22:50:53<TheTechRobo>(and i find ssh seems to increase my internet latency a lot... no hard proof, though)
22:51:05<@JAA>It doesn't scale well if you need communication between more than two hosts, but yeah, if it's stupid but it works...
22:51:35<TheTechRobo>i mean, you just set up ssh on all the servers you need! :P
22:52:35<@JAA>It's also very awkward to get working if you have NATs on both ends.
22:52:39<TheTechRobo>then, if necessary, you set up a server so that the clients can talk to each other, and- oh fuck, this was more work than properly securing the server :P
22:52:50<@JAA>:-)
22:53:28<h2ibot>Tech234a edited Finding subdomains (+313, Add additional suggestions): https://wiki.archiveteam.org/?diff=49343&oldid=49342
22:53:29<h2ibot>Tech234a edited Site exploration (+54, /* Subdomain enumeration */ Add link to new page): https://wiki.archiveteam.org/?diff=49344&oldid=46200
23:01:06BlueMaxima joins
23:31:08Mateon1 quits [Quit: Mateon1]
23:31:10Mateon2 joins
23:33:38Mateon2 is now known as Mateon1
23:34:53<mgrandi>Furaffinity 's forums are now read only, maybe a scrape of that can now commence