| 00:00:03 | | Hackerpcs (Hackerpcs) joins |
| 00:21:36 | | cyan_box joins |
| 00:22:19 | | Arcorann_ joins |
| 00:22:57 | | Mateon1 quits [Ping timeout: 272 seconds] |
| 00:24:18 | | Mateon1 joins |
| 00:25:29 | | cyanbox_ quits [Ping timeout: 272 seconds] |
| 00:26:07 | | Arcorann quits [Ping timeout: 272 seconds] |
| 00:52:20 | | etnguyen03 quits [Client Quit] |
| 01:17:38 | <pabs> | datechnoman: got any URLs for *.in-addr.arpa *.ip6.arpa ? for https://cabforum.org/2025/11/10/ballot-sc-086v3-sunset-the-inclusion-of-ip-reverse-address-domain-names/ |
| 01:18:52 | | etnguyen03 (etnguyen03) joins |
| 01:31:16 | | etnguyen03 quits [Client Quit] |
| 01:35:56 | | etnguyen03 (etnguyen03) joins |
| 01:48:27 | | Mateon1 quits [Ping timeout: 272 seconds] |
| 01:49:29 | | Mateon1 joins |
| 02:01:45 | | Mateon1 quits [Ping timeout: 272 seconds] |
| 02:02:45 | | Mateon1 joins |
| 02:23:06 | | Suika quits [Quit: Server is ded] |
| 02:23:55 | | Suika joins |
| 02:25:11 | | mr_sarge quits [Ping timeout: 272 seconds] |
| 02:25:18 | | mr_sarge (sarge) joins |
| 03:31:45 | | Kotomind_ joins |
| 03:40:33 | | Kotomind_ quits [Ping timeout: 272 seconds] |
| 03:41:55 | | PredatorIWD256 joins |
| 03:43:46 | | PredatorIWD25 quits [Ping timeout: 256 seconds] |
| 03:43:46 | | PredatorIWD256 is now known as PredatorIWD25 |
| 04:02:02 | | BennyOtt quits [Quit: ZNC 1.10.1 - https://znc.in] |
| 04:04:23 | | BennyOtt (BennyOtt) joins |
| 04:05:21 | | etnguyen03 quits [Client Quit] |
| 04:05:50 | | etnguyen03 (etnguyen03) joins |
| 04:13:18 | | etnguyen03 quits [Remote host closed the connection] |
| 04:33:13 | | nexussfan quits [Read error: Connection reset by peer] |
| 04:33:52 | | Island quits [Read error: Connection reset by peer] |
| 05:04:48 | | n9nes quits [Ping timeout: 256 seconds] |
| 05:05:08 | | n9nes joins |
| 05:06:50 | | SootBector quits [Remote host closed the connection] |
| 05:08:01 | | SootBector (SootBector) joins |
| 06:04:05 | | nexussfan (nexussfan) joins |
| 06:23:57 | | nexussfan quits [Client Quit] |
| 06:54:20 | | Wohlstand (Wohlstand) joins |
| 06:57:55 | | hexagonwin (hexagonwin) joins |
| 07:00:11 | | SootBector quits [Remote host closed the connection] |
| 07:01:20 | | SootBector (SootBector) joins |
| 07:09:05 | | sec^nd quits [Remote host closed the connection] |
| 07:09:39 | | sec^nd (second) joins |
| 07:11:39 | | s-crypt quits [Quit: Ping timeout (120 seconds)] |
| 07:11:51 | | s-crypt (s-crypt) joins |
| 08:22:30 | | Shard114 (Shard) joins |
| 08:22:37 | | Wohlstand quits [Client Quit] |
| 08:23:08 | | Shard11 quits [Ping timeout: 256 seconds] |
| 08:23:08 | | Shard114 is now known as Shard11 |
| 08:31:59 | | FireFly quits [Quit: Upgrading...] |
| 08:58:29 | | nine quits [Quit: See ya!] |
| 08:58:42 | | nine joins |
| 08:58:42 | | nine is now authenticated as nine |
| 08:58:42 | | nine quits [Changing host] |
| 08:58:42 | | nine (nine) joins |
| 09:58:31 | | Kotomind_ joins |
| 10:02:15 | | Kotomind_ quits [Client Quit] |
| 10:03:43 | | gogo joins |
| 10:03:48 | | Webuser064005 joins |
| 10:03:57 | | Webuser064005 quits [Client Quit] |
| 10:12:15 | <Guest> | does urls-grab only get websites on the public internet (not tor)? |
| 10:15:51 | <@imer> | Guest: yep |
| 10:22:55 | <Guest> | then how did the warrior end up downloading CSAM? this isnt something i ran into but someone posted about it happening to them in #archiveteam. like what would have happened if it was a residential connection vs hetzner? i wouldnt want my door busted down for running an AT project D: |
| 10:26:02 | <@imer> | as far as we can tell, it's a honeypot - so maybe a site that was previously csam related. urls does grab the front page of domains it sees (and can follow links down based on some heuristics), so if it was linked somewhere else it'll discover it that way |
| 10:27:21 | <@imer> | this is also why we generally don't recommend people run urls |
| 10:29:24 | <@imer> | as for the legal side of things, I'm not a lawyer. I do think that if something like that would happen, we would be able to conclusively prove it wasn't the person, but the (automated) warrior downloading these things on their network |
| 10:30:06 | <@imer> | I don't think aside from occasional abuse emails (most of which are bogus) we've had any issues |
| 10:32:13 | | gogo quits [Ping timeout: 272 seconds] |
| 10:35:45 | | simon816 quits [Quit: ZNC 1.10.1 - https://znc.in] |
| 10:40:42 | | simon816 (simon816) joins |
| 10:49:59 | <Guest> | got it |
| 11:11:29 | | ats quits [Ping timeout: 272 seconds] |
| 11:15:13 | | ats (ats) joins |
| 11:53:10 | | RadRooster quits [Remote host closed the connection] |
| 12:00:01 | | Bleo182600722719623455222 quits [Quit: The Lounge - https://thelounge.chat] |
| 12:02:50 | | Bleo182600722719623455222 joins |
| 12:25:29 | | Dada joins |
| 12:40:27 | | Gadelhas562873784438 quits [Quit: Ping timeout (120 seconds)] |
| 12:43:50 | | irisfreckles13 joins |
| 13:22:58 | | Dada quits [Remote host closed the connection] |
| 13:24:30 | | Dada joins |
| 13:51:11 | <Yakov> | https://www.apkmirror.com/apk/discord/discord-chat-for-gamers/ Has anyone seen this before? Discord name is redacted in apkmirror because of a DMCA... |
| 13:52:46 | <Yakov> | It's really weird that apkmirror would do that yet still redistribute their apk |
| 14:10:32 | <TheTechRobo> | I do think the URLs project should have a stronger warning in the Warrior web UI than just "WARNING IP BLOCK LISTS" given that things like this can happen |
| 14:11:48 | <klea> | Is there some project to grab tor websites? |
| 14:18:04 | | Webuser178095 joins |
| 14:18:08 | | Webuser178095 quits [Client Quit] |
| 14:27:11 | | sec^nd quits [Remote host closed the connection] |
| 14:27:39 | | sec^nd (second) joins |
| 14:31:37 | | Arcorann_ quits [Ping timeout: 272 seconds] |
| 14:35:20 | <TheTechRobo> | There was, but I don't remember what happened to it; it hasn't been running for awhile |
| 15:25:45 | | gosc joins |
| 15:26:06 | <gosc> | apparently amazon disabled downloading mobile apps on everything but an amazon fire tablet last year |
| 15:26:25 | <gosc> | does anyone own a fire tablet/fire tv? |
| 15:27:00 | | stepney141 quits [Ping timeout: 256 seconds] |
| 15:27:08 | <gosc> | there's a whole bunch of free apps on there I wanted to grab, which are very obscure and still downloadable |
| 15:27:34 | <schwarzkatz|m> | Yakov: I have seen that before, it has been like that for some time I think |
| 15:30:34 | | stepney141 (stepney141) joins |
| 15:31:57 | <gosc> | here's the apps I need to grab, I'm kind of busy so I'll just leave them here: https://www.amazon.com/s?i=mobile-apps&rh=p_4%3AJTWebMan&search-type=ss |
| 15:32:00 | <gosc> | and https://www.amazon.com/aibee-Rose-Guns-Days-Season1/dp/B00ANFT1NE/ |
| 15:32:17 | <gosc> | I understand if no one wants to pay for the second one but the first link has entirely free apps |
| 15:36:50 | <gosc> | actually wait, this would make the amazon appstore endangered by archiveteam standards |
| 15:36:52 | <gosc> | let me add that |
| 15:46:07 | <h2ibot> | Calmevening edited Alive... OR ARE THEY (+281): https://wiki.archiveteam.org/?diff=60368&oldid=60359 |
| 16:06:16 | <justauser> | blog.archive.today now doesn't resolve in many places and https://archive-is.tumblr.com/ stopped redirecting. Has two recent posts. |
| 16:06:45 | <anarcat> | quick, archive it |
| 16:06:47 | <anarcat> | oh shit |
| 16:07:40 | <justauser> | Already SPN'd the latter. |
| 16:08:03 | <justauser> | Doing something with AT as whole is planned for ages, AFAIK? |
| 16:09:08 | <justauser> | !ao'd the redirect. |
| 16:16:39 | <Guest> | https://transfer.archivete.am/EtrtB/blogat_spn.png |
| 16:16:40 | <eggdrop> | inline (for browser viewing): https://transfer.archivete.am/inline/EtrtB/blogat_spn.png |
| 16:16:49 | <Guest> | (spn fails to resolve domain) |
| 16:18:29 | | DogsRNice joins |
| 16:19:11 | <justauser> | klea: Pair.com? Don't tell me their security@ is a mailing list. |
| 16:19:52 | <klea> | justauser: i sent a email to webmaster@marcdashevsky.com not to pair.com, maybe they handle mail via mailing list? |
| 16:19:56 | <klea> | would be annoying probably. |
| 16:21:02 | <justauser> | The bounce is from pair.com, and I still remember the rush to save their mailing lists. Thus the conclusion. |
| 16:21:14 | <justauser> | Nothing certain, but a good suspicion. |
| 16:24:48 | <schwarzkatz|m> | does the WBM care a lot about query params? Say, I request ?page=2&foo=bar but only ?page=2 is saved. would I get an answer? |
| 16:24:48 | <schwarzkatz|m> | background: a website I am collecting links for pollutes almost all pagination with some random parameters that do not disappear again from the url after visiting another page; I am thinking about just removing them but am not sure if that is okay |
| 16:24:59 | | lucifer_sam quits [Ping timeout: 272 seconds] |
| 16:26:12 | <justauser> | Almost exact match. |
| 16:26:31 | <justauser> | Details on the wiki. |
| 16:26:41 | <justauser> | However, the opposite sort of works. |
| 16:27:28 | <justauser> | If you go to ?page=2, but only ?page=2&foo=bar is saved, it'll suggest to show pages with this prefix and you'll see the saved one. |
| 16:28:12 | <justauser> | Does it mean URLs grow infinitely? |
| 16:29:40 | <schwarzkatz|m> | justauser: hm, and is that also depending on the order? in this case the page params gets added at the end :/ |
| 16:30:11 | <justauser> | Meh. |
| 16:30:22 | <justauser> | Another case of user-hostility. |
| 16:30:25 | <schwarzkatz|m> | justauser: luckily no, once they are added they stay there |
| 16:30:39 | <justauser> | That's exactly what I meant. |
| 16:30:56 | <justauser> | If parameters are only added and never removed while you browse... |
| 16:31:06 | <justauser> | ..this could explode. |
| 16:32:11 | <schwarzkatz|m> | at least the nav bar on top has hardlinks :D |
| 16:32:11 | <justauser> | arkiver: Two days left for opendiary. Are you still hoping to do something? |
| 16:32:40 | <schwarzkatz|m> | I think I'll just keep them then. thanks! the website in question is https://www.animepro.de/anima/test/dvd-serien in case you are wondering |
| 16:32:49 | <justauser> | I'd say strip. |
| 16:33:09 | <justauser> | Only save a single canonical URL for a page. |
| 16:35:36 | <schwarzkatz|m> | oh? from what you said it sounded like this would break the links :o |
| 16:36:16 | <justauser> | It would, but from what you say they are unpredictable to an outsider anyways. |
| 16:36:44 | <justauser> | If we were doing a crawl, it would make some sense to have every internal link good. |
| 16:37:08 | <justauser> | Unless it results in hundreds of duplicated captures, taht is. |
| 16:37:43 | <schwarzkatz|m> | oh, I'm so sorry, the context is important of course |
| 16:37:43 | <schwarzkatz|m> | am collecting the links for archivebot |
| 16:38:14 | <justauser> | So !ao-ing page list instead of letting it crawl? |
| 16:39:02 | <schwarzkatz|m> | yeah, I always was under the impression that is more accurate, no? |
| 16:42:03 | <justauser> | It depends (tm). |
| 16:42:25 | <justauser> | It's more effort and a risk of making a dumb mistake. |
| 16:42:58 | <justauser> | Has to be used when the site is uncrawlable, but this doesn't seem to be the case here? |
| 16:44:11 | <justauser> | Clicked around a bit, doesn't seem that bad in terms of possible duplicate captures. |
| 16:45:14 | <justauser> | It also creates some links to nothing, but you can't tell without fetching that it's one. |
| 16:45:48 | <justauser> | Or does it? |
| 16:46:05 | <schwarzkatz|m> | I see... in that case an automated crawl of that site would be nice! I don't think there will be any updates to the site in the forseeable future |
| 16:46:20 | <schwarzkatz|m> | justauser: do you have an example? |
| 16:48:10 | <justauser> | I suspected https://www.animepro.de/anima/test/dvd-serien?pimcore_request_source=staticroute&controller=%40AppBundle\Controller\AnimaController&bundle=AppBundle&action=list&filter=Z&page=3 , but it actually works fine. |
| 16:48:47 | <justauser> | Despite pointing to page number 3 out of 1. There is probably some workaround server-side. |
| 16:49:21 | <schwarzkatz|m> | ah, it also keeps the page param when filtering :D |
| 16:50:35 | | gogo joins |