More

Philpax · 2026-04-16T15:17:16 1776352636

They are planning to release a Mythos-class model (from the initial announcement), but they won't until they can trust their safeguards + the software ecosystem has been sufficiently patched.

Philpax · 2026-04-16T15:08:10 1776352090

Both. There's the risk of them instructing a user on how to produce a known formulation (the Anarchist Cookbook solution, as you say), which is irritating but not that problematic.

The bigger issue is that they are potentially capable of producing novel formulations capable of producing harm, and guiding someone through this process. That is, consider a world in which someone with malicious desires has access to a model as capable at chemistry / biology as Mythos is at offensive cybersecurity abilities.

This is obviously limited by the fact that the models don't operate in the physical world, but there's plenty of written material out there.

rogerrogerr · 2026-04-16T15:31:33 1776353493

The world has been blessed by two connected things:

1. Smart people have economic opportunities that align them away from being evil

2. People who are evil tend not to be smart.

We're breaking both of these assumptions.

chrisweekly · 2026-04-16T16:03:06 1776355386

"Smart people have economic opportunities that align them away from being evil"

For some definition of evil, some of the time, ok. But as economic opportunities compound (looking at the behavior of the ultra-rich), it seems there's at least strong correlation in the other direction, if not full-on "root of all evil" causation.

rogerrogerr · 2026-04-16T16:17:50 1776356270

Sure, but that’s not “slaughter a stadium of people with drones” evil or “poison the water supply” evil or “take out unprotected electrical substations” evil.

So much infrastructure is very soft because the evil people aren’t smart enough to conceive of or conduct an attack.

malcolmgreaves · 2026-04-16T16:46:41 1776358001

That’s not quite true. Take a look at all the billionaires destroying society. Being evil is the surest way to get to get rich. In fact it’s the only way to amass that level of capital: there’s no ethical billionaire.

Der_Einzige · 2026-04-16T15:39:58 1776353998

Good. This is how we will force the world to reckon with the isolated, the disgruntled, and "lone wolf" terrorist. Real "sigma males" actually exist, and when they decide "society has to pay" we are all worse off for it. If Ted Kaczynski (quintessential example of a real actual sigma) had been in his prime operating right now, he'd have mail-bombed NeurIPS and ICLR already. I'm not cool with being in crowds of AI professionals right now for physical security reasons given the extreme anti-AI sentiment that exists from nearly everyone outside of the valley: https://jonready.com/blog/posts/everyone-in-seattle-hates-ai...

Philpax · 2026-04-16T14:52:26 1776351146

The front page is currently home to the announcement of Qwen 3.6 35B, which has comparable performance to the flagship coding models of a few months ago, and can be run at home by those with a gaming computer or MBP from the last five years. It is happening, but there will always be some lag.

lionkor · 2026-04-16T14:56:02 1776351362

Yes, but every time the capabilities, security, accuracy, or any other quality of LLMs is challenged, the default answer is that we'll essentially have AGI in a quarter or two. It's very tiring to try to argue with people about current quality, when the argument is always to wait and/or pay for a super expensive model.

catapart · 2026-04-16T16:32:58 1776357178

right on. I certainly empathize with your frustrations about "AGI". but rest assurred, I'm firmly in the camp of "not in my lifetime" and even further in the camp of "not without at least 3 more massive breakthroughs about things we currently do not understand at all". so sorry if it sounded like I was asking "what about when local llms get SUPER GOOD", or something. that's not at all what I meant. All I was asking was - "Claude Code can currently be pointed to a directory and then be chatted with about what it needs to do in that directory to make a full code project. That ability is already available on local machines through a ton of convoluted setup, but it's almost certainly going to be a packaged solution within a year (and possibly within the next few months/weeks/days). So when that packaged solution arrives and the choices are 'use the llm for scaffolding which takes 3 hours of unattended time' or 'build the scaffolding myself which takes 6 hours of deep focus time', what will still be objectionable about choosing the former?"

and, to be clear, it's an earnest question. like I've said elsewhere, I have concerns about over-reliance on the tech, but once it all moves local, a lot of those concerns become much more trivial. so I'm curious if other people have concerns that remain pressing and practical.

ETA: I'm aware that Claude wouldn't take 3 hours to do this, while using its massive warehouses of GPUS. I'm estimating what I think is a reasonable time for a single-gpu device to produce something workable.

Philpax · 2026-04-16T15:00:54 1776351654

That's not what the grandparent poster was saying, but sure. They have been steadily improving across those metrics, as Opus 4.6 / 4.7 / Mythos demonstrate. They're certainly not perfect, and I understand your fatigue (it is certainly fatiguing to follow, even if interested!), but each new release pushes it that bit further, and the improvements percolate downwards to the cheaper models.

Philpax · 2026-04-14T14:55:40 1776178540

1. Pagination with a pager is a reasonable default. See `git log`.

2. The native format would be `jj init`. For precedent, see how uv dealt with its pip compatibility: `uv pip install` was obsoleted by `uv add`.

SV_BubbleTime · 2026-04-14T15:27:05 1776180425

1. No one with good vision would give a single feature two names. It’s dumb. Here is our pager feature. Cool, how do I access it? Oh you set the ui.paginate options of course!!

2. It’s almost like we have some established ways to denote arguments that are pretty popular… ‘jj init —-git’ for example? By using ‘jj git init’ I would expect all of the git compatible commands to be be ‘jj git xxx’ because that is a reasonable expectation.

This is a problem with the voodoo. These obscure nonsense commands only makes sense when you are accustomed to them. If there’s no reasonable expectation that you could just figure it out on your own. Go on vacation and come back and be surprised when you forget the voodoo. Not to mention that every tool has to have its own unique voodoo.

Almost like the professional world has figured out that made by software engineers for software engineers will never be popular. And then engineers don’t understand the effects of why you might want tool to be intuitive and popular.

steveklabnik · 2026-04-14T15:29:37 1776180577

You're right that, looking solely at `init`, a flag could make sense to choose the backend.

The bigger picture here though: `jj git` is the subcommand that prefixes all commands that are git specific, rather than being backend agnostic. There is also `jj git clone`, `jj git fetch`, `jj git push`, etc.

For a different backend, say Google's piper backend, there's `jj piper <whatever>`.

This means that backend specific features aren't polluting the interface of more general features.

SV_BubbleTime · 2026-04-14T15:43:43 1776181423

>There is also `jj git clone`, `jj git fetch`, `jj git push`, etc.

If the compatibility isn’t automatic… why would I bother with jj commands here at all? “Git with extra steps”

steveklabnik · 2026-04-14T16:03:02 1776182582

The on-disk repository compatibility is automatic. But if you're trying to fetch something via a specific protocol, you use the command for the protocol you want to use.

There is no extra step between `git push` and `jj git push`, they're both one step.

SV_BubbleTime · 2026-04-14T17:59:45 1776189585

I meant the extra step being why would I bother with jj if I’m having to specific gut inside of jj?

The issue is pretty obvious to me. GIT is the standard and that likely won’t change for some time. So if jj makes my git life better, awesome, but it’s just a wrapper and I need to know all the git voodoo now with jj voodoo on top, I don’t quite get it.

steveklabnik · 2026-04-14T18:24:43 1776191083

If you're happy with git, you should keep using it.

Philpax · 2026-04-14T14:50:40 1776178240

Immediately after that line:

> If you're not a Rust developer, please read the documentation to figure out how to install things on your platform

Rather selective reading we have here, don't we?

QuiDortDine · 2026-04-14T17:09:47 1776186587

I did! No apt install jujutsu. I also did 'apt search jujutsu'.

Don't ask me to care about yet another language's package manager, I already know way more than I wish to.

Philpax · 2026-04-14T14:40:11 1776177611

This tutorial predates his involvement with ERSC.

Philpax · 2026-04-14T12:03:15 1776168195

While I agree with you, their system did not start privatised, and the Shinkansens predate privatisation by some time. I don't have the evidence to justify this, but I suspect that you need national buy-in - both financially and politically - to start a HSR build-out, which could then potentially be privatised at a later stage.

Philpax · 2026-04-14T11:57:15 1776167835

They weren't working for BD, they were working for a company using BD's platforms.

Philpax · 2026-04-12T19:10:03 1776021003

Adding to the chorus: if you need to apply a solution like this, it's probably time to walk away from the platform. (Well, the right time to walk away would have been years ago, but...)

thegrim33 · 2026-04-12T19:44:38 1776023078

All remotely popular online public spaces are completely infiltrated by bots/propagandists/trolls/morons/etc. If you could successfully filter that type of content out you'd end up with a much larger pool of valid/authentic content to access than if you abandoned the space altogether and switched to some very obscure/niche space that's yet to be manipulated.

frollogaston · 2026-04-13T01:58:09 1776045489

You can already follow who you want on Twitter. The thing is, bots etc take their toll even on the good users.

jachee · 2026-04-12T19:57:29 1776023849

Bluesky has a default feed that is just the posts/reposts of the people who you choose to follow, in reverse chronological order.

No need for an algorithm to decide what is worth seeing.

allanmacgregor · 2026-04-12T21:18:43 1776028723

Maybe, but no one worth listening to uses bluesky

jachee · 2026-04-13T05:38:55 1776058735

Incorrect. William Gibson does. And he’s definitely worth listening to.

celeritascelery · 2026-04-12T20:20:49 1776025249

Twitter/X has the same feature. It is all I use.

api · 2026-04-12T19:19:26 1776021566

Network effects are stronger than we are. People are there because people are there.

BadBadJellyBean · 2026-04-12T19:53:06 1776023586

And when you are not there you are not there. We are way too obsessed with missing a thing. May it be a popular figure or someone we know in person. The reality is that it's actually not too bad to miss things and most information still gets through. Especially the one that's important. You might even miss out on a lot of crap that is filtered out when it gets to you.

I am happy on my personal Mastodon instance and occasional visits to HN. You might be too if you allow yourself to be.

hunterpayne · 2026-04-12T20:21:02 1776025262

The problem is that your definition of "crap" is probably a bit different from others. Everyone probably has a slightly different definition. Also, your feed is probably mostly stuff that was posted on X first and replicated over somehow. Network effect is real.

That being said, there are clearly multiple active automated influence operations happening on X all the time. If Elon wants X to stick around, it would be in his interest to put a stop to those. The default feed is full of posts from those bots; that's also a big problem they (X) needs to fix.

BadBadJellyBean · 2026-04-13T11:51:54 1776081114

> Also, your feed is probably mostly stuff that was posted on X first and replicated over somehow.

Possibly. But if it reaches me anyways then there clearly was no need for me to be there. And if more people realize maybe the discussion might be able to move away from that place.

> The problem is that your definition of "crap" is probably a bit different from others.

I was talking about everyone's personal definition of crap. If it has not enough velocity to leave the sphere it might be only relevant for a small community or just not relevant enough to discuss. Or something different.

My argument stands. It is okay to not be part of every discussion. A lot of people think that they must be on X to stay in touch and be informed. I am not there and I am informed enough and in touch with all the people I want. If you can't be bothered to make an account outside X then we don't need to talk.

jazzyjackson · 2026-04-12T20:05:50 1776024350

yea but which people ;) unless you want to in that in-group, crypto, rage and all, better off without it

daveguy · 2026-04-12T19:31:25 1776022285

I know a bunch of people and companies who happily dumped the twitter cesspool. It has to be > 50% scammers and ragebots at this point.

ryandrake · 2026-04-12T20:43:03 1776026583

We have a solution like this for HN, but people don't use it: It's the "hide" button, and it's right next to the "flag" button. Yet, when users see content they don't like, instead of just hiding it, to block it for themselves, they often choose to flag it so that they can block others from seeing it too.

I'd welcome per-user curation tools like OP's which don't affect the content for the rest of us.

alain_gilbert · 2026-04-12T20:14:19 1776024859

I was actually thinking of making a similar app for hacker news comments. Should we all quit hacker news too?

mh- · 2026-04-12T20:42:31 1776026551

HN is my top candidate for a solution like this, too. Because there's a ton of high quality content here, increasingly buried beneath a small number of sentiments and topics I don't care to see rehashed constantly.

ryandrake · 2026-04-12T20:46:07 1776026767

I'd like to see it, too, but for the opposite[1] reason: Others can use this curation (which only affects their own view of HN) instead of flagging (which affects my view and everyone else's too).

1: https://news.ycombinator.com/item?id=47744253

mh- · 2026-04-12T20:51:56 1776027116

I use the flag functionality as per the guidelines:

> Off-Topic: Most stories about politics, or crime, or sports, or celebrities, unless they're evidence of some interesting new phenomenon. If they'd cover it on TV news, it's probably off-topic.

> If a story is spam or off-topic, flag it. Don't feed egregious comments by replying; flag them instead. If you flag, please don't also comment that you did.

Flagging is a way to shape what types of content takes up the finite amount of attention available on HN. If everyone used it (only) in the way the guidelines ask you to, the front page would look very different on a given day.

https://news.ycombinator.com/newsguidelines.html

jim33442 · 2026-04-13T02:18:12 1776046692

HN doesn't need it. I'll read this site, not gonna bother with Twitter or Reddit though.

pgt · 2026-04-12T20:40:25 1776026425

You need to curate your algorithm. Took me 10 years before I started blocking aggressively and now my feed is amazing with 90% bangers. Twitter is by far the best product in this space. Every other platform is 2+ weeks behind. Twitter is where the news breaks.

perching_aix · 2026-04-12T23:51:26 1776037886

I had a well curated feed too (even used word filters) and yet I felt compelled to pack up and walk away. It was simply not enough.

The negative effect the various drivel had on me was nonlinear. Even if 99% of posts were fine, if that 1% was seriously upsetting, it just ruined the whole thing.

Philpax · 2026-04-08T05:12:02 1775625122

Er, what? We've had open models that can outperform ChatGPT 3.5 for several years now, and they can run entirely on your phone these days. There is no metric by which 3.5 has not been exceeded.

LoganDark · 2026-04-08T18:09:32 1775671772

Not in the creative writing I care about. I've been looking for years and trying new models practically every month, including closed, hosted models. None of them approach the quality of the logs I have from that original release.