Hacker Newsnew | past | comments | ask | show | jobs | submit | adriand's commentslogin

It’s a deliberate process powered by rightwing and capitalist interests designed to create a dumber, less educated and more distracted population. A war as stupid as the one with Iran would not have been possible three decades ago. As ill-advised as the Iraq war was, Bush at least spent months explaining the rationale and building support for it, successfully. Now that’s not needed.

I saw interviews with young Americans on spring break and they were so utterly uninformed it was mind-blowing. Their priorities are getting drunk and getting laid while their country bombs a nation “into the stone ages”, according to their president. And it’s not their fault: they are the product of a media environment and education system designed for exactly this outcome.


I was there for that war. Kids weren't listening and didn't care back then either. If anything, Gen Z is the most politically-aware generation we've had since we started keeping track.

Trump doesn't have to justify a single thing because the billionaires behind him know that every last bet is off and their very livelihoods are at risk, and his entire base of support up and down the chain are either complicit or fooled.

What the world does when they finally realize Democrats and Republicans are simply two sides of the vast apparatus suppressing the will of the people by any means necessary will be... spectacular.


I was there as well, the bush presidency lasted my entire middle and high school career, and I got the chance to vote for Obama in my senior year.

I remember things very differently. Everyone cared about the Iraq war, gay-straight alliance was one of the most up and coming clubs, and political music was everywhere. Green Day had their big second wave with American Idiot, System Of A Down was on top of the world, Rock Against Bush was huge, anarcho-punk like Rise Against was getting big.

I'm not a teenager anymore obviously, so it's entirely possible I'm just missing it, but I've seen very little that compares to those sort of movements. On the other hand, most millennials I know are still wildly politically active.


In 2002, there war in Iraq had large popular support, something like 70-80 percent. It took a few years for people to realize it was based on a lie and was a massive mistake. It was also morally reprehensible, but that part is not often discussed in mainstream US politics.

If you compare that to the current Iran war, a majority of the population is already against it, however the current administration doesn't seem to care much about public opinion, and there doesn't seem to be much that the public can do about it.


Yeah I was there too and I don’t know what this guy is talking about. Gen X was highly politically active. This was the era of violent in the street anti-globalization clashes like the WTO protests.

Where exactly because in the Midwest we were very vocal about it. We have tons of military families out there and we were poor enough to feel almost like military service was inevitable if we didn’t get scholarships for school. You know the band NOFX had an album, the War on Errorism that was quite successful based on the fuckery of the bush administration. Punk rock and protest music was huge then.

Trump was in part a response to that realization, an outsider from both parties that used the Republicans to get in the door. Bernie Sanders was supposed to be the same, and when he dropped out a lot of us switched to Trump because they represented the same thing.

Tearing down entrenched establishments that don't work for the people is what people were voting for in 2016 and were disappointed he didn't do it. They're glad he's finally doing what he promised a decade ago.


"Bernie Sanders was supposed to be the same, and when he dropped out a lot of us switched to Trump because they represented the same thing."

That is definitely an opinion.


It was a pretty widely shared one in 2016, if I remember correctly.

Is it insane at all? The biggest fees are charged to the biggest providers. With short form video now the dominant form of addictive social media content, it doesn’t seem insane at all that large media companies ought to compensate inventors/owners of patented video technology. A company with 100 million or more subscribers is not a company I feel a lot of empathy for if they’re trying to avoid paying licensing fees.

It goes to $2.5m for 5 million users/subscribers and tops out at $4.5m for 100 million subscribers. It’s not staggered evenly at all IMO. So I worry mainly for the small players. This shouldn’t have any meaningful effect on any big player.

But for small players nothing apparently changed, they keep paying the $100k as usual.

5 million users isn’t a lot unless we’re talking paid subscribers. Their license likely does not make a distinction. For a (free/ad supported) service like a niche YT clone, this could be fatal.

I found myself trying to fill out a captcha the other day whose letters were so skewed and crazy I really had no idea what they were. It took me four tries!

> Anyone who has been doing this professionally will tell you that the "last step" is what takes the majority of time and effort.

This is true, and I bet there are thousands of people who are in this stage right now - having gotten there far faster than they would have without Claude Code - which makes me predict that the point made in the article will not age well. I think it’s just a matter of a bit more time before the deluge starts, something on the order of six more months.


I'd argue that LLMs are not yet capable of the last step, and because most sufficiently large AI-generated codebase are an unmaintainable mess, it's also very hard for a human developer to take over and go the last mile.

So what is the “last step”? I have one shotted a complete AWS CDK app to create infrastructure on an empty AWS account and deploy everything - networking, VPC endpoints, Docker based lambdas, databases, logging, monitoring alerts etc.

Yes I know AWS well and was detailed about the requirements z


Fortunately, fossil fuels are a stable and geopolitically risk-free source of energy.

They're a relatively stable and risk-free source of money for a certain kind of politician.

The energy part is incidental.


Is this the biggest Woosh of the year?

Is this comment on purpose? The whooshes are getting hard to track!

This will not be a learned more robustly in the US until one or both of the only two (edit: major) gas turbine manufacturers in the world (GE Vernova, Siemens Energy) suffer a tail risk event causing their failure. Backlog for new gas turbines is ~7 years, as of this comment. Continued production capacity is a function of how fragile those two companies are.

The White House’s Bet on Fossil Fuels Is Already Losing - https://www.bloomberg.com/opinion/articles/2025-10-28/white-... | https://archive.today/vpvch - October 28th, 2025

Gas-Turbine Crunch Threatens Demand Bonanza in Asia - https://www.bloomberg.com/news/newsletters/2025-10-07/gas-tu... | https://archive.today/z4Ixw - October 7th, 2025

AI-Driven Demand for Gas Turbines Risks a New Energy Crunch - https://www.bloomberg.com/features/2025-bottlenecks-gas-turb... | https://archive.today/b8bhn - October 1st, 2025

(think in systems)


Both of those are big wind tubine manufactres as well.

Luckily, the wind futures market is pretty bullish for the foreseeable future

Isn't there Ansaldo Energia too?

Yes, but their production volume is limited (imho) compared to the two companies I mentioned. Good callout regardless. I'll have a post put together to share here enumerating and comparing.

(i track global fossil generation production capacity as a component of tracking the overall rate of global energy transition to clean energy and electrification, but some of my resources are simply an excel spreadsheet)


Hah, that's super interesting. How are we looking? What under served areas are you seeing? Do you post anywhere your numbers?

They are also organic, all-natural, and fat-free! And renewable on geological timescales.

Contrary to windmills, which slows down the rotation of the earth.

Doesn't that depend whether you point them east or west?

Point them north and you'll increase Earth's axial tilt.


I think you just solved both leap seconds and daylight savings time.

I personally would like more hours in the day.

No problem: Just build a subterranean boat and launch a few nukes close to the core to restart rotation.

Won't someone think of the ~children~ birds?!

The US has their own oil fields.

If they can burn down the EU in that ongoing crisis, they don't care.

That's likely the strategy the administration is running.


And clean. Really, really clean. Just look at coal. A no-brainer. Go for it.

You mean "clean coal", right? Of course it's clean, it's right in the name.

People laugh at this, but anthracite genuinely is cleaner than other coal in every regard save CO2 emissions. People just think it's a joke because they've come to believe that CO2 is the only coal emission worth caring about, which definitely isn't true.

The oxymoronic term "clean coal" refers to carbon-capture-and-storage (CCS) technology [0], touted by the fossil fuel industry as a way to reduce greenhouse gas emissions and continue employing coal workers.

Thus far, it is incredibly expensive, at a time when solar and wind generation is cost-competitive with fossil-fuel plants which don't employ CCS. It is simply a dead end. You can generate more renewable energy, and store it, for far less than it takes to equip and operate CCS in conjunction with a fossil-fuel-fired plant. Only direct government subsidy makes it viable for a vanishingly small amount of GHG emissions.

[0]: https://en.wikipedia.org/wiki/Carbon_capture_and_storage


"Clean coal" is like saying "a fast snail". Sure it can be faster than other snails, but even if it's twice as fast as the second fastest snail, it's still a snail and I'll still laugh when an ant runs circles around it.

No, the criticism isn't because people get caught up about CO2 -- it's because "cleaner than other coal" is a very low bar to meet to be calling something "clean" full stop.

Also "clean coal" is not a type of coal being burnt (although that does matter too) but pollution control systems added to coal plants.


Anthracite burns clean enough to use in a pizza oven. If your neighbor told you he was going to install a new furnace and offered you the choice of it burning wood pellets or anthracite, from a smell standpoint you should absolutely choose the anthracite.

Anthracite, in these regards, is very different from bituminous coal.


Considering the mercury and arsenic in all coal, wood is preferred in ovens.

And both are very different from not burning anything.

Undoubtedly. Doesn't change the fact that one kind of coal burns smokeless with a clean blue flame while the other will cover everything for miles in a film of soot and tar.

>Anthracite burns clean enough to use in a pizza oven.

Yeah, so does wood, which is horribly polluting.


The smell of wood might be nice for flavor, but that's beyond the point of anthracite being clean. That particulate pollution from wood burning is severe compared to the smoke you'll get off anthracite, which is virtually nonexistent.

Regardless of how good it might be at being the cleanest dirty thing, it's not what the US trope of "clean coal" refers to anyway. Anthracite is not used in the US to generate power because it is too expensive.

The doesn't cause acid rain version is called "clean" and that seems pretty fair to me when the other version causes acid rain.

It is still dirtier than all of the alternatives we have.

The US (with Canada and Mexico) is self-sufficient with fossil fuel energy.

> The US (with Canada and Mexico) is self-sufficient with fossil fuel energy.

Oh boy can't wait for the reenactment of third reich intervening peacefully in czechoslovakia, for their own safety and wellbeing of course, and not at all for the resources they're hoarding, the filthy hoarders.


I do find the slow Sovietization of America funny, both mentally and economically. The year is 2050, autarky on energy has been established, the markets cut off, politics in the hands of erratic and geriatric leaders. Americans proudly drive 30 year old Fords the way people used to drive Ladas, while China exports green energy, cars and infrastructure to the world.

Unfortunately, we share the planet and the atmosphere with it.

If the US taunts someone into a nuclear war, the rest of us get to live but should be investing more in cancer research.

It's awesome the US hasn't destabilized one of those neighbors and alienated the other one by declaring it the prospective 51st state. Soft power really is America's super power.

I’d wager the US is self sufficient also in terms of renewable energies.

But it gets traded globally. That means if the price goes up in Asia, it also goes up in NA.

It doesn't have to be traded globally. The US could ban oil and gas exports, and that would decouple local prices from the global market.

Why the US can't use its own oil: it's the wrong type. https://blog.drillingmaps.com/2025/06/this-is-why-us-cant-us...

Imports into the US will experience inflation regardless. Semiconductor imports from East Asia are one example, since they depend on helium and energy from the Gulf.

tbh I’m kind of surprised the admin hasn’t enacted export tariffs on oil and gas already to take the pressure off car owners.

Wouldn’t do anything to the prices of imported products since the entire intl supply chain would be subject to even higher prices, but would reduce pressure at the pump


Sure, if we build out refining capacity for the next ten years. Then we're golden until we run out of the finite well of combustible dead algae. So if you think we can revitalize American manufacturing and resource processing starting now, and you're okay with those investments being worthless in a few decades, and you don't give a shit about rendering the planet significantly less habitable to human life, then yeah, we're totally self-sufficient with fossil fuels.

Or we could, you know, pull energy out of the air and sun, a strategy which will be viable until our star dies.


Alberta tar sands have hundreds of years worth of reserves. They're also expensive and incredibly dirty to extract and emit significantly more CO2 during processing than a light oil well will. (The tar is usually melted by heating with natural gas).

I'm quite confident cheap renewable alternatives will make the tar sands inviable far before they run out.


Some good news though, with the war in Iran the spiking oil price means that Albertan executives can ramp up operations and stay quite profitable! Push the price to 200/barrel and we'll just strip mine the entire province after airlifting out Calgary and Edmonton.

This assumes that there isn't profound demand destruction caused by the stratospheric energy prices.

Fossil fuels were already an inferior energy source when oil was $60/barrel. Electrification has been moving fast and accelerating, even at the pre-energy crisis prices.

Now? Current events are likely to take fossil fuels out back and give 'em the Old Yeller treatment with surprising speed.


I absolutely agree, _in market driven economies_, fossil fuels are slowly pricing themselves out of relevancy. The issue is that for some reason the US specifically subsidizes their usage keeping them artificially lowly priced.

So, how many billions of newly printed debt is Trump willing to throw at the problem to keep those subsidies up so that he can be sheltered from the scary windmills?


another option is not to shit on all countires who do have resources driving the prices up for everyone.

This is an article about paying private industry to not build wind capacity.

I don't agree with redirecting towards fossil fuels instead of wind power, but its not really paying TotalEnergies "for not building wind capacity", its more like changing what was ordered on behalf of the population: first the wind power capacity was ordered, then it was stalled and blocked, and now this president and TotalEnergies have agreed to change the order to another type of meal (investing in fossil fuel facilities within the US).

The US is unable to implement export controls so consuming less than it creates doesnt mean theres enough since producers will export if international prices are better

Ireland during the famine was self sufficient with food production but that didn't stop people from sending food to the highest bidders abroad.

It didn't look like that at the gas pump today.

Ignoring the part where just running everything off fossil fuel is suicidal for the planet, the US actually isn't self-sufficient with just fossil fuels.

https://ember-energy.org/latest-updates/fossil-fuels-fall-be...

Renewables are cheaper to build out, and we're facing a massive energy shortage. We need to be building renewable production as quickly as possible just to keep up with demand.

Insisting that we use obsolete, expensive and dirty technologies while the rest of the planet modernizes is just dumb.


Is SOC 2 legit? I have this on my roadmap but now I’m wondering if it’s just security theatre?


That's a difficult question to answer. It shouldn't be, but it is. The reality is, SOC2 is a sales-enablement tool. You should:

* Run a SOC2/compliance program that is entirely disjoint from your security practice.

* Defer SOC2 until the work required to sell into customers demanding it (phone calls, questionnaires) exceeds the cost of obtaining SOC2.

* Prepare for SOC2 by making simple best-practices engineering decisions, in particular single-signon for virtually everything and protected branches for all your repositories.

* Do not allow SOC2 to force any engineering decisions that you would not have intuitively made yourself (this is a big risk with the evidence-gathering platforms like Drata, Delve, and Vanta).

* Assume your SOC2 Type I report will suffice as a first attestation (ie: buy you 1 year of time) with all your customers, and understand that you cannot fail to obtain a Type I; your Type I is guaranteed.

Over 5-6 years of discussing SOC2 with other security practitioners pretty intensively, the overwhelming weight of the evidence is that ~practically nobody actually reads SOC2 reports; they just check the box for each vendor and move on. Plan accordingly.


Since you know a lot about SOC: is SOC2 Type I (point in time) enough to close enterprise sales? Is it worth getting for a new startup (seems super simple)?


Yes, it is, and no, you should not get it, not until you know you need it. If you have to ask, defer.


Thanks!


It's complicated. In theory, SOC2 forces you to do some important stuff, like define your threat model and say "I can mitigate against the threats and prove that my mitigations are in place". The problem is always that the companies that care don't need it but are burdened with it while the companies that don't care will just checkbox their way through it. It sort of enforces a very baseline security posture, in theory, but the major win of "We've thought our security through" is more of a choice - SOC2 can't actually force you to care.

A ton of these SOC2 vendors take all of the potential good parts of SOC2 out of the equation, building the threat models for you and then you just hook up your gsuite/ github and they check boxes for you or tell you to flip a policy here or there. Delve took this to the extreme by not even asking you to flip the checkboxes.

That said, it doesn't matter if it's legit. Everyone is SOC2, and part of being SOC2 is that the vendors whose products you purchase are SOC2, so it's not a choice - you have to be SOC2 if you want to sell (industry/ product specific, but at some point it'll be clear if it applies). If your goal is security, well, SOC2 is irrelevant.

Ultimately, you'll end up having a separate compliance team to manage SOC2 and you'll actively try to keep "real security" from it because real security has to change over time. You'll encode the absolute minimum possible into your compliance for that reason so that you can easily pass every year and then, if you care about security, you'll invest in that separately.


You can get a long, long way without SOC2; virtually every prospective customer you run into that asks for a SOC2 will have an alternate on-ramp for vendors without it, and the ones that don't will sign a contingent PO on your Type I, which (again) you are guaranteed to get.

The idea that SOC2 forces you to do important stuff gets it backwards; SOC2 documents your existing practice, and demands only extremely high-level controls that you can deliver in any number of ways. Your security practice should (minimally) inform your SOC2, not the other way around.


> You can get a long, long way without SOC2;

Yes, that's true. I edited my post to be a bit clearer about this. When you need a SOC2 is going to depend a lot on your business. Lots of companies can make exceptions very easily. Type 1 is easy, I would highly recommend starting there pretty much no matter what since it'll be good practice before your SOC2.

> The idea that SOC2 forces you to do important stuff gets it backwards;

It's the goal behind SOC2. You're assuming a company has a security practice that informs the SOC2 but I think the idea is that companies have no security practice and the SOC2 is what forces them to sit down and build one. What you're describing is more like what happens when a company that actually cares about security goes through SOC2 - you take what you have, put it into a NIST format, and map minimal controls from your practices to the CCs. Most companies have nothing to start with.


We did SOC 2 a few years ago, I'm glad we did it.

In my mind getting a clean report required three kinds of work:

1. Work that actively improved our security posture. 2. Work that didn't change much, but made our security posture easier to understand. 3. Busy work.

I think for most companies all three kinds of work will be required, but you can also make decisions that will push the percentages around. SOC 2 required us to start doing an annual security table top exercise. You could sit down, run a scenario, run it as fast as you can, and come up with a few pre-determined "improvements" that would help if you actually had that problem in the future. Or you could sit down and really put work into it, and see what works well and what doesn't.

As an example in our last tabletop I "exfiltrated" some data from one of our servers, and challenged the team to figure out what I'd done. The easy way out would have been for someone to say "We'll look at the logs and figure it out", but instead I asked them to actually try and find it. We discovered that the sheer volume of logs for that system made them hard to work with. So we made some changes to make them easier to work with and repeated the exercise later.

It could have been busy work, but instead we got real value from it.


It's security theater. Friendly plug for Oneleet, who actually talked us out of getting it.

We were considering getting certified, but it only really makes sense if your customers require you to have it.


Tangential to this but do ISO certifications make sense or are they security theater as well?

And another question but as a consumer, is there any certification which can meaningfully try to show if people/business take their security carefully or are all things security theater in that aspect and at some point, we just have to trust the enterprise and look for other signals of security (like for example blog posts which might show a deep-dive into security for example comes to my mind)


Not really. As long as current system where auditors are also clients of company being audited, the conflict of interest is too high.

Also, not to mention in many countries, the cost of getting breached is nothing so many companies are willing to just hope for the best and payout in case of the worst.


What about enterprise customers / sales?


For enterprise sales you can get a SOC 2 Type I faster than any enterprise sale goes through. Typically, most enterprises are okay if you show them proof that you are "in the process" of getting the certification by showing them that you have signed up with one of those platforms (Delve, Vanta, etc.), so you would be okay to start only when you are about to close one of those enterprise deals.


Yeah, we got a signed letter of engagement from our auditor, which was enough to unlock a customer without having to go through any sidestepping process.


Thanks!

Great info, thanks!

It’s fine for what it is: some light guardrails that attempt to nudge you towards answering “is this all just a house of cards that will obviously collapse under a light breeze”.

Getting a SOC2 doesn’t mean you’re amazing or secure or stable. If a customer says they’ll write you a fat check but they need you to have a SOC2, tell them you’ll get it within a year if they start paying. Otherwise don’t bother.


All of the audit / certifications are theatre. the only question is if your customer is required to participate in the show.

If you really care about security, you need to separate it from this stuff, it can only hurt you.

Do your own, real, security, and treat this compliance stuff as an opaque customer feature request.


It basically shows clients that you are not doing wildly incompetent things with their data, or if you are, they can more easily sue you, since you probably lied to your auditor about it.

But it’s ultimately not up to you if you do it or not. If all of your potential clients demand it, it’s generally easier to get it than it is to get on the phone with all of your potential clients’ IT departments and explain why you don’t have it.


It will go down in history as one of the most monumental avoidable disasters of all time.


It's so wild to me that the world invests in US treasuries to fund a country that spends like a drunken sailor on wars and stock buybacks, with no plan to ever pay down the debt, nor to invest in its domestic future via infrastructure or state capacity. "You need another $200B for a conflict with no purpose or need? Sure, here you go."


What, you mean this entire administration? Well, you’re right.


I think working with the technology gives you powerful intuitions that improve your skill and lead to better outcomes, but you don't really notice that that's what's happening. Personally speaking - and I suspect this is true of most people in general - I have very poor recollections of what it was like to be really bad/new at things that I am now very skilled at.

If you have try teaching someone something from the absolute ground up, you will quickly realize that a huge number of things you now believe are "standard assumptions" or "obvious" or "intuitive" are actually the result of a lot of learning you forgot you did.


I was about to comment that there was no amount of money I would take in return for spending time in prison but then I realized that of course that’s not true. It would be fun to create a survey that would show a visualization of where people tend to fall on the time/money axis for this.


It logically should track closely to the person's age and life expectancy and "legit job" earning potential. I would spend my years 20-29 in jail for $400M, wealth that I'd enjoy for the rest of my life, without hesitation. Heck, I'd have been willing to spend my twenties in prison for $40M. That's still life-changing never-have-to-work-again money. 30-39? I'd probably do it for $400M. 40-49? Hmm, now that's getting kind of tough. Maybe I'd do it for $1B. 50-59? I don't think I could physically do it, and given the number of years I had left, I probably wouldn't even be able to enjoy whatever sum we are talking about.


> I would spend my years 20-29 in jail for $400M

This is kind of why I want to make this survey now because there’s no way I’d spend a decade of my life in prison for any amount of money. I would do six months for $3M. I’d maybe do 12 for $10M. But beyond that…I don’t know, even a year seems like too long to be behind bars.


Would a guarantee of a different kind of prison environment change your mind? For example, prison conditions in the Netherlands versus the US? If you were allowed 6+ hours of positive, structured activities a day? Less than if you weren't in prison of course, but as we're talking about 'How much is it worth to you...'


Sure - I think it would decrease the amount of money I’d insist on, and/or increase the amount of time I’d tolerate, but only by a factor of 1.5 or so. Conversely, if I had to stay on an American supermax facility, the calculus would swing way in the other direction.


I disagree that it’s “just a text generator” but you are so right about how primed people are to think they’re talking to a person. One of my clients has gone all-in on openclaw: my god, the misunderstanding is profound. When I pointed out a particularly serious risk he’d opened up, he said, “it won’t do that, because I programmed it not to”. No, you tried to persuade it not to with a single instruction buried in a swamp of markdown files that the agent is itself changing!


I insist on the text generator nature of the thing. It’s just that we built harnesses to activate on certain sequences of text.

Think of it as three people in a room. One (the director), says: you, with the red shirt, you are now a plane copilot. You, with the blue shirt, you are now the captain. You are about to take off from New York to Honolulu. Action.

Red: Fuel checked, captain. Want me to start the engines?

Blue: yes please, let’s follow the procedure. Engines at 80%.

Red: I’m executing: raise the levers to 80%

Director: levers raised.

Red: I’m executing: read engine stats meters.

Director: Stats read engine ok, thrust ok, accelerating to V0.

Now pretend the director, when heard “I’m executing: raise the levers to 80%”, instead of roleplaying, she actually issue a command to raise the engine levers of a plane to 80%. When she hears “I’m executing: read engine stats”, she actually get data from the plane and provide to the actor.

See how text generation for a role play can actually be used to act on the world?

In this mind experiment, the human is the blue shirt, Opus 4-6 is the red and Claude code is the director.


For context I've been an AI skeptic and am trying as hard as I can to continue to be.

I honestly think we've moved the goalposts. I'm saying this because, for the longest time, I thought that the chasm that AI couldn't cross was generality. By which I mean that you'd train a system, and it would work in that specific setting, and then you'd tweak just about anything at all, and it would fall over. Basically no AI technique truly generalized for the longest time. The new LLM techniques fall over in their own particular ways too, but it's increasingly difficult for even skeptics like me to deny that they provide meaningful value at least some of the time. And largely that's because they generalize so much better than previous systems (though not perfectly).

I've been playing with various models, as well as watching other team members do so. And I've seen Claude identify data races that have sat in our code base for nearly a decade, given a combination of a stack trace, access to the code, and a handful of human-written paragraphs about what the code is doing overall.

This isn't just a matter of adding harnesses. The fields of program analysis and program synthesis are old as dirt, and probably thousands of CS PhD have cut their teeth of trying to solve them. All of those systems had harnesses but they weren't nearly as effective, as general, and as broad as what current frontier LLMs can do. And on top of it all we're driving LLMs with inherently fuzzy natural language, which by definition requires high generality to avoid falling over simply due to the stochastic nature of how humans write prompts.

Now, I agree vehemently with the superficial point that LLMs are "just" text generators. But I think it's also increasingly missing the point given the empirical capabilities that the models clearly have. The real lesson of LLMs is not that they're somehow not text generators, it's that we as a species have somehow encoded intelligence into human language. And along with the new training regimes we've only just discovered how to unlock that.


> I thought that the chasm that AI couldn't cross was generality. By which I mean that you'd train a system, and it would work in that specific setting, and then you'd tweak just about anything at all, and it would fall over. Basically no AI technique truly generalized for the longest time.

That is still true though, transformers didn't cross into generality, instead it let the problem you can train the AI on be bigger.

So, instead of making a general AI, you make an AI that has trained on basically everything. As long as you move far enough away from everything that is on the internet or are close enough to something its overtrained on like memes it fails spectacularly, but of course most things exists in some from on the internet so it can do quite a lot.

The difference between this and a general intelligence like humans is that humans are trained primarily in jungles and woodlands thousands of years ago, yet we still can navigate modern society with those genes using our general ability to adapt to and understand new systems. An AI trained on jungles and woodlands survival wouldn't generalize to modern society like the human model does.

And this makes LLM fundamentally different to how human intelligence works still.


> And I've seen Claude identify data races that have sat in our code base for nearly a decade

how do you know that claude isn't just a very fast monkey with a very fast typewriter that throws things at you until one of them is true ?


Iteration is inherent to how computers work. There's nothing new or interesting about this.

The question is who prunes the space of possible answers. If the LLM spews things at you until it gets one right, then sure, you're in the scenario you outlined (and much less interesting). If it ultimately presents one option to the human, and that option is correct, then that's much more interesting. Even if the process is "monkeys on keyboards", does it matter?

There are plenty of optimization and verification algorithms that rely on "try things at random until you find one that works", but before modern LLMs no one accused these things of being monkeys on keyboards, despite it being literally what these things are.


Of course it doesn't matter indeed. What I was hinting at is if you forget all the times the LLM was wrong and just remember that one time it was right it makes it seem much more magical than it actually might be.

Also how were the data races significant if nobody noticed them for a decade ? Were you all just coming to work and being like "jeez I dont know why this keeps happening" until the LLM found them for you?


I agree with your points. Answering your one question for posterity:

> Also how were the data races significant if nobody noticed them for a decade ?

They only replicated in our CI, so it was mainly an annoyance for those of us doing release engineering (because when you run ~150 jobs you'll inevitably get ~2-4 failures). So it's not that no one noticed, but it was always a matter of prioritization vs other things we were working on at the time.

But that doesn't mean they got zero effort put into them. We tried multiple times to replicate, perhaps a total of 10-20 human hours over a decade or so (spread out between maybe 3 people, all CS PhDs), and never got close enough to a smoking gun to develop a theory of the bug (and therefore, not able to develop a fix).

To be clear, I don't think "proves" anything one way or another, as it's only one data point, but given this is a team of CS PhDs intimately familiar with tools for race detection and debugging, it's notable that the tools meaningfully helped us debug this.


For someone claiming to be an AI skeptic, your post here, and posts in your profile certainly seem to be at least partially AI written.

For someone claiming to be an AI skeptic, you certainly seem to post a lot of pro-AI comments.

Makes me wonder if this is an AI agent prompted to claim to be against AIs but then push AI agenda, much like the fake "walk away" movement.


I have an old account, you can read my history of comments and see if my style has changed. No need to take my word for it.


Tangential off topic, but reminds me of seeing so many defenses for Brexit that started with “I voted Remain but…”

Nowadays when I read “I am an AI skeptic but” I already know the comment is coming from someone that has just downed the kool aid.


> No, you tried to persuade it not to with a single instruction

Even persuade is too strong a word. These things dont have the motivation needed to enable persuation being a thing. Whay your client did was put one data point in the context that it will use to generate the next tokens from. If that one data point doesnt shift the context enough to make it produce an output that corresponds to that daya point, then it wont. Thats it, no sentience involved


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: