More

burroisolator · 2025-12-09T01:31:42 1765243902

"In 1920, there were 25 million horses in the United States, 25 million horses totally ambivalent to two hundred years of progress in mechanical engines.

And not very long after, 93 per cent of those horses had disappeared.

I very much hope we'll get the two decades that horses did."

I'm reminded of the idiom "be careful what you wish for, as you might just get it." Rapid technogical change has historically lead to prosperity over the long term but not in the short term. My fear is that the pace of change this time around is so rapid that the short term destruction will not be something that can be recovered from even over the longer term.

mxfh · 2025-12-09T02:38:41 1765247921

I just have no idea how rigerously the data was reviewed. The 95% decline simply does no compute with

4,500,000 in 1959

and even an increase to

7,000,000 in 1968

largely due to increase in recreational horse population.

https://time.com/archive/6632231/recreation-return-of-the-ho...

So that recreational existence at the leisure of our own machinery seems like an optional future humans can hope for too.

Turns out the chart is about farm horses only as counted by the USDA not including any recreational horses. So this is more about agricultural machinery vs. horses, not passenger cars.

---

City horses (the ones replaced by cars and trucks) were nearly extinct by 1930 already.

City horses were formerly almost exclusively bred on farms but because of their practical disappearance such breeding is no longer necessary. They have declined in numbers from 3,500,000 in 1910 to a few hundred thousand in 1930.

https://www2.census.gov/library/publications/decennial/1930/...

falcor84 · 2025-12-09T01:43:39 1765244619

My reading of tfa is exactly that - the author is hoping that we'll have at least a generation or so to adapt, like horses did, but is concerned that it might be significantly more rapid.

OccamsMirror · 2025-12-09T01:48:36 1765244916

To be clear though, the horses didn't adapt. Their populate was reduced by orders of a magnitude.

sendes · 2025-12-09T02:31:02 1765247462

True, but the horses' population started (slightly) rising again when they went from economic tools to recreational tools for humans. What will happen to humans?

Gigachad · 2025-12-09T05:26:53 1765258013

The horse population was being boosted beyond normal numbers by human intervention. When humans stopped breeding them the numbers dropped.

At least currently humans do not need AI to reproduce.

baq · 2025-12-09T06:39:38 1765262378

There were approximately zero horses in the wild, so it was all about what humans found useful.

Pray it’s still humans who ask these kinds of questions about AI, not the other way around.

goatlover · 2025-12-09T04:30:10 1765254610

Did the population of work/service dogs decline? Horses were already a form of automation over human labor.

defrost · 2025-12-09T04:40:17 1765255217

Bullocks.

That's what Sandy over the road (born 1932, died last year), used to hitch up every morning at 4am, when he was ten, to sled a tank of water back to the farm from the local spring.

burroisolator · 2025-12-09T02:02:11 1765245731

"You're absolutely right!" Thanks for pointing it out. I was expecting that kind of perspective when the author brought up horses, but found the conclusion to be odd. Turns out it was just my reading of it.

nacozarina · 2025-12-09T01:56:03 1765245363

the stability of no govt faced risk over a 20% increase in horse unemployment

burroisolator · 2025-11-06T20:05:36 1762459536

In short, the others have a huge margin if you ignore training costs. See https://martinalderson.com/posts/are-openai-and-anthropic-re... for details.

throwdbaaway · 2025-11-06T22:21:00 1762467660

Somehow that article totally ignored the insane pricing of cached input tokens set by Anthropic and OpenAI. For agentic coding, typically 90~95% of the inference cost is attributed to cached input tokens, and a scrappy China company can do it almost for free: https://api-docs.deepseek.com/news/news0802

burroisolator · 2025-10-08T01:10:50 1759885850

Why have spot H100 prices been going down then? It was roughly $3/hr a year ago and now it is closer to $2.2/hr.

AbstractH24 · 2025-10-08T11:35:02 1759923302

Reminds me of the fiber boom

burroisolator · 2025-09-30T05:30:40 1759210240

I'll admit I immediately thought of https://www.decisionproblem.com/paperclips/ after seeing this blog post.

burroisolator · 2025-09-20T01:29:26 1758331766

That is my interpretation.

Regardless of whether you think imposing a $100k fee on H1Bs is a good idea or not, there is no way that a 2 day deadline makes sense from an implementation perspective. On a weekend too. This is just going to cause panic and confusion at the border.

burroisolator · 2025-07-02T00:00:50 1751414450

AI only got big, especially for coding, because they were able to train on a massive corpus of open source code. I don't think it is a coincidence.

hardwaresofton · 2025-07-02T05:39:37 1751434777

Another funny possibly sad coincidence is that the licenses that made open source what it is will probably be absolutely useless going forward, because as recent precedent has shown, companies can train on what they have legally gained access to.

On the other hand, AGPL continues to be the future of F/OSS.

haiku2077 · 2025-07-02T06:04:29 1751436269

MIT is also still useful; it lets me release code where I don't really care what other people do with it as long as they don't sue me (an actual possibility in some countries)

LtWorf · 2025-07-02T06:21:50 1751437310

Which countries would these be?

haiku2077 · 2025-07-02T06:24:14 1751437454

The US, for one. You can sue nearly anyone for nearly anything, even something you obviously won't win in court, as long as you find a lawyer willing to do it; you don't need any actual legal standing to waste the target's time and money.

Even the most unscrupulous lawyer is going to look at the MIT license, realize the target can defend it for a trivial amount of money (a single form letter from their lawyer) and move on.

Jensson · 2025-07-02T07:12:48 1751440368

You can sue for damages if they have malware in the code, there is no license that protects you from distributing harmful products even if you do it for free.

haiku2077 · 2025-07-02T08:29:20 1751444960

If I commit fraud, sure. But the code I release is extremely honest about what it does :)

thih9 · 2025-07-02T18:38:34 1751481514

There are other ways to litigate that the malicious/greedy can use, where MIT offers no protection; e.g. patent trolling.

tom_m · 2025-07-02T13:37:58 1751463478

And illegally too. Anthropic didn't pay for those books they used.

It's too late at this point. The damage is done. These companies trained on illegally obtained data and they will never be held accountable for that. The training is done and they got what they needed. So even if they can't train on it in the future, it doesn't matter. They already have those base models.

ddq · 2025-07-02T14:57:14 1751468234

Then punitive measures are in order. Add it to the pile of illegal, immoral, and unethical behavior of the feudal tech oligarchs already long overdue for justice. The harm they have done and are doing to humanity should not remain unpunished.

malfist · 2025-07-02T12:05:22 1751457922

Legally or illegally gained access too. Lest we forget Meta pirating books

coffeefirst · 2025-07-02T12:31:13 1751459473

And the legality of this may vary by jurisdiction. There’s a nonzero chance that they pay a few million in the US for stealing books but the EU or Canada decide the training itself was illegal.

andy99 · 2025-07-02T14:14:44 1751465684

Then the EU and canada just won't have any sovereign LLMs. They'll have to decide if they'd rather prop up some artificial monopoly or support (by not actively undermining) innovation.

foobiekr · 2025-07-02T17:09:23 1751476163

It’s not going to happen. The EU is desperate to stop being in fourth place in technology and will do absolutely nothing to put a damper on this. It’s their only hope to get out of the rut.

EGreg · 2025-07-02T12:13:14 1751458394

Explain how AGPL would prevent AI from being trained on it or AI-generated code competing with it. I have used AGPL for a decade and still not sure.

hardwaresofton · 2025-07-02T13:01:04 1751461264

It wouldn't -- AGPL code that is picked up would also just get "fair used" into new software.

That said, AGPL as a trend was a huge closing of the spigot of free F/OSS code for companies to use and not contribute back to.

EGreg · 2025-07-02T15:21:21 1751469681

Yes, I hope it was a trend. People were judging me when I first started using it over 10 years ago.

jorvi · 2025-07-02T06:25:50 1751437550

Yup. The book torrenting case is pretty nuts.

If I can reproduce the entirety of most books off the top of my head and sell that to people as a service, it's a copyright violation. If AI does it, it's fair use.

Pants-on-head idiotic judge.

derektank · 2025-07-02T08:28:38 1751444918

>If I can reproduce the entirety of most books off the top of my head and sell that to people as a service, it's a copyright violation. If AI does it, it's fair use.

Assuming you're referring to Bartz v. Anthropic, that is explicitly not what the ruling said, in fact it's almost the inverse. The judge said that output from an AI model which is a straight up reproduction of copyrighted material would likely be an explicit violation of copyright. This is on page 12/32 of the judgement[1].

But the vast majority of output from an LLM like Claude is not a word for word reproduction; it's a transformative use of the original work. In fact, the authors bringing the suit didn't even claim that it had reproduced their work. From page 7, "Authors do not allege that any infringing copy of their works was or would ever be provided to users by the Claude service." That's because Anthropic is already explicitly filtering out results that might contain copyrighted material. (I've run into this myself while trying to translate foreign language song lyrics to English. Claude will simply refuse to do this)[2]

[1] https://www.courtlistener.com/docket/69058235/231/bartz-v-an...

[2] https://claude.ai/share/d0586248-8d00-4d50-8e45-f9c5ef09ec81

gosub100 · 2025-07-02T13:48:56 1751464136

They should still have to pay damages for possessing the copyrighted material. That's possession, which courts have found is copyright violation. Remember all the 12 year olds who got their parents sued back in the 2000s? They had unauthorized copies.

derektank · 2025-07-02T15:02:20 1751468540

I don't know what exactly you're referring to here. The model itself is not a copy, you can't find the copyrighted material in the weights. Even if you could, you're allowed under existing case law to make copies of a work for personal use if the copies have a different character and as long as you don't yourself share the new copies. Take the Sony Betamax case, which found that it was legal and a transformative use of copyrighted material to create a copy of a publicly aired broadcast onto a recording medium like VHS and Betamax for the purposes of time-shifting one's consumption.

Now, Anthropic was found to have pirated copyrighted work when they downloaded and trained Claude on the LibGen library. And they will likely pay substantial damages for this. So on those grounds, they're as screwed as the 12 year olds and their parents. The trial to determine damages hasn't happened yet though.

gosub100 · 2025-07-02T16:21:09 1751473269

> The model itself is not a copy,

Agreed

> the Sony Betamax case, which found that it was legal and a transformative use of copyrighted material to create a copy of a publicly aired broadcast

Good thing libgen is not publicly aired in broadcast format.

> So on those grounds, they're as screwed as the 12 year olds and their parents.

Except they have deep enough pockets to actually pay the damages for each count of infringement. That's the blood most of us want to see shed.

You cannot have trained the model without possession of copyrighted works. Which we seem to be in agreement on.

hardwaresofton · 2025-07-02T06:40:53 1751438453

This was immediately my reaction as well, but I'm not a judge so what do I know. In my own mind I mark it as a "spice must flow" moment -- it will seem inevitable in retrospect but my simple (almost surely incorrect) take is that there just wasn't a way this was going to stop AI's progress. AI as a trend has incredible plot armor at this point in time.

Is the hinge that the tools can recall a huge portion (not perfectly of course) but usually don't? What seems even more straight forward is the substitute good idea, it seems reasonable to assume people will buy less copies of book X when they start generating books heavily inspired by book X.

But, this is probably just a case of a layman wandering into a complex topic, maybe it's the case that AI has just nestled into the absolute perfect spot in current copyright law, just like other things that seem like they should be illegal now but aren't.

fragmede · 2025-07-02T06:47:25 1751438845

I didn't see the part of the trial where they got the "entirety of most books" out of Llama. What did you see that I didn't?

redman25 · 2025-07-02T11:42:33 1751456553

Sad to say but it would have put US companies at a major disadvantage if they were not allowed to.

tim333 · 2025-07-02T11:55:09 1751457309

I'm not sure that's true. I've never heard of a human being done for copyright for reciting a book passage.

I daresay the difference with AI is that pretty much no human can do that well enough to harm the copyright holder, whereas AI can churn it out.

tom_m · 2025-07-02T13:41:46 1751463706

Yea, that dipshit judge just opened the flood gates for more problems. The problem is they don't understand how this stuff works and they're in the position of having to make a judgement on it. They're completely unprepared to do so.

Now there's precedent for future cases where theft of code or any other work of art can be considered fair use.

sneak · 2025-07-02T14:04:41 1751465081

The AGPL is a nonfree license that is virtually impossible to comply with.

It’s an EULA trying to pretend it’s a license. You can’t have it both ways.

hardwaresofton · 2025-07-02T14:20:36 1751466036

This is a strong claim, given it is listed as a free, copyleft license:

https://www.gnu.org/licenses/agpl-3.0.en.html

Could you expand on why you think it's nonfree? Also, it's not that hard to comply with either...

px43 · 2025-07-02T14:26:08 1751466368

For some people "free" means "autonomy", and copyleft licences do a lot to restrict autonomy.

jrochkind1 · 2025-07-02T16:00:35 1751472035

So interestingly, free meant autonomy for Stallman and the original proponents of "copyleft" style licenses too. But autonomy for end-users, not developers. But Stallman et al believed the copyleft style licenses maximized autonomy for end-users, rightly or wrongly, that was the intent.

hardwaresofton · 2025-07-02T15:45:43 1751471143

Yeah if it's a problem of definition, then I definitely agree that it could not match there, it certainly isn't a do anything you want license.

waffletower · 2025-07-02T15:47:26 1751471246

"Free" decidedly means autonomy; "I have been freed from prison". Use of the word "free" in many OSS licenses is a jarring euphemism.

tedheath123 · 2025-07-02T18:36:01 1751481361

cf. https://en.wikipedia.org/wiki/Two_Concepts_of_Liberty

sneak · 2025-07-02T14:49:24 1751467764

marcan does a much more detailed job than I do:

https://news.ycombinator.com/item?id=30495647

https://news.ycombinator.com/item?id=30044019

GNU/FSF are the anticapitalist zealots that are pushing this EULA. Just because they approve of it doesn’t make it free software. They are confused.

hardwaresofton · 2025-07-02T15:56:40 1751471800

I read through and I think that the analysis suffers from the fact that in the case when the modifier is the user it's fine.

Free software refers to user freedoms, not developer freedoms.

I don't think the below is right:

> > Notwithstanding any other provision of this License, if you modify the Program, your modified version must prominently offer all users interacting with it remotely through a computer network (if your version supports such interaction) an opportunity to receive the Corresponding Source of your version by providing access to the Corresponding Source from a network server at no charge, through some standard or customary means of facilitating copying of software.

>

> Let's break it down:

>

> > If you modify the Program

>

> That is if you are a developer making changes to the source code (or binary, but let's ignore that option)

>

> > your modified version

>

> The modified source code you have created

>

> > must prominently offer all users interacting with it remotely through a computer network

>

> Must include the mandatory feature of offering all users interacting with it through a computer network (computer network is left undefined and subject to wide interpretation)

I read the AGPL to mean if you modify the program then the users of the program (remotely, through a computer network) must be able to access the source code.

It has yet to be tested, but that seems like the common sense reading for me (which matters, because judges do apply judgement). It just seems like they are trying too hard to do a legal gotcha. I'm not a lawyer so I can't speak to that, but I certainly don't read it the same way.

I don't agree with this interpretation of every-change-is-a-violation either:

> Step 1: Clone the GitHub repo

>

> Step 2: Make a change to the code - oops, license violation! Clause 13! I need to change the source code offer first!

>

> Step 1.5: Change the source code offer to point to your repo

This example seems incorrect -- modifying the code does not automatically make people interact with the program over a network...

"free software" was defined by the GNU/FSF... so I generally default to their definitions. I don't think the license falls afoul of their stated definitions.

That said, they're certainly anti-capitalist zealots, that's kind of their thing. I don't agree with that, but that's besides the point.

marcosdumay · 2025-07-02T16:01:00 1751472060

It's not really "virtually impossible to comply with". It's very restrictive, yes, but not hard to comply if you want to.

And yes, it is an EULA pretending to be a license. I'd put good odds on it being illegal in my country, and it may even be illegal on the US. But it's well aligned with the goals of GNU.

surfingdino · 2025-07-02T09:13:04 1751447584

And if they AI companies don't like the license, they will ignore it or pay to be given a waver. Long may they rot in hell for doing that.

yard2010 · 2025-07-02T10:24:55 1751451895

Hell is, by design, a consequence for poor people. (People could literally pay the church to not go to hell[0]). Rich people have no consequences whatsoever, let alone poor people consequences.

[0] https://www.cambridge.org/core/books/abs/preaching-the-crusa...

GTP · 2025-07-02T13:23:39 1751462619

Not "by design", as historically the hell came first. It was only much later that they catholic church started talking about the purgatory and the possibility of reducing your punishment by paying money.

smokel · 2025-07-02T09:32:28 1751448748

The people running AI companies have figured out that there is no such thing as hell. We have to come up with new reasons for people to behave in a friendly way.

fennecbutt · 2025-07-02T11:14:36 1751454876

We already have such reasons. Besides, all religious "kindness" was never kindness without strings attached, even though they'd like you to think that was the case.

_heimdall · 2025-07-02T11:11:57 1751454717

The people running AI companies aren't magic, they can't be certain about what comes after death.

pizzafeelsright · 2025-07-02T15:10:18 1751469018

If I can have AI retype all code per my desire how exactly is source code special?

I like open source. I also don't think that is where the magic is anymore.

It was scale for 20 years.

Now it is speed.

bravesoul2 · 2025-07-02T10:47:32 1751453252

Open source may be necessary but it is not sufficient. You also needed the compute power and architecture discoveries and the realisation that lots of data > clever feature mapping for this kind of work.

A world without open source may have given birth to 2020s AI but probably at a slower pace.

burroisolator · on Nov 21, 2024

This is a common myth. This might explain why Harvard or MIT tuition is high but not the average college. Tuition mostly reflects staff costs and those have been going up due to Baumol's cost disease. Dentists, along with many other industries with its main cost being highly educated staff that haven't managed to scale production like online brokerages, have had a similar price increase since 1970.

bjt · on Nov 21, 2024

Increased tuition is not primarily going to pay higher salaries to professors. It's mostly going to hiring lots more administrators. https://www.forbes.com/sites/paulweinstein/2023/08/28/admini...

adastra22 · on Nov 21, 2024

You’re going to have to qualify where you are talking about. Where I am, California, that only describes community colleges. Even state and especially UC have “invested” significantly in infrastructure improvements paid for with loans backed by expectations of tuition income, which has had an absurd effect on growing tuition far outside of inflation. Very little of your tuition at these schools goes towards teaching salaries.

vitus · on Nov 21, 2024

> Even state and especially UC have “invested” significantly in infrastructure improvements paid for with loans backed by expectations of tuition income, which has had an absurd effect on growing tuition far outside of inflation.

What timeframe are you looking at?

Back in 2011, registration fees at UC Berkeley were $7,230 per semester, with $813 allotted to health insurance (which could be waived if you provided proof of existing insurance from your family), so $6,417 ignoring health insurance. Meanwhile, last year, registration fees were an eye-popping $9,847 for new students, but cost of health insurance grew much faster to $1,929 ($7,918 ignoring health insurance). This is about a 23% increase, compared to CPI-measured inflation of about 35% between Sep 2011 and Sep 2023.

(The next biggest driver of the overall increase was the campus fee, which went from $253 to $820.)

Or, if you look at just tuition alone, that went from $5610 to $6261, or just barely above 10%.

https://registrar.berkeley.edu/wp-content/uploads/2021/03/Fe...

https://registrar.berkeley.edu/wp-content/uploads/fee_schedu...

If you look further back, in 1999, tuition was a mere $1543, but I posit that tuition at UCs has actually been fairly stable over the past decade.

adastra22 · on Nov 21, 2024

Those are some cherry-picked numbers. Tuition went up A LOT just prior to your starting point, 2011, as the great financial crisis made renewing those loans I mention much more expensive: https://ucop.edu/operating-budget/_files/fees/201415/documen...

vitus · on Nov 21, 2024

> Those are some cherry-picked numbers.

I don't disagree, but they support my point that tuition has not changed meaningfully in the past decade (and then some), which is why I asked what timeframe you were looking at.

Inflation is perhaps not a good point of reference anyways, since in 2009, inflation per CPI was actually slightly negative. Cost of borrowing is not the same as cost of goods and services or cost of labor, for reasons such as the ones you point out (changes to banking regulations, increased risk aversion, etc).

Although, I'm a little surprised that cost of borrowing would have been much higher, seeing as that was the start of the zero interest-rate policy in the US. The average 30-year fixed mortgage rate was hovering around 6-7% pre-crisis and 4-5% in the years immediately following it.

vitus · on Nov 21, 2024

Perhaps some more generous explanations for the rapid tuition growth between 2000 and 2010:

- UC Merced was established in 2005, so I buy the argument on that point regarding investment in infrastructure.

- In 2009, the state's general funds accounted for $2.6 billion, compared to just under $3 billion in 2006. Student fees in that timeframe rose from $1.55 billion to $2 billion, tracking fairly closely with the corresponding shortfall in state funding. [0, 1] Yes, these numbers are also cherrypicked as a representative budget right before the GFC and shortly after, but they represent neither peak funding nor the overall sharpness of the budget cuts. So, I reject the claim that the tuition hikes in the aftermath of the GFC was due to increased borrowing costs for the UC system. I think a more mundane explanation is that the state had a budgetary shortfall due to less taxes being collected (income, property), and made cuts across the board; the UC system raised student fees to compensate.

https://thebottomline.as.ucsb.edu/2017/10/a-brief-history-of... provides an overview of how tuition at UCs evolved up through 2017, although it gets the state funding amounts off by 3 orders of magnitude (since the linked governor's budget is measured in thousands of dollars).

[0] UC's 2009 budget is outlined in slide 3 of https://www.ucop.edu/operating-budget/_files/documents/2010-...

[1] UC's 2006 budget is outlined in slide 9 of https://www.ucop.edu/operating-budget/_files/rbudget/200607-...

baq · on Nov 21, 2024

When you compare the campus of MIT or Harvard to the average university anywhere else, you’ll find… excess. Lots of it.

burroisolator · on Nov 4, 2023

Does it help to submit duplicative arguments? I see some pretty strong arguments in the comments already. I wish there was a way to just upvote an existing comment.

burroisolator · on Nov 4, 2023

While debatably unprofessional to blame your vendor, I found this read to be fascinating. I'm sure there are blog posts that detail how data centers work and fail but it's rare to get that cross over from a software engineering context. It puts into perspective what it takes for an average data center of this class to fail: power outage, generator failure, and then battery loss.

chippiewill · on Nov 4, 2023

I think what it really does is emphasise how common it is for crap to hit the fan when things go wrong - even with the best laid plans.

The DC almost certainly advertises the redundant power supplies, generator backups and battery failover in order to get the customers. But probably doesn't do the legwork or spend the money to make those things truly reliable. It's a bit like having automated backups - but never testing them and discovering they're empty when they're really needed.

burroisolator · on Aug 12, 2023

What incentive would there be to take the risk of being the first in the market?

One idea I've been mulling is a progressive corporate tax. It would encourage companies to split up if there aren't massive synergies to justify the increased tax.