More

addaon · 2026-01-26T22:53:12 1769467992

> caffeine derived from coca leaves

Coca leaves contain various alkaloids, but not caffeine. Coca Cola gets its caffeine from (traditionally) kola nuts, and (today, presumedly) the usual industrial sources.

atombender · 2026-01-26T23:03:38 1769468618

Not sure what happens with my brain there. I did indeed mean de-cocainized coca leaves, not caffeine.

wrboyce · 2026-01-26T23:28:30 1769470110

Um… might want to double check your brain there!

addaon · 2026-01-26T15:25:48 1769441148

Are you at a company that tends to hire from non-traditional backgrounds? The topics you mention -- the underlying "how it works" of the tech we use to build things day to day -- should be, and in my experience are, the areas where juniors have the clearest understanding relative to more senior engineers, since they just finished 4+ years learning about it five days a week in detail.

saidinesh5 · 2026-01-26T17:03:28 1769447008

Good point.. A lot of those colleagues were indeed either fresh out of college (math, computer science, mechanism etc...) or graduated in things like electrical engineering etc.. and worked in software roles for 3-5 years..

addaon · 2026-01-18T19:12:29 1768763549

> IMHO the bigger issue with NaN-boxing is that on 64-bit systems it relies on the address space only needing <50 bits or so, as the discriminator is stored on the high bits.

Is this right? You get 51 tag bits, of which you must use one to distinguish pointer-to-object from other uses of the tag bits (assuming Huffman-ish coding of tags). But objects are presumedly a minimum of 8-byte sized and aligned, and on most platforms I assume they'd be 16-byte sized and aligned, which means the low three (four) bits of the address are implicit, giving 53 (54) bit object addresses. This is quite a few years of runway...

kannanvijayan · 2026-01-18T22:48:37 1768776517

There's a bit of time yes, but for an engine that relies on this format (e.g. spidermonkey), the assumptions associated with the value boxing format would have leaked into the codebase all over the place. It's the kind of thing that's far less painful to take care of when you don't need to do it than when you need to do it.

But fair point on the aligned pointers - that would give you some free bits to keep using, but it gets ugly.

You're right about the 51 bits - I always get mixed up about whether it's 12 bits of exponent, or the 12 includes the sign. Point is it puts some hard constraints on a pretty large number of high bits of a pointer being free, as opposed to an alignment requirement for low-bit tagging which will never run out of bits.

addaon · 2026-01-18T18:36:58 1768761418

This was 20+ years ago, so the "sophisticated" baseline wasn't ML or AI.

I was looking into an initial implementation and use of order files for a major platform. Quick recap: C (and similar languages) define that every function must have a unique address, but place no constraints on the relative order of those addresses. Choosing the order in which functions appear in memory can have significant performance impact. For example, suppose that you access 1,000 functions over a run of a program, each of which is 100 bytes in size. If each of those functions is mixed in with the 100,000 functions you don't call, you touch (and have to read from disk) 1000 pages; if they're all directly adjacent, you touch 25 pages. (This is a superficial description -- the thousand "but can't you" and "but also"s in your mind right now are very much the point.)

I went into this with moderately high confidence that runtime analysis was going to be the "best" answer, but figured I'd start by seeing how much of an improvement static analysis could give -- this would provide a lower bound for the possible improvement to motivate more investment in the project, and would give immediate improvements as well.

So, what are all the ways you can use static analysis of a (large!) C code base to figure out order? Well, if you can generate a call graph, you can do depth first or breadth first, both of which have theoretical arguments for them -- or you can factor in the function call size, page size, page read lookahead size, etc, and do a mixture based on chunking to those sizes... and then you can do something like an annealing pass since a 4097 byte sequence is awful and you're better off swapping something out for a slightly-less-optimal-but-single-page sequence, etc.

And to test the tool chain, you might as well do a trivial one. How about we just alphabetize the symbols?

Guess which static approach performed best? Alphabetization, by a large margin. This was entirely due to the fact that (a) the platform in question used symbol name prefixes as namespaces; (b) callers that used part of a namespace tended to use significant chunks of it; and (c) call graph generation across multiple libraries wasn't accurate so some of these patterns from the namespaces weren't visible to other approaches.

The results were amazingly good. I felt amazingly silly.

(Runtime analysis did indeed exceed this performance, significantly.)

addaon · 2026-01-15T17:20:46 1768497646

Wild blueberries. Yum.

addaon · 2026-01-14T15:34:59 1768404899

"The -Wsign-compare warning is old and it’s not enabled by default or even by the “warn about everything” -Wall option."

The "warn about everything" option is -Weverything, which /does/ enable -Wsign-compare.

welfareleech · 2026-01-15T04:05:16 1768449916

Only with Clang. With GCC, -Wextra implicitly enables -Wsign-compare (with G++ -Wall implicitly enables it instead).

addaon · 2026-01-14T01:16:28 1768353388

Been wondering the same since it stopped working without a Facebook account...

addaon · 2026-01-13T18:13:45 1768328025

> 2. Become very good (top 25%) at two or more things.

Is this idea that top 25% is "very good" at something innumeracy, or a subtle insight I'm missing? There's got to be a million skills that you could assess rank at -- writing embedded C code, playing basketball, identifying flora, PacMan, archery, bouldering… I can't imagine ever being able to not continue this list -- and you should expect to be in the top 25% of roughly a quarter of those skills, obviously heavily biased towards the ones you've tried, and even more biased towards the ones you care about. It's hard to imagine anyone who's not in the top 25% of skill assessment in a dozen things, let alone two or more…

twalla · 2026-01-13T18:33:06 1768329186

Ignore the numbers - the gist is being good enough at the right two or three things can create similar value for you as being the best at one specific thing.

Everyone (for the sake of my argument) wants to be an engineer at a FAANG but there are tons of folks making more money with more autonomy because they've found a niche that combines their good-enough technical ability with an understanding or interest in an underserved market.

aidenn0 · 2026-01-13T22:04:35 1768341875

It depends on the population you are taking from. Being the top quartile embedded C developer in the world is perhaps unimpressive (there are up to 2 billion people better than you at embedded C programming), but being the top quartile embedded C developer within the population of professional embedded C developers is much more impressive.

ygjb · 2026-01-13T19:53:03 1768333983

I think it's generally accepted that at a high level being in the top quartile is considered very good. Not excellent. Not unicorn. Just very good.

Beyond that, it's not about becoming very good at two different, completely orthogonal things, it's about becoming very good at two things that are complementary in some way that is of value to others. Being good at PacMan and Bouldering is only particularly valuable if you are competing for opportunities to participate in a hypothetical mixed reality video game, or perhaps a very niche streaming channel. Being the top quartile of embedded c code, and flora identification could result in building software/hardware tools to identify flora, which is a niche that currently has multiple competing products that are high value to those interested.

OkayPhysicist · 2026-01-13T18:24:19 1768328659

If you consider your denominator to be the population of practitioners, rather than "everybody", top quartile would be pretty good. To use chess as an example, the 75th percentile of the global population probably knows the rules and nothing else. The 75th percentile of chess players would be an Elo of 1800 and change.

raincole · 2026-01-14T03:05:51 1768359951

It's (obviously) a random number pulled out from someone's ass. However, I think top 25% isn't that off. It means top 25% of people who actually tried.

If it still sounds easy, try to reach top 25% rank of a video game that you are not familiar with (diamond in Starcraft II or whatever). You'll find it's literally the workload of a full-time job.

carabiner · 2026-01-13T20:15:42 1768335342

He wrote that 20 years ago. I think today, it's more like top 10% in 3 or more things.

x0x0 · 2026-01-13T21:47:03 1768340823

a [chemist, biologist, mathematician, DSP researcher] who can code at a professional level (that 25%) is worth far more to the right position than either of those skills individually.

tomjen3 · 2026-01-13T18:55:31 1768330531

Okay, make it two useful things then. Be a top 25% marketeer and a top 25% programmer and you are worth so much more than either separately.

addaon · 2026-01-13T16:26:00 1768321560

Ah, that explains the white bar at the top.

addaon · 2026-01-13T05:20:31 1768281631

If you haven't tried Hidden Rose apples, give them a try. Besides being gorgeous, they have a tart:sweet ratio that's similar to Granny Smith, but with a texture that's further away from a baking apple and a thinner skin. Absolutely my favorite lately.