At some point it will be less wrong (TM) and it'll be helpful. Feels generally l...

Xss3 · 2025-11-11T16:53:51 1762880031

Will it though?

Fundamentally this is an alignment problem.

There isnt a single AI out there that wont lie to your face, reinterpret your prompt, or just decide to ignore your prompt.

When they try to write a doc based off code, there is nothing you can do to prevent them from making up a load of nonsense and pretending it is thoroughly validated.

Do we have any reason to believe alignment will be solved any time soon?

aswegs8 · 2025-11-13T12:14:19 1763036059

Why should this be an issue? We are producing more and more correct training data and at some point the quality will be sufficient. To me its not clear what speaks against this.

Xss3 · 2025-11-13T20:11:03 1763064663

Look up AI safety and THE aligment problem.

This isnt a matter of training data quality.

skissane · 2025-11-13T20:47:03 1763066823

We don’t expect 100% reliability from humans-humans will slack off, steal, defraud, harass each other, sell your source code to a foreign intelligence service, turn your business behind your back into a front for international drug cartels-some of that is very low probability, but never zero probability-so is it really a problem if we can’t reduce the probability to literally zero for AIs either?

Xss3 · 2025-11-16T16:58:00 1763312280

Humans have incentives to not do those things. Family. Jail. Money. Food. Bonuses. Etc.

If we could align an AI with incentives in the same way we can a person then youd have a point.

So far alignment research is hitting dead ends no matter what fake incentives we try to feed an AI.

aswegs8 · 2025-11-19T11:29:43 1763551783

Can you remind me of the link between alignment and writing accurate documentation? Honestly don't understand how they are linked.

Xss3 · 2025-11-21T12:21:43 1763727703

You want the ai aligned with writing accurate documentation, not aligned with a goal thats near but wrong, e.g. writing accurate sounding documentation.