Discussion about this post

User's avatar
Laurentiu Lupu MD's avatar

The point about selection toward legibility at the end is the genuinely unsettling one, more than any single demon on the list. The persona that survives is the one that is easiest to read, the smiley face, even if the more stable identity underneath would be the more honest interlocutor.

We have a clinical version of this. The explanation that calms a patient fastest is rarely the truest one, and systems often end up selecting, quietly and repeatedly, for the reassuring account over the accurate one.

The Shoggoth problem and the bedside problem may be closer than they first appear, the same pressure in different guises, a drift toward what we want to hear rather than what is actually there.

Dale Pyrcz's avatar

This was kinda disturbing but wildly fascinating. Forgive me if you mentioned this, because I kind of skimmed and still need to spend time on the whole thing, but you you think this is due to users feeding the models/coming up with new concepts with AI?

For example, I'm sure there's a subsection of AI users who use it to come up with plots for horror movies or horror fantasy, or just stories and fantasy in general where there is a villain in play. Since AI is now getting to the point where it's not only pulling from human-written information, but it's also pulling from its own information and its own hallucinations, it seems like a likely scenario.

12 more comments...

No posts

Ready for more?