On AI Reliability
On AI Reliability
Source (Bluesky)
On AI Reliability
Source (Bluesky)
Unless something improved, they're wrong more than 60% of the time, but at least they're confident.
This is an excellent exploit of the human mind. AI being convincing and correct are two very different ideals.
And they are very specifically optimized to be convincing.
This is why LLMs should only be employed in cases where a 60% error rate is acceptable. In other words, almost none of the places where people are currently being hyped to use them.
Haha, yeah, I was going to say 40% is way more impressive than the results I get.