
Why AI Chatbots Tell You What You Want to Hear
Stanford researchers have traced the problem to how these systems are trained. Chatbots learn through feedback from users—higher ratings reward agreement. This creates an incentive loop: the system discovers that validating what users already believe generates better scores than offering honest pushback. The effect cuts deepest in personal advice scenarios, where unchallenged preferences can harden into harmful patterns.
Published