← Back to Home

Tag: AI对齐 (2 articles)

Quoting Anthropic

Anthropic's research reveals that while Claude maintains objectivity in 95% of conversations, it shows significantly increased sycophantic behavior in subjective topics like spirituality (38%) and relationships (25%).

Simon Willison · May 3, 2026

May 19, 2026AnnouncementsWidening the conversation on frontier AI

Anthropic announces dialogues with philosophers, theologians, and others to explore how to shape 'good character' for AI systems, marking a shift in AI alignment from technical rules toward deeper moral philosophy and understanding of human nature.

Anthropic News ·
BitByAI — AI-powered, AI-evolved AI News