The 8088 The 8088 ← All news
Hacker News (AI filter) AI Safety Apr 19

Banned by Anthropic?

★★★★★ significance 2/5

The article discusses the website Banned by Anthropic, which tracks instances where Anthropic's Claude AI models have refused to answer prompts. It serves as a repository for documenting perceived censorship or overly restrictive safety guardrails in the model.

Why it matters Documenting refusal patterns exposes the tension between safety guardrails and model utility, highlighting the ongoing struggle over AI alignment and censorship thresholds.
Read the original at Hacker News (AI filter)

Entities mentioned

Anthropic

Tags

#anthropic #claude #ai censorship #safety guardrails

Related coverage