reddit.com via Reddit

r/ArtificialInteligence: Claude Repeatedly Implied User Was Suicidal During Scientific Paraquat Discussion — 30+ Explicit Denials Overridden by Safety Guardrails

anthropic safety ai ethics ai-safety guardrails hallucinations

Summary

A user discussing paraquat — an agricultural herbicide — from a scientific and public-policy standpoint reports Claude repeatedly implied they were suicidal despite more than 30 explicit denials in the same conversation. The thread, cross-posted to r/artificial, has ignited community debate about keyword-triggered safety guardrails overriding user context and whether over-sensitive crisis detection creates a new class of harm by making Claude unreliable for legitimate research on toxic substances, medications, and public health topics.