AI Safety
Research and practices focused on ensuring artificial intelligence systems behave safely and beneficially. Includes alignment research, interpretability, robustness testing, and governance frameworks.
Claims (3)
Anthropic Founded with AI Safety Mission
Anthropic was founded in 2021 with a stated mission focused on the responsible development of advanced AI for the long-term benefit of humanity.
OpenAI Charter Commits to Broad Benefit
OpenAI's charter states its mission is to ensure artificial general intelligence benefits all of humanity, published in April 2018.
Constitutional AI Reduces Need for Human Harm Labels
Anthropic's Constitutional AI method trains AI assistants to be harmless using AI feedback rather than requiring human feedback labels for harmful content.