Tag
105 articles
Anthropic reveals that Claude now writes over 90% of its code, and the company is calling for a global AI pause to ensure safe development.
Florida sues OpenAI and CEO Sam Altman over ChatGPT's alleged safety failures and lack of age verification, treating the AI as a defective product.
Google AI announced major advancements in multimodal models, safety measures, and enterprise applications in May 2026. The company's Gemini 2.0 release represents a significant leap in AI capabilities and accessibility.
This explainer explains what a verifiable pause in AI development means and why experts like Anthropic are calling for coordinated safety measures to prevent AI from becoming too powerful too quickly.
Sam Altman urged Congress to fund AI safety testing instead of requiring model approvals, advocating for a balanced approach to AI regulation.
OpenAI proposes an international institute to strengthen AI safeguards and opportunities for young people, calling for global cooperation on youth AI safety.
Learn how researchers test AI systems for honesty using special 'honesty traps' to ensure AI gives accurate information in important fields like law, medicine, and finance.
OpenAI has clarified its approach to AI policy and political advocacy, emphasizing transparency, responsible regulation, and maintaining control over its messaging. The company will not allow external political groups to speak on its behalf, reinforcing its commitment to AI safety and beneficial development.
Anthropic has filed confidentially for an IPO, marking a significant step toward going public as the AI industry continues to expand.
Learn how the Microsoft Agent Governance Toolkit ensures AI agents behave safely by checking their identity, trust score, and actions before they execute tasks.
OpenAI introduces its Frontier Governance Framework to align AI safety practices with EU and California regulations. The framework emphasizes proactive risk management and regulatory compliance as AI systems advance.
New analysis reveals that OpenAI's Opus 4.8 and Anthropic's Claude Mythos Preview have similar misalignment rates, suggesting ongoing challenges in AI alignment across the industry.