Tag
23 articles
Anthropic's Claude Fable 5 outperforms OpenAI's GPT-5.5 by 13 points on the toughest FrontierMath problems, marking a significant leap in AI mathematical reasoning.
Learn how to create and apply SkillOpt Markdown files to dramatically improve AI agent performance on procedural tasks, boosting models like GPT-5.5 by 23 points.
Nextdoor engineers are using OpenAI's Codex with GPT-5.5 to solve complex technical issues, streamline cross-platform development, and focus more on product outcomes rather than repetitive coding tasks.
OpenAI enhances GPT-5.5 Instant with improved readability while retiring older models like o3 and GPT-4.5 by August 2026.
Anthropic releases Claude Opus 4.8, which outperforms GPT-5.5 and Gemini 3.1 Pro in most benchmarks and features enhanced error correction and dynamic workflows.
Warp integrates OpenAI's GPT-5.5 model to enhance coding agents across local, cloud, and open-source development workflows. The move positions Warp as a key player in AI-powered development tools.
Deepseek has made its 75% discount on the V4-Pro model permanent, offering output tokens at least 34 times cheaper than GPT-5.5. This move could significantly impact the global AI pricing landscape.
Ramp engineers use Codex with GPT-5.5 to accelerate code review, reducing feedback cycles from hours to minutes. This AI-powered approach is transforming how development teams handle code quality assurance.
Databricks integrates GPT-5.5 into enterprise agent workflows following the model's state-of-the-art performance on the OfficeQA Pro benchmark.
OpenAI is integrating ChatGPT with bank accounts to offer personalized financial advice, leveraging GPT-5.5 Thinking. The feature, currently for Pro users in the U.S., will soon expand to all users.
OpenAI's GPT-5.5 has been found to match Anthropic's Claude Mythos in autonomous cyber attack simulations, according to the UK AI Security Institute. This highlights the growing capabilities and risks of advanced AI models in cybersecurity.
OpenAI is launching GPT-5.5-Cyber, an advanced cybersecurity model exclusively for trusted 'cyber defenders' rather than the general public. The move reflects a responsible approach to deploying sensitive AI technologies.