• Bay Area Times
  • Posts
  • GPT-5.5 debuts as stronger than Opus 4.7, weaker than Mythos

GPT-5.5 debuts as stronger than Opus 4.7, weaker than Mythos

Brought to you by:

Top stories today:

  1. GPT-5.5 debuts as stronger than Opus 4.7, weaker than Mythos
  2. DeepSeek releases V4 Pro, V4 Flash open-weight models in preview
  3. Anthropic fixes Claude Code issues tied to reasoning, caching
  4. Meta, Microsoft plan cuts, buyouts affecting ~23,000 jobs
  5. Cohere, Aleph Alpha agree merger at ~$20B to build “sovereign” AI
  6. Cognition said to eye raising hundreds of millions at $25B valuation

0. Data and calendar

All values as of 6 AM ET / 3 AM PT, other than S&P500 and NASDAQ close (4 PM ET / 1 PM PT).

All times are ET.

Listen to our AI-generated podcast summarizing today’s newsletter (beware of hallucinations):

1. GPT-5.5 debuts as stronger than Opus 4.7, weaker than Mythos

GPT-5.5 topped the Artificial Analysis Intelligence Index (of public models), overtaking Claude Opus 4.7 and Gemini 3.1 Pro:

Anthropic’s Mythos is taking on GPT-5.5 across benchmarks

Similarly to GPT-5.4, GPT-5.5 (xhigh) has an 86% hallucination rate vs. Opus 4.7 (36%) and Gemini 3.1 Pro (50%)

  • Watch the on-demand webinar to see how Slackbot lives in the flow of work, already understanding your conversations, files, and people—delivering answers and actions that are relevant and personalized to you.

  • Learn from real Slack customers, like Engine and Asymbl, as they share how Slackbot finds answers, analyzes documents, schedules meetings, creates content, and orchestrates specialized agents to execute complex workflows.

  • See contextual intelligence in action as Slackbot moves work forward by respecting your permissions and working with the tools you trust.

*Sponsored.

3. DeepSeek releases V4 Pro, V4 Flash open-weight models in preview

V4 Pro topped open-weight models on the GDPval-AA leaderboard:

*Sponsored.

5. Anthropic fixes Claude Code issues tied to reasoning, caching, verbosity

Default reasoning set to “xhigh” for Opus 4.7 and “high” for other models to boost intelligence:

  • Overlapping experiments and a hard-to-reproduce bug delayed detection; fixed on Apr. 10.

  • Apr. 20: Anthropic reverted a prompt change that reduced verbosity but hurt performance.

  • Anthropic is adding tighter controls on system prompt changes.

  • The company will also have more staff use the exact public build of Claude Code.

Need domain experts for AI training work? Athyna Intelligence connects AI labs with financial analysts, economists, and quant researchers from Latin America who can evaluate models on:

  • Financial modeling & valuation

  • Investment analysis & portfolio theory

  • Risk assessment & derivatives pricing

*Disclaimer: We have equity in Athyna.

7. Meta, Microsoft plan cuts, buyouts affecting ~23,000 jobs as Meta targets ~10% layoffs

Meta plans to cut ~8K jobs starting May 20 as part of streamlining efforts:

Subscribe to keep reading

This content is free, but you must be subscribed to Bay Area Times to continue reading.

Already a subscriber?Sign in.Not now