Become a MacRumors Supporter for $50/year with no ads, ability to filter front page stories, and private forums.

MacRumors

macrumors bot
Original poster
Apr 12, 2001
69,465
40,573


Anthropic today introduced Claude Sonnet 4.5, which the company says is the "best coding model in the world," outperforming GPT-5 and Gemini 2.5 Pro. It's also the strongest model for building complex agents and using computers, plus Anthropic says that it shows substantial gains in reasoning and math compared to Opus 4.1.

anthopic-claude.jpg

Along with Claude Sonnet 4.5, Anthropic added a new terminal interface and checkpoints to Claude Code. Checkpoints serve as save points, saving your progress so you can roll back to a previous state as necessary.

The Claude apps now support code execution and file creation for making spreadsheets, slides, and documents directly in the conversation. Claude for Chrome is available as well, with Anthropic allowing some Max users to beta test the feature. There's also a Claude Agent SDK that's available for building agents.


According to Anthropic, Claude Sonnet 4.5 has improved capabilities and extensive safety training to improve its behavior, so there has been a reduction in sycophancy, deception, power-seeking, and the tendency to encourage delusional thinking.

Claude Sonnet 4.5 is available everywhere starting today, and Anthropic recommends that users upgrade.

Article Link: Anthropic Debuts Claude Sonnet 4.5 With Improved Coding
 
it seems people are so interested in AI, a game that you could play on a potato (assuming you could connect airpods to it) is getting more comment engagement than a major AI model release.
 
I'm not sure I can speak to how well one model does vs another. But I've tried all of them (Gemini, ChatGPT and Claude) and the UI/UX that Antrhopic provides is hands down the best. I know that isn't what most people care about but I really like it. It makes it much easier to work with and create documents and keep track of versions.
 
Will require more testing from me but considering gpt-5-codex is so close by Anthropic's own admission it's kind of hard to justify Sonnet's greater price tag and rate limits. gpt-5-codex through ChatGPT Plus is the best deal in AI coding right now.

Anthropic's platform is pretty solid from a feature perspective though, it's tempting to want to go all in on Anthropic and build everything around Claude's native tools. The Claude website experience is also really good and I know lots of people who prefer it over ChatGPT.

The dark horse is xAI who seem to be catching up fast in terms of best performance to price ratio for coding...
 
I'm not sure I can speak to how well one model does vs another. But I've tried all of them (Gemini, ChatGPT and Claude) and the UI/UX that Antrhopic provides is hands down the best. I know that isn't what most people care about but I really like it. It makes it much easier to work with and create documents and keep track of versions.
Claude is still the overall best of the bunch. The lead has only grown with 4.5
 
"and Anthropic recommends that users upgrade."

Ah really! I thought they would recommend we don't upgrade. No **** sherlock!
 
Claude models are actually really good. Much better than ChatGPT 5.

The problem are the heavy usage limits even for paid users. The £18/month one is so bad I was hitting the limit after an hour. Sometimes even faster.

Once you hit the limit it doesn’t fall back like ChatGPT does to the mini model. Once you hit the limit with Claude that’s you. You need to wait 5 hours to start using it again. They also have added more limits on top of this. For example, you might not hit a daily limit but a weekly limit. You could potentially be locked out for much longer than 5 hours…and I’m talking days.

I was using Claude and when I hit the limit I went back to ChatGPT until it reset.

To be fair, they do at least have an option of I think ~£90/month for 5x more usage on the Max plan. Most other services only have £20 and £200/month options.

Another thing with Claude is after a while it starts to, for lack of a better phrase, **** the bed. I don’t even know how to describe it but it’s like it almost goes crazy and loses focus and context. When coding it doesn’t “see” the code and starts coding gibberish outside it. Gemini does this too after a while so might just be the nature of LLMs. I am not talking about hallucinations.

I ended up deleting my Claude account. The only other AI that is on the same level as Claude is Gemini. Gemini might actually be better but I haven’t tested it enough. From what I’ve used on the free service it is really impressive.

ChatGPT is the best overall AI model but it absolutely sucks at coding. Gemini/Claude is where it’s at but again Claude is held back by it’s aggressive usage limits.
 
Register on MacRumors! This sidebar will go away, and you'll see fewer ads.