OpenAI’s o3 Pro and ChatGPT-5 Pro are much better at math. Claude 4.1 costs 20 credits and performs worse than Gemini 2.5 Pro or GPT-5 Thinking.
o3 Pro — key use cases: PhD-level math function optimization; scientific analysis; math-heavy code synthesis/refactoring/optimization.
There is strong demand across Windsurf and AI-dev Discord, Facebook, and Slack rooms for o3 Pro in Cascade for math/science/code workflows (e.g., PhD-level math function optimization, scientific analysis, math-heavy code synthesis).
Once available, please also include the ChatGPT-5 Pro model.
I’m happy to share reproducible prompt sets and benchmarks, and to help as an early tester to validate output quality.
Thanks!