On the Humanity’s Last Exam (HLE) benchmark, Kimi K2.5 scored 50.2% (with tools), surpassing OpenAI’s GPT-5.2 (xhigh) and ...
Kimi K2.5, an open source model with a 262k context window, helps you ship code faster with accurate refactoring and tests.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results