MiniMax open-sourced the first Coding Agent evaluation set.

date
14/01/2026
MiniMax officially released the first systematic evaluation set OctoCodingBench aimed at Coding Agents. The evaluation results show that some open source models have quickly approached or even surpassed some closed source models in process compliance indicators, reflecting that in the Agent era, the importance of "data and evaluation paradigm" is rising as a new competitive factor.