What's Changed
- Add visualization examples, support interface language switching between Chinese and English by @Yunnglin in #289, #294
- Add GPQA benchmark by @Yunnglin in #293
- Fix ifeval dependency by @Yunnglin in #292
- Fix viz subset by @Yunnglin in #295
更新内容
- 添加可视化示例,支持界面中英文切换 @Yunnglin 在 #289, #294
- 添加 GPQA 评测基准 @Yunnglin 在 #293
- 修复 ifeval 依赖 @Yunnglin 在 #292
- 修复可视化模型预测结果的bug @Yunnglin 在 #295
Full Changelog: v0.10.0...v0.10.1