Automatic Robustness Stress Testing of LLMs as Mathematical Problem Solvers

Published in In Findings of ACL 2026

Yutao Hou, Zeguan Xiao, Fei Yu, Yihan Jiang, Xuetao Wei, Hailiang Huang, Yun Chen, Guanhua Chen

[arXiv]