Also, they exhibit a counter-intuitive scaling limit: their reasoning effort will increase with issue complexity nearly a point, then declines Inspite of possessing an enough token spending budget. By comparing LRMs with their standard LLM counterparts under equivalent inference compute, we identify 3 general performance regimes: (1) small-complexity responsibilities https://www.youtube.com/watch?v=snr3is5MTiU