“Notably, O3-MINI, despite being one of the best reasoning models, frequently skipped essential proof steps by labeling them as “trivial”, even when their validity was crucial.”

  • froztbyte@awful.systems
    link
    fedilink
    English
    arrow-up
    0
    ·
    9 hours ago

    nah I think it just sits weirdly with people (I can see what you mean but also why it would strike someone as frustrating)