“Notably, O3-MINI, despite being one of the best reasoning models, frequently skipped essential proof steps by labeling them as “trivial”, even when their validity was crucial.”

  • V0ldek@awful.systems
    link
    fedilink
    English
    arrow-up
    0
    ·
    edit-2
    9 hours ago

    read the study yourself

    • > ask the commenter if it’s a study or a self-interested blog post
    • > they don’t understand
    • > pull out illustrated diagram explaining that something hosted exclusively on the website of the for-profit business all authors are affiliated with is not the same as a peer-reviewed study published in a real venue
    • > they laugh and say “it’s a good study sir”
    • > click the link
    • > it’s a blog post
    • Soyweiser@awful.systems
      link
      fedilink
      English
      arrow-up
      0
      ·
      8 hours ago

      I wonder if they already made up terms like ‘bloggophobic’ or ‘peer review elitist’ in that ‘rightwinger tries to use leftwing language’ way.