Skip Navigation

Remember how ChatGPT totally aced the bar exam? Wow! yeah, turns out that was just a lie

www.nytimes.com

Opinion | Press Pause on the Silicon Valley Hype Machine

37 yorumlar
  • From Re-evaluating GPT-4’s bar exam performance (linked in the article):

    First, although GPT-4’s UBE score nears the 90th percentile when examining approximate conversions from February administrations of the Illinois Bar Exam, these estimates are heavily skewed towards repeat test-takers who failed the July administration and score significantly lower than the general test-taking population.

    Ohhh, that is sneaky!

    • What I find delightful about this is that I already wasn't impressed! Because, as the paper goes on to say

      Moreover, although the UBE is a closed-book exam for humans, GPT-4’s huge training corpus largely distilled in its parameters means that it can effectively take the UBE “open-book”

      And here I was thinking it not getting a perfect score on multiple-choice questions was already damning. But apparently it doesn't even get a particularly good score!

37 yorumlar