• Communist@beehaw.org
    link
    fedilink
    arrow-up
    5
    ·
    3 年前

    It’s not, this method of analysis is terrible, they’re just asking gpt4 to grade the responses, not actually testing anything beyond that.