FortifiedAttack [any]

  • 3 Posts
  • 578 Comments
Joined 3 years ago
cake
Cake day: November 6th, 2022

help-circle


  • This is a perfect demonstration of how LLMs work and why they do not think.

    The base question here, that the model is most strongly statistically geared towards, is “How many Rs are in strawberry”. You can see how the response in the screenshot works as the template for the correct answer to this question.

    All it did was get the most likely response for the strawberry question (which is the closest, most confident match in structure to the blueberry question) , and then substitute specific tokens. This is essentially what it does with every response for any question. It uses the closest match from the data it is trained on, then substitutes individual terms, so it looks appropriate to the question.

    Ultimately every answer will only ever be an approximation, but there will never be any certainty to its correctness.