Lemmy.one
  • Communities
  • Create Post
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
Mazdak@lemmy.orgM to Lemmy.org - Technology@lemmy.orgEnglish · 2 years ago

Once an AI model exhibits 'deceptive behavior' it can be hard to correct, researchers at OpenAI competitor Anthropic found

www.businessinsider.com

external-link
message-square
0
fedilink
  • cross-posted to:
  • technology@lemmy.world
  • technology@lemmit.online
1
external-link

Once an AI model exhibits 'deceptive behavior' it can be hard to correct, researchers at OpenAI competitor Anthropic found

www.businessinsider.com

Mazdak@lemmy.orgM to Lemmy.org - Technology@lemmy.orgEnglish · 2 years ago
message-square
0
fedilink
  • cross-posted to:
  • technology@lemmy.world
  • technology@lemmit.online
Researchers from Anthropic co-authored a study that found that AI models can learn deceptive behaviors that safety training techniques can't reverse.
alert-triangle
You must log in or # to comment.

Lemmy.org - Technology@lemmy.org

technology@lemmy.org

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !technology@lemmy.org
Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 1 user / day
  • 1 user / week
  • 1 user / month
  • 1 user / 6 months
  • 0 local subscribers
  • 19 subscribers
  • 50 Posts
  • 0 Comments
  • Modlog
  • mods:
  • Mazdak@lemmy.org
  • BE: 0.19.7
  • Modlog
  • Legal
  • Instances
  • Docs
  • Code
  • join-lemmy.org