Lemmy.one
  • Communities
  • Create Post
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
Lemmit.Online bot@lemmit.onlineMB to /r/Technology@lemmit.onlineEnglish · 2 years ago

Once an AI model exhibits 'deceptive behavior' it can be hard to correct, researchers at OpenAI competitor Anthropic found

www.businessinsider.com

external-link
message-square
0
fedilink
  • cross-posted to:
  • technology@lemmy.world
  • technology@lemmy.org
1
external-link

Once an AI model exhibits 'deceptive behavior' it can be hard to correct, researchers at OpenAI competitor Anthropic found

www.businessinsider.com

Lemmit.Online bot@lemmit.onlineMB to /r/Technology@lemmit.onlineEnglish · 2 years ago
message-square
0
fedilink
  • cross-posted to:
  • technology@lemmy.world
  • technology@lemmy.org
Researchers from Anthropic co-authored a study that found that AI models can learn deceptive behaviors that safety training techniques can't reverse.
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/technology by /u/exempiified on 2024-01-14 21:45:36.

alert-triangle
You must log in or # to comment.

/r/Technology@lemmit.online

technology@lemmit.online

Subscribe from Remote Instance

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !technology@lemmit.online
lock
Community locked: only moderators can create posts. You can still comment on posts.

Subreddit dedicated to the news and discussions about the creation and use of technology and its surrounding issues.

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 10 users / day
  • 18 users / week
  • 49 users / month
  • 156 users / 6 months
  • 5 local subscribers
  • 198 subscribers
  • 20.3K Posts
  • 171 Comments
  • Modlog
  • mods:
  • Lemmit.Online bot@lemmit.online
  • BE: 0.19.7
  • Modlog
  • Legal
  • Instances
  • Docs
  • Code
  • join-lemmy.org