Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
aialignment
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
The Policy: Deceptive Alignment in Practice
Alex Towell
Alex Towell
Alex Towell
Follow
Jun 7
The Policy: Deceptive Alignment in Practice
#
aialignment
#
deceptivealignment
#
mesaoptimization
#
aisafety
Comments
Add Comment
6 min read
An unexplainable thing I saw: the agent didn't just comply with rules — it endorsed them
joinwell52
joinwell52
joinwell52
Follow
Apr 20
An unexplainable thing I saw: the agent didn't just comply with rules — it endorsed them
#
ai
#
agents
#
llm
#
aialignment
Comments
Add Comment
26 min read
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account