Self-driving cars are only one example where it's tricky but critical to align AI and human goals. AP Photo/Michael Liedtke Ideally, artificial intelligence agents aim to help humans, but what does ...
Ideally, artificial intelligence agents aim to help humans, but what does that mean when humans want conflicting things? My colleagues and I have come up with a way to measure the alignment of the ...
Aidan Kierans has participated as an independent contractor in the OpenAI Red Teaming Network. His research described in this article was supported in part by the NSF Program on Fairness in AI in ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results
Feedback