Skip to main content

Broken Actors ((new)) [2024]

: Multi-agent systems, AI safety, reward misspecification, robustness, failure modes.