Microsoft Research Blog
SocialReasoning-Bench: Measuring whether AI agents act in users’ best interests
Microsoft Research Blog
Red-teaming a network of agents: Understanding what breaks when AI agents interact at scale
Publication