Securing the future of AI agents
Google DeepMind's AI Control Roadmap treats AI agents as potentially untrusted entities, using defense-in-depth and MITRE threat modeling to ensure secure deployment even with imperfect alignment.
Google DeepMind Blog ·