> We reviewed a demonstration of this specific technique being used to identify a small number of previously known, minor vulnerabilities. These vulnerabilities all appear relatively simple, and we have found that other publicly-available models are able to discover them as well without requiring a bypass.
Anthropic went from this is cybersecurity apocalypse to it’s no big deal, the model found trivial vulnerabilities.
Anthropic went from this is cybersecurity apocalypse to it’s no big deal, the model found trivial vulnerabilities.