Microsoft's ExCyTIn-Bench: Open Source AI Cybersecurity Testing Framework
Microsoft has launched ExCyTIn-Bench, an open-source benchmarking framework designed to evaluate how effectively large language models and agentic AI systems perform complex, multi-stage...