Microsoft's ExCyTIn-Bench: Open Source AI Benchmark for SOC Cybersecurity
Microsoft's security team has open-sourced ExCyTIn-Bench, a groundbreaking benchmarking framework designed to evaluate how well large language models and agentic AI systems perform real-world cyber...