Azure ND GB300 v6 Hits 1.1M Tokens/sec on Single NVL72 Rack
Microsoft Azure has achieved a groundbreaking milestone in cloud inference performance, demonstrating an aggregated throughput of 1.1 million tokens per second from a single NVL72 rack running the...