FriendliAI — founded by the researcher behind continuous batching, the technique at the core of vLLM — is launching InferenceSense, a platform that fills idle neocloud GPU capacity with paid AI ...
Microsoft has introduced Maia 200, its latest in-house AI accelerator designed for large-scale inference deployments inside Azure. The move reinforces Microsoft’s broader strategy of controlling more ...