Inference Engine Architecture

Taalas Etches AI Models Onto Transistors To Rocket Boost Inference

Adding big blocks of SRAM to collections of AI tensor engines, or better still, a waferscale collection of such engines, turbocharges AI inference, as has ...

VentureBeat

Pipeshift cuts GPU usage for AI inferences 75% with modular interface engine

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now DeepSeek’s release of R1 this week was a ...

insideHPC

VAST Data Redesigns Inference Architecture for Agentic AI with NVIDIA

NEW YORK – – VAST Data, the AI Operating System company, today announced a new inference architecture that enables the NVIDIA Inference Context Memory Storage Platform – deployments for the era of ...

Morningstar

Quadric, Inference Engine for On-Device AI Chips, Raises $30M Series C as Design Wins Accelerate Across Edge LLMs, Automotive, and Enterprise

Tripling product revenues, comprehensive developer tools, and scalable inference IP for vision and LLM workloads, position Quadric as the platform for on-device AI. ACCELERATE Fund, managed by BEENEXT ...

The Next Platform

Show inaccessible results

Taalas Etches AI Models Onto Transistors To Rocket Boost Inference

Pipeshift cuts GPU usage for AI inferences 75% with modular interface engine

VAST Data Redesigns Inference Architecture for Agentic AI with NVIDIA

Quadric, Inference Engine for On-Device AI Chips, Raises $30M Series C as Design Wins Accelerate Across Edge LLMs, Automotive, and Enterprise

Cerebras Trains Llama Models To Leap Over GPUs

Next-level AI engine comes top in LLM speed showdown

Distributed Intelligence Is Here—And Reshaping Device Architecture