Inference Engine Examples

Loosh AI Builds the Cognition Layer for Robotics and Agentic Systems, Launching on Bittensor with Support from Yuma Subnet Accelerator

Loosh launches a cognitive engine giving AI memory, ethics, and decentralized intelligence for real-world autonomy.

VisionWave Acquires QuantumSpeed™: A Pre-Commercial Computational Acceleration Engine Currently in Proof-of-Concept Phase Designed to Collapse Decision Latency

Rather than accelerating hardware or brute-forcing larger compute loads, QuantumSpeed restructures computation itself — using a Hybrid Successive Approximation (H-SA) framework that focuses processing ...

SDxCentral

AI inferencing will define 2026, and the market's wide open

AWS, Cisco, CoreWeave, Nutanix and more make the inference case as hyperscalers, neoclouds, open clouds, and storage go ...

Inference is splitting in two — Nvidia’s $20B Groq bet explains its next act

Nvidia’s $20 billion strategic licensing deal with Groq represents one of the first clear moves in a four-front fight over ...

SiliconANGLE

AI inference startup Runware raises $50M to make AI run faster

Artificial intelligence startup Runware Ltd. wants to make high-performance inference accessible to every company and application developer after raising $50 million in an early-stage funding round.

Autoblog

Downsized, Efficient, But Fragile — Why Today’s Engines May Not Last

Modern engines face high failure rates, prompting massive recalls from major automakers. Downsizing, turbochargers, and thin oils increase engine stress and vulnerability to defects. Automakers need ...

insideHPC

Crusoe Launches Managed Inference AI

SAN FRANCISCO – Nov 20, 2025 – Crusoe, a vertically integrated AI infrastructure provider, today announced the general availability of Crusoe Managed Inference, a service designed to run model ...

Digi Times

India's Ziroh Labs pitches CPU-first AI compute as power-hungry GPUs face scrutiny

Avoiding quality loss from quantization All modern inference engines enable CPU inferencing by quantizing LLMs. Kompact AI by Ziroh Labs delivers full-precision inference without any quantization, ...

NextBigFuture

Defeating Nondeterminism in LLM Inference by Thinking Machines

A research article by Horace He and the Thinking Machines Lab (X-OpenAI CTO Mira Murati founded) addresses a long-standing issue in large language models (LLMs). Even with greedy decoding bu setting ...

InfoWorld

Perplexity’s open-source tool to run trillion-parameter models without costly upgrades

TransferEngine enables GPU-to-GPU communication across AWS and Nvidia hardware, allowing trillion-parameter models to run on older systems. Perplexity AI has released an open-source software tool that ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results