
Side-by-side comparison of Etched and SambaNova — pricing, features, and use cases. Reviewed by our editorial team in Jun 2026.


Etched's Sohu has not shipped to customers as of March 2026 and no independent benchmarks exist, making it a high-conviction bet rather than a proven platform.
Etched's Sohu is optimized to the millimeter for transformer operations—attention, projections, and feed-forward layers—achieving much higher performance per watt by eliminating the overhead of general programmability. Sohu claims 500K+ tokens/sec on Llama 70B, built on TSMC 4nm with 144GB HBM3E.
The core risk is architectural lock-in: DeepSeek V4 is the most downloaded model on Hugging Face as of early 2026 and it is a 671B MoE architecture that Sohu cannot serve. By contrast, SambaNova's SN50 RDU is built for this future and will start shipping to customers in the second half of 2026.
The SN50 RDU is SambaNova's fifth-generation AI inference processor designed specifically for large-scale and agentic workloads.
It uses its unique Dataflow technology and three-tiered memory architecture to reduce data movement, enabling faster inference, lower latency, and improved energy efficiency compared to traditional accelerator designs. SambaNova Composer is the proprietary layer that compiles your model graph for the RDU.
Unlike CUDA, which accepts arbitrary kernel code, Composer only supports architectures it can map to the RDU dataflow. This means no vLLM, no SGLang, no custom attention kernels. In February 2026, SambaNova raised a funding round led by Vista Equity, with Intel participating as a co-investor.
The raise came after reported acquisition talks with Intel failed and followed a period in which the company had struggled to close a new funding round amid intensifying competition with Nvidia.
Etched targets inference-only transformer-dominated workloads at hyperscale; SambaNova targets enterprise agentic AI and heterogeneous workloads that mix training and inference.
Etched's Sohu makes a permanent architectural bet; SambaNova's RDUs reconfigure per model, accepting broader model support at the cost of proprietary compilation.
For organizations serving homogeneous transformer inference at massive scale, Etched's specialization wins on throughput-per-watt if the transformer thesis holds.
For enterprises deploying diverse models, agents, and mixed training-inference, SambaNova's flexibility and current availability make it the lower-risk choice.
Pure transformer inference at hyperscale
Etched claims the total speedup reaches 20x over the H100 for Llama 70B inference, targeting inference-only deployments where flexibility is not required.
Agentic AI and multi-model inference
SambaNova RDUs take on high-speed decoding and can handle the prefill and decode phases of agentic inference, with support for MoE and diverse architectures.
Available hardware today
SambaNova raised a funding round in February 2026 and has existing deployments, while Etched is still ramping production.
4 use cases scored. Etched wins 0, SambaNova wins 2.
Neither tool publishes a starting price.
Neither tool offers a free tier or trial.
SambaNova averages 4.8 / 5 vs 4.5 / 5 on the other side.
SambaNova has 161 ratings vs 90 on the other.
Where each tool earns its rating — and where it falls short.



Every spec on one page. Live-pulled from each tool's detail page.
Quick answers to the questions readers ask before picking between these two.
No. DeepSeek V4 is the most downloaded model on Hugging Face as of early 2026 and it is a 671B MoE architecture that Sohu cannot serve. Sohu only supports dense transformer architectures.
As of April 2026 Sohu is not publicly available for purchase or rental. Etched is in customer engagement, but no public availability timeline has been confirmed.
SambaNova. The SN50 RDU will start shipping to customers in the second half of 2026, and earlier-generation SN40L systems are already deployed at enterprises and sovereign AI centers.
Sohu features 144GB of HBM3E memory per chip, while the SN40L RDU features a novel three-tier memory system with 520 MiB of on-chip SRAM, 64 GiB of on-package HBM, and up to 1.5 TiB of off-package DDR DRAM. Sohu optimizes for high bandwidth; SambaNova optimizes for capacity and flexible tiering.
Sohu's circuitry is optimized to the millimeter for the key operations of Transformers: attention, projections, and feed-forward layers, making it fixed-function. SambaNova's Dataflow Architecture allows data to flow from one AI operation to the next as an assembly pipeline, eliminating frequent, energy-intensive memory bottlenecks, and reconfigures per model.
SambaNova. The SambaRack can hot-swap between models in milliseconds and manage many models on the same infrastructure without the latency spikes common in shared cloud queues and GPU clusters, essential for agent-driven workflows.
Etched Sohu and SambaNova RDUs target overlapping but distinct personas within enterprise AI infrastructure.
Etched is an all-in bet on transformer dominance: if your workload is pure transformer inference at massive scale—serving models like Llama 70B with minimal model diversity—and you can wait for production availability and tolerate the risk of architectural obsolescence, Sohu offers unmatched throughput-per-watt and cost-per-token claims.
It suits hyperscalers and inference-only service providers who amortize fixed-function silicon over enormous inference volumes. SambaNova is a hedge against architectural fragmentation: its reconfigurable RDUs run transformers, MoE models like DeepSeek, and other workloads on the same silicon.
The dataflow architecture minimizes memory movement, making it efficient for agentic AI where models switch frequently and latency is critical.
With recent funding, Intel partnership, and fifth-generation hardware shipping in H2 2026, SambaNova offers lower execution risk for enterprises building production AI agents and sovereign AI. Choose Etched if you believe transformers are permanent and can commit to specialized ASIC hardware. Choose SambaNova if you need flexibility, agentic inference support, and hardware available now.
More ai infrastructure head-to-heads.
Receive weekly updates so you can stay up-to-date with the world of AI
Receive weekly updates so you can stay up-to-date with the world of AI