• Home
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Subscribe to Daily Newsletter
  • NextGenInfra.io
No Result
View All Result
Converge Digest
Friday, April 10, 2026
  • Home
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Subscribe to Daily Newsletter
  • NextGenInfra.io
No Result
View All Result
Converge Digest
No Result
View All Result

Home » Positron AI Raises $51.6M Series A to Scale Custom Inference Silicon

Positron AI Raises $51.6M Series A to Scale Custom Inference Silicon

July 28, 2025
in Semiconductors, Start-ups
A A

Positron AI, a Reno-based startup focused on inference-optimized AI hardware, secured $51.6 million in an oversubscribed Series A funding round, bringing its total capital raised this year to over $75 million. Valor Equity Partners, Atreides Management, and DFJ Growth led the round, with participation from Flume Ventures, Resilience Reserve, 1517 Fund, and Unless. The funds will accelerate deployment of Positron’s first-generation Atlas inference accelerator and development of its next-generation system, Titan, powered by the company’s upcoming “Asimov” custom silicon.

Positron’s Atlas system, already deployed in production, is designed specifically for generative AI inference and claims 3.5x better performance-per-dollar and up to 66% lower power consumption than NVIDIA’s H100. Unlike general-purpose GPUs, the FPGA-based Atlas achieves 93% memory bandwidth utilization and supports models with up to 500 billion parameters per 2-kilowatt server. Atlas is compatible with Hugging Face models and OpenAI API endpoints and is currently in use by customers such as Parasail, SnapServe, and Cloudflare.

The upcoming Titan platform will feature up to two terabytes of directly attached high-speed memory per chip, enabling single-system inference for models with up to 16 trillion parameters. Titan also eliminates the traditional 1:1 GPU-to-model constraint with multi-agent hosting and is engineered for standard data centers without requiring exotic cooling solutions. The company plans to launch Titan in 2026, leveraging learnings from its current FPGA deployments and a software-first development approach.

  • Series A funding: $51.6M, led by Valor, Atreides, and DFJ Growth
  • Total 2025 funding: Over $75M
  • First-gen product: Atlas (FPGA-based), now in production
  • Next-gen product: Titan, powered by “Asimov” custom silicon (launching 2026)
  • Key specs: Up to 2TB memory per accelerator, 16T parameter model support
  • Deployment: Customers include Cloudflare, Parasail, and SnapServe
  • Founders: Thomas Sohmers (CTO), Edward Kmett (Chief Scientist), Mitesh Agrawal (CEO)
  • Headquarters: Reno, NV; remote-first team
  • Core advantage: High performance/$ and performance/watt for AI inference at scale

“We founded Positron to meet the demands of modern AI: aiming to run the frontier models at the lowest cost per token generation with the highest memory capacity of any chip available,” said Mitesh Agrawal, CEO of Positron AI.

🌐 Why it Matters: With AI inference costs rising sharply due to the dominance of NVIDIA GPUs, Positron’s custom silicon offers a compelling alternative for large-scale deployments. Its focus on memory capacity, energy efficiency, and API compatibility positions it as a serious contender in the rapidly expanding inference hardware space.

Founded in 2023 and based in Reno, Nevada, Positron AI is a privately held semiconductor startup that delivers inference‑optimized hardware for generative AI workloads—designed, fabricated, and assembled entirely in the United States  . The company’s mission is to challenge Nvidia’s dominance by offering energy‑efficient, cost‑effective AI inference capabilities with significantly lower power consumption and higher performance per dollar and per watt  .

Positron was co‑founded by Thomas Sohmers (CTO, a Thiel Fellow and serial entrepreneur) and Edward Kmett (Chief Scientist, a noted mathematician and programming expert). In 2025 they appointed Mitesh Agrawal—formerly COO at Lambda, where he helped scale revenue from $0.5 M to $500 M annually—as CEO to lead commercial scaling and go‑to‑market expansion  .

🌐 We’re tracking the latest developments in networking silicon. Follow our ongoing coverage at: https://convergedigest.com/category/semiconductors/

ShareTweetShare
Previous Post

Crusoe and Tallgrass to Build 1.8-GW AI Data Center Campus in Wyoming

Next Post

10 Years Ago: Chuck Robbins Takes over at Cisco, Ending the John Chambers Era

Jim Carroll

Jim Carroll

Editor and Publisher, Converge! Network Digest, Optical Networks Daily - Covering the full stack of network convergence from Silicon Valley

Related Posts

Cisco, G42, and AMD to Build AI Infrastructure in the UAE
AI Infrastructure

DigitalBridge Teams with KT for AI Data Centers in Korea

November 26, 2025
BerryComm Expands Central Indiana Fiber with Nokia
5G / 6G / Wi-Fi

Telefónica Germany Awards Nokia a 5-Year RAN Modernization Deal

November 26, 2025
AMD’s Compute + Pensando Network Architecture Powers Zyphra’s AI 
AI Infrastructure

AMD’s Compute + Pensando Network Architecture Powers Zyphra’s AI 

November 25, 2025
Bleu, the “Cloud de Confiance” from Capgemini and Orange
Clouds and Carriers

Orange Business Begins Migration of 70% of IT Infrastructure to Bleu Cloud

November 25, 2025
Dell’s server and networking sales rise 16% yoy
Financials

Dell Raises FY26 AI Infrastructure Outlook as AI Server Shipments Surge 150%

November 25, 2025
GlobalFoundries acquires Tagore Technology’s GaN IP
Optical

GlobalFoundries Acquires InfiniLink for Silicon-Photonics Expertise

November 25, 2025
Next Post
10 Years Ago: Chuck Robbins Takes over at Cisco, Ending the John Chambers Era

10 Years Ago: Chuck Robbins Takes over at Cisco, Ending the John Chambers Era

Categories

  • 5G / 6G / Wi-Fi
  • AI Infrastructure
  • All
  • Automotive Networking
  • Blueprints
  • Clouds and Carriers
  • Data Centers
  • Enterprise
  • Explainer
  • Feature
  • Financials
  • Last Mile / Middle Mile
  • Legal / Regulatory
  • Optical
  • Quantum
  • Research
  • Security
  • Semiconductors
  • Space
  • Start-ups
  • Subsea
  • Sustainability
  • Video
  • Webinars

Archives

Tags

5G All AT&T Australia AWS Blueprint columns BroadbandWireless Broadcom China Ciena Cisco Data Centers Dell'Oro Ericsson FCC Financial Financials Huawei Infinera Intel Japan Juniper Last Mile Last Mille LTE Mergers and Acquisitions Mobile NFV Nokia Optical Packet Systems PacketVoice People Regulatory Satellite SDN Service Providers Silicon Silicon Valley StandardsWatch Storage TTP UK Verizon Wi-Fi
Converge Digest

A private dossier for networking and telecoms

Follow Us

  • Home
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Subscribe to Daily Newsletter
  • NextGenInfra.io

© 2025 Converge Digest - A private dossier for networking and telecoms.

No Result
View All Result
  • Home
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Subscribe to Daily Newsletter
  • NextGenInfra.io

© 2025 Converge Digest - A private dossier for networking and telecoms.

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.
Go to mobile version