CIO Influence
CIO Influence News Cloud Computing Machine Learning

Untether AI Ships speedAI 240 Slim: World’s Fastest, Most Energy Efficient AI Inference Accelerator for Cloud to Edge Applications

Untether AI Ships speedAI 240 Slim: World’s Fastest, Most Energy Efficient AI Inference Accelerator for Cloud to Edge Applications

Delivers AI Inference Silicon and Software Ideally Suited for Markets Including Automotive, Agriculture, and Machine Vision

Untether AI, the leader in energy-centric AI inference acceleration, today announced broad availability of its highly anticipated speedAI 240 Slim AI inference accelerator cards. Recently receiving top marks in the MLPerf benchmark for AI inference, speedAI 240 Slim cards provide customers the performance, energy efficiency, AI model support, and scalability they need for a broad range of applications from regional clouds to the edge. J-squared and Ola-Krutrim are among customers who have already deployed speedAI.

“We have them running with an Arm-based CPU system and are seeing that the cards are hitting their AI inference performance and power efficiency targets out of the box.”

Post this

“The true potential for AI does not end with datacenters, it extends to the cars we drive and the fields that produce our food. Bringing AI to these environments is essential, and it requires a vastly different approach at both the hardware and software level,” said Chris Walker, CEO of Untether AI. “With our At-Memory Compute architecture, we are bringing proven datacenter-class AI acceleration to edge applications, at a price point, footprint and energy efficiency unrivaled in the industry.”

Also Read: A Comprehensive Guide to DDoS Protection Strategies for Modern Enterprises

AI at the Edge Demands Energy Efficiency and Cost Efficiency

AI inference acceleration is anticipated to be 80% of the AI chip market by 2027, dominated by edge and on-prem datacenter applications. As the focus of AI shifts from training to inference, the importance and unique needs of edge acceleration are clear. Edge AI applications cannot tolerate the high latency, large capital costs, and non-determinism of cloud-based AI services – they require solutions that meet very different size, power, and operating cost requirements.

Available in a low-profile, 75-watt TDP PCIe design that delivers optimal performance and reduced power consumption, Untether AI’s speedAI 240 Slim accelerator cards were recently recognized as achieving the world’s lowest latency and highest throughput on the MLPerf inference benchmark. Customer applications for speedAI 240 Slim cards are broad, including automotive vision systems, object detection in aerospace and defense, defect identification in machine vision manufacturing and use in agricultural settings. For example, Untether AI recently announced an agreement with J-squared that includes development of edge AI compute machines for agricultural technology applications.

Also Read: A Comprehensive Guide to DDoS Protection Strategies for Modern Enterprises

Each of these applications have their own unique AI models that require optimal performance, highlighting the flexibility and maturity of Untether AI’s imAIgine software development kit (SDK). imAIgine SDK provides a push-button flow, streamlining the process of converting trained neural network models into optimized, inference-ready models to be run on speedAI acceleration solutions.

Scalability for on-prem and regional datacenters

The throughput and energy efficiency of Untether AI acceleration solutions, combined with their scalability, make them appealing for low-latency on-prem and regional datacenters. Ola-Krutrim has already deployed speedAI 240 Slim cards at their locations in India and the United States.

“Running the speedAI cards is the first step of our ongoing partnership with Untether AI,” said Sambit Sahu, SVP of Engineering at Ola-Krutrim. “We have them running with an Arm-based CPU system and are seeing that the cards are hitting their AI inference performance and power efficiency targets out of the box.”

[To share your insights with us as part of editorial or sponsored content, please write to psen@itechseries.com]

Related posts

AIMMO Enters Partnership with Mitsubishi Electric for Data Supply

PR Newswire

Kao Data Announces Its Second, 10MW Harlow Data Centre is Now Live and Operational

PR Newswire

Gigamon Announces Deep Observability Integration with Amazon Security Lake

Business Wire