Delivers AI Inference Silicon and Software Ideally Suited for Markets Including Automotive, Agriculture, and Machine Vision
Untether AI, the leader in energy-centric AI inference acceleration, today announced broad availability of its highly anticipated speedAI 240 Slim AI inference accelerator cards. Recently receiving top marks in the MLPerf benchmark for AI inference, speedAI 240 Slim cards provide customers the performance, energy efficiency, AI model support, and scalability they need for a broad range of applications from regional clouds to the edge. J-squared and Ola-Krutrim are among customers who have already deployed speedAI.
“We have them running with an Arm-based CPU system and are seeing that the cards are hitting their AI inference performance and power efficiency targets out of the box.”
“The true potential for AI does not end with datacenters, it extends to the cars we drive and the fields that produce our food. Bringing AI to these environments is essential, and it requires a vastly different approach at both the hardware and software level,” said Chris Walker, CEO of Untether AI. “With our At-Memory Compute architecture, we are bringing proven datacenter-class AI acceleration to edge applications, at a price point, footprint and energy efficiency unrivaled in the industry.”
Also Read: A Comprehensive Guide to DDoS Protection Strategies for Modern Enterprises
AI at the Edge Demands Energy Efficiency and Cost Efficiency
AI inference acceleration is anticipated to be 80% of the AI chip market by 2027, dominated by edge and on-prem datacenter applications. As the focus of AI shifts from training to inference, the importance and unique needs of edge acceleration are clear. Edge AI applications cannot tolerate the high latency, large capital costs, and non-determinism of cloud-based AI services – they require solutions that meet very different size, power, and operating cost requirements.
Available in a low-profile, 75-watt TDP PCIe design that delivers optimal performance and reduced power consumption, Untether AI’s speedAI 240 Slim accelerator cards were recently recognized as achieving the world’s lowest latency and highest throughput on the MLPerf inference benchmark. Customer applications for speedAI 240 Slim cards are broad, including automotive vision systems, object detection in aerospace and defense, defect identification in machine vision manufacturing and use in agricultural settings. For example, Untether AI recently announced an agreement with J-squared that includes development of edge AI compute machines for agricultural technology applications.
Also Read: A Comprehensive Guide to DDoS Protection Strategies for Modern Enterprises
Each of these applications have their own unique AI models that require optimal performance, highlighting the flexibility and maturity of Untether AI’s imAIgine software development kit (SDK). imAIgine SDK provides a push-button flow, streamlining the process of converting trained neural network models into optimized, inference-ready models to be run on speedAI acceleration solutions.
Scalability for on-prem and regional datacenters
The throughput and energy efficiency of Untether AI acceleration solutions, combined with their scalability, make them appealing for low-latency on-prem and regional datacenters. Ola-Krutrim has already deployed speedAI 240 Slim cards at their locations in India and the United States.
“Running the speedAI cards is the first step of our ongoing partnership with Untether AI,” said Sambit Sahu, SVP of Engineering at Ola-Krutrim. “We have them running with an Arm-based CPU system and are seeing that the cards are hitting their AI inference performance and power efficiency targets out of the box.”