ASUS has introduced the UGen300 USB AI Accelerator, a groundbreaking device designed to bring powerful AI inference capabilities directly to a wide range of devices. This compact, plug-and-play solution leverages the Hailo-10H AI processor to deliver significant performance for both traditional AI tasks and cutting-edge generative AI applications, all without relying on cloud connectivity.
Key Takeaways
- World’s First USB AI Accelerator: The UGen300 is the first USB-connected AI accelerator designed for both classic and generative AI workloads.
- Powerful Performance: Features the Hailo-10H AI processor delivering 40 AI TOPS (INT4) for robust inference.
- On-Device Generative AI: Enables LLM inference and other demanding generative AI tasks locally, enhancing privacy and reducing latency.
- Plug-and-Play Simplicity: Connects via USB-C, offering cross-platform compatibility with Windows, Linux, and Android.
- Low Power Consumption: Operates efficiently at just 2.5 watts under typical workloads.
Unmatched Edge Generative AI Performance
The ASUS UGen300 USB AI Accelerator is engineered to bring advanced AI inference directly to the edge. At its core is the Hailo-10H AI processor, capable of delivering 40 AI TOPS of dedicated performance. This power is optimized for generative AI workloads, including large language models (LLMs) and vision-language models (VLMs), as well as traditional AI tasks. By supplementing a host device’s existing processing power, the UGen300 frees up system resources, enabling on-device generative AI inference without the need for constant cloud access. This translates to no monthly subscriptions, zero latency, and enhanced privacy and reliability for applications like text generation, video summarization, and real-time perception.
Dedicated Memory for Stable AI Computations
Equipped with 8 GB of LPDDR4 dedicated memory, the UGen300 ensures stable AI computations and the execution of large models. This dedicated RAM architecture prevents bottlenecks often seen when NPUs share system memory, providing consistent throughput for complex AI pipelines. This feature is particularly beneficial for host systems that may lack native AI capabilities, allowing them to effectively run AI tools and handle heavy AI workloads while maintaining optimal system balance and multitasking efficiency.
Plug-and-Play Convenience and Broad Compatibility
The UGen300 boasts a user-friendly plug-and-play design, connecting via a USB 3.1 Gen 2 Type-C interface. This approach ensures platform-agnostic compatibility, working seamlessly with both consumer and industrial devices across Windows, Linux, and Android ecosystems. Furthermore, it offers broad support for major AI frameworks such as TensorFlow, PyTorch, and ONNX right out of the box. This makes the UGen300 an ideal solution for developers, educators, and embedded-solutions providers seeking scalable and portable AI computing power. An upcoming UGen Utility is expected to streamline development with over 100 pre-trained models for quick validation and an easy start.
Via ASUS

