DeepCube announced the launch of a new suite of products and services to help drive enterprise adoption of deep learning, at scale, on intelligent edge devices and in data centers.
The offerings build on DeepCube’s patented platform, which is the industry’s first software-based deep learning accelerator that drastically improves performance on any existing hardware.
Now, DeepCube will offer solutions for neural network training and inference, allowing users to leverage DeepCube’s technology to address challenges in their deep learning pipeline.
Additionally, a new service offering will make available DeepCube’s team of leading AI experts to support deep learning projects.
DeepCube’s new offerings include:
- CubeIQ: the first fully automated training framework that can take a model and surgically eliminate unnecessary parameters to ensure that physical, real-world constraints and profiles are met. CubeIQ trains models with a significant reduction in size, and with pre-knowledge of the end target and environment. This leads to a drastic speed increase, minimized compute footprint, and efficient edge deployment – all while maintaining accuracy.
- CubeEngine: an inference engine designed to run next-generation deep learning models for optimal performance. CubeEngine is designed to accelerate CubeIQ generated models by dynamically assigning optimal kernels suitable for the specific hardware and model execution. CubeEngine is architected as a composable inference engine, unlike prior generation monolithic inference engines.
- CubeAdvisor: an expert-level service offered to leverage DeepCube’s wide-ranging ML experience, with guidance from some of the world’s leading AI experts and PhDs. It helps customers design, optimize, and deploy deep learning models, ensuring that customers achieve the best performing model that fits their strict cost, performance, power, and latency requirements.
To trial the new suite of products, DeepCube utilized 2nd Gen AMD EPYC based cloud instances and the new DeepCube solutions to showcase high levels of inference performance on a multitude of popular neural networks, including ResNet-50, BERT-Large, and DLRM.
“The offerings announced by DeepCube today are the culmination of decades of work and research by some of the world’s leading experts in deep learning,” said Michael Zimmerman, CEO at DeepCube.
“We have long been focused on solving the technical challenges of training and inference for next-generation deep learning models, which is no easy feat – this is proven by the fact that so many enterprises are still unable to take their AI models out of the research stage.
“But we’re confident in the power of our patented technology, and by commercializing it through CubeIQ, CubeEngine and CubeAdvisor, we’re taking steps toward democratizing deep learning across industries.”
“AMD worked with DeepCube to preview the new solutions on cloud instances using 2nd Gen AMD EPYC processors, and saw fantastic performance for deep learning workloads,” said Kumaran Siva, corporate vice president, EPYC Cloud Business, AMD.
“We believe that DeepCube’s innovative CubeIQ and CubeEngine products, coupled with optimizations specific to current and future generation AMD EPYC CPUs, will set a new bar for performance and business metrics, such as inference throughput, latency, and performance/dollar.