Deliver Superior AI-Powered Solutions Faster

Deliver Superior AI-Powered
Solutions Faster

The Challenge

Implementing deep learning on edge devices can revolutionize security applications. However, limited computational resources and power constraints pose challenges for running complex deep learning models in real-time on resource-constrained devices.

Enable New Applications on Edge Devices

Improve model inference and reduce model size and memory footprint to run on resource constrained devices.

Scale up Inference on Existing Edge Devices

Make the most of your devices and scale up inference more cost efficiently with better hardware utilization.

Improve User Experience with Better Inference Speed

Boost inference speed and achieve real-time performance without compromising on accuracy.

How Companies in the Public Sector are Leveraging Deci

Achieve Real Time Inference on the Edge & Maximize HW Utilization

A defense company developing electro-optics solutions for space, airborne, ground, and maritime applications was looking to improve the throughput of an image-denoising model for video stream analysis to deliver real-time insights on the edge. The team was also looking for a way to free up GPU resources to support additional parallel tasks on the same edge device.

Using the Deci platform and its NAS-based engine, the team built a new architecture that delivered a 1.58x acceleration of throughput while maintaining the original model’s accuracy. The team achieved the desired performance within 10 days. Once the team trained the model, they easily compiled and quantized it to TensorRT FP16 using Deci’s platform.

Ship Superior Product to Market
within a Few Weeks

A company developing an AI-powered video analytics technology for security applications was looking to improve one of its existing products by providing real-time alerts to its users. While the solution was already in production, running on a wide range of Intel CPUs, the team struggled to improve the model’s speed without compromising accuracy.

Using the Deci platform, the company’s team generated a new model architecture that delivered a 6.5x acceleration of throughput over the original SOTA architecture while maintaining the original model’s accuracy. The team trained, compiled, and quantized the new model on-premise, all within 14 days.

Enabling a New Detection Solution to Run on Highly Constrained Compute Resources

A defense company specializing in advanced land systems needed to process high-resolution images for object detection and tracking tasks running on an NVIDIA Jetson Xavier NX device. For the system to become operational, the team needed to run in a 10-watt power mode and achieve a throughput of 10 FPS.

Using Deci’s platform and its NAS-based engine, the team generated a new model that delivered a 3.08x increase in the throughput from 3.5 FPS to 10.8 FPS, and the model size was reduced by 40% from 40.04MB to 24.14 MB. As a result, the team obtained smooth object detection and tracking, allowing the company to deliver a new and superior product to the market.

"Deci's AutoNAC engine quickly produced a new model architecture that surpassed our optimized model - a model we’d been working on for 18 months prior. The development time and cost saved is remarkable and provides a significant advantage for our business. Moving forward, we intend to utilize Deci for every new model we develop."

Computer Vision & Deep Learning Team Lead
Large Security Company

Build Better Models Faster with Deci’s Deep Learning Development Platform

Main Capabilities Overview

The Deci platform is used by data scientists and machine learning engineers to build, optimize, and deploy highly accurate and efficient models to production. Teams can easily develop production grade models and gain unparalleled accuracy and speed tailored for any performance targets and hardware environment. Deci is powered by AutoNAC (Automated Neural Architecture Construction), the most advanced and commercially scalable Neural Architecture Search engine in the market. AutoNAC performs a multi-constraints search to find the architecture that delivers the highest accuracy for any performance targets and hardware environment.

Gain Superior Performance with Custom Architectures

Build accurate & efficient architectures tailored to your hardware and application’s performance targets with Deci’s Neural Architecture Search engine.

Simplify Runtime Optimization

Easily compile and quantize your models (FP16/INT8) and evaluate different production settings with a click of a button.

Maximize Accuracy with Advanced Training Techniques

Train models with SuperGradients. Leverage custom recipes and advanced training techniques (e.g. knowledge distillation, quantization-aware training) with one line of code.

Streamline Deployment with 3 Lines of Code

Deploy your models with Infery, Deci’s simple-to-use, unified, model inference API. Streamline deployment and boost serving performance with parallelism and concurrent execution. Compatible with multiple frameworks and hardware.

Related resources

Computer Vision

Deci

Video

Deci

Case Study

Deci Team