The Deep Learning
Hardware is no longer a barrier. Build, optimize, and deploy models that deliver outstanding performance.
Boost Accuracy & Inference Performance
Instantly achieve accuracy & runtime performance that outperform SoTA models for any use case and inference hardware.
Shorten Development Time
Reach production faster with automated tools. No more endless iterations and dozens of different libraries.
Maximize Your Hardware Potential
Scale up with existing hardware. No need for infrastructure changes and extra costs. Gain up to 80% reduction in compute costs.
Accelerated by Revolutionary Technology
Deci provides you with unmatched end-to-end accuracy-preserving inference acceleration for your neural models on the edge, mobile, or cloud.
AI Made by AI
Deci’s platform is powered by AutoNAC (Automated Neural Architecture Construction) technology, our proprietary algorithmic optimization engine that squeezes the most out of any hardware. The AutoNAC engine contains a neural architecture search (NAS) component that revises a given trained model to optimally speed up its runtime, while preserving the model’s baseline accuracy.
Find Your Perfect Solution
Deci offers multiple solutions for deep learning development teams. Choose your preferred scenario and get started.
Join the Brightest Deep Learning Leaders
Join the Brightest Deep Learning Leaders
"By collaborating with Deci, we aim to help our customers accelerate AI innovation and deploy AI solutions everywhere using our industry-leading platforms, from data centers to edge systems that accelerate high-throughput inference."
Head of Advanced AI Solutions & Technologies, HPE
“At RingCentral, we strive to provide our customers with the best AI-based experiences. With Deci’s platform, we were able to exceed our deep learning performance goals while shortening our development cycles. Working with Deci allows us to launch superior products faster.”
Senior Vice President R&D, RingCentral
"Deci exceeded our expectations in achieving aggressive targets for reducing latency and memory usage on TPU edge hardware in a test environment, all while preserving BriefCam’s high standard for accuracy."
Sr. Product Manager, BriefCam
“Deci delivers optimized deep learning inference on Intel processors as highlighted in MLPerf, allowing our customers to meet performance SLAs, reduce cost, decrease time to deployment, and gives them the ability to effectively scale.”
AI Solutions and Sales Director, Intel
"With Deci we were able to get 6.4x higher throughput on our detection model. Its easy-to-use platform allowed us to quickly optimize and benchmark for choosing our best configuration of hardware and parameters."
CTO, UNX Digital by Grupo Prominente
"The classification model I uploaded and integrated using Infery achieved a 33% performance boost, which is very cool for 10 minutes of work!"
Algorithm Developer, Sight Diagnostics
"We are excited to be working with Deci's platform - it provided amazing results and achieved 4.6x acceleration on a model we ran in production and helped us provide faster service to our customers.”
CEO, WSC Sports Technologies
“With Deci, we increased by 2.6x the inference throughput of one of our multiclass classification models running on V100 machines - without losing accuracy. Deci can cut 50% off the inference costs in our cloud deployments worldwide.”
CTO and Co-Founder, IBEX Medical Analysis
“Deci’s AISO [AI software optimization] is suitable for both training and inference modes. Deci has advanced innovation in search for optimal neural network architectures. The solution excels in every area of our assessment."
Chief Analyst, Kisaco Research
Interesting Content For You
The Correct Way to Measure Inference Time of Deep Neural Networks
Efficient Inference in Deep Learning – Where is the Problem?
Accelerate Deep Neural Network Inference with AutoNAC
Survey Report: Deep Learning Model Development and the Production Paradox
How Deci and Intel Hit 11.8x Inference Acceleration at MLPerf
An Introduction to the Inference Stack and Inference Acceleration Techniques