Home / Deep Learning Inference Acceleration

Deep Learning Inference Acceleration

Deep learning inference acceleration is the end-to-end process of accelerating the inference of neural models while preserving the baseline accuracy. It is fully aware of the desired target inference hardware including GPU, CPU, or any ASIC accelerator. Powered by AutoNAC, it helps AI teams to squeeze the maximum utilization out of any hardware, speed up the trained model’s runtime, and reduce its memory size.

Related resources

Deployment

deci-winter-release-2023-blog-featured-5

Algorithms

Algorithms

Add Your Heading Text Here

				
					from transformers import AutoFeatureExtractor, AutoModelForImageClassification

extractor = AutoFeatureExtractor.from_pretrained("microsoft/resnet-50")

model = AutoModelForImageClassification.from_pretrained("microsoft/resnet-50")

Deep Learning Inference Acceleration

Related resources

Share

Add Your Heading Text Here