Home / Model Serving

Model Serving

Model serving refers to the way trained models are made available for others to use. Choosing a model serving strategy can be the first step in model deployment, where factors such as user expectations, production requirements, business rules, and existing technologies are considered. There are four common model serving tactics, namely, batch inference, model as a service, online model as a service, and edge deployment.

Related resources

Deployment

Deployment

Deployment

Add Your Heading Text Here

				
					from transformers import AutoFeatureExtractor, AutoModelForImageClassification

extractor = AutoFeatureExtractor.from_pretrained("microsoft/resnet-50")

model = AutoModelForImageClassification.from_pretrained("microsoft/resnet-50")

Model Serving

Related resources

Share

Add Your Heading Text Here