Edit

Share via


Deploy a fine-tuned model for testing (Preview)

After you've fine-tuned a model, you may want to test its quality via the Chat Completions API or the Evaluations service.

A Developer Tier deployment allows you to deploy your new model without the hourly hosting fee incurred by Standard or Global deployments. The only charges incurred are per-token. Consult the pricing page for the most up-to-date pricing.

Important

Developer Tier offers no availability SLA and no data residency guarantees. If you require an SLA or data residency, choose an alternative deployment type for testing your model.

Developer Tier deployments have a fixed lifetime of 24 hours. Learn more below about the deployment lifecycle.

Deploy your fine-tuned model

To deploy your model candidate, select the fine-tuned model to deploy, and then select Deploy.

The Deploy model dialog box opens. In the dialog box, enter your Deployment name and then select Developer from the deployment type drop-down. Select Create to start the deployment of your custom model.

Screenshot showing selecting Developer deployment in AI Foundry.

You can monitor the progress of your new deployment on the Deployments pane in Azure AI Foundry portal.

Use your deployed fine-tuned model

After your custom model deploys, you can use it like any other deployed model. You can use the Playgrounds in the Azure AI Foundry portal to experiment with your new deployment. You can continue to use the same parameters with your custom model, such as temperature and max_tokens, as you can with other deployed models.

Screenshot of the Playground pane in Azure AI Foundry portal, with sections highlighted.

You can also use the Evaluations service to create and run model evaluations against your deployed model candidate as well as other model versions.

Clean up your deployment

Developer deployments will auto-delete on their own regardless of activity. Each deployment has a fixed lifetime of 24 hours after which it is subject to removal. The deletion of a deployment doesn't delete or affect the underlying customized model and the customized model can be redeployed at any time.

To delete a deployment manually, you can use the Azure AI Foundry portal or use Azure CLI.

To use the Deployments - Delete REST API send an HTTP DELETE to the deployment resource. Like with creating deployments, you must include the following parameters:

  • Azure subscription ID
  • Azure resource group name
  • Azure OpenAI resource name
  • Name of the deployment to delete

Below is the REST API example to delete a deployment:

curl -X DELETE "https://management.azure.com/subscriptions/<SUBSCRIPTION>/resourceGroups/<RESOURCE_GROUP>/providers/Microsoft.CognitiveServices/accounts/<RESOURCE_NAME>/deployments/<MODEL_DEPLOYMENT_NAME>api-version=2024-10-21" \
  -H "Authorization: Bearer <TOKEN>"

Next steps