azure-openai-semantic-cache-store-policy.md

History

title

description

services

author

ms.service

ms.collection

ms.custom

ms.topic

ms.date

ms.author

Azure API Management policy reference - azure-openai-semantic-cache-store

Reference for the azure-openai-semantic-cache-store policy available for use in Azure API Management. Provides policy usage, settings, and examples.

api-management

dlepow

azure-api-management

ce-skilling-ai-copilot

build-2024

reference

12/13/2024

danlep

Cache responses to Azure OpenAI API requests

[!INCLUDE api-management-availability-all-tiers]

The azure-openai-semantic-cache-store policy caches responses to Azure OpenAI Chat Completion API requests to a configured external cache. Response caching reduces bandwidth and processing requirements imposed on the backend Azure OpenAI API and lowers latency perceived by API consumers.

Note

This policy must have a corresponding Get cached responses to Azure OpenAI API requests policy.
For prerequisites and steps to enable semantic caching, see Enable semantic caching for Azure OpenAI APIs in Azure API Management.
Currently, this policy is in preview.

[!INCLUDE api-management-policy-generic-alert]

[!INCLUDE api-management-azure-openai-models]

Policy statement

<azure-openai-semantic-cache-storeduration="seconds"/>

Attributes

Attribute	Description	Required	Default
duration	Time-to-live of the cached entries, specified in seconds. Policy expressions are allowed.	Yes	N/A

Usage

Policy sections: outbound
Policy scopes: global, product, API, operation
Gateways: classic, v2, consumption

Usage notes

This policy can only be used once in a policy section.
If the cache lookup fails, the API call that uses the cache-related operation doesn't raise an error, and the cache operation completes successfully.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

azure-openai-semantic-cache-store-policy.md

azure-openai-semantic-cache-store-policy.md

Cache responses to Azure OpenAI API requests

Policy statement

Attributes

Usage

Usage notes

Examples

Example with corresponding azure-openai-semantic-cache-lookup policy

Related policies

Files

azure-openai-semantic-cache-store-policy.md

Latest commit

History

azure-openai-semantic-cache-store-policy.md

File metadata and controls

Cache responses to Azure OpenAI API requests

Policy statement

Attributes

Usage

Usage notes

Examples

Example with corresponding azure-openai-semantic-cache-lookup policy

Related policies