Skip to content

microsoft/amplifier-module-provider-chat-completions

Repository files navigation

amplifier-module-provider-chat-completions

Amplifier provider module for any server implementing the OpenAI Chat Completions wire format (/v1/chat/completions).

Unlike amplifier-module-provider-openai (which targets the OpenAI Responses API on api.openai.com) and amplifier-module-provider-azure-openai (which extends it for Azure-hosted deployments), this provider is deliberately endpoint-agnostic. Point it at any Chat Completions-compatible server with a base_url and you are done.

Tested against:

  • llama-server (llama.cpp)
  • vLLM (Chat Completions mode)
  • SGLang
  • LocalAI
  • LM Studio
  • text-generation-inference
  • Any other server speaking the OpenAI Chat Completions wire format

Capabilities

  • Tool calling (including parallel tool calls, configurable)
  • Streaming and non-streaming modes
  • JSON mode
  • Automatic retries with exponential backoff (amplifier-core managed, SDK retries disabled)
  • JIT tool-sequence repair for local models that emit non-compliant conversation history
  • Per-instance naming so multiple chat-completions providers can be mounted in the same session with distinct routing identities

Configuration

Key Type Default Env Var
base_url text (required) CHAT_COMPLETIONS_BASE_URL
api_key secret not-needed CHAT_COMPLETIONS_API_KEY
default_model / model text default
max_tokens int 4096
temperature float 0.7
timeout float 300.0
max_retries int 3
min_retry_delay float 1.0
max_retry_delay float 30.0
use_streaming bool true
top_p float None
stop list[str] None
seed int None
parallel_tool_calls bool true
priority int 100
filtered bool true
raw bool false

Silent-skip behavior

If base_url is not configured in either the module config or CHAT_COMPLETIONS_BASE_URL, mount() returns None and logs an info-level message rather than raising. This allows the provider to be pre-installed and always available without forcing every user to configure it.

Installation

This module is pre-installed by amplifier-app-cli alongside the standard providers. To use it, install Amplifier and configure the provider interactively:

uv tool install git+https://github.com/microsoft/amplifier@main
amplifier provider add
# Select "OpenAI-Compatible (self-hosted)" and supply base_url + model name.

Installing standalone

amplifier module add provider-chat-completions \
  --source git+https://github.com/microsoft/amplifier-module-provider-chat-completions@main

Installing for development

git clone https://github.com/microsoft/amplifier-module-provider-chat-completions.git
cd amplifier-module-provider-chat-completions
uv sync

Related

Contributing

Note

This project is not currently accepting external contributions, but we're actively working toward opening this up. We value community input and look forward to collaborating in the future. For now, feel free to fork and experiment!

Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit Contributor License Agreements.

When you submit a pull request, a CLA bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.

Trademarks

This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft's Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party's policies.

About

Chat completions provider module for the Amplifier project

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages