299 lines
7.0 KiB
Plaintext
299 lines
7.0 KiB
Plaintext
---
|
|
title: "Passthrough"
|
|
description: "Forward provider-native requests through Bifrost with full core pipeline processing, including logs and observability."
|
|
icon: "route"
|
|
---
|
|
|
|
## Overview
|
|
|
|
Passthrough integrations let you call provider-native API paths and payloads through Bifrost without route-level request/response conversion.
|
|
|
|
When you use passthrough endpoints, the request still flows through Bifrost core logic. You keep Bifrost features such as logging and observability while sending provider-native paths and bodies.
|
|
|
|
---
|
|
|
|
## Endpoints
|
|
|
|
- `/openai_passthrough`
|
|
Default provider: `openai`
|
|
- `/anthropic_passthrough`
|
|
Default provider: `anthropic`
|
|
- `/azure_passthrough`
|
|
Default provider: `azure`
|
|
- `/genai_passthrough`
|
|
Default provider: `gemini` (with automatic Vertex detection for clients configured to use Vertex)
|
|
|
|
---
|
|
|
|
## How It Works
|
|
|
|
1. Send your request to a passthrough endpoint (OpenAI, Anthropic, Azure, or GenAI passthrough).
|
|
2. The integration strips the passthrough prefix and forwards the remaining provider-native path/body.
|
|
3. Bifrost handles provider execution through core inference and plugin pipelines.
|
|
4. Response status, headers, and body are returned as passthrough output (for both stream and non-stream requests).
|
|
|
|
---
|
|
|
|
## Provider Selection Rules
|
|
|
|
### OpenAI Passthrough
|
|
|
|
- Uses `openai` as the default provider.
|
|
|
|
### Anthropic Passthrough
|
|
|
|
- Uses `anthropic` as the default provider.
|
|
|
|
### Azure Passthrough
|
|
|
|
- Uses `azure` as the default provider.
|
|
- Requires an Azure key with `endpoint` configured. `api-version` is injected automatically:
|
|
- **Key config `api_version`** takes priority (consistent with how auth is handled).
|
|
- Falls back to any `api-version` the client supplied in the query string.
|
|
|
|
### GenAI Passthrough
|
|
|
|
- Uses `gemini` by default.
|
|
- Automatically switches to `vertex` when Vertex patterns are detected, such as:
|
|
- URL path containing `/projects/{PROJECT_ID}/locations/{LOCATION}/`
|
|
- Request body `model` containing a Vertex resource path
|
|
- OAuth token pattern typically used for Vertex (`Bearer ya29...`)
|
|
|
|
---
|
|
|
|
## Usage Examples
|
|
|
|
### OpenAI Passthrough
|
|
|
|
<Tabs group="openai-passthrough">
|
|
<Tab title="Python SDK">
|
|
|
|
```python
|
|
import openai
|
|
|
|
client = openai.OpenAI(
|
|
base_url="http://localhost:8080/openai_passthrough/v1",
|
|
api_key="dummy-key"
|
|
)
|
|
|
|
response = client.chat.completions.create(
|
|
model="gpt-4o-mini",
|
|
messages=[{"role": "user", "content": "hello from passthrough"}]
|
|
)
|
|
|
|
print(response.choices[0].message.content)
|
|
```
|
|
|
|
</Tab>
|
|
<Tab title="cURL">
|
|
|
|
```bash
|
|
curl -X POST "http://localhost:8080/openai_passthrough/v1/chat/completions" \
|
|
-H "content-type: application/json" \
|
|
-H "authorization: Bearer sk-your-openai-key" \
|
|
-d '{
|
|
"model": "gpt-4o-mini",
|
|
"messages": [{"role":"user","content":"hello from passthrough"}]
|
|
}'
|
|
```
|
|
|
|
</Tab>
|
|
</Tabs>
|
|
|
|
### Anthropic Passthrough
|
|
|
|
<Tabs group="anthropic-passthrough">
|
|
<Tab title="Python SDK">
|
|
|
|
```python
|
|
import anthropic
|
|
|
|
client = anthropic.Anthropic(
|
|
base_url="http://localhost:8080/anthropic_passthrough",
|
|
api_key="dummy-key"
|
|
)
|
|
|
|
response = client.messages.create(
|
|
model="claude-sonnet-4-20250514",
|
|
max_tokens=1024,
|
|
messages=[{"role": "user", "content": "hello from passthrough"}]
|
|
)
|
|
|
|
print(response.content[0].text)
|
|
```
|
|
|
|
</Tab>
|
|
<Tab title="cURL">
|
|
|
|
```bash
|
|
curl -X POST "http://localhost:8080/anthropic_passthrough/v1/messages" \
|
|
-H "content-type: application/json" \
|
|
-H "x-api-key: your-anthropic-key" \
|
|
-H "anthropic-version: 2023-06-01" \
|
|
-d '{
|
|
"model": "claude-sonnet-4-20250514",
|
|
"max_tokens": 1024,
|
|
"messages": [{"role":"user","content":"hello from passthrough"}]
|
|
}'
|
|
```
|
|
|
|
</Tab>
|
|
</Tabs>
|
|
|
|
### Azure Passthrough
|
|
|
|
<Tabs group="azure-passthrough">
|
|
<Tab title="Azure OpenAI SDK">
|
|
|
|
```python
|
|
from openai import AzureOpenAI
|
|
|
|
client = AzureOpenAI(
|
|
azure_endpoint="http://localhost:8080/azure_passthrough",
|
|
api_key="dummy-key",
|
|
api_version="2024-10-21", # overridden by key config api_version if set
|
|
)
|
|
|
|
response = client.chat.completions.create(
|
|
model="gpt-4o", # your Azure deployment name
|
|
messages=[{"role": "user", "content": "hello from azure passthrough"}]
|
|
)
|
|
|
|
print(response.choices[0].message.content)
|
|
```
|
|
|
|
</Tab>
|
|
<Tab title="OpenAI SDK">
|
|
|
|
```python
|
|
from openai import OpenAI
|
|
|
|
client = OpenAI(
|
|
base_url="http://localhost:8080/azure_passthrough/openai/v1/",
|
|
api_key="dummy-key",
|
|
)
|
|
|
|
response = client.responses.create(
|
|
model="gpt-4.1", # your Azure deployment name
|
|
input="hello from azure passthrough",
|
|
)
|
|
|
|
print(response.output_text)
|
|
```
|
|
|
|
</Tab>
|
|
<Tab title="Anthropic SDK (Anthropic on Azure)">
|
|
|
|
```python
|
|
import anthropic
|
|
|
|
client = anthropic.Anthropic(
|
|
base_url="http://localhost:8080/azure_passthrough",
|
|
api_key="dummy-key",
|
|
)
|
|
|
|
response = client.messages.create(
|
|
model="claude-sonnet-4-20250514",
|
|
max_tokens=1024,
|
|
messages=[{"role": "user", "content": "hello from azure passthrough"}]
|
|
)
|
|
|
|
print(response.content[0].text)
|
|
```
|
|
|
|
</Tab>
|
|
<Tab title="cURL">
|
|
|
|
```bash
|
|
curl -X POST "http://localhost:8080/azure_passthrough/openai/deployments/gpt-4o/chat/completions" \
|
|
-H "content-type: application/json" \
|
|
-d '{
|
|
"messages": [{"role": "user", "content": "hello from azure passthrough"}]
|
|
}'
|
|
```
|
|
|
|
</Tab>
|
|
</Tabs>
|
|
|
|
### GenAI Passthrough (Gemini)
|
|
|
|
<Tabs group="genai-passthrough">
|
|
<Tab title="Python SDK">
|
|
|
|
```python
|
|
from google import genai
|
|
from google.genai.types import HttpOptions
|
|
|
|
client = genai.Client(
|
|
api_key="dummy-key",
|
|
http_options=HttpOptions(base_url="http://localhost:8080/genai_passthrough")
|
|
)
|
|
|
|
response = client.models.generate_content(
|
|
model="gemini-2.5-flash",
|
|
contents="hello from passthrough"
|
|
)
|
|
|
|
print(response.text)
|
|
```
|
|
|
|
</Tab>
|
|
<Tab title="cURL">
|
|
|
|
```bash
|
|
curl -X POST "http://localhost:8080/genai_passthrough/v1beta/models/gemini-2.5-flash:generateContent" \
|
|
-H "content-type: application/json" \
|
|
-H "x-goog-api-key: your-gemini-key" \
|
|
-d '{
|
|
"contents":[{"parts":[{"text":"hello from passthrough"}]}]
|
|
}'
|
|
```
|
|
|
|
</Tab>
|
|
</Tabs>
|
|
|
|
### GenAI Passthrough (Vertex-style request)
|
|
|
|
<Tabs group="vertex-passthrough">
|
|
<Tab title="Python SDK">
|
|
|
|
```python
|
|
from google import genai
|
|
from google.genai.types import HttpOptions
|
|
|
|
client = genai.Client(
|
|
vertexai=True,
|
|
api_key="dummy-key",
|
|
http_options=HttpOptions(base_url="http://localhost:8080/genai_passthrough")
|
|
)
|
|
|
|
response = client.models.generate_content(
|
|
model="gemini-2.5-flash",
|
|
contents="hello from vertex passthrough"
|
|
)
|
|
|
|
print(response.text)
|
|
```
|
|
|
|
</Tab>
|
|
<Tab title="cURL">
|
|
|
|
```bash
|
|
curl -X POST "http://localhost:8080/genai_passthrough/v1/projects/my-project/locations/us-central1/publishers/google/models/gemini-2.5-flash:generateContent" \
|
|
-H "content-type: application/json" \
|
|
-H "authorization: Bearer ya29.your-vertex-token" \
|
|
-d '{
|
|
"contents":[{"parts":[{"text":"hello from vertex passthrough"}]}]
|
|
}'
|
|
```
|
|
|
|
</Tab>
|
|
</Tabs>
|
|
|
|
---
|
|
|
|
## Notes
|
|
|
|
- Use passthrough when you need a provider endpoint that is not directly supported by Bifrost integration routes yet.
|
|
- For Azure passthrough, auth headers (`api-key`, `x-api-key`, OAuth token) are always sourced from the Bifrost key config and never forwarded from the client request.
|