first commit

2026-04-26 21:52:23 +03:00
commit 880f412e2c
2662 changed files with 866266 additions and 0 deletions
--- a/docs/integrations/vector-databases/pinecone.mdx
+++ b/docs/integrations/vector-databases/pinecone.mdx
@@ -0,0 +1,97 @@
+---
+title: "Pinecone"
+description: "Pinecone vector database integration for semantic caching in Bifrost."
+icon: "database"
+---
+
+## Pinecone
+
+[Pinecone](https://www.pinecone.io/) is a managed vector database service designed for machine learning applications, offering both serverless and pod-based deployment options.
+
+### Key Features
+
+- **Managed Service**: Fully managed with no infrastructure to maintain
+- **Serverless Option**: Pay-per-use pricing with automatic scaling
+- **High Performance**: Optimized for low-latency vector search
+- **Metadata Filtering**: Advanced filtering on vector metadata
+- **Namespaces**: Organize vectors into separate namespaces within an index
+
+### Setup & Installation
+
+**Pinecone Cloud:**
+- Sign up at [pinecone.io](https://www.pinecone.io/)
+- Create a new index with the desired dimensions
+- Get your API key and index host URL from the console
+
+**Local Development (Pinecone Local):**
+```bash
+docker run -d \
+  --name pinecone-local \
+  -p 5081:5081 \
+  ghcr.io/pinecone-io/pinecone-index:latest
+```
+
+### Configuration Options
+
+<Tabs group="pinecone-config">
+
+<Tab title="Go SDK">
+
+```go
+vectorConfig := &vectorstore.Config{
+    Enabled: true,
+    Type:    vectorstore.VectorStoreTypePinecone,
+    Config: vectorstore.PineconeConfig{
+        APIKey:    "your-pinecone-api-key",
+        IndexHost: "your-index-host.svc.environment.pinecone.io",
+    },
+}
+
+store, err := vectorstore.NewVectorStore(context.Background(), vectorConfig, logger)
+```
+
+</Tab>
+
+<Tab title="config.json">
+
+**Cloud Setup:**
+```json
+{
+  "vector_store": {
+    "enabled": true,
+    "type": "pinecone",
+    "config": {
+      "api_key": "your-pinecone-api-key",
+      "index_host": "your-index-host.svc.environment.pinecone.io"
+    }
+  }
+}
+```
+
+**Local Development:**
+```json
+{
+  "vector_store": {
+    "enabled": true,
+    "type": "pinecone",
+    "config": {
+      "api_key": "pclocal",
+      "index_host": "localhost:5081"
+    }
+  }
+}
+```
+
+</Tab>
+
+</Tabs>
+
+<Note>
+For local development with Pinecone Local, any API key value works (e.g., "pclocal"). The index host should point to localhost:5081 by default.
+</Note>
+
+<Warning>
+Pinecone requires all IDs to be unique strings. Namespaces are created automatically when you first upsert vectors.
+</Warning>
+
+For the VectorStore interface API and usage examples, see [Vector Store Architecture](/architecture/framework/vector-store). For semantic caching setup, see [Semantic Caching](/features/semantic-caching).
--- a/docs/integrations/vector-databases/qdrant.mdx
+++ b/docs/integrations/vector-databases/qdrant.mdx
@@ -0,0 +1,94 @@
+---
+title: "Qdrant"
+description: "Qdrant vector database integration for semantic caching in Bifrost."
+icon: "database"
+---
+
+## Qdrant
+
+[Qdrant](https://qdrant.tech/) is a high-performance vector search engine built in Rust.
+
+### Setup & Installation
+
+**Local Qdrant:**
+```bash
+# Using Docker
+docker run -d \
+  --name qdrant \
+  -p 6333:6333 \
+  -p 6334:6334 \
+  -v $(pwd)/qdrant_storage:/qdrant/storage \
+  qdrant/qdrant:latest
+```
+
+**Qdrant Cloud:**
+Sign up at [cloud.qdrant.io](https://cloud.qdrant.io)
+
+### Configuration Options
+
+<Tabs group="qdrant-config">
+
+<Tab title="Go SDK">
+
+```go
+vectorConfig := &vectorstore.Config{
+    Enabled: true,
+    Type:    vectorstore.VectorStoreTypeQdrant,
+    Config: vectorstore.QdrantConfig{
+        Host:   "localhost",
+        Port:   6334,
+        APIKey: "",
+        UseTLS: false,
+    },
+}
+
+store, err := vectorstore.NewVectorStore(context.Background(), vectorConfig, logger)
+```
+
+</Tab>
+
+<Tab title="config.json">
+
+**Local Setup:**
+```json
+{
+  "vector_store": {
+    "enabled": true,
+    "type": "qdrant",
+    "config": {
+      "host": "localhost",
+      "port": 6334
+    }
+  }
+}
+```
+
+**Cloud Setup:**
+```json
+{
+  "vector_store": {
+    "enabled": true,
+    "type": "qdrant",
+    "config": {
+      "host": "your-qdrant-cluster.cloud.qdrant.io",
+      "port": 6334,
+      "api_key": "your-qdrant-api-key",
+      "use_tls": true
+    }
+  }
+}
+```
+
+</Tab>
+
+</Tabs>
+
+<Note>
+Qdrant uses port 6334 for gRPC and port 6333 for REST. Bifrost uses the gRPC port.
+</Note>
+
+<Warning>
+Qdrant requires all IDs to be valid UUIDs. Use `uuid.New().String()` to generate IDs.
+</Warning>
+
+For the VectorStore interface API and usage examples, see [Vector Store Architecture](/architecture/framework/vector-store). For semantic caching setup, see [Semantic Caching](/features/semantic-caching).
--- a/docs/integrations/vector-databases/redis.mdx
+++ b/docs/integrations/vector-databases/redis.mdx
@@ -0,0 +1,241 @@
+---
+title: "Redis / Valkey"
+description: "Redis and Valkey vector store integration for semantic caching in Bifrost."
+icon: "database"
+---
+
+## Redis
+
+Redis provides high-performance in-memory vector storage using RediSearch-compatible APIs, ideal for applications requiring sub-millisecond response times and fast semantic search capabilities. Valkey deployments that expose compatible `FT.*` commands are supported through the same configuration.
+
+### Key Features
+
+- **High Performance**: Sub-millisecond cache retrieval with Redis's in-memory storage
+- **Cost Effective**: Open-source solution with no licensing costs
+- **HNSW Algorithm**: Fast vector similarity search with excellent recall rates
+- **Connection Pooling**: Advanced connection management for high-throughput applications
+- **TTL Support**: Automatic expiration of cached entries
+- **Streaming Support**: Full streaming response caching with proper chunk ordering
+- **Flexible Filtering**: Advanced metadata filtering with exact string matching
+
+### Setup & Installation
+
+**Redis Cloud:**
+- Sign up at [cloud.redis.io](https://cloud.redis.io)
+- Create a new database with RediSearch module enabled
+- Get your connection details
+
+**Local Redis with RediSearch:**
+```bash
+# Using Docker with Redis Stack (includes RediSearch)
+docker run -d --name redis-stack -p 6379:6379 redis/redis-stack:latest
+```
+
+**Local Valkey Bundle:**
+```bash
+# Example Valkey bundle with search/vector support
+docker run -d --name valkey-bundle -p 6379:6379 valkey/valkey-bundle:9.0.0
+```
+
+### Configuration Options
+
+<Tabs group="redis-config">
+
+<Tab title="Go SDK">
+
+```go
+// Configure Redis-compatible vector store (Redis or Valkey endpoint)
+vectorConfig := &vectorstore.Config{
+    Enabled: true,
+    Type:    vectorstore.VectorStoreTypeRedis, // Keep type as "redis" for Valkey too
+    Config: vectorstore.RedisConfig{
+        Addr:     "localhost:6379",        // Redis/Valkey server address - REQUIRED
+        Username: "",                      // Optional: Redis username
+        Password: "",                      // Optional: Redis password
+        DB:       0,                       // Optional: Redis database number (default: 0)
+
+        // Optional: TLS and cluster settings
+        UseTLS:             false,             // Enable TLS for encrypted connections
+        InsecureSkipVerify: false,             // Skip TLS cert verification
+        ClusterMode:        false,             // Use Redis Cluster client for cluster endpoints
+
+        // Optional: Connection pool settings
+        PoolSize:        10,               // Maximum socket connections
+        MaxActiveConns:  10,               // Maximum active connections
+        MinIdleConns:    5,                // Minimum idle connections
+        MaxIdleConns:    10,               // Maximum idle connections
+
+        // Optional: Timeout settings
+        DialTimeout:     5 * time.Second,  // Connection timeout
+        ReadTimeout:     3 * time.Second,  // Read timeout
+        WriteTimeout:    3 * time.Second,  // Write timeout
+        ContextTimeout:  10 * time.Second, // Operation timeout
+    },
+}
+
+// Create vector store
+store, err := vectorstore.NewVectorStore(context.Background(), vectorConfig, logger)
+if err != nil {
+    log.Fatal("Failed to create vector store:", err)
+}
+```
+
+</Tab>
+
+<Tab title="config.json">
+
+```json
+{
+  "vector_store": {
+    "enabled": true,
+    "type": "redis",
+    "config": {
+      "addr": "localhost:6379",
+      "username": "",
+      "password": "",
+      "db": 0,
+      "use_tls": false,
+      "insecure_skip_verify": false,
+      "ca_cert_pem": "",
+      "cluster_mode": false,
+      "pool_size": 10,
+      "max_active_conns": 10,
+      "min_idle_conns": 5,
+      "max_idle_conns": 10,
+      "dial_timeout": "5s",
+      "read_timeout": "3s",
+      "write_timeout": "3s",
+      "context_timeout": "10s"
+    }
+  }
+}
+```
+
+**For Redis Cloud or Valkey service endpoints:**
+```json
+{
+  "vector_store": {
+    "enabled": true,
+    "type": "redis",
+    "config": {
+      "addr": "your-redis-host:port",
+      "username": "your-username",
+      "password": "your-password",
+      "db": 0,
+      "use_tls": true,
+      "ca_cert_pem": "-----BEGIN CERTIFICATE-----\n...\n-----END CERTIFICATE-----",
+      "cluster_mode": false,
+      "context_timeout": "10s"
+    }
+  }
+}
+```
+
+**For managed Redis Cluster endpoints:**
+```json
+{
+  "vector_store": {
+    "enabled": true,
+    "type": "redis",
+    "config": {
+      "addr": "your-cluster-endpoint:6379",
+      "username": "your-username",
+      "password": "your-password",
+      "db": 0,
+      "use_tls": true,
+      "ca_cert_pem": "-----BEGIN CERTIFICATE-----\n...\n-----END CERTIFICATE-----",
+      "cluster_mode": true,
+      "context_timeout": "10s"
+    }
+  }
+}
+```
+
+</Tab>
+
+</Tabs>
+
+### Redis-Specific Features
+
+**Vector Search Algorithm:**
+Redis uses the **HNSW (Hierarchical Navigable Small World)** algorithm for vector similarity search, which provides:
+
+- **Fast Search**: O(log N) search complexity
+- **High Accuracy**: Excellent recall rates for similarity search
+- **Memory Efficient**: Optimized for in-memory operations
+- **Cosine Similarity**: Uses cosine distance metric for semantic similarity
+
+**Connection Pool Management:**
+Redis provides extensive connection pool configuration:
+
+```go
+config := vectorstore.RedisConfig{
+    Addr:            "localhost:6379",
+    UseTLS:          true,                     // Enable TLS
+    ClusterMode:     true,                     // Enable cluster mode
+    PoolSize:        20,                    // Max socket connections
+    MaxActiveConns:  20,                    // Max active connections
+    MinIdleConns:    5,                     // Min idle connections
+    MaxIdleConns:    10,                    // Max idle connections
+    ConnMaxLifetime: 30 * time.Minute,      // Connection lifetime
+    ConnMaxIdleTime: 5 * time.Minute,       // Idle connection timeout
+    DialTimeout:     5 * time.Second,       // Connection timeout
+    ReadTimeout:     3 * time.Second,       // Read timeout
+    WriteTimeout:    3 * time.Second,       // Write timeout
+    ContextTimeout:  10 * time.Second,      // Operation timeout
+}
+```
+
+### Performance Optimization
+
+**Connection Pool Tuning:**
+For high-throughput applications, tune the connection pool settings:
+
+```json
+{
+  "vector_store": {
+    "config": {
+      "pool_size": 50,           // Increase for high concurrency
+      "max_active_conns": 50,    // Match pool_size
+      "min_idle_conns": 10,      // Keep connections warm
+      "max_idle_conns": 20,      // Allow some idle connections
+      "conn_max_lifetime": "1h", // Refresh connections periodically
+      "conn_max_idle_time": "10m" // Close idle connections
+    }
+  }
+}
+```
+
+**Memory Optimization:**
+- **TTL**: Use appropriate TTL values to prevent memory bloat
+- **Namespace Cleanup**: Regularly clean up unused namespaces
+
+**Batch Operations:**
+Redis supports efficient batch operations:
+
+```go
+// Batch retrieval
+results, err := store.GetChunks(ctx, namespace, []string{"id1", "id2", "id3"})
+
+// Batch deletion
+deleteResults, err := store.DeleteAll(ctx, namespace, queries)
+```
+
+### Production Considerations
+
+<Info>
+**TLS and Cluster Mode**: Set `use_tls: true` to enable TLS encryption for the Redis connection, and `insecure_skip_verify: true` if using self-signed certificates. Set `cluster_mode: true` when connecting to a Redis Cluster endpoint. When cluster mode is enabled, the `db` field must be `0` (Redis Cluster does not support database selection).
+</Info>
+
+<Info>
+**Search Module Required**: Redis/Valkey integration requires a search module/API that supports `FT.*` commands (index creation and vector search). If `FT.INFO` or `FT.SEARCH` is unavailable, semantic caching will not work.
+</Info>
+
+<Warning>
+**Production Considerations**:
+- Use Redis AUTH for production deployments
+- Configure appropriate connection timeouts
+- Monitor memory usage and set appropriate TTL values
+</Warning>
+
+For the VectorStore interface API and usage examples, see [Vector Store Architecture](/architecture/framework/vector-store). For semantic caching setup, see [Semantic Caching](/features/semantic-caching).
--- a/docs/integrations/vector-databases/weaviate.mdx
+++ b/docs/integrations/vector-databases/weaviate.mdx
@@ -0,0 +1,146 @@
+---
+title: "Weaviate"
+description: "Weaviate vector database integration for semantic caching in Bifrost."
+icon: "database"
+---
+
+## Weaviate
+
+Weaviate is a production-ready vector database solution that provides advanced querying capabilities, gRPC support for high performance, and flexible schema management for production deployments.
+
+### Key Features
+
+- **gRPC Support**: Enhanced performance with gRPC connections
+- **Advanced Filtering**: Complex query operations with multiple conditions
+- **Schema Management**: Flexible schema definition for different data types
+- **Cloud & Self-Hosted**: Support for both Weaviate Cloud and self-hosted deployments
+- **Scalable Storage**: Handle millions of vectors with efficient indexing
+
+### Setup & Installation
+
+**Weaviate Cloud:**
+- Sign up at [cloud.weaviate.io](https://cloud.weaviate.io)
+- Create a new cluster
+- Get your API key and cluster URL
+
+**Local Weaviate:**
+```bash
+# Using Docker
+docker run -d \
+  --name weaviate \
+  -p 8080:8080 \
+  -e QUERY_DEFAULTS_LIMIT=25 \
+  -e AUTHENTICATION_ANONYMOUS_ACCESS_ENABLED='true' \
+  -e PERSISTENCE_DATA_PATH='/var/lib/weaviate' \
+  semitechnologies/weaviate:latest
+```
+
+### Configuration Options
+
+<Tabs group="weaviate-config">
+
+<Tab title="Go SDK">
+
+```go
+// Configure Weaviate vector store
+vectorConfig := &vectorstore.Config{
+    Enabled: true,
+    Type:    vectorstore.VectorStoreTypeWeaviate,
+    Config: vectorstore.WeaviateConfig{
+        Scheme: "http",           // "http" for local, "https" for cloud
+        Host:   "localhost:8080", // Your Weaviate host
+        APIKey: "your-weaviate-api-key", // Required for Weaviate Cloud; optional for local/self-hosted
+
+        // Enable gRPC for improved performance (optional)
+        GrpcConfig: &vectorstore.WeaviateGrpcConfig{
+            Host:    "localhost:50051", // gRPC port
+            Secured: false,             // true for TLS
+        },
+    },
+}
+
+// Create vector store
+store, err := vectorstore.NewVectorStore(context.Background(), vectorConfig, logger)
+if err != nil {
+    log.Fatal("Failed to create vector store:", err)
+}
+```
+
+</Tab>
+
+<Tab title="config.json">
+
+**Local Setup:**
+```json
+{
+  "vector_store": {
+    "enabled": true,
+    "type": "weaviate",
+    "config": {
+      "scheme": "http",
+      "host": "localhost:8080"
+    }
+  }
+}
+```
+
+**Cloud Setup with gRPC:**
+```json
+{
+  "vector_store": {
+    "enabled": true,
+    "type": "weaviate",
+    "config": {
+      "scheme": "https",
+      "host": "your-weaviate-host",
+      "api_key": "your-weaviate-api-key",
+      "grpc_config": {
+        "host": "your-weaviate-grpc-host",
+        "secured": true
+      }
+    }
+  }
+}
+```
+
+</Tab>
+
+</Tabs>
+
+<Note>
+gRPC host should include the port. If no port is specified, port 80 is used for insecured connections and port 443 for secured connections.
+</Note>
+
+### Advanced Features
+
+**gRPC Performance Optimization:**
+Enable gRPC for better performance in production:
+
+```go
+vectorConfig := &vectorstore.Config{
+    Type: vectorstore.VectorStoreTypeWeaviate,
+    Config: vectorstore.WeaviateConfig{
+        Scheme: "https",
+        Host:   "your-weaviate-host",
+        APIKey: "your-api-key",
+
+        // Enable gRPC for better performance
+        GrpcConfig: &vectorstore.WeaviateGrpcConfig{
+            Host:    "your-weaviate-grpc-host:443",
+            Secured: true,
+        },
+    },
+}
+```
+
+### Production Considerations
+
+<Info>
+**Performance**: For production environments, consider using gRPC configuration for better performance and enable appropriate authentication mechanisms for your Weaviate deployment.
+</Info>
+
+<Warning>
+**Authentication**: Always use API keys for Weaviate Cloud deployments and configure proper authentication for self-hosted instances in production.
+</Warning>
+
+For the VectorStore interface API and usage examples, see [Vector Store Architecture](/architecture/framework/vector-store). For semantic caching setup, see [Semantic Caching](/features/semantic-caching).