Files
bifrost/docs/enterprise/invpc-deployments.mdx
Beyhan Oğur 880f412e2c first commit
2026-04-26 21:52:23 +03:00

109 lines
4.2 KiB
Plaintext
Raw Permalink Blame History

This file contains ambiguous Unicode characters
This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
---
title: "In-VPC Deployments"
description: "Deploy Bifrost within your private cloud infrastructure with VPC isolation, custom networking, and enhanced security controls for enterprise environments."
icon: "cloud"
---
In-VPC (Virtual Private Cloud) deployments allow you to run Bifrost entirely within your private cloud infrastructure, providing maximum security, compliance, and control over your AI gateway deployment.
## Supported Cloud Providers
Bifrost supports INVPC deployments across all major cloud providers:
<div className="grid grid-cols-2 md:grid-cols-3 gap-4 my-6">
<div className="flex items-center gap-2 p-3 border rounded-lg">
<span>Google Cloud Platform</span>
</div>
<div className="flex items-center gap-2 p-3 border rounded-lg">
<span>Amazon Web Services</span>
</div>
<div className="flex items-center gap-2 p-3 border rounded-lg">
<span>Microsoft Azure</span>
</div>
<div className="flex items-center gap-2 p-3 border rounded-lg">
<span>Cloudflare</span>
</div>
<div className="flex items-center gap-2 p-3 border rounded-lg">
<span>Vercel</span>
</div>
</div>
## Architecture Benefits
### Security & Compliance
- **Network Isolation**: Complete isolation within your VPC with no external network dependencies
- **Data Sovereignty**: All data processing occurs within your controlled environment
- **Compliance Ready**: Meets requirements for HIPAA, SOC2, GDPR, and other regulatory frameworks
- **Zero Trust Architecture**: Implements principle of least privilege with granular access controls
### Performance & Reliability
- **Low Latency**: Direct communication between services within your network
- **High Availability**: Multi-zone deployment with automatic failover capabilities
- **Guaranteed Uptime**: 99.95% SLA with comprehensive monitoring and alerting
### Control & Customization
- **Custom Networking**: Configure subnets, routing, and security groups to your specifications
- **Resource Management**: Full control over compute, storage, and network resources
- **Scaling Policies**: Define auto-scaling rules based on your usage patterns
## Service Level Agreement
### Availability Commitment
- **Uptime Guarantee**: 99.95% monthly uptime for all core components
- **Downtime Calculation**: `(Total Minutes - Downtime Minutes) / Total Minutes × 100`
- **Partial Downtime**: Reduced functionality counted as 50% downtime
### Core Components Covered
The following components are monitored for SLA compliance:
- Gateway instance
- Log ingestion pipeline
### Exclusions
SLA excludes downtime due to:
- Scheduled maintenance (14-day advance notice)
- Downstream provider incidents
- Client hardware/software/network issues
- Third-party AI provider outages
- Client misuse or unauthorized modifications
## Support & Maintenance
### Technical Support
- **24/7 Critical Support**: Available for core component issues
- **Multiple Channels**: Platform, email (contact@getmaxim.ai), or Slack Connect
- **Audit Trail**: Detailed logs for any data access during troubleshooting
### Maintenance Windows
- **Scheduled Maintenance**: 14-day advance notice for major updates
- **Security Patches**: Immediate or 14-day delayed application (your choice)
- **Continuous Updates**: Regular feature improvements with 7-day advance notice
## Getting Started
### Prerequisites
- VPC with appropriate CIDR ranges
- Kubernetes cluster (GKE, EKS, or AKS)
- Container registry access
- DNS configuration for internal routing
### Deployment Process
1. **Infrastructure Setup**: Configure VPC, subnets, and security groups
2. **Cluster Preparation**: Set up Kubernetes cluster with required permissions
3. **Bifrost Installation**: Deploy using provided Helm charts or manifests
4. **Configuration**: Apply your specific settings and integrations
5. **Validation**: Run connectivity and performance tests
6. **Go Live**: Begin routing production traffic
## Cost Optimization
### Resource Sizing
- **Development**: 2 vCPU, 4GB RAM minimum
- **Production**: 4+ vCPU, 8GB+ RAM recommended
- **High Availability**: Multi-zone deployment with load balancing
### Scaling Strategies
- **Horizontal Pod Autoscaling**: Based on CPU/memory utilization
- **Vertical Pod Autoscaling**: Automatic resource adjustment
- **Cluster Autoscaling**: Node pool expansion/contraction