Deployment Documentation
This section provides comprehensive deployment documentation for QDrant Loader, covering production deployment strategies, environment setup, monitoring, and performance optimization. All examples are verified against the actual implementation.
🎯 Deployment Overview
QDrant Loader can be deployed in various environments and configurations to meet different scale and reliability requirements:
🚀 Deployment Options
QDrant Loader supports the following deployment patterns:
- Local Installation - Direct Python package installation for development and small-scale use
- PyPI Package Deployment - Official package distribution via PyPI
- Workspace-Based Deployment - Organized multi-project configurations
- MCP Server Deployment - Optional server component for AI assistant integration
🏗️ Architecture Patterns
┌─────────────────────────────────────────────────────────────┐
│ QDrant Loader Deployment │
├─────────────────────────────────────────────────────────────┤
│ CLI Tool │ MCP Server (Optional) │
│ ┌───────────────┐ │ ┌─────────────────────────────────┐ │
│ │ qdrant-loader │ │ │ mcp-qdrant-loader │ │
│ │ │ │ │ (AI Assistant Integration) │ │
│ │ - init │ │ │ │ │
│ │ - ingest │ │ │ - semantic_search │ │
│ │ - config │ │ │ - hierarchy_search │ │
│ │ - project │ │ │ - attachment_search │ │
│ └───────────────┘ │ └─────────────────────────────────┘ │
├─────────────────────────────────────────────────────────────┤
│ External Dependencies │
│ ┌───────────────┐ │ ┌─────────────────────────────────┐ │
│ │ QDrant │ │ │ OpenAI API │ │
│ │ Vector DB │ │ │ (Embeddings) │ │
│ └───────────────┘ │ └─────────────────────────────────┘ │
└─────────────────────────────────────────────────────────────┘
🚀 Quick Start Deployment
Single Server Setup
# Create deployment directory
mkdir qdrant-loader-deployment
cd qdrant-loader-deployment
# Create virtual environment
python -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activate
# Install QDrant Loader (and optional MCP server)
pip install qdrant-loader
pip install qdrant-loader-mcp-server  # optional
# Create workspace structure
mkdir -p {data,logs}
# Create configuration files
cat > config.yaml << EOF
global:
  qdrant:
    url: "http://localhost:6333"
    collection_name: "documents"
  llm:
    provider: "openai"
    base_url: "https://api.openai.com/v1"
    api_key: "${LLM_API_KEY}"
    models:
      embeddings: "text-embedding-3-small"
      chat: "gpt-4o-mini"
  state_management:
    state_db_path: "./data/state.db"
projects:
  docs:
    display_name: "Documentation"
    sources:
      git:
        main-docs:
          base_url: "https://github.com/company/docs"
          branch: "main"
          token: "${REPO_TOKEN}"
EOF
# Create environment file
cat > .env << EOF
QDRANT_URL=http://localhost:6333
QDRANT_COLLECTION_NAME=documents
LLM_API_KEY=your-openai-key
OPENAI_API_KEY=your-openai-key  # Legacy support
REPO_TOKEN=your-github-token
EOF
# Initialize and start
qdrant-loader init --workspace .
qdrant-loader ingest --workspace .
Production Environment Setup
# Create production user
sudo useradd -m -s /bin/bash qdrant-loader
sudo su - qdrant-loader
# Setup application directory
mkdir -p /opt/qdrant-loader/{data,logs}
cd /opt/qdrant-loader
# Install Python and dependencies
python -m venv venv
source venv/bin/activate
pip install qdrant-loader qdrant-loader-mcp-server
# Setup configuration (see Configuration section below)
# Edit config.yaml and .env with your settings
# Initialize workspace
qdrant-loader init --workspace /opt/qdrant-loader
🖥️ Environment Setup
System Requirements
Minimum Requirements
- CPU: 2 cores
- RAM: 4 GB
- Storage: 10 GB available space
- Python: 3.12 or higher
- Network: Internet access for API calls
Recommended Requirements
- CPU: 4+ cores
- RAM: 8+ GB
- Storage: 50+ GB SSD
- Python: 3.12+
- Network: High-speed internet connection
Operating System Support
| OS | Support Level | Notes | 
|---|---|---|
| Ubuntu 20.04+ | ✅ Fully Supported | Recommended for production | 
| CentOS 8+ | ✅ Fully Supported | Enterprise environments | 
| macOS 12+ | ✅ Fully Supported | Development and testing | 
| Windows 10+ | ✅ Fully Supported | Development environments | 
Dependencies
System Dependencies
# Ubuntu/Debian
sudo apt update
sudo apt install -y python3.12 python3.12-venv python3.12-dev git curl
# CentOS/RHEL
sudo yum install -y python3.12 python3.12-venv python3.12-devel git curl
# macOS (with Homebrew)
brew install python@3.12 git curl
Python Dependencies
# Core dependencies are automatically installed
pip install qdrant-loader qdrant-loader-mcp-server
# Optional development dependencies
pip install qdrant-loader[dev] qdrant-loader-mcp-server[dev]
QDrant Database Setup
Local QDrant Installation
# Using Docker (recommended)
docker run -p 6333:6333 -p 6334:6334 \ -v $(pwd)/qdrant_storage:/qdrant/storage:z \ qdrant/qdrant
# Using binary installation
wget https://github.com/qdrant/qdrant/releases/latest/download/qdrant-x86_64-unknown-linux-gnu.tar.gz
tar xzf qdrant-x86_64-unknown-linux-gnu.tar.gz
./qdrant
Cloud QDrant Setup
# QDrant Cloud configuration
export QDRANT_URL="https://your-cluster.qdrant.io"
export QDRANT_API_KEY="your-api-key"
export QDRANT_COLLECTION_NAME="documents"
🔧 Configuration Management
Environment Variables
# Production environment variables
cat > /opt/qdrant-loader/.env << EOF
# QDrant Configuration
QDRANT_URL=http://localhost:6333
QDRANT_COLLECTION_NAME=documents
QDRANT_API_KEY=your-api-key
# LLM Configuration
LLM_API_KEY=your-openai-api-key
OPENAI_API_KEY=your-openai-api-key  # Legacy support
# Data Source Credentials
REPO_TOKEN=your-github-token
CONFLUENCE_TOKEN=your-confluence-token
CONFLUENCE_EMAIL=your-email@domain.com
JIRA_TOKEN=your-jira-token
JIRA_EMAIL=your-email@domain.com
# Application Settings
STATE_DB_PATH=./data/state.db
EOF
Configuration File
# /opt/qdrant-loader/config.yaml
global:
  qdrant:
    url: "${QDRANT_URL}"
    api_key: "${QDRANT_API_KEY}"
    collection_name: "${QDRANT_COLLECTION_NAME}"
  llm:
    provider: "openai"
    base_url: "https://api.openai.com/v1"
    api_key: "${LLM_API_KEY}"
    models:
      embeddings: "text-embedding-3-small"
      chat: "gpt-4o-mini"
  state_management:
    state_db_path: "${STATE_DB_PATH}"
  chunking:
    chunk_size: 1200
    chunk_overlap: 300
  file_conversion:
    max_file_size: "100MB"
    conversion_timeout: 300
projects:
  production:
    project_id: "production"
    display_name: "Production Documentation"
    description: "Production documentation and knowledge base"
    sources:
      git:
        docs-repo:
          source_type: "git"
          source: "docs-repo"
          base_url: "https://github.com/company/docs"
          branch: "main"
          token: "${REPO_TOKEN}"
          include_paths:
            - "**/*.md"
            - "**/*.rst"
      confluence:
        company-wiki:
          source_type: "confluence"
          source: "company-wiki"
          base_url: "https://company.atlassian.net/wiki"
          deployment_type: "cloud"
          space_key: "DOCS"
          token: "${CONFLUENCE_TOKEN}"
          email: "${CONFLUENCE_EMAIL}"
🔄 Service Management
Systemd Service
# /etc/systemd/system/qdrant-loader.service
[Unit]
Description=QDrant Loader Service
After=network.target
Wants=network.target
[Service]
Type=simple
User=qdrant-loader
Group=qdrant-loader
WorkingDirectory=/opt/qdrant-loader
Environment=PATH=/opt/qdrant-loader/venv/bin
ExecStart=/opt/qdrant-loader/venv/bin/qdrant-loader ingest --workspace /opt/qdrant-loader
Restart=always
RestartSec=10
StandardOutput=journal
StandardError=journal
[Install]
WantedBy=multi-user.target
MCP Server Service
# /etc/systemd/system/mcp-qdrant-loader.service
[Unit]
Description=QDrant Loader MCP Server
After=network.target
Wants=network.target
[Service]
Type=simple
User=qdrant-loader
Group=qdrant-loader
WorkingDirectory=/opt/qdrant-loader
Environment=PATH=/opt/qdrant-loader/venv/bin
ExecStart=/opt/qdrant-loader/venv/bin/mcp-qdrant-loader --workspace /opt/qdrant-loader/config
Restart=always
RestartSec=10
StandardOutput=journal
StandardError=journal
[Install]
WantedBy=multi-user.target
Service Management Commands
# Enable and start services
sudo systemctl enable qdrant-loader
sudo systemctl enable mcp-qdrant-loader
sudo systemctl start qdrant-loader
sudo systemctl start mcp-qdrant-loader
# Check status
sudo systemctl status qdrant-loader
sudo systemctl status mcp-qdrant-loader
# View logs
sudo journalctl -u qdrant-loader -f
sudo journalctl -u mcp-qdrant-loader -f
# Restart services
sudo systemctl restart qdrant-loader
sudo systemctl restart mcp-qdrant-loader
📊 Monitoring and Observability
Log Management
Log Configuration
# logging.yaml
version: 1
formatters: default: format: '%(asctime)s - %(name)s - %(levelname)s - %(message)s' json: format: '{"timestamp": "%(asctime)s", "logger": "%(name)s", "level": "%(levelname)s", "message": "%(message)s"}'
handlers: console: class: logging.StreamHandler level: INFO formatter: json stream: ext://sys.stdout file: class: logging.handlers.RotatingFileHandler level: DEBUG formatter: default filename: /opt/qdrant-loader/logs/app.log maxBytes: 10485760 # 10MB backupCount: 5
loggers: qdrant_loader: level: DEBUG handlers: [console, file] propagate: false
root: level: INFO handlers: [console]
Log Rotation
# /etc/logrotate.d/qdrant-loader
/opt/qdrant-loader/logs/*.log { daily missingok rotate 30 compress delaycompress notifempty create 644 qdrant-loader qdrant-loader postrotate systemctl reload qdrant-loader endscript }
Health Monitoring
Health Check Script
#!/bin/bash
# /opt/qdrant-loader/bin/health-check.sh
set -e
WORKSPACE="/opt/qdrant-loader/config"
LOG_FILE="/opt/qdrant-loader/logs/health-check.log"
# Function to log with timestamp
log() { echo "$(date '+%Y-%m-%d %H:%M:%S') - $1" >> "$LOG_FILE"
}
# Check QDrant Loader configuration
if qdrant-loader config --workspace "$WORKSPACE" >/dev/null 2>&1; then
  log "QDrant Loader: HEALTHY - Configuration valid"
  exit 0
else
  log "QDrant Loader: UNHEALTHY - Configuration invalid"
  exit 1
fi
Cron Job for Health Checks
# Add to crontab
*/5 * * * * /opt/qdrant-loader/bin/health-check.sh
Performance Monitoring
System Metrics
# Monitor system resources
htop
iostat -x 1
free -h
df -h
Application Metrics
# Check project status
qdrant-loader config --workspace /opt/qdrant-loader/config
# Check configuration and project status
qdrant-loader config --workspace /opt/qdrant-loader/config
# Monitor system services
systemctl status qdrant-loader
systemctl status mcp-qdrant-loader
Prometheus Metrics
QDrant Loader includes built-in Prometheus metrics support:
# Available metrics (from prometheus_metrics.py)
INGESTED_DOCUMENTS = Counter("qdrant_ingested_documents_total", "Total number of documents ingested")
CHUNKING_DURATION = Histogram("qdrant_chunking_duration_seconds", "Time spent chunking documents")
EMBEDDING_DURATION = Histogram("qdrant_embedding_duration_seconds", "Time spent embedding chunks")
UPSERT_DURATION = Histogram("qdrant_upsert_duration_seconds", "Time spent upserting to Qdrant")
CHUNK_QUEUE_SIZE = Gauge("qdrant_chunk_queue_size", "Current size of the chunk queue")
EMBED_QUEUE_SIZE = Gauge("qdrant_embed_queue_size", "Current size of the embedding queue")
CPU_USAGE = Gauge("qdrant_cpu_usage_percent", "CPU usage percent")
MEMORY_USAGE = Gauge("qdrant_memory_usage_percent", "Memory usage percent")
🔒 Security Configuration
File Permissions
# Set proper file permissions
sudo chown -R qdrant-loader:qdrant-loader /opt/qdrant-loader
sudo chmod 750 /opt/qdrant-loader
sudo chmod 640 /opt/qdrant-loader/config/.env
sudo chmod 644 /opt/qdrant-loader/config/config.yaml
sudo chmod 755 /opt/qdrant-loader/bin/health-check.sh
Firewall Configuration
# Ubuntu/Debian (ufw)
sudo ufw allow ssh
sudo ufw allow 6333/tcp # QDrant HTTP
sudo ufw allow 6334/tcp # QDrant gRPC
sudo ufw enable
# CentOS/RHEL (firewalld)
sudo firewall-cmd --permanent --add-service=ssh
sudo firewall-cmd --permanent --add-port=6333/tcp
sudo firewall-cmd --permanent --add-port=6334/tcp
sudo firewall-cmd --reload
SSL/TLS Configuration
# Generate SSL certificates for QDrant
openssl req -x509 -newkey rsa:4096 -keyout qdrant-key.pem -out qdrant-cert.pem -days 365 -nodes
# Configure QDrant with SSL
# Add to QDrant configuration
🚀 Scaling Strategies
Horizontal Scaling
Multiple Worker Processes
# Run multiple ingestion processes for different projects
qdrant-loader ingest --workspace /opt/qdrant-loader --project project1 &
qdrant-loader ingest --workspace /opt/qdrant-loader --project project2 &
qdrant-loader ingest --workspace /opt/qdrant-loader --project project3 &
wait
Load Balancing
# Use nginx for load balancing MCP servers
# /etc/nginx/sites-available/qdrant-loader
upstream mcp_servers { server 127.0.0.1:8001; server 127.0.0.1:8002; server 127.0.0.1:8003;
}
server { listen 80; server_name qdrant-loader.example.com; location / { proxy_pass http://mcp_servers; proxy_set_header Host $host; proxy_set_header X-Real-IP $remote_addr; }
}
Vertical Scaling
Resource Optimization
# Optimize for high-memory systems
# Configure larger chunk sizes in config.yaml:
# global:
# chunking:
# chunk_size: 2000
# chunk_overlap: 400
# Run ingestion with specific projectqdrant-loader ingest --workspace /opt/qdrant-loader/config --project high-priority
📚 Deployment Documentation
Detailed Deployment Guides
- Environment Setup - Complete environment setup guide
- Monitoring and Observability - Comprehensive monitoring setup
- Performance Optimization - Production optimization guide
Best Practices
- Use virtual environments - Isolate Python dependencies
- Implement health checks - Monitor application health
- Monitor everything - Comprehensive observability
- Plan for scale - Design for growth
- Secure by default - File permissions, firewall, SSL
- Automate deployments - Use scripts and configuration management
Deployment Checklist
- [ ] System requirements met
- [ ] Dependencies installed
- [ ] Configuration files created and validated
- [ ] Environment variables set
- [ ] QDrant database accessible
- [ ] Services configured and started
- [ ] Health checks implemented
- [ ] Monitoring and logging configured
- [ ] Security measures applied
- [ ] Backup and recovery tested
- [ ] Documentation updated
🆘 Getting Help
Deployment Support
- GitHub Issues - Report deployment issues
- GitHub Discussions - Ask deployment questions
- Deployment Examples - Reference configurations
Community Resources
- Configuration Examples - Community configurations
- Deployment Guides - Community deployment guides
Ready to deploy? Start with Environment Setup for detailed setup instructions or jump to Monitoring and Observability for production monitoring. Don't forget to check Performance Optimization for optimization tips.
Performance Optimization
Configure chunking and processing parameters in your workspace configuration:
# config.yaml - Performance tuning
global:
  chunking:
    chunk_size: 1200
    chunk_overlap: 300
  file_conversion:
    max_file_size: "100MB"
    conversion_timeout: 300