Deployment updates

cybermaggedon · cybermaggedon · commit f100cdf78f61 · 2025-07-10T17:40:56.000+01:00
diff --git a/_config.yml b/_config.yml
@@ -59,7 +59,7 @@ heading_anchors: true
 # Note: The footer_content option is deprecated and will be removed in a
 # future major release. Please use `_includes/footer_custom.html` for more
 # robust markup / liquid-based content.
-footer_content: "Copyright &copy; 2017-2020 Patrick Marsceill. Distributed by an <a href=\"https://github.com/just-the-docs/just-the-docs/tree/main/LICENSE.txt\">MIT license.</a>"
+footer_content: "Copyright &copy; 2024-2025 TrustGraph.ai. Distributed under an <a href=\"https://github.com/trustgraph-ai/trustgraph/blob/master/LICENSE\">Apache 2 license.</a>"
 
 # Footer last edited timestamp
 last_edit_timestamp: true # show or hide edit time - page must have `last_modified_date` defined in the frontmatter
diff --git a/deployment/azure.md b/deployment/azure.md
@@ -0,0 +1,153 @@
+---
+title: Azure AKS
+layout: default
+nav_order: 3.5
+parent: Deployment
+grand_parent: TrustGraph Documentation
+---
+
+# Microsoft Azure AKS Deployment
+
+Deploy TrustGraph on Microsoft Azure using Azure Kubernetes Service (AKS) with comprehensive AI integration.
+
+## Overview
+
+TrustGraph provides a complete Azure deployment solution using **Pulumi** (Infrastructure as Code) that automatically provisions and configures an AKS cluster with Azure's AI services for a production-ready TrustGraph deployment.
+
+## What You Get
+
+The Azure deployment includes:
+
+- **Dedicated resource group** for complete resource isolation
+- **Azure Identity service principal** for secure component authentication
+- **AKS cluster** deployed in managed resource group
+- **Key Vault and Storage Account** for AI component requirements
+- **Azure AI Foundry integration** with AI hub, workspace, and serverless endpoints
+- **Azure Cognitive Services** with OpenAI GPT-4o-mini deployment
+- **Dual AI model support**: Phi-4 and OpenAI models
+- **Complete TrustGraph stack** deployed and configured
+- **Secrets management** for secure credential handling
+- **Monitoring and observability** with Grafana
+- **Web workbench** for document processing and Graph RAG
+
+## Deployment Method
+
+The deployment uses **Pulumi**, an Infrastructure as Code tool that:
+
+- Has an open-source license
+- Uses general-purpose programming languages (TypeScript/JavaScript)
+- Provides testable infrastructure code
+- Offers retryable deployments
+- Supports local or cloud state management
+
+## Architecture
+
+**Kubernetes Platform**: Azure Kubernetes Service (AKS)
+**AI Services**: Azure AI Foundry + Cognitive Services
+**Default Models**: Phi-4 (Machine Learning) and GPT-4o-mini (OpenAI)
+**Identity Management**: Azure Identity service principal
+**Storage**: Azure Key Vault and Storage Account
+**Network**: Managed AKS networking with Azure CNI
+**Monitoring**: Integrated Azure monitoring and Grafana
+
+## AI Model Options
+
+Choose between two AI configurations:
+
+### Machine Learning Services (AI Foundry)
+- **Model**: Phi-4 (serverless endpoint)
+- **Configuration**: Copy `resources.yaml.mls` to `resources.yaml`
+- **Features**: Azure-native model hosting
+
+### Cognitive Services (OpenAI)
+- **Model**: GPT-4o-mini
+- **Configuration**: Copy `resources.yaml.cs` to `resources.yaml`
+- **Features**: OpenAI API compatibility
+
+## Quick Process Overview
+
+1. **Choose AI model** (Phi-4 or OpenAI)
+2. **Install Pulumi** and dependencies
+3. **Configure Azure credentials** using `az login`
+4. **Customize configuration** in `Pulumi.azure.yaml`
+5. **Deploy** with `pulumi up`
+6. **Access services** via port-forwarding
+
+## Configuration Options
+
+Customizable settings include:
+
+- **Location**: Azure deployment region
+- **Environment**: dev, staging, production
+- **AI Endpoint Model**: e.g., `azureml://registries/azureml/models/Phi-4`
+- **OpenAI Model**: e.g., `gpt-4o-mini`
+- **OpenAI Version**: e.g., `"2024-07-18"` (quoted for date format)
+- **Content Filtering**: e.g., `Microsoft.DefaultV2` (Responsible AI policy)
+
+## Access Points
+
+Once deployed, you'll have access to:
+
+- **TrustGraph API**: Port 8088
+- **Web Workbench**: Port 8888 (document processing, Graph RAG)
+- **Grafana Monitoring**: Port 3000
+
+## Azure AI Integration
+
+The deployment includes comprehensive Azure AI integration:
+
+### Machine Learning Services
+- **AI Hub**: Central workspace for ML operations
+- **AI Workspace**: Project-specific environment
+- **Serverless Endpoints**: Scalable model hosting
+- **Model Catalog**: Access to Azure's model library
+
+### Cognitive Services
+- **OpenAI Service**: GPT models via Azure
+- **Content Filtering**: Responsible AI policies
+- **API Management**: Secure API access
+- **Usage Monitoring**: Built-in analytics
+
+## Complete Documentation
+
+For detailed step-by-step instructions, configuration options, and troubleshooting, visit:
+
+**[TrustGraph Azure AKS Deployment Guide](https://github.com/trustgraph-ai/pulumi-trustgraph-aks)**
+
+The repository contains:
+- Complete Pulumi deployment code
+- AKS cluster configuration
+- Azure AI services integration
+- Dual model configuration templates
+- Detailed setup instructions
+- Troubleshooting guides
+- Customization options
+
+## Important Notes
+
+**Storage Account Issues**: Azure occasionally reports "parallel access to resources" errors during Storage Account creation. If deployment fails, retry with `pulumi up` - it's retryable and continues from where it left off.
+
+**Model Selection**: Choose your AI model before deployment by copying the appropriate `resources.yaml.*` file to `resources.yaml`.
+
+**Resource Groups**: Azure automatically creates separate resource groups for AKS cluster components.
+
+**SSH Keys**: An SSH private key is generated but typically not needed for AKS management.
+
+## Azure Enterprise Features
+
+**Enterprise Integration**: Native integration with Azure Active Directory
+**Compliance**: Built-in compliance tools and reporting
+**Security**: Azure Security Center integration
+**Networking**: Advanced networking with Azure CNI
+**Monitoring**: Azure Monitor and Application Insights
+**Backup**: Azure Backup integration for persistent data
+
+## Next Steps
+
+After deployment, you can:
+- Load documents through the web workbench
+- Test Graph RAG queries with Phi-4 or OpenAI models
+- Monitor processing through Grafana and Azure Monitor
+- Scale the AKS cluster as needed
+- Integrate with other Azure services
+- Leverage Azure's enterprise security features
diff --git a/deployment/intel.md b/deployment/intel.md
@@ -0,0 +1,170 @@
+---
+title: Intel GPU / Tiber Cloud
+layout: default
+nav_order: 2.7
+parent: Deployment
+grand_parent: TrustGraph Documentation
+---
+
+# Intel Tiber Cloud Deployment
+
+Deploy TrustGraph on Intel Tiber Cloud with Intel GPU and Gaudi accelerated systems for high-performance AI workloads.
+
+## Overview
+
+TrustGraph provides specialized deployment configurations for Intel's advanced hardware platforms, including Intel Tiber Cloud and bare-metal Intel GPU/Gaudi systems. This deployment method is designed for:
+
+- **High-performance AI inference** with Intel accelerators
+- **Self-hosted deployments** on Intel hardware
+- **Research and development** with cutting-edge Intel AI technologies
+- **Enterprise workloads** requiring Intel-optimized performance
+
+⚠️ **Work in Progress**: This deployment method is actively being developed and optimized for Intel's latest hardware platforms.
+
+## What You Get
+
+The Intel Tiber Cloud deployment includes:
+
+- **Intel accelerated systems** (Gaudi, GPU, or GR platforms)
+- **Optimized AI inference** with vLLM or TGI servers
+- **Large language models** like Llama 3.3 70B
+- **Complete TrustGraph stack** deployed via containerization
+- **SSH-based deployment** with automated scripts
+- **Port forwarding setup** for secure access
+- **Monitoring and observability** with Grafana
+- **Web workbench** for document processing and Graph RAG
+
+## Intel Hardware Platforms
+
+### Intel Gaudi Systems
+- **Software**: TrustGraph 1.0.13 + vLLM (HabanaAI fork)
+- **Model**: Llama 3.3 70B
+- **Deployment**: `deploy-gaudi-vllm.zip`
+- **Optimized for**: AI training and inference workloads
+
+### Intel GPU Multi-GPU 1550 Systems
+- **Software**: TrustGraph 1.0.13 + TGI Server 3.3.1-intel-xpu
+- **Deployment**: `deploy-gpu-tgi.zip`
+- **Optimized for**: High-throughput GPU inference
+
+### Intel GR Systems
+- **Software**: TrustGraph 1.0.13 + TGI Server 3.3.1-intel-xpu
+- **Deployment**: `deploy-gr.zip`
+- **Optimized for**: Specialized Intel AI workloads
+
+## Why Choose Intel Tiber Cloud?
+
+### 🚀 **Cutting-Edge AI Hardware**
+- **Intel Gaudi**: Purpose-built for AI training and inference
+- **Intel GPU**: High-performance parallel processing
+- **Specialized Architecture**: Optimized for AI/ML workloads
+
+### ⚡ **Performance Optimization**
+- **Hardware-Accelerated Inference**: Native Intel optimization
+- **Large Model Support**: Handle models like Llama 3.3 70B efficiently
+- **Reduced Latency**: Direct hardware acceleration
+
+### 🔒 **Self-Hosted Control**
+- **Data Sovereignty**: Complete control over data and models
+- **Custom Configuration**: Tailor deployments to specific needs
+- **Enterprise Security**: Self-hosted infrastructure
+
+### 🛠️ **Developer Access**
+- **Research Platform**: Access to latest Intel AI technologies
+- **Experimentation**: Test advanced AI configurations
+- **Direct Hardware Access**: Low-level optimization capabilities
+
+## Deployment Method
+
+The Intel deployment uses automated deployment scripts that:
+
+- Connect via SSH jump host to Intel Tiber Cloud
+- Deploy pre-configured TrustGraph packages
+- Set up Intel-optimized AI inference servers
+- Configure port forwarding for secure access
+- Initialize monitoring and web interfaces
+
+## Quick Process Overview
+
+1. **Obtain access** to Intel Tiber Cloud instance
+2. **Create HuggingFace token** and accept model licenses
+3. **Choose deployment type** (Gaudi, GPU, or GR)
+4. **Deploy via script** with SSH parameters
+5. **Connect via SSH** with port forwarding
+6. **Access services** through forwarded ports
+
+## Access Configuration
+
+Intel Tiber Cloud uses SSH jump host access:
+
+```bash
+# SSH connection format
+ssh -J guest@[jump-host] sdp@[target-host]
+
+# Deployment command
+./deploy-tiber guest@[jump-host] sdp@[target-host] [deployment-package]
+
+# Port forwarding
+./port-forward guest@[jump-host] sdp@[target-host]
+```
+
+## Access Points
+
+Once deployed, you'll have access to:
+
+- **TrustGraph API**: Port 8089 (forwarded from 8088)
+- **Web Workbench**: Port 8889 (forwarded from 8888)
+- **Grafana Monitoring**: Port 3001 (forwarded from 3000)
+
+## Model Support
+
+**Large Language Models**: Llama 3.3 70B and other HuggingFace models
+**License Requirements**: HuggingFace account with model access
+**Hardware Optimization**: Intel-specific optimizations for inference
+**Inference Engines**: vLLM (Gaudi) and TGI (GPU/GR)
+
+## Complete Documentation
+
+For detailed step-by-step instructions, deployment packages, and troubleshooting, visit:
+
+**[TrustGraph Intel Tiber Cloud Guide](https://github.com/trustgraph-ai/trustgraph-tiber-cloud)**
+
+The repository contains:
+- Deployment automation scripts
+- Intel platform-specific configurations
+- Pre-built deployment packages
+- SSH connection utilities
+- Port forwarding setup
+- Detailed setup instructions
+- Intel hardware optimization guides
+
+## Prerequisites
+
+**Intel Tiber Cloud Access**: Account and instance allocation
+**HuggingFace Token**: For model downloads and licensing
+**SSH Access**: Familiarity with SSH jump host connections
+**Model Licenses**: Acceptance of required model licenses (e.g., Llama)
+
+## Performance Benefits
+
+**Hardware Acceleration**: Native Intel AI acceleration
+**Large Model Efficiency**: Optimized for 70B+ parameter models
+**Reduced Infrastructure Costs**: Efficient hardware utilization
+**Custom Optimization**: Direct access to hardware-level tuning
+
+## Use Cases
+
+**Research & Development**: Access to cutting-edge Intel AI hardware
+**High-Performance Inference**: Large model deployment with optimal performance
+**Self-Hosted AI**: Complete control over AI infrastructure
+**Intel Ecosystem Integration**: Leverage Intel's AI software stack
+
+## Next Steps
+
+After deployment, you can:
+- Load documents through the web workbench
+- Test Graph RAG queries with Llama 3.3 70B
+- Monitor Intel hardware performance through Grafana
+- Experiment with Intel-optimized AI configurations
+- Benchmark performance against other platforms
+- Integrate with Intel's AI development tools
diff --git a/deployment/scaleway.md b/deployment/scaleway.md
@@ -1,7 +1,7 @@
 ---
 title: Scaleway
 layout: default
-nav_order: 5
+nav_order: 3.2
 parent: Deployment
 grand_parent: TrustGraph Documentation
 ---
@@ -142,4 +142,4 @@ After deployment, you can:
 - Monitor processing through Grafana
 - Scale the cluster as needed
 - Integrate with other Scaleway services
-- Ensure GDPR compliance for your AI workflows
+- Ensure GDPR compliance for your AI workflows