Overview
The Create Cluster endpoint allows you to provision new GPU clusters with flexible configurations including GPU types, storage options, networking, and security settings. Perfect for ML training, development environments, and production AI workloads.Endpoint
Request Body
Parameter | Type | Required | Description |
---|---|---|---|
name | string | Yes | Cluster name (3-64 characters, alphanumeric and hyphens) |
description | string | No | Optional cluster description |
gpu_type | string | Yes | GPU type: A100 , H100 , RTX4090 , V100 , T4 , RTX3090 |
gpu_count | integer | Yes | Number of GPUs (1-8 depending on GPU type) |
cpu_cores | integer | No | CPU cores (auto-calculated if not specified) |
memory_gb | integer | No | RAM in GB (auto-calculated if not specified) |
storage_gb | integer | Yes | Persistent storage in GB (minimum 50GB) |
region | string | Yes | Deployment region |
project_id | string | Yes | Project ID for organization |
template_id | string | No | Template ID for pre-configured environments |
docker_image | string | No | Custom Docker image (if not using template) |
environment_variables | object | No | Environment variables for the cluster |
ssh_enabled | boolean | No | Enable SSH access (default: true) |
ssh_public_keys | array | No | SSH public keys for access |
port_mappings | array | No | Port forwarding configuration |
auto_start | boolean | No | Start cluster immediately (default: true) |
auto_terminate | object | No | Auto-termination settings |
network_config | object | No | Advanced networking configuration |
security_groups | array | No | Security group IDs |
tags | object | No | Resource tags for organization |
Request Examples
Response Schema
Configuration Options
GPU Types and Availability
GPU Type | Memory | Cores | Max Count | Hourly Rate | Best For |
---|---|---|---|---|---|
A100 | 80GB | 6912 | 8 | $2.50+ | Large model training, inference |
H100 | 80GB | 16896 | 8 | $4.00+ | Latest generation, fastest training |
RTX4090 | 24GB | 16384 | 4 | $0.80+ | Development, medium models |
V100 | 32GB | 5120 | 8 | $1.20+ | Legacy support, cost-effective |
T4 | 16GB | 2560 | 4 | $0.50+ | Inference, light training |
Storage Options
Type | Min Size | Max Size | Performance | Use Case |
---|---|---|---|---|
ssd | 50GB | 10TB | High IOPS | OS, applications, fast data access |
nvme | 100GB | 5TB | Ultra-high IOPS | Training data, checkpoints |
hdd | 100GB | 50TB | Standard | Archives, large datasets |
Auto-termination Settings
Use Cases
ML Model Training
Create powerful multi-GPU clusters for training large language models and computer vision models.Development Environment
Set up interactive development environments with Jupyter, VSCode, and debugging tools.Production Inference
Deploy production-ready inference clusters with load balancing and auto-scaling.Error Handling
Security Considerations
- SSH Keys: Always use strong SSH key pairs and rotate them regularly
- Network Security: Configure security groups and firewall rules appropriately
- Environment Variables: Never store secrets in plain text; use encrypted secrets
- Access Control: Ensure proper project-based access controls
- Cost Monitoring: Implement cost alerts to prevent unexpected charges
Best Practices
- Resource Planning: Choose GPU types based on your specific workload requirements
- Cost Optimization: Use auto-termination to prevent runaway costs
- Data Management: Plan storage requirements and backup strategies
- Security: Implement proper access controls and network security
- Monitoring: Set up alerts for cluster status and performance metrics
- Template Usage: Use templates for consistent, repeatable deployments
Authorizations
API key authentication. Use 'Bearer YOUR_API_KEY' format.
Body
application/json
Cluster configuration
The body is of type object
.
Response
Cluster created successfully
The response is of type object
.