Overview
TensorOne API implements rate limiting to ensure fair usage and maintain service quality for all users. Rate limits vary by endpoint type, subscription plan, and API key permissions.Rate Limit Tiers
Free Tier
- Read Operations: 100 requests/hour
- Write Operations: 20 requests/hour
- AI Services: 50 requests/hour
- Training Jobs: 5 jobs/day
Pro Tier
- Read Operations: 1,000 requests/hour
- Write Operations: 200 requests/hour
- AI Services: 500 requests/hour
- Training Jobs: 20 jobs/day
Enterprise Tier
- Read Operations: 10,000 requests/hour
- Write Operations: 2,000 requests/hour
- AI Services: 5,000 requests/hour
- Training Jobs: Unlimited
Rate Limit Headers
Every API response includes rate limit information:Header Descriptions
X-RateLimit-Limit
: Maximum requests allowed in the time windowX-RateLimit-Remaining
: Requests remaining in current windowX-RateLimit-Reset
: Unix timestamp when the rate limit resetsX-RateLimit-Window
: Rate limit window in secondsRetry-After
: Seconds to wait before making another request (when rate limited)
Endpoint-Specific Limits
Account Management
GPU Clusters
Serverless Endpoints
AI Services
Training Jobs
Rate Limit Strategies
1. Exponential Backoff
Implement exponential backoff when rate limited:2. Request Batching
Batch multiple operations when possible:3. Caching
Cache responses when appropriate:Monitoring Rate Limits
Check Current Usage
Usage Analytics
Rate Limit Errors
429 Too Many Requests
Handling in Code
Optimization Tips
1. Use Appropriate HTTP Methods
- Use
HEAD
requests to check resource existence - Use
PATCH
instead ofPUT
for partial updates - Implement conditional requests with
If-Modified-Since
2. Optimize Polling
3. Request Prioritization
Use priority headers for critical requests:Increasing Rate Limits
Upgrade Your Plan
Higher tier plans come with increased rate limits:- Pro Plan: 5x increase across all endpoints
- Enterprise Plan: 50x increase with custom limits available
Request Limit Increase
For specific use cases, contact support with:- Expected request volume
- Use case description
- Timeline requirements
- Current plan tier
Temporary Limit Boosts
For events or migrations:Consider using webhooks instead of polling to reduce API calls and stay within rate limits.