Rate Limits

All API endpoints are subject to two tiers of rate limiting:

Tier 1: Global Rate Limits (All Endpoints)

All API calls are subject to global rate limits per client:

These limits apply to ALL endpoints including:

Job creation endpoints (/v1/transcribe, /v1/translate, /v1/voiceover, /v1/dub)
Management endpoints (/v1/account, /v1/keys, /v1/jobs, etc.)
Upload endpoints (/v1/upload/*)

Job creation endpoints have additional per-product rate limits:

Important: These are in addition to the global limits. A request must pass both checks to be allowed.

If you make 5 transcribe requests in one minute:

If you make 4 transcribe + 30 translate + 10 account requests in one minute:

Implement exponential backoff - Wait before retrying after rate limit errors
Monitor your usage - Track API calls to avoid hitting limits
Use webhooks - Avoid polling for job status; use webhooks instead
Cache results - Cache API responses to reduce request volume