Rate Limits & Retries

Rate limits

Augment APIs allow up to 10 requests per second (600 requests per minute) per customer, unless a different limit is explicitly documented for a specific endpoint.

Retrying failed requests

Only retry when the failure is likely transient. A good rule of thumb: if the same request would probably succeed without any changes, it's worth retrying.

Retry these responses

Status / Condition Reason
500 Internal Server Error Unexpected server-side failure
502 Bad Gateway Upstream service unavailable
503 Service Unavailable Server temporarily unable to handle the request
504 Gateway Timeout Upstream service did not respond in time
Network-level failures Timeouts, connection resets, refused connections

Do not retry these responses

Status Reason
400 Bad Request Request is malformed or missing required fields
401 Unauthorized API key is invalid, missing, or revoked
403 Forbidden Authenticated but not authorized for this action
404 Not Found Resource does not exist
422 Unprocessable Entity Request is well-formed but fails validation

These errors indicate a problem with the request itself. Retrying without changing the request will produce the same result.

We monitor 4xx and 5xx errors across all API traffic. If you're running into persistent errors or need help troubleshooting, don't hesitate to reach out to your customer success representative or our integrations team.

Backoff strategy

Use exponential backoff with full jitter. Suggested parameters:

Parameter Value
Initial base delay ~500 ms
Maximum delay cap ~30 seconds
Maximum attempts 5
Maximum total wait ~2 minutes

Stop retrying when you hit either the attempt limit or the total time budget — whichever comes first.