When an LLM call is rate limited, our system detects the error and retries the request to prevent failovers.