Distributed Rate Limiter

[ OK ] 241 — full content available

[ INFO ] category: System Design difficulty: hard freq: high first seen: 2026-01-13

[HARD][SYSTEM DESIGN][HIGH]data_engineeringDistributed Systemswebmachine_learningSystem DesignBackendbackendinfrastructure

$ cat problem.md

Apple's "Distributed Rate Limiter" interview question focuses on designing a scalable system to enforce request limits across distributed services, often tagged with data engineering, distributed systems, backend infrastructure, and system design. It typically tests handling high-throughput scenarios like 1M requests/second for 100M users with low latency (<10ms). No exact full problem statement with coding inputs/outputs was found in public sources, as Apple questions are proprietary, but common formulations emphasize token bucket or sliding window algorithms using Redis for atomic counters.[1]

Core Problem Statement

Design a distributed rate limiter that identifies clients (by user ID, IP, or API key) and enforces configurable rules (e.g., 100 requests/minute per user). On excess, return HTTP 429 with headers for remaining quota and reset time. Support per-user, per-IP, global, and endpoint-specific limits in a multi-node setup.[2][1]

Functional Requirements

Client identification via user ID/IP/API key.
Rule-based limits (requests per window).
Reject over-limit requests with 429 status, X-RateLimit-Remaining, and X-RateLimit-Reset.
Out-of-scope: analytics, long-term storage.[1]

Non-Functional Requirements

Latency: <10ms p99.
Scale: 1M req/s, 100M DAU.
Availability: High, eventual consistency OK.
Fault tolerance: Fail-open (allow if unsure).[3][1]

Key Constraints & Examples

No verbatim input/output examples (e.g., LeetCode-style functions) appear publicly, but typical pseudocode interface is:

isRequestAllowed(clientId: str, ruleId: str) -> {passes: bool, remaining: int, resetTime: timestamp}

Example Scenario (token bucket, 3 req/window):

Req1 (t=0): Allow, remaining=2.
Req2 (t=1): Allow, remaining=1.
Req3 (t=2): Allow, remaining=0.
Req4 (t=3): Deny (429).[9][1]

Design Highlights

Use Redis for sharded counters with INCR + EXPIRE for atomicity, avoiding race conditions in distributed gateways. Handle hot keys via consistent hashing; scale with Redis Cluster.[1]

user@intervues:~/apple$