Design Top K Search Words

[ OK ] 119 — full content available

[ INFO ] category: System Design difficulty: medium freq: medium first seen: 2026-01-12

[MEDIUM][SYSTEM DESIGN][MEDIUM]Rankingdata_engineeringwebbackendmachine_learningStream ProcessingSystem Design

$ cat problem.md

LinkedIn's "Design Top K Search Words" interview question focuses on building a scalable system to track and rank the most frequent search queries in real time. It draws from heavy hitters algorithms and stream processing to handle high-volume web data.

Problem Statement

Design a service that continuously processes user search queries from LinkedIn's platform and returns the top K most searched words or phrases over a recent time window (e.g., last hour or day). The system must support high throughput (millions of queries per second), low-latency queries (<100ms), and real-time updates while using bounded memory. It aligns with tags like ranking (via frequency/count scores), data engineering (aggregation pipelines), web/backend (API serving), machine learning (potential advanced ranking), and stream processing/system design (distributed heavy hitters).[1][2][6][7]

Functional Requirements

Ingest search queries as a stream from frontend/backend services.
Compute and maintain top K queries by frequency, with optional filters (e.g., by region, time window).
Expose an API endpoint like GET /topk?k=10&window=1h returning ranked list.
Handle updates from millions of users globally.[2][7]

Input/Output Examples

No verbatim examples from official LinkedIn sources, but standard formulations use these:[1][2]

Stream Input (continuous events): search: "machine learning", timestamp: 2026-02-02T04:00:00Z, user_id: 123 search: "system design", timestamp: 2026-02-02T04:00:01Z, user_id: 456 search: "machine learning", timestamp: 2026-02-02T04:00:02Z, user_id: 789 ... (millions more)

API Output (JSON): json { "top_k": [ {"query": "machine learning", "count": 150000, "rank": 1}, {"query": "system design", "count": 120000, "rank": 2}, {"query": "data engineering", "count": 80000, "rank": 3} ], "window": "1h", "as_of": "2026-02-02T04:23:00Z" } For K=3 over 1-hour window.[2]

Constraints and Scale

Throughput: 10M+ queries/second globally.[6][2]
Latency: Updates propagate in seconds; queries <100ms.[2]
Data: Unique queries up to 1B+; stream unbounded, but approximate top K with ε-error (e.g., 1% frequency threshold for heavy hitters).[6][1]
Memory: O(K log N) space per node using Count-Min Sketch or Misra-Gries algorithm; no full stream storage.[6]
Other: 99.99% availability, handle 10x spikes, support sharding by geography.[3][2]

user@intervues:~/linkedin$