Hey r/DarkInterview — sharing a Netflix coding question from https://darkinterview.com .

Auto-Expire Cache

Design and implement a key-value cache with automatic expiration. This is a classic interview question asked at Netflix (also known as the "Log Rate Limiter" problem) that tests your understanding of data structures, memory management, and system design trade-offs.

Part 1: Basic Auto-Expire Cache

Implement an AutoExpireCache class with set and get:

`python import time

cache = AutoExpireCache()

cache.set("user_session", {"user_id": 123}, 2) print(cache.get("user_session")) # {"user_id": 123}

time.sleep(3)

print(cache.get("user_session")) # None (expired)

False is a valid value — don't confuse it with "not found"

cache.set("feature_flag", False, 60) print(cache.get("feature_flag")) # False (not None!) `

This part is straightforward — store (value, expire_timestamp) in a dict and use lazy deletion: check expiry on get() and delete if stale.

Key design decision: Return None for cache misses, not False, since False can be a valid cached value.

Part 2: Preventing Memory Leaks

> Interviewer: "If we set millions of keys that are never accessed again, they stay in memory forever. How do you fix this?"

This is where it gets interesting. Three approaches:

Strategy 1: Periodic Cleanup (Background Thread)

Run a daemon thread that scans and removes expired entries on an interval:

python def _cleanup_expired(self): current = time.time() with self.lock: expired = [k for k, (_, exp) in self.cache.items() if current > exp] for k in expired: del self.cache[k]

Trade-off: O(N) scan, but simple and effective. Requires thread safety with locks.

Strategy 2: Min-Heap for Efficient Expiration

Use a min-heap sorted by expiry time so you only process entries that have actually expired — O(E log N) where E is expired entries:

python heapq.heappush(self.expiry_heap, (expire_at, key, version))

Trick: Use version tracking to handle stale heap entries when keys are updated with new TTLs.

| Approach | Set | Get | Cleanup | Memory Safety | |---|---|---|---|---| | Lazy only | O(1) | O(1) | N/A | ❌ | | Periodic scan | O(1) | O(1) | O(N) | ✅ | | Min-heap | O(log N) | O(1) | O(E log N) | ✅ |

Part 3: Fixed-Size LRU Cache with Expiration

> Interviewer: "What if memory is still a concern? Enforce a maximum cache size with LRU eviction."

Combine TTL expiration with LRU eviction using OrderedDict:

`python from collections import OrderedDict

class AutoExpireCache: def init(self, capacity: int): self.capacity = capacity self.cache = OrderedDict()

def set(self, key, value, ttl_seconds):
    if key in self.cache:
        del self.cache[key]
    while len(self.cache) &gt;= self.capacity:
        self.cache.popitem(last=False)  # Evict LRU
    self.cache[key] = (value, time.time() + ttl_seconds)

def get(self, key):
    if key not in self.cache:
        return None
    value, expire_at = self.cache[key]
    if time.time() &gt; expire_at:
        del self.cache[key]
        return None
    self.cache.move_to_end(key)  # Mark as recently used
    return value

The interviewer may also ask you to implement LRU from scratch using a doubly-linked list + hash map for O(1) operations.

Follow-up Discussion Topics

The interviewer may extend the conversation to:

Thread safety — global lock vs. read-write lock? What about concurrent.futures?
Distributed cache — how does this scale across servers? Consistency between nodes? (Redis/Memcached as production solutions)
Sliding vs. absolute expiration — should get() refresh the TTL?
Bulk operations — mset(), mget() for multiple keys at once
Out-of-order timestamps — how to handle LRU with events arriving out of order?

Full question + Python solutions with all 3 implementation strategies: https://darkinterview.com/collections/n3f6x9k2/questions/ef32a27f-b05a-4e44-a838-1bfd7979ccaf

Hey r/DarkInterview — sharing a Netflix coding question from https://darkinterview.com .

Auto-Expire Cache

Part 1: Basic Auto-Expire Cache

Implement an AutoExpireCache class with set and get:

`python import time

cache = AutoExpireCache()

cache.set("user_session", {"user_id": 123}, 2) print(cache.get("user_session")) # {"user_id": 123}

time.sleep(3)

print(cache.get("user_session")) # None (expired)

False is a valid value — don't confuse it with "not found"

cache.set("feature_flag", False, 60) print(cache.get("feature_flag")) # False (not None!) `

This part is straightforward — store (value, expire_timestamp) in a dict and use lazy deletion: check expiry on get() and delete if stale.

Key design decision: Return None for cache misses, not False, since False can be a valid cached value.

Part 2: Preventing Memory Leaks

> Interviewer: "If we set millions of keys that are never accessed again, they stay in memory forever. How do you fix this?"

This is where it gets interesting. Three approaches:

Strategy 1: Periodic Cleanup (Background Thread)

Run a daemon thread that scans and removes expired entries on an interval:

python def _cleanup_expired(self): current = time.time() with self.lock: expired = [k for k, (_, exp) in self.cache.items() if current > exp] for k in expired: del self.cache[k]

Trade-off: O(N) scan, but simple and effective. Requires thread safety with locks.

Strategy 2: Min-Heap for Efficient Expiration

Use a min-heap sorted by expiry time so you only process entries that have actually expired — O(E log N) where E is expired entries:

python heapq.heappush(self.expiry_heap, (expire_at, key, version))

Trick: Use version tracking to handle stale heap entries when keys are updated with new TTLs.

Part 3: Fixed-Size LRU Cache with Expiration

> Interviewer: "What if memory is still a concern? Enforce a maximum cache size with LRU eviction."

Combine TTL expiration with LRU eviction using OrderedDict:

`python from collections import OrderedDict

class AutoExpireCache: def init(self, capacity: int): self.capacity = capacity self.cache = OrderedDict()

def set(self, key, value, ttl_seconds):
    if key in self.cache:
        del self.cache[key]
    while len(self.cache) &gt;= self.capacity:
        self.cache.popitem(last=False)  # Evict LRU
    self.cache[key] = (value, time.time() + ttl_seconds)

def get(self, key):
    if key not in self.cache:
        return None
    value, expire_at = self.cache[key]
    if time.time() &gt; expire_at:
        del self.cache[key]
        return None
    self.cache.move_to_end(key)  # Mark as recently used
    return value

The interviewer may also ask you to implement LRU from scratch using a doubly-linked list + hash map for O(1) operations.

Follow-up Discussion Topics

The interviewer may extend the conversation to:

Thread safety — global lock vs. read-write lock? What about concurrent.futures?
Distributed cache — how does this scale across servers? Consistency between nodes? (Redis/Memcached as production solutions)
Sliding vs. absolute expiration — should get() refresh the TTL?
Bulk operations — mset(), mget() for multiple keys at once
Out-of-order timestamps — how to handle LRU with events arriving out of order?

Full question + Python solutions with all 3 implementation strategies: https://darkinterview.com/collections/n3f6x9k2/questions/ef32a27f-b05a-4e44-a838-1bfd7979ccaf

Coding - Auto-Expire Cache

Auto-Expire Cache

Part 1: Basic Auto-Expire Cache

False is a valid value — don't confuse it with "not found"

Part 2: Preventing Memory Leaks

Strategy 1: Periodic Cleanup (Background Thread)

Strategy 2: Min-Heap for Efficient Expiration

Part 3: Fixed-Size LRU Cache with Expiration

Follow-up Discussion Topics

Coding - Auto-Expire Cache

Auto-Expire Cache

Part 1: Basic Auto-Expire Cache

False is a valid value — don't confuse it with "not found"

Part 2: Preventing Memory Leaks

Strategy 1: Periodic Cleanup (Background Thread)

Strategy 2: Min-Heap for Efficient Expiration

Part 3: Fixed-Size LRU Cache with Expiration

Follow-up Discussion Topics