Day 14 Hard Feed System Push/Pull/Hybrid Fanout Ranking

Feed System — The Fanout Art of a 100M-DAU Following TimelinePush vs Pull vs Hybrid, Fanout-on-write Limits, Timeline Storage, Ranking Pipeline

Problem & Constraints

Design the backend for a 100M-DAU following timeline (Twitter/X Home, Instagram Feed, Weibo following stream): you follow a few hundred accounts, and pull-to-refresh must return their newest, sorted posts within p99 < 200ms. The hard part isn't storing posts — it's the read/write amplification asymmetry: one post must reach millions of followers, while one refresh must aggregate and sort across hundreds of followees. The first question: do you do the merge work at write time or read time?

Scale: 100M DAU, hundreds of followees each, post peak a few thousand~10K QPS, timeline read peak hundreds of thousands of QPS (Twitter has publicly cited ~300K QPS of timeline reads).
Read:write ratio: heavily read-skewed, read:write ≈ 100:1 — the root motivation for "do more work on write, save on read".
Latency SLO: refresh p99 < 200ms; new-post "visibility delay" in seconds (Twitter targeted delivery within 5s).
Uneven fanout: a normal user has hundreds of followers, a celebrity tens of millions — one fanout strategy can't serve both ends.
Ranking: pure chronological or ML-ranked? The latter must score hundreds of candidates after aggregation, on a tight budget.

High-Level Architecture (Hybrid Fanout)

graph TD
    POST["Post / Write API"] --> TW[("Tweet Store
source of truth")]
    POST --> FO["Fanout service
look up social graph"]
    FO -->|normal: push| TLC[("Home Timeline Cache
Redis · one bounded list/user")]
    FO -.celeb: skip fanout.-> CEL[("Celebrity posts
pulled at read time")]

    READ["Refresh / Read API"] --> MIX["Timeline mixing service"]
    MIX -->|① read pre-materialized| TLC
    MIX -->|② pull celeb latest| CEL
    MIX --> RANK["Rank + diversity"]
    RANK --> HYDRATE["Hydrate
fetch post bodies by id"]
    HYDRATE --> TW
    HYDRATE --> OUT["Return 20-post feed"]

    classDef w fill:#1a2530,stroke:#64c8ff,color:#e8eef5
    classDef cache fill:#1a1a30,stroke:#ffb450,color:#e8eef5
    classDef store fill:#2a1530,stroke:#ff7ab6,color:#e8eef5
    class POST,READ,FO,MIX,RANK,HYDRATE,OUT w
    class TLC,CEL cache
    class TW store

Write path pre-materializes normal users' timelines; read path merges "pre-materialized" + "celebrity real-time pull" then ranks

Component roles: Tweet Store is the source of truth for posts (a KV sharded by tweet_id). The Fanout service, on each post, looks up the social graph and pushes the tweet_id into every follower's Home Timeline Cache — that's "do more on write". Celebrities are the exception: fanout cost is too high, so their posts aren't pre-materialized and are pulled at read time. The mixing service is the crux: at read time it merges the pre-materialized timeline with celebrities' latest posts, ranks them, then hydrates only the top-N (timelines store only ids; bodies are fetched from Tweet Store to save memory).

Key Technical Points

1. Push vs Pull vs Hybrid: merge at write time or read time

One-line trade-off: trade "write amplification at post time" for "read amplification at refresh time" — you can only save on one end, and celebrities force you to manage both.

Principle: Push (fanout-on-write) writes the post id into every follower's timeline at post time, read takes it out in one shot — read O(1), write O(followers). Pull (fanout-on-read) precomputes nothing; on refresh it queries all your followees' recent posts and merge-sorts them — write O(1), read O(followees × per-query). The choice depends on read:write ratio and fanout distribution: read-heavy (100:1) with most users having few followers → push amortizes cost onto the sparse writes, a win; but a celebrity with tens of millions of followers triggers tens of millions of writes per post, and push collapses. So industry is almost always Hybrid: push for normal users, pull for celebrities, merge at read time.

	Push (write fanout)	Pull (read fanout)	Hybrid
Post cost	high (O followers)	low (one write)	normal high / celeb low
Refresh cost	low (read one list)	high (merge N sources)	read 1 list + pull few celebs
New-post delay	after fanout done	real-time	normal lag / celeb real-time
Failure mode	celebrity write blowup	active-user read blowup	merge logic complexity

Trade-off:

Pure push: ✅ very fast reads, ranking can be precomputed offline; ❌ celebrity write amplification is fatal; inactive users get materialized too (computing timelines for dormant accounts is pure waste).
Pure pull: ✅ cheap writes, less storage, always real-time; ❌ every refresh redoes the merge-sort, peak read QPS overwhelms the backend, p99 jitters.
Hybrid difficulty: where to set the celebrity threshold (follower count? activity?), and how to keep ordering consistent and dedup when merging two sources at read time.

Real-world cases:

Twitter: push-dominant (Redis pre-materialized home timelines), celebrities go hybrid — their posts skip fanout and merge at read time. Raffi Krikorian's Timelines at Scale is the classic material.
Facebook News Feed (Multifeed): a pull design — aggregators fan out at read time to a set of leaf nodes for friends' recent actions, then rank/filter and return (Meta engineering blog, 2015).
LinkedIn FollowFeed: also pull — the broker (followfeed-query) queries multiple storage nodes' activity timelines in parallel at read time, then merges (LinkedIn engineering blog).

2. Fanout-on-write amplification and the celebrity problem

One-line trade-off: push converts read cost into write cost, and write cost explodes linearly with follower count — unsustainable at celebrity scale.

Principle: the fanout write count for one post = the author's follower count. A normal person triggers a few hundred cache writes, easily absorbed; but an account with 50M followers triggers 50M Redis writes per post — even at 100µs each, serial is hours, and it instantly saturates cache-cluster bandwidth. Worse is thundering-herd overlap: celebrities have many online followers at once, so fanout storm and read storm collide. The fix is a fanout threshold: accounts above it join a "celebrity list", their posts skip fanout and are pulled and merged into each reader's timeline at read time. The cost is one extra "pull celeb latest + merge" on the read path, but it avoids the write-side avalanche.

# Hybrid fanout: split by threshold at post time (pseudo-code)
FANOUT_THRESHOLD = 100_000  # followers above this -> read-time pull

def on_post(author_id, tweet_id, ts):
    tweet_store.put(tweet_id, ...)          # always write source of truth first
    n = social_graph.follower_count(author_id)
    if n > FANOUT_THRESHOLD:
        celeb_recent.zadd(author_id, ts, tweet_id)  # celeb latest, pulled at read
        return                                # no fanout
    # normal user: async fanout, batched pipeline, don't block the post
    for batch in chunks(social_graph.followers(author_id), 1000):
        pipe = redis.pipeline()
        for fid in batch:
            pipe.zadd(f"tl:{fid}", ts, tweet_id)
            pipe.zremrangebyrank(f"tl:{fid}", 0, -801)  # bounded: keep newest 800
        pipe.execute()

Trade-off:

Threshold too high: a few big accounts still push, occasional write storms; upside is a simpler read path (fewer celebs to merge).
Threshold too low: many mid-tier accounts become pull, the read-time celeb merge list grows, read amplification rises.
Activity-aware: an advanced move is to fanout only to active followers (logged in within 30 days), materializing the rest on demand — cutting wasted writes to dormant accounts.

Real-world cases:

Twitter: high-follower accounts are explicitly listed as "celebrity" to bypass fanout and merge at read; the home timeline in Redis is bounded (public material mentions a ~800-entry cap), older entries truncated.
Instagram: uses Cassandra to asynchronously pre-materialize feeds for non-celebrities; celebrities (millions of followers) are not pre-materialized, since fanning out to all followers is too compute/I/O intensive (Instagram engineering material).

3. Timeline storage: bounded list, store ids not bodies

One-line trade-off: trade "one extra hydrate to fetch bodies at read time" for "shrinking the timeline cache memory by one or two orders of magnitude".

Principle: the home timeline is an ordered structure in Redis (a sorted set / list scored by time), one per user, storing only tweet_ids, not bodies. Why: ① a body is a few KB; storing 100M users × 800 entries of full bodies is a TB-scale memory disaster, while 8-byte ids shrink it 100×; ② posts can be edited/deleted, and a body redundantly copied into every timeline would need to be re-fanned-out on update. At read time you take the top-N ids and batch-hydrate bodies from Tweet Store (one MGET), filtering out deleted/blocked ones. The timeline must also be bounded — keep only the newest few hundred, truncate older, or an active big account's followers' timelines grow unboundedly. Cold data (scrolling far back) falls back to pull.

# Read path: merge + hydrate (pseudo-code)
def get_home_timeline(uid, limit=20):
    ids = redis.zrevrange(f"tl:{uid}", 0, 400)        # ① pre-materialized part
    for cid in following_celebs(uid):                  # ② pull celeb live
        ids += celeb_recent.zrevrange(cid, 0, 50)
    ids = dedup(ids)
    ranked = rank(uid, ids)[:limit]                    # ③ rank, take top-N only
    tweets = tweet_store.mget(ranked)                  # ④ hydrate only top-N
    return [t for t in tweets if visible(uid, t)]      # ⑤ filter deleted/blocked

Trade-off:

Store ids (recommended): ✅ 100× less memory, edit/delete changes one place; ❌ one extra hydrate RTT, Tweet Store becomes a read hotspot (needs its own cache layer).
Store bodies redundantly: ✅ zero re-fetch on read, single return; ❌ memory blowup, delete/edit requires full re-fanout — almost nobody does this.
Unbounded list: skip truncation for convenience → big accounts' followers' timelines bloat, both Redis memory and single-key size go out of control.

Real-world cases:

Twitter: home timelines live in a Redis cluster, bounded (~800 entries), each entry id-level data, with async pipelined writes to optimize throughput (Krikorian material).
Instagram: migrated feed/activity data from Redis to Cassandra (then to in-house Rocksandra storage engine) to push down GC stalls and p99 — showing timeline storage choices evolve with scale (Instagram engineering blog).

4. Ranking Pipeline: from chronological to ML ranking

One-line trade-off: trade "engagement uplift" for "real-time, explainability, and compute budget" — the ranking model can only run on the small candidate set after aggregation.

Principle: early feeds were pure reverse-chronological — simple, real-time, predictable. After information overload, the shift was to ML ranking: after aggregating candidates (push-materialized + pull-celebs, a few hundred), a model predicts multiple engagement signals per post (like/comment/reshare/dwell probability), weighted into a score. This echoes Day 13's multi-stage funnel — the difference is feed candidates come from the follow graph not full-corpus retrieval, so the candidate set is far smaller (hundreds vs billions), allowing a fairly heavy ranking model directly. After ranking comes re-ranking to inject diversity (spread same-author, avoid a single-topic streak) and business rules (ad insertion, recency boosts).

Trade-off:

Chronological: ✅ real-time, explainable, zero model cost, no confusion about "why am I seeing this"; ❌ high miss rate (good content buried while you slept), lower engagement than a ranked feed.
ML ranking: ✅ higher engagement, rescues "posted early but high quality"; ❌ compute-expensive, poor explainability, prone to echo chambers and "why didn't I see their post" complaints; needs pCTR calibration and multi-objective fusion.
Push + offline ranking conflict: if the timeline is sorted at pre-materialization time, new posts and ranking-model updates can't be reflected live — so re-ranking usually lives at read time, with push only responsible for stuffing candidates in.

Real-world cases:

Facebook News Feed: Multifeed's aggregator ranks and filters the leaf-fetched candidates at read time before returning — ranking is part of the read path (Meta engineering blog).
LinkedIn: one of FollowFeed's goals was to support "more computationally complex scoring and ranking" without sacrificing real-time, with the broker ranking after read-time merge (LinkedIn engineering blog).

Scaling & Optimization

Activity-aware fanout: pre-materialize only for recently active followers; generate on demand for dormant ones, cutting huge amounts of wasted writes.
Tiered timeline: hot (recent few hundred) in Redis, warm/cold falls back to Cassandra/disk for on-demand pull, controlling memory cost.
Multi-region: deploy timeline caches near the user's home region; cross-region fanout on post via async replication (echoes Day 5).
Decouple ranking from retrieval: split "fill candidates" and "ranking" into separate services so the ranking model can iterate and canary independently (Day 13 funnel thinking).
Bottleneck spotting: monitor fanout latency p99, single-key timeline size, hydrate re-fetch QPS, celeb-merge time — celeb merge is often the root of read-path tail latency.

Common Pitfalls + Interview Questions

1. Agonizing over push vs pull from the start. The right answer is almost always hybrid. The interviewer wants the threshold thinking ("split by follower count/activity") and how to merge two sources at read time — not a binary choice.

2. Stuffing bodies into every timeline. Memory blowup + delete/edit needs full re-fanout. Timelines store only ids, hydrate at read.

3. Forgetting timelines must be bounded. Without truncating old entries, a big account's followers' single keys bloat unboundedly, and a Redis big-key drags down the whole shard.

4. Ranking at write time. Fixing order at push means new posts and model updates can't reflect live; re-ranking belongs on the read path.

5. Ignoring visibility filtering. If blocked/deleted/private posts aren't filtered at read, deleted content gets served — you must filter after hydrate.

Likely follow-ups: ① A celebrity with tens of millions of followers posts — how do you avoid blowing up the system? ② How do you set the push/pull threshold, on what metric? ③ When I newly follow someone, how do their historical posts enter my timeline (backfill)? ④ How do you budget the end-to-end post-to-visible latency? ⑤ Chronological vs ranked feed — what does each sacrifice, and which product should pick which?

Deep Resources

Raffi Krikorian, Timelines at Scale (QCon/InfoQ) + High Scalability's breakdown of Twitter's timeline architecture: firsthand material on fanout, Redis materialization, celebrity hybrid.
Meta engineering blog, Serving Facebook Multifeed (2015): the pull-style aggregator-leaf architecture and read-time ranking in practice.
LinkedIn engineering blog, FollowFeed: LinkedIn's Feed Made Faster and Smarter: the trade-offs of pull/read-time merge + complex ranking.
Designing Data-Intensive Applications, Ch. 1 (Kleppmann): uses Twitter's timeline write-fanout vs read-fanout as the opening "describing load" case study.

Deep Thinking (click to expand)

1. Push's write amplification grows linearly with follower count, pull's read amplification grows linearly with followee count. Can you write an inequality to estimate the "push or pull" crossover?

Modeling: for a single account, push is worthwhile roughly when "the number of times it's read ≫ the number of fanout writes its posts trigger" — the crossover is essentially the marginal benefit of a fanout write: when one fanout write saves less expected read cost than the write itself costs, switch to pull.

Intuition: a normal person posts rarely and is refreshed repeatedly → reads ≫ writes → push (write once, save countless reads). A celebrity's follower count both raises write cost and, because their posts get buried by ranking with low per-person exposure, lowers write benefit — squeezed from both ends, they land in the pull region. This is the theoretical basis for the hybrid threshold.

2. User A just followed big-V B (5M followers). Should B's historical posts appear in A's timeline immediately? How to backfill without stepping on landmines?

Crux: B is a celebrity served by pull and isn't pre-materialized — so "at follow time" you actually don't need to write historical posts into A's timeline; A's next refresh will auto-merge B's latest. This is a hidden benefit of hybrid: following a celebrity needs no backfill.

If B is a normal account (push): B's historical posts aren't in A's pre-materialized timeline. Options: pull B's recent posts and merge at read time on first refresh (simple, common), or async zadd them into A's timeline in the background. Pitfall: don't backfill inside the synchronous "follow" request — if A follows a batch at once (contacts import), synchronous backfill amplifies into a mini fanout storm; it must be async, rate-limited, and respect the bounded truncation.

3. Timelines store ids in a Redis sorted set. If a Redis shard goes down, that batch of users' timelines is gone. Is this data loss?

Key insight: the home timeline is derived data, not the source of truth — the real posts are in Tweet Store. Losing the timeline cache isn't "data loss", it's a "materialized view invalidation" that can be rebuilt.

Recovery: ① short-term degrade to pull — that shard's users refresh by live-merging their followees' latest, slow but usable, buying time for rebuild; ② background re-fanout rebuild, at the cost of a degraded read path + a write storm (rate-limit it). This also explains why timelines must be bounded — rebuild only needs to backfill the recent few hundred, not full history. Counter-lesson: if you'd made the timeline the only store (no Tweet Store), this would be true data loss — a derived layer must never be authoritative.

4. Switching from chronological to ML ranking, DAU engagement rose, but "why didn't I see friend X's post" complaints surged. How does this second-order effect arise? How to mitigate?

How it arises: under chronological, "follow = guaranteed delivery", users have a stable mental model. After switching to ranking, the model filters/downranks by predicted engagement, and low-interaction friends' posts get buried; users don't know ranking exists and just feel "I followed them but didn't see it". This is the inherent cost of a ranked feed: trading individual predictability and trust for aggregate engagement uplift.

Mitigation: ① give strong-tie/explicit subscriptions a guaranteed exposure slot; ② offer a "Latest" chronological view as a switchable tab (Twitter/Instagram both did this); ③ add diversity and coverage constraints in ranking so top-interaction accounts don't dominate; ④ clearly label "For You vs Latest". Fundamentally it's admitting offline engagement metrics ≠ user trust, needing ecosystem-side metrics as a backstop (echoes Day 13's echo-chamber discussion).

5. Estimate: 100M users, 800-entry bounded timelines each, 8 bytes per tweet_id. How much memory just for the timeline cache? What does this tell you architecturally?

Naive estimate: 1e8 users × 800 entries × 8 bytes ≈ 640 GB of raw id data alone. But a Redis sorted set element also stores a score (8-byte double) plus skiplist/dict pointer overhead, so realistically 50~100 bytes per element, bloating the total to several TB.

Architectural implications: ① one machine can't hold it → the timeline cache must be sharded (hash by uid, echoes Day 4); ② this is just ids — thankfully no bodies, or 800 × few KB × 1e8 = PB-scale, the hard constraint behind "store ids + hydrate"; ③ memory is too expensive → activity-aware materialization and tiering (hot in memory, cold to Cassandra) are necessities not optimizations; ④ it also explains why the bounded 800 matters — it directly sets the cache tier's total memory budget; double the cap, double the cost.

← Back to index