Phase 3 — Advanced Data Structures

Target level: Medium → Hard Expected duration: 3 weeks (12-week track) / 3 weeks (6-month track) / 4 weeks (12-month track) Weekly cadence: ~8 advanced structures per week + 30–60 problems applying them under the framework

Why Advanced Data Structures Unlock Hards

Phase 2 gave you 28 patterns that solve the vast majority of Mediums. The patterns work because each one carries an O(N) or O(N log N) algorithm in its template — you recognize the signal, instantiate the template, and the runtime falls out for free.

Hards are different. The signal still fires — you still recognize “this is sliding window with a tricky max”, “this is DP with a state transition”, “this is shortest path with a constraint” — but the vanilla template’s complexity is one factor too high. A sliding-window max over a stream of N updates becomes O(N²) with a sorted list. A DP with N=20 and “set of visited” in the state explodes to O(2^N · N²) without bitmask compression. A range-sum problem with both updates and queries blows past prefix sums. A string match against a pattern of length M in a text of length N is O(N·M) with naive comparison; that’s 10^10 ops at N=10^5, M=10^5.

The advanced data structures in this phase are the augmented engines that bring Hard problems back into reach. Each one is a 1–2 log-factor improvement over a naive structure. They are not “tricks”. They are well-defined, well-proven structures with known invariants, well-understood failure modes, and known operating ranges. The skill is not to invent them — it is to recognize when the vanilla template is one log factor short, identify which augmented structure plugs the hole, and instantiate it correctly under interview pressure.

There are roughly three families:

Range query / range update structures — segment tree, Fenwick tree, sparse table, sqrt decomposition. These turn O(N) per range query into O(log N) or O(√N), and (with augmentation) handle range updates the same way. They show up whenever the problem has a sequence and you need both updates and aggregates over arbitrary subranges in the same workload.
String-matching / hashing / suffix structures — KMP, Z, Manacher, rolling hash, suffix array, suffix automaton, Aho-Corasick, tries. These bring per-character work down from O(M) (full pattern recompare) to O(1) amortized, enabling O(N+M) or O(N log N) algorithms over strings. They show up whenever the problem mentions “substring”, “match”, “occurrence”, “palindrome”, or “common”.
State-compression and amortization — bitmask DP, meet-in-the-middle, DSU with α(N), bit manipulation idioms, Bloom/skip/LRU-LFU. These exploit problem-specific structural facts (small N, splittable input, near-constant amortized work, probabilistic acceptance) to clear constraints that naive DP/search cannot.

You will not memorize 24 implementations cold. You will understand the invariants well enough to derive each implementation in 5–15 minutes under pressure, and instantly recognize which one is needed from the problem signal.

After this phase you can solve unmistakably-Hard LeetCode problems on first attempt: range queries with updates, palindromic counts in linear time, multi-pattern matching, exact-cover by bitmask DP, subset-sum at N=40 by meet-in-the-middle, equation-solving by weighted DSU, dynamic LRU caches. You also become visibly stronger in mock interviews because you no longer flinch at “what if the input is updated?”, “what if N is 40?”, or “what if there are 10^5 patterns to match?”.

What You Will Be Able To Do After This Phase

For any range-query Hard, identify within 60 seconds whether vanilla prefix sums suffice, or whether Fenwick / segment tree / sparse table / sqrt decomposition is required, and why.
Implement a segment tree (point update, range query) from memory in <12 minutes.
Add lazy propagation when the problem demands range updates, and articulate the lazy-tag-push invariant.
Recognize a string-match Hard and pick the right tool: KMP (single pattern), Z (border + offsets), Manacher (palindromes), rolling hash (probabilistic equality, multi-substring), Aho-Corasick (multi-pattern), suffix array/automaton (overview-level for “longest common substring”, “distinct substrings”).
Build a trie augmented with counts / deletion / prefix-sum cache for word-search and autocomplete-class problems.
Recognize bitmask DP from N ≤ 20 constraint, formulate the state, and implement the transition without bugs.
Recognize meet-in-the-middle from N ≤ 40 (split into 20+20) and code the two-half merge.
Implement DSU with path compression + union by rank, and prove the α(N) amortized bound.
Use bit manipulation idioms (popcount, lowbit, isolate trailing one, parity tricks) without thinking.

How To Read This Phase

Read the inline reference below in two passes. Pass 1: linear, end to end, to assemble a mental map of which structure plugs which hole. Pass 2: as you work through the labs, refer back to the structure entries to clarify invariants and pitfalls. Each entry has a fixed shape:

When to use — the problem signal that should fire this structure within 2 minutes of reading.
Complexity — build, query, update, space.
Implementation pitfalls — the bugs that consume the most interview minutes.
Classic problems — 3–6 representative problems where the structure is the intended solution.

Where labs cover the structure hands-on, the entry references the lab. Where the structure is overview-only (rare in interviews but expected of strong candidates), the entry says so explicitly.

Problem Signal	Structure
Range query + point update, sum/min/max	Segment tree (#1) or Fenwick (#3, if invertible)
Range query + range update	Lazy segment tree (#2)
Static range min/max with O(1) queries	Sparse table (#5)
Range distinct count, hard-to-segment-tree aggregate	Sqrt decomposition / Mo’s (#6)
Single pattern in text	KMP (#10) or Z (#11)
Longest palindrome / count palindromes	Manacher (#12)
Many-substring equality / longest duplicate	Rolling hash (#13)
Multi-pattern dictionary in text	Aho–Corasick (#17)
Prefix queries, autocomplete, word-on-grid	Trie variants (#16)
Probabilistic membership	Bloom filter (#18)
Cache with O(1) get/put	LRU / LFU (#20)
Connectivity / equation graphs	DSU (#21)
N ≤ 20, subset / assignment optimum	Bitmask DP (#23)
N ≤ 40, subset existence / closest sum	Meet-in-the-middle (#24)
Bit-level state mechanics	Bit idioms (#22)

#	Lab	Structure	Canonical Problem
01	Segment tree (range query)	Point update + range sum/min/max	LC 307
02	Segment tree with lazy propagation	Range update + range query	Range-add + range-sum
03	Fenwick tree (BIT)	Coord-compressed Fenwick	LC 315
04	Sparse table for RMQ	Static O(1) RMQ	Range-min array
05	KMP string matching	Failure function + match	LC 28 / 459
06	Rolling hash	Double hashing	LC 187 / 1044
07	Trie applications	Trie with `is_end` + DFS-on-trie	LC 208 / 212
08	Bitmask DP	Permutation DP over subsets	LC 847
09	Meet-in-the-middle	Split-sort-merge	LC 1755
10	Z-function & Manacher	Linear-time prefix-match + palindrome radii	LC 5 / 214
11	LCA via binary lifting	Up-table ancestor jumps + path aggregates	LC 1483 / 236
12	Sqrt decomposition & Mo’s algorithm	Block decomposition + offline query ordering	SPOJ DQUERY
13	Treap & ordered set	Split/merge BST with random priorities	LC 315

LeetCode Interview — Extreme Coding