Knapsack Problem Explained from First Principles

Q: Is the knapsack DP solution actually efficient if it's NP-complete?

The DP solution runs in O(n * W) time, where n is the number of items and W is the capacity. This is pseudo polynomial: polynomial in the numeric value of W , not in the number of bits needed to represent W . For interview problems where W is bounded (typically under 10,000), the DP approach is fast and expected. NP-completeness means no algorithm is polynomial in all parameters simultaneously, but within typical interview constraints, the DP solution works well.

The knapsack problem explained through the binary decision behind the recurrence. Table walkthrough, 1D optimization, and variants.

10 minutes

Intermediate

What you will learn

Why the knapsack recurrence follows from a single binary decision

How to build and trace the 2D DP table step by step

When to optimize from 2D to 1D and why reverse traversal matters

Which problems are knapsack variants in disguise

You've probably memorized the knapsack recurrence. You can write dp[i][w] = max(dp[i-1][w], dp[i-1][w-weight[i]] + value[i]) from memory. But ask yourself why that recurrence is correct, why those are the only two terms, why i-1 and not i, and the explanation falls apart. You won't find the knapsack problem explained from the decision that produces the recurrence in most resources. Instead, most resources teach it as a formula to apply.

⚡TL;DR

The knapsack recurrence isn't a formula you memorize. It's the logical consequence of one binary decision per item: include it or exclude it. Understanding that decision makes the recurrence derivable, the table traceable, and knapsack variants recognizable on sight.

The knapsack problem explained in plain terms

The 0/1 knapsack problem asks you to select items with given weights and values to maximize total value without exceeding a weight capacity. Each item can be included once or not at all. Every item presents one binary decision: you take it or you leave it.

That binary constraint makes brute force exponential. With n items, you've got 2^n possible subsets. For 20 items, that's over a million combinations, and for 40 it's over a trillion. No amount of pruning makes exhaustive search practical at scale.

But knapsack has optimal substructure. The best solution for n items at capacity w depends only on the best solutions for n-1 items at specific smaller capacities. That's the dependency that dynamic programming exploits.

ℹ️ Info

Knapsack is technically NP-complete, which means no known algorithm solves all instances in polynomial time relative to input size. But the DP solution runs in O(n * W) time, where W is the capacity. The term for this is pseudo polynomial, because W is a number in the input, not the size of the input. For interview purposes, the DP approach is the expected solution.

The knapsack problem's recurrence explained

Start with the last item, item n. You have exactly two choices:

If you don't include item n, the best value you can get is whatever you could get from items 1 through n-1 with the full capacity still available. If you include item n, you gain value[n], but you've consumed weight[n] of your capacity. The best value for the remaining items (1 through n-1) is now constrained to capacity w - weight[n].

The optimal answer is whichever choice gives more value. That's the entire recurrence:

`
dp[i][w] = max(dp[i-1][w], dp[i-1][w - weight[i]] + value[i])
`

There's also a guard condition. If weight[i] > w, item i can't fit, so the only option is to exclude it and dp[i][w] = dp[i-1][w].

Why i-1 and not i? Because this is 0/1 knapsack. Each item is used at most once. When you're deciding about item i, the remaining subproblem only involves items before i. If the recurrence used dp[i][w - weight[i]], you'd be allowing item i to be included multiple times. That's a different problem entirely (unbounded knapsack).

The base cases follow from the definition. With zero items, the best value at any capacity is 0, so dp[0][w] = 0 for all w. With zero capacity, you can't include anything, so dp[i][0] = 0 for all i.

“The knapsack recurrence isn't memorized. It's derived from the only two things you can do with each item.”

Dynamic programming

Building the knapsack table frame by frame

Recurrences make more sense when you trace them on real numbers. Take these 4 items.

`
| Item | Weight | Value | |------|--------|-------| | A | 2 | 3 | | B | 3 | 4 | | C | 4 | 5 | | D | 5 | 8 |
`

Capacity = 5. Build a table where rows are items (plus a row 0 for the empty set) and columns are capacities 0 through 5.

Python

You can trace through a few cells to see the decision playing out.

At dp[1][2] (item A, capacity 2), A weighs 2 and fits. Excluding gives dp[0][2] = 0, while including gives dp[0][0] + 3 = 3. The max is 3, so you take A. Moving to dp[2][5] (items A-B, capacity 5), B weighs 3 and also fits. Excluding gives dp[1][5] = 3 (just A), while including gives dp[1][2] + 4 = 3 + 4 = 7. The max is 7, so you take both A and B.

At dp[3][5] (items A-C, capacity 5), C weighs 4 and fits, but excluding gives dp[2][5] = 7 (A+B), while including gives dp[2][1] + 5 = 0 + 5 = 5. The exclude path wins, so you don't take C.

At dp[4][5] (items A-D, capacity 5), D weighs 5 and fits. Excluding gives dp[3][5] = 7 (A+B), while including gives dp[3][0] + 8 = 0 + 8 = 8. The include path wins, so you take D alone.

The answer is 8, taking only item D. Items A and B together weigh 5 and give value 7, but item D alone weighs 5 and gives value 8. Having more items doesn't guarantee more value, because both branches get compared at every cell and the recurrence picks the winner.

💡 Tip

To reconstruct which items were selected (not just the max value), trace back through the table. At dp[i][w], if dp[i][w] != dp[i-1][w], then item i was included. Move to dp[i-1][w - weight[i]] and repeat. If they're equal, item i was excluded. Move to dp[i-1][w].

Common implementation mistakes

The recurrence is clean once you understand the decision, but the implementation has spots where things quietly go wrong.

Off-by-one item index: If your items array is 0-indexed but your table rows start at 1, you need items[i - 1] when accessing weight and value for row i. Forgetting this shift means row 1 reads item 1 instead of item 0, and your last row reads past the array bounds. This won't always crash, and sometimes it produces a wrong answer that looks plausible.
Skipped guard condition: When weight[i] > w, the item can't fit. If you skip this check and try to access dp[i-1][w - weight[i]], you're reading a negative index. In Python, that wraps around to the end of the array and returns a meaningless value. In C++ or Java, you'll get undefined behavior or an exception. Always check fit before evaluating the include branch.
Wrong base case init: The standard initialization sets dp[0][w] = 0 for all capacities and dp[i][0] = 0 for all items. But if you're working with a variant where values can be negative, or where the goal is feasibility rather than maximization, you might need dp[0][w] = -infinity or dp[0][w] = False instead. The base case should reflect what "no items, capacity w" actually means for your specific variant.
Row meaning confusion: In 0/1 knapsack, row i means "considering items 1 through i." In unbounded knapsack, the same notation means something subtly different because items can repeat. If you're switching between variants during practice, double check which version you're implementing. The inner loop direction and the row reference (i-1 vs i) both depend on this.

From 2D to 1D: The space optimization

The recurrence only looks at row i-1 to fill row i. That means you don't need all n rows in memory at once. A single 1D array of size W+1 is enough, as described in the classic optimization of this problem.

There's a catch, though. If you fill the array left to right (increasing w), you'll overwrite values that later cells in the same row still need. When you compute dp[w], the value dp[w - weight[i]] might already reflect the current item, not the previous row. That accidentally creates the unbounded knapsack behavior, allowing items to be reused.

The fix: iterate capacity in reverse (from W down to weight[i]). That way, when you read dp[w - weight[i]], it still holds the value from the previous item's pass.

Python

You get the same answer with O(W) space instead of O(n * W). The reverse iteration isn't a trick. It's what the 0/1 constraint requires, because each item can be used at most once. If you see a knapsack variant where items can be reused, iterate forward instead. The direction of the inner loop tells you the reuse policy.

When knapsack DP hits its limits

The O(n * W) runtime looks polynomial, but it depends on the value of W, not the number of bits representing it. That distinction matters when capacity gets large.

For a typical interview problem with n = 100 items and W = 10,000, the table has a million cells. That's fast. But if W = 10^9, you're looking at 100 billion operations, and no reasonable time limit will accommodate that. You'd need a different approach entirely, like greedy with a fractional relaxation or branch and bound with pruning.

In interviews, the problem setter almost always bounds W or the target to keep DP feasible. If you see a knapsack style problem with a suspiciously large capacity, that's a signal the intended solution isn't standard knapsack DP. You might need meet in the middle (splitting items into two halves and enumerating subsets of each) or a greedy approximation instead. Read the constraints before committing to an approach.

The knapsack problem family

Once the include or exclude decision is clear, you'll start recognizing it in problems that don't mention "knapsack" at all.

Subset Sum: Asks whether a set of integers contains a subset that sums to exactly a target. This is knapsack where weight equals value and you're checking if dp[n][target] == target. The decision stays the same, include this number or skip it.

Equal Partition: Asks whether you can split an array into two subsets with equal sum. You first check if total sum is even. If it is, this reduces to Subset Sum with target = total/2, using the same binary decision and the same table shape. Minimum Partitioning is a close relative that splits an array into two subsets to minimize the difference between their sums. You find the largest achievable sum that doesn't exceed total/2 (a Subset Sum variant), then the answer is total - 2 * that sum.

All of these share the same core decision, at each element, include it or skip it. The table dimensions and the value function change, but the recurrence shape stays the same. That's why understanding why the recurrence works matters more than memorizing it. If you can reason about the decision, you can rebuild the recurrence for any variant without looking it up.

⚠️ Warning

Don't confuse 0/1 knapsack with unbounded knapsack. In the unbounded version, each item can be selected multiple times. The recurrence changes to dp[i][w] = max(dp[i-1][w], dp[i][w - weight[i]] + value[i]). Notice dp[i], not dp[i-1], in the include branch. And the 1D optimization iterates forward, not in reverse.

Recognizing knapsack in the wild

Knapsack is one of several DP pattern families. Recognizing when a new problem is a knapsack variant means training your identification layer, where you learn to read a problem statement and spot the triggers that point to this pattern. If a problem has binary include/exclude decisions over items with a resource constraint, you already know the recurrence shape.

For the full progression across DP pattern families (Coin Change, LCS, LIS, Edit Distance, Grid DP, and knapsack), see our complete guide to dynamic programming. If you're still working on how to identify which problems need DP at all, start there.

Codeintuition's Dynamic Programming course covers knapsack with this same derivation first approach. You build the recurrence before seeing the solution, trace the table before writing code, then apply it to variants with increasing difficulty. The free Arrays course uses the same teaching model, so you can test whether it fits your learning style before committing to the full platform at $79.99/year.

Six months ago, you stared at a knapsack problem and thought "I know there's a 2D table involved." You wrote the recurrence from memory, got the indices wrong, spent 20 minutes debugging. Now you derive it from the decision. At each item you include it or exclude it, and the only state you need is the item index and remaining capacity. The recurrence follows directly from those two questions, and understanding why the formula exists is what actually got you there.

Derive DP recurrences instead of memorizing them

Codeintuition's Dynamic Programming course teaches knapsack, LCS, coin change, and three other DP families from the decision that produces the recurrence. Trace the table before writing code. Start with the FREE Arrays course to see the derivation first model.

Start Learning

In 0/1 knapsack, each item can be included at most once. In unbounded knapsack, each item can be selected any number of times. The recurrence reflects this: 0/1 uses dp[i-1][w - weight] (previous row, item not available again), while unbounded uses dp[i][w - weight] (current row, item still available). The 1D space optimization reverses iteration order for 0/1 and uses forward iteration for unbounded. This single index difference (i-1 vs i) is what separates the two variants at the implementation level.

Look for these signals: a set of items with two properties (usually weight and value, or cost and benefit), a constraint on one property (capacity, budget, or total), and a goal to optimize the other. The giveaway is the binary decision at each item, include it or skip it. Subset Sum, Equal Partition, and Minimum Partitioning are all knapsack variants that share this structure.

Because each item should be used at most once. Forward iteration lets dp[w - weight] reflect the current item from an earlier update in the same pass, accidentally allowing reuse, so reverse iteration preserves the previous item's values.

The DP solution runs in O(n * W) time, where n is the number of items and W is the capacity. This is pseudo polynomial: polynomial in the numeric value of W, not in the number of bits needed to represent W. For interview problems where W is bounded (typically under 10,000), the DP approach is fast and expected. NP-completeness means no algorithm is polynomial in all parameters simultaneously, but within typical interview constraints, the DP solution works well.

Yes, but the table gains dimensions. With two constraints (weight and volume, for instance), you'd use dp[i][w][v] and the state space grows with each added constraint. Most interview problems stick to one constraint to keep things practical. The decision at each item doesn't change, just include or exclude. Three constraint variants are rare in interviews because O(nWV) time gets impractical fast.