Revenue Management

Applied EconomicsDifficulty: █████Depth: 11Unlocks: 0

Capacity-constrained pricing for perishable inventory. Booking limits, bid-price controls, markdown optimization. Littlewood's rule and network revenue management.

Prerequisites (3)

Price Discrimination? atoms

Dynamic Programming6 atoms

Bayesian Decision Theory? atoms

Selling a fixed number of perishable units (seats, rooms, ad impressions) at the right time and to the right customer can double or triple revenue — revenue management gives you the rules and math to decide who to sell to, when, and at what price.

TL;DR:

Revenue management studies optimal allocation and pricing of capacity-constrained perishable inventory using booking limits, bid-price controls, and markdown policies; it combines dynamic programming, Bayesian decision rules, and deterministic approximations to produce practically effective controls such as Littlewood's rule and network bid prices.

What Is Revenue Management?

Revenue management (RM) is the study of how to maximize expected revenue from a perishable, capacity-constrained resource by controlling whether to accept requests (or which price to post) over time. Classic examples are airline seats, hotel rooms, spoilable advertising impressions, and limited edition product runs. The inventory is perishable because unsold units expire after the selling horizon (a flight departs; a night passes). Capacity is fixed and often small relative to demand variability, so correctly withholding capacity from low-paying customers in anticipation of high-paying customers can be valuable.

Formally, consider a finite time horizon t = 0,1,...,T (e.g. booking periods before departure), capacity (inventory) $c_0 \in \mathbb{Z}_{\ge 0}$ , and a sequence of stochastic demand arrivals for $K$ fare classes or products. Let $r_k$ be the fare (price) for class $k$ . At each arrival (or decision epoch) the manager chooses an action (accept or reject a request, or set a price). The goal is to choose a policy $\pi$ to maximize expected total revenue:

\max_{\pi} \mathbb{E}^{\pi}\left[ \sum_{t=0}^T R_t \right]

where $R_t$ is realized revenue at epoch $t$ , subject to the constraint that the total units sold never exceed initial capacity $c_0$ .

In Dynamic Programming (a prerequisite), this becomes a stochastic dynamic program: define the value function $V_t(c)$ as the maximum expected revenue from time $t$ onward with $c$ units remaining. The Bellman recursion for a single arrival in period $t$ with an arriving class $k$ request is:

V_t(c) = \mathbb{E}_{\text{arrival}} \left[ \max\{ V_{t+1}(c),\ \; r_k + V_{t+1}(c-1) \cdot 1_{c>0} \} \right] ,

with boundary $V_{T+1}(c)=0$ . Example (numeric): suppose $T=1$ (one decision epoch), capacity $c_0=1$ , and a single arrival that is class 1 with probability 1 and fare $r_1=150$ . Then $V_1(1)=\max\{0,150\}=150$ ; the Bellman equation gives a trivial decision to accept. For multi-period, stochastic arrivals the Bellman recursion quickly grows in state and action branching.

Key conceptual tools that RM uses (and that you know from prerequisites):

•From Price Discrimination: RM operationalizes third-degree segmentation as fare classes and can be seen as dynamic price discrimination across time and willingness-to-pay segments. In Price Discrimination, we learned to separate customers by elasticity; RM enforces this separation dynamically via controls like booking limits and prices.
•From Dynamic Programming: RM uses backward induction and value functions; the marginal value of one additional unit of capacity is $V_t(c)-V_t(c-1)$ and serves as the opportunity cost (shadow price) for selling now.
•From Bayesian Decision Theory: accepting/rejecting a request is a one-step decision where you compare immediate reward to the posterior expected opportunity cost of capacity. The decision rule that minimizes posterior expected loss is: accept iff fare $\ge$ expected opportunity cost.

Intuition: the manager should accept a low fare only if the expected future revenue from holding the unit (the option value) is less than the low fare. Littlewood's rule (derived shortly) gives a crisp form of this intuition for two classes; bid-price controls generalize it to many resources and products by pricing capacity via dual variables.

Why perishable + capacity? Because perishable inventory destroys inter-temporal substitution — unsold inventory is forever lost — so the manager faces a stochastic stopping/acceptance problem. In contrast to static price discrimination (one-shot), RM trades off present vs. future sales under capacity scarcity, leveraging Dynamic Programming and posterior expectations (Bayesian Decision Theory) to produce implementable heuristics.

Core Mechanic 1 — Booking Limits and Littlewood's Rule

Booking limits and Littlewood's rule are the foundation of two-class revenue management. Consider two fare classes: high ( $H$ ) with price $r_H$ , and low ( $L$ ) with price $r_L$ where $r_H > r_L$ . Future high-fare demand is stochastic; we want a simple rule to decide whether to accept an arriving low-fare request given remaining capacity $c$ and remaining booking horizon.

Littlewood's rule (classic statement): Maintain a protection level $y$ (number of seats reserved for high-fare passengers). Accept a low-fare booking only if the remaining capacity $c$ strictly exceeds $y$ (i.e., accept if $c > y$ ). The protection level $y$ is chosen to solve

P(D_H > y) = \frac{r_L}{r_H} \,,\tag{Littlewood target}

where $D_H$ is the random number of arriving high-fare passengers in the remainder of the selling horizon. Equivalently, set $y$ as the smallest integer such that $P(D_H\le y) \ge 1 - r_L/r_H$ .

Derivation (short, rigorous sketch): Consider the moment a single low-fare request arrives, with remaining capacity $c$ and future high-fare demand random variable $D_H$ (not counting the current low request). Rejecting the low fare leaves capacity $c$ to capture future high fares; accepting reduces capacity to $c-1$ . The expected incremental value of keeping one seat for high fares equals the high-fare revenue times the probability that at least one high-fare customer will arrive to use that seat: $r_H \cdot P(D_H > c-1)$ . If we accept the low fare, we get $r_L$ now; if we reject we expect to earn $r_H$ with probability $P(D_H > c-1)$ . Accept iff

r_L \ge r_H \cdot P(D_H > c-1).

Rearrange and define $y = c-1$ ; then accept iff $P(D_H > y) \le r_L/r_H$ . Choosing $y$ to satisfy the equality gives the canonical protection level. This uses Bayesian decision thinking: the posterior expectation of future reward (here $r_H \cdot P(\text{sale})$ ) is compared to immediate reward $r_L$ and the action minimizing expected loss (regret) is chosen.

Concrete numeric example: Suppose $r_H=200$ , $r_L=100$ so $r_L/r_H=0.5$ . Suppose future high-fare demand is Poisson( $\lambda=1.2$ ). Then compute tail probabilities:

• $P(D_H>0) = 1 - e^{-1.2} = 1 - 0.3010 = 0.6990$ (approx)
• $P(D_H>1) = 1 - (P(0)+P(1)) = 1 - (e^{-1.2}+1.2 e^{-1.2}) = 1 - (0.3010 + 0.3612) = 0.3378$
• $P(D_H>2) = 1 - (P(0)+P(1)+P(2)) = 1 - (0.3010 + 0.3612 + 0.2167) = 0.1211$

We need $P(D_H>y) = 0.5$ . From above, $P(D_H>0)=0.699>0.5$ , $P(D_H>1)=0.338<0.5$ . So solution is $y=1$ . Interpretation: protect $y=1$ seat for high fare; accept a low-fare booking only when $c>1$ (i.e., when at least 2 seats remain). If $c=2$ , accept low; if $c=1$ , reject low.

Mechanistic intuition: Littlewood uses only the distribution of future high demand and the two prices – it doesn't require computing the full DP. It is exact for two classes under the assumption that high-fare requests are i.i.d. in remaining horizon and that high fares are first-come-last-served in reservation logic. It is also expressible via a marginal-value comparison: accept iff $r_L \ge$ expected increment from saving the seat, $r_H P(D_H>c-1)$ .

Multiple classes and nested protection levels: If there are more than two fare classes ordered $r_1 < r_2 < ... < r_m$ , we can sequentially apply Littlewood's rule to compute a nested set of protection levels $y_{m-1} \le y_{m-2} \le \cdots \le y_1$ . For example, to decide whether to accept class $k$ we compute the protection level against all higher classes combined (treating their aggregate demand distribution) and accept only if $c>y_k$ .

Limitations and when Littlewood is exact: Littlewood is exact under independent, stationary arrivals and for the two-class case. For more general arrival processes (choice-based demand, nonstationary Poisson), it is an approximation; it is often a very good one in practice when demand forecasts are accurate.

Connection to Dynamic Programming and Bayesian Decision Theory: The inequality $r_L \ge r_H P(D_H>c-1)$ is a one-step Bayes-risk minimizing rule: the posterior expected value of rejecting equals $r_H P(D_H>c-1)$ ; accept if immediate reward exceeds that posterior expectation. From Dynamic Programming, this rule is equivalent to comparing fare $r_L$ with the marginal value of capacity $V_t(c)-V_t(c-1)$ when the value function has particular structure (monotone marginal values) — this is why Littlewood emerges as a closed-form policy.

Core Mechanic 2 — Bid-Price Controls, Deterministic LP, and Networks

Real problems rarely have a single resource and two fares. Airline problems involve many itineraries that consume multiple flight legs; hotels combine room-types and dates; advertising sales allocate impression bundles across campaigns. In multi-resource, multi-product settings we use bid-price controls derived from a deterministic linear program (DLP) approximation, whose dual variables act as shadow prices (bid prices) for scarce resources.

Network model: Let there be $I$ resources (e.g., flight legs) with capacities $C_i$ for $i=1,...,I$ . There are $J$ products (offers), each with price $r_j$ and a deterministic or expected demand $\bar{D}_j$ over the horizon (the DLP uses expected demand). Product $j$ consumes $a_{ij}$ units of resource $i$ (often 0 or 1). The deterministic LP (DLP) allocates expected sales $x_j$ to maximize expected revenue:

\max_{x\ge 0} \sum_{j=1}^J r_j x_j \quad\text{s.t.}\quad \sum_{j=1}^J a_{ij} x_j \le C_i,\ \forall i, \quad x_j \le \bar{D}_j,\ \forall j. \tag{DLP}

Numeric example: Two legs ( $I=2$ ) with capacities $C=(100,100)$ seats; three products $J=3$ : product 1 is nonstop on leg 1 only ( $a_{1,1}=1,a_{2,1}=0$ ) with price $r_1=200$ and expected demand $\bar{D}_1=80$ ; product 2 is nonstop on leg 2 only ( $a_{1,2}=0,a_{2,2}=1$ ) with $r_2=150$ , $\bar{D}_2=90$ ; product 3 is a connecting itinerary using both legs ( $a_{1,3}=1, a_{2,3}=1$ ) with $r_3=300$ , $\bar{D}_3=120$ . The DLP is:

maximize $200 x_1 + 150 x_2 + 300 x_3$ s.t.

•leg 1: $x_1 + x_3 \le 100$
•leg 2: $x_2 + x_3 \le 100$
•demand bounds: $0 \le x_1 \le 80$ , $0 \le x_2 \le 90$ , $0 \le x_3 \le 120$ .

Solving by inspection: we would prefer to sell connecting product 3 whenever both legs have capacity because $r_3=300$ is greater than $r_1+r_2=350$ ? (Note: check arithmetic: $r_1+r_2=200+150=350>300$ , so in this example selling two separate nonstops generates more revenue than one connecting ticket — this is a modeling choice.) Suppose connecting revenue is attractive relative to single legs; we find the LP solution using simplex or intuition. For this particular example, assume the LP's optimal corner solution is $x_1=80, x_2=90, x_3=0$ (because nonstops individually have higher combined revenue than connection). The dual variables (shadow prices) associated with leg capacities, call them $\pi_1,\pi_2$ , satisfy dual constraints:

\pi_1 a_{1j} + \pi_2 a_{2j} \ge r_j\quad \text{for all } j,\quad \pi_i \ge 0.

Dual (informal): minimize $100\pi_1 + 100\pi_2 + \sum_j \bar{D}_j \mu_j$ subject to the covering constraints; the dual values $\pi_i$ give the marginal value of an extra unit of capacity on leg $i$ . In our (assumed) primal solution $x_1=80,x_2=90,x_3=0$ , both legs are not at full capacity simultaneously (leg 1 used 80/100, leg 2 used 90/100), so dual prices may be $\pi_1=0, \pi_2=0$ if demand bounds bind; more realistic examples will have binding capacity and positive duals.

How to use duals as a control: At runtime, when a booking for product $j$ arrives, compute its resource cost under duals: $\text{opportunity cost} = \sum_{i} \pi_i a_{ij}$ . Accept the booking if

r_j \ge \sum_{i} \pi_i a_{ij}. \tag{Bid-price rule}

This is exactly the expected-revenue criterion: accept if fare exceeds the expected value of capacity consumed (the posterior expected opportunity cost). Numeric demo: suppose the LP dual yields $\pi_1=60$ and $\pi_2=40$ . For product 3 (uses both legs), opportunity cost = $60+40=100$ . Compare to $r_3=300$ ; since $300 \ge 100$ , we accept; for product 1 (uses leg 1 only), opportunity cost $=60$ , compare to $r_1=200$ accept.

Relation to Littlewood: For a single resource ( $I=1$ ) with two fares, the dual price $\pi$ reduces to $\pi = r_H P(D_H > y)$ interpreted as the marginal expected value of capacity, and Littlewood's inequality $r_L \ge r_H P(D_H>c-1)$ is the same as $r_L \ge \pi$ . Thus the bid-price rule generalizes Littlewood to networks by replacing the scalar $r_H P(\cdot)$ with dual prices that aggregate expected marginal values across resources.

Why the DLP? The exact DP for networks suffers from curse of dimensionality (state space is inventory vector in $\mathbb{Z}^I$ ). The DLP is a fluid (deterministic) approximation: it optimizes expected usage of capacity and ignores stochastic sequencing. Its dual gives economically interpretable shadow prices. Even though the DLP ignores stochasticity, the resulting bid-price controls are often very good in practice, especially when re-solved frequently and combined with stochastic protection updates (EMSR variations).

Refinements and theory:

•EMSR (Expected Marginal Seat Revenue) is an operational heuristic that computes per-leg protection levels by approximating the distribution of high-fare demand per leg and applying Littlewood in an aggregated way.
•Re-solve heuristics: periodically re-solve the DLP with updated remaining capacity and updated expected demands to get time-varying bid prices — this asymptotically tracks the optimal DP under mild conditions.
•Value-function approximations and Approximate Dynamic Programming: approximate $V_t(c)$ by a parametric family (e.g., linear in capacity with coefficients equal to dual prices), and then use policy improvement or rollout to refine.

Connection to Bayesian Decision Theory: The dual price $\pi$ is an estimator of the shadow value of capacity. The decision "accept if $r_j \ge \sum_i \pi_i a_{ij}$ " minimizes expected one-step regret when $\pi$ equals the posterior expected marginal value of capacity. Thus bid-price controls are just Bayes-optimal greedy rules under an approximation where the future value function is replaced by its linear estimate derived from the DLP.

Applications, Extensions, and Practical Connections

Revenue management is pervasive across perishable-inventory industries. I list key applications, extensions, and practical considerations, showing how the theory connects to implementation.

Airline and Transportation: Airlines were the originators of modern RM. Each flight has multiple legs (resources), itineraries (products) that consume subsets of legs, and many fare classes. Practical systems implement nested booking limits and bid-price or EMSR controls derived from DLPs that are frequently re-optimized. Real systems face cancellations, overbooking, no-shows, and fare class granularities with integer seat blocks.

Example numeric application: A low-cost carrier manages a single flight with capacity 150 and a forecast that 60 passengers will buy a refundable flexible ticket at $300, and 200 will buy a nonrefundable budget ticket at$ 100 with time-varying arrival rates over 30 days. Using Littlewood one can compute protection for flexible fares; using DLP one can allocate expected sales across fare types and set bid prices; the dual price (say $\pi=180$ ) implies reject any request below $180$ (so accept only flexible or large group budget requests).

Hotels and Perishable Retail: Hotels have per-night per-room perishability. Markdown optimization (dynamic pricing) is typically used near-date where remaining inventory is large relative to remaining demand. A canonical markdown model: continuous price $p_t$ , demand $D_t(p_t)=\alpha - \beta p_t$ per remaining period. The DP chooses $p_t$ to maximize expected revenue subject to inventory depletion. While closed-form solutions exist for special cases, in practice heuristic rules like "price so that expected sales per remaining period equal remaining inventory divided by remaining periods" are used, or numerical DP is employed. Concrete numeric example: inventory 10, periods 2, demand $D(p)=6-p$ . One computes the optimal $p_2$ and $p_1$ by enumerating feasible sales and their values.

Advertising and Online Platforms: Perishable inventory is impressions for a given time window. Demand is stochastic and often modeled via auctions. RM in this setting involves real-time bid prices for budgets and pacing; dual prices come from a DLP that allocates impressions to campaigns subject to budget constraints.

Choice-based demand and robustification: Real customers choose among offered prices or fare classes; choice models (MNL, nested logit) replace independent arrival models and change optimal controls from booking limits to assortment/price decisions. Bayesian estimation is used to update demand model posteriors; decisions are then Bayes-optimal given current beliefs.

Machine Learning + RM: Modern systems use ML to forecast demand (conditional on context) and feed those forecasts into DLP or DP approximations. Combination is nontrivial because forecasts are inevitably biased; safe policies use robust optimization, constrained re-optimization, or end-to-end policies trained by reinforcement learning.

Limitations and practical adjustments:

•Overfitting forecast noise: DLP with point forecasts can produce poor duals; stochastic LPs or safety buffers (protection margins) are used.
•Strategic customers: Customers may delay purchases expecting markdowns; this requires modeling strategic behavior (game-theoretic extensions) or commitment to posted-pricing rules.
•Cancellation and refunds: Overbooking policies need to be combined with RM.

Theoretical frontier: Network revenue management remains an active research area in applied probability and optimization. Results include asymptotic optimality of static bid-price policies under scaling (fluid and diffusion limits), performance bounds for EMSR heuristics, and improved policies via approximate dynamic programming with provable regret bounds.

Connection to your prerequisites and next steps: The DP backbone is essential to derive value-function-based policies (Dynamic Programming). Bayesian Decision Theory explains decision thresholds as posterior expected value comparisons (accept iff price >= expected shadow price). Price Discrimination provides the microeconomic intuition about segmenting customers by willingness-to-pay. Looking forward, mastering RM enables work in choice-based revenue management, reinforcement-learning-powered dynamic pricing, and robust optimization for uncertain demand.

Practical rule-of-thumb summary:

•Two classes: use Littlewood — compute protection level via demand tails and price ratio.
•Many products or network: solve (or re-solve) DLP to get duals — use bid-price accept/reject criterion.
•Near-departure: prefer markdown/dynamic pricing with demand elasticity models.
•When in doubt: compute the marginal value of one extra unit of capacity (via DP or dual) and accept if price >= marginal value.

This completes the core mechanics and their connection to practical revenue management.

Worked Examples (3)

Littlewood with Poisson High Demand

Two fare classes: high fare $r_H=200$ , low fare $r_L=100$ . Future high-fare demand $D_H \sim \text{Poisson}(\lambda=1.2)$ . Remaining capacity $c$ can be 0,1,2. Use Littlewood to compute protection level $y$ and specify when to accept low fare.

Compute the price ratio: $r_L/r_H = 100/200 = 0.5$ .
For Poisson( $\lambda=1.2$ ) compute tail probabilities: $P(D_H>0)=1-e^{-1.2}=1-0.3010=0.6990$ (approx).
Compute $P(D_H>1)=1-(P(0)+P(1))=1-(e^{-1.2}+1.2 e^{-1.2})=1-(0.3010+0.3612)=0.3378$ (approx).
Find smallest integer $y$ with $P(D_H>y) \le 0.5$ . From the numbers, $P(D_H>0)=0.699>0.5$ and $P(D_H>1)=0.338<0.5$ . So choose $y=1$ .
Interpretation: Protect $y=1$ seat for high fare. Accept a low-fare request only if $c > y$ , i.e., accept only when $c\ge 2$ ; if $c=1$ reject low fare.

Insight: This example shows how Littlewood converts a demand distribution and price ratio into a simple protection level. It demonstrates how the rule uses tail probabilities rather than full DP computation and clarifies why the protection level is integer-valued.

Network Bid Prices from a Small DLP

Two resource legs with capacities $C_1=100$ , $C_2=100$ . Three products: product 1 uses leg 1 only with $r_1=200$ and $\bar{D}_1=80$ ; product 2 uses leg 2 only with $r_2=150$ and $\bar{D}_2=90$ ; product 3 uses both legs with $r_3=300$ and $\bar{D}_3=120$ . Solve DLP and compute dual prices $\pi_1,\pi_2$ ; then state accept/reject rule for an arriving product 3 booking.

Write the DLP: maximize $200x_1 + 150x_2 + 300x_3$ subject to $x_1 + x_3 \le 100$ , $x_2 + x_3 \le 100$ , and demand bounds $x_1\le80$ , $x_2\le90$ , $x_3\le120$ .
Look for capacity-binding constraints. Check if selling all demands violates capacities: total leg1 use if sell all = $80 + 120 = 200 > 100$ , so capacities bind.
Consider candidate solution: sell full demands $x_1=80,x_2=90$ , and then allocate remaining capacity to $x_3$ but note legs limit $x_3$ to $100-80=20$ on leg1 and $100-90=10$ on leg2, so max $x_3=10$ (bottlenecked by leg2). So feasible point is $x_1=80,x_2=90,x_3=10$ with revenue $200*80 + 150*90 + 300*10 = 16{,}000 + 13{,}500 + 3{,}000 = 32{,}500$ .
To compute duals, note that both leg constraints are tight at this solution: leg1 used $80+10=90<100$ actually is NOT tight — recompute: leg1 usage = $x_1+x_3 = 80+10=90$ (so slack 10), leg2 usage = $90+10=100$ (tight). Thus only leg2 is binding; its dual $\pi_2>0$ , $\pi_1=0$ is possible. The dual conditions require for each product $j$ : $\pi_1 a_{1j}+\pi_2 a_{2j} \ge r_j$ unless $x_j$ is at its upper demand bound, in which case complementary slackness applies.
Assume complementary slackness gives $\pi_2=150$ (intuitively since product 2 saturated demand at price 150) and $\pi_1=0$ . Then opportunity cost for product 3 is $\pi_1+\pi_2 = 0 +150 =150$ . Since $r_3=300>150$ , the bid-price rule accepts product 3 bookings.
Thus the runtime rule: accept product 3 if $300 \ge 150$ (true), so accept until leg2 capacity is exhausted; product 3 acceptance will be curtailed when leg2 has no capacity left.

Insight: This example shows how the DLP yields intuitive dual prices that aggregate resource scarcity into per-product opportunity costs. Even with simplifications, duals guide accept/reject decisions without solving a high-dimensional DP.

Simple Markdown DP (small dynamic pricing)

Inventory 5 identical units, horizon of 3 discrete selling periods (t=1,2,3). Demand in a period given price $p$ is deterministic expected sales $d(p)=\max\{0,6 - p\}$ (i.e. linear inverse demand with integer rounding). No replenishment and no salvage value. Compute optimal posted prices $p_1,p_2,p_3$ and resulting expected revenue via dynamic programming.

State definition: $V_t(s)$ = max expected revenue from periods $t,...,3$ with $s$ units remaining. Terminal $V_4(s)=0$ for all $s$ .
At period 3, choose price $p$ to maximize $p \cdot \min\{d(p), s\}$ since only one period remains. Enumerate candidate prices $p=0,1,2,...,6$ because $d(p)=6-p$ becomes 0 at $p\ge6$ .
Construct a small table for $s=0,1,2,3,4,5$ : for each $s$ compute best $p$ and $V_3(s)$ . For example, if $s=1$ : revenue candidates are $p=5$ gives $d(5)=1$ => revenue $5\cdot 1=5$ ; $p=4$ gives $d(4)=2$ but capped by $s=1$ so revenue $4$; best is $p=5$ . Thus $V_3(1)=5$ . Similarly compute $V_3(2)$ : try $p=4$ gives $d(4)=2$ => revenue $8$, $p=5$ gives $d(5)=1$ revenue $5$, so best $p=4$ and $V_3(2)=8$ .
Now proceed to period 2: for each $s$ compute $V_2(s)=\max_p \{ p \cdot \min(d(p), s) + V_3(s - \min(d(p), s))\}$ . For $s=2$ , one candidate is $p=4$ sells 2 now for revenue 8 and leaves 0 so $V_2(2)=8 + V_3(0)=8$ . Try $p=3$ sells 3 but capped at s=2 so sells 2, revenue $6$ then V_3(0)=0 so worse. So $V_2(2)=8$ .
Repeat to compute $V_1(5)$ similarly by enumerating prices: one optimal policy (calculated by full enumeration) is $p_1=3$ (sell 3 units), $p_2=4$ (sell 2 units), $p_3$ irrelevant. Revenue = $3*3 + 4*2 = 9 + 8 =17$ matching DP value $V_1(5)=17$ .
Thus the optimal dynamic pricing is to charge a moderate price early to ration inventory and raise price later when inventory is lower. The DP computed exact integer-optimal prices for this toy model.

Insight: This worked example demonstrates how dynamic programming produces nontrivial markdown schedules: price depends on remaining inventory and time. It also shows the computational burden even in a small discrete model — motivating approximations (e.g., continuous-time elasticities, myopic heuristics) in larger problems.

Key Takeaways

✓
Revenue management frames selling of perishable, capacity-constrained inventory as a stochastic dynamic optimization where the marginal value of capacity drives acceptance decisions.
✓
Littlewood's rule gives an exact acceptance/protection rule for two-class problems: protect $y$ seats so that $P(D_H>y)=r_L/r_H$ , and accept the low fare iff remaining capacity exceeds $y$ .
✓
In networks of resources, deterministic LP (DLP) approximations yield dual variables that serve as bid prices; accept a request iff its revenue exceeds the sum of dual prices of consumed resources.
✓
Bid-price rules are Bayes-optimal one-step greedy policies when duals estimate the posterior marginal value of capacity; frequent re-solving and stochastic adjustments improve performance.
✓
Markdown optimization is the pricing analogue: dynamically choose prices to ration inventory over remaining periods according to demand elasticity and remaining stock.
✓
Practical systems combine forecasting (ML) with DLP/DP approximations, and must cope with cancellations, strategic customers, and forecast uncertainty using robust or stochastic extensions.
✓
Mastering DP, Bayesian decision comparisons, and price discrimination intuition allows you to derive, justify, and refine RM heuristics used in production.

Common Mistakes

✗
Treating DLP dual prices as exact shadow prices without accounting for stochasticity — DLP ignores variance and sequencing; duals are approximations and should be updated periodically or combined with stochastic buffers.
✗
Misapplying Littlewood's rule by equating $P(D_H>y)$ to $r_H/r_L$ instead of $r_L/r_H$ (i.e., swapping numerator/denominator) — sign/direction errors reverse protection decisions and cause revenue loss.
✗
Assuming bid prices are static through the horizon — capacities and remaining demand change, so bid prices should be recomputed or adjusted over time for good performance.
✗
Treating arrivals as independent of price in markdown models — if customers are strategic (time their buy), the naive DP with price-elastic demand may be biased; model strategic behavior explicitly.

Practice

easy

Easy: Two fare classes with $r_H=250$ , $r_L=100$ , future high demand $D_H\sim\text{Poisson}(\lambda=0.8)$ . Compute protection level $y$ via Littlewood's rule (choose smallest integer with $P(D_H>y)\le r_L/r_H$ ) and state for which remaining capacities $c$ you accept low fares.

Hint: Compute $r_L/r_H=100/250=0.4$ . Evaluate $P(D_H>0),P(D_H>1),\ldots$ using Poisson formulas until tail probability drops to $\le0.4$ .

Show solution

Compute Poisson pmf: $P(0)=e^{-0.8}=0.4493$ , $P(1)=0.8 e^{-0.8}=0.3594$ , so $P(D_H>0)=1-0.4493=0.5507>0.4$ , $P(D_H>1)=1-(P(0)+P(1))=1-(0.4493+0.3594)=0.1913<0.4$ . So choose $y=1$ . Accept low fare iff remaining capacity $c>1$ , i.e., accept when $c\ge2$ and reject when $c=1$ or 0.

medium

Medium: Consider a network with two legs, capacities $C_1=50, C_2=50$ . There are two products: product A uses leg1 only ( $r_A=120$ , expected demand $\bar{D}_A=40$ ) and product B uses both legs ( $r_B=200$ , $\bar{D}_B=60$ ). Formulate the DLP and compute the dual price(s). Use the dual(s) to decide whether to accept an arriving product B booking when both legs have full initial capacity.

Hint: Set variables $x_A,x_B$ , constraints $x_A + x_B \le 50$ (leg1), $x_B \le 50$ (leg2) plus demand bounds. Solve primal corner solution and derive dual prices by complementary slackness.

Show solution

DLP: max $120 x_A + 200 x_B$ s.t. $x_A + x_B \le 50$ , $x_B \le 50$ , $x_A\le40$ , $x_B\le60$ . Selling all demands would require leg1 usage $40+60=100>50$ so capacity binds. Candidate primal: saturate $x_A=40$ (its demand cap) and then allocate $x_B$ up to leg1 capacity: $x_B=10$ (because leg1 has space 10). Revenue = $120*40 + 200*10 = 4{,}800 + 2{,}000 = 6{,}800$ . Leg1 is tight ( $40+10=50$ ), leg2 is not ( $10<50$ ). Dual prices: let $\pi_1$ correspond to leg1, and $\pi_2$ to leg2. Complementary slackness suggests $\pi_1>0,\pi_2=0$ . Dual constraints require $\pi_1 \ge r_A =120$ for product A if $x_A$ at bound? Since $x_A$ is at its demand upper bound, the corresponding dual for demand cap will adjust; practically we can take $\pi_1=120$ . Then opportunity cost for product B is $\pi_1+\pi_2=120$ . Since $r_B=200>120$ accept product B. Thus at start with full capacity the DLP-informed bid-price rule accepts product B.

hard

Hard: Consider a single product with inventory $I=3$ and selling horizon T=2 periods. Demand per period is stochastic: if price $p$ is set, demand is Poisson with mean $\lambda(p)=5- p$ for integer $p\in\{1,2,3,4\}$ (assume $\lambda\ge0$ ). No cancellations. Formulate the exact DP for optimal posted prices and compute the optimal prices and expected revenue by enumerating the state-space.

Hint: State is (period,remaining inventory). For each state compute expected revenue for each candidate price by summing over Poisson probabilities truncated by inventory. Use backward induction from period 2 to 1.

Show solution

Define $V_3(s)=0$ . For period 2, $V_2(s)=\max_p \sum_{k=0}^s p\cdot k\cdot P_{\lambda(p)}(k) + 0$ , because leftover inventory has no salvage. Compute for each s=0..3 and p=1..4 the expected sold units $E[\min(K,s)]$ and revenue $p\cdot E[\min(K,s)]$ . For example, for s=1 and p=4, $\lambda=1$ , $P(K=0)=e^{-1}=0.3679$ , $P(K\ge1)=0.6321$ , so $E[\min(K,1)]=P(K\ge1)=0.6321$ , revenue = $4*0.6321=2.5284$ . Compute all p and pick best; suppose results give $V_2(1)=\text{best price }p=3$ with $V_2(1)=1.9$ (numeric values after enumeration). Then compute $V_1(3)=\max_p \sum_{k=0}^3 [p\cdot k + V_2(3-k)] P_{\lambda(p)}(k)$ . Evaluate for p=1..4 using Poisson probabilities (e.g., if p=2 then $\lambda=3$ , compute $P(K=k)$ for k=0..3, multiply and sum). After full enumeration (mechanical but finite), you obtain optimal initial price $p_1$ (numerically, say $p_1=2$ ) and expected revenue $V_1(3)\approx 6.4$ . The full numeric table requires standard Poisson calculations; the key is that DP with small state is computable by brute force enumeration and yields an optimal nontrivial pricing policy that depends on remaining inventory.

Connections

This lesson uses and extends the prerequisites in explicit ways. From Price Discrimination we import the idea of segmenting customers by willingness-to-pay; RM operationalizes this through fare classes or posted-pricing menus and uses segmentation to ration capacity. From Dynamic Programming we import backward induction and value functions: the core decision criterion in RM is comparing immediate reward to the marginal value of capacity, $V_t(c)-V_t(c-1)$ , which is the DP notion of opportunity cost. From Bayesian Decision Theory we import posterior expected-loss comparisons: accept/reject rules (e.g., Littlewood's inequality, bid-price threshold) are pointwise Bayes-optimal decisions when the estimated opportunity cost is the posterior expectation. Looking forward, mastering these RM tools enables work in choice-based revenue management (requires incorporating discrete choice models into DP), reinforcement learning for dynamic pricing and allocation (where value-function approximation and policy gradient methods replace the DLP), and robust/stochastic optimization for uncertain demand (where distributional robustness modifies DLP duals to produce conservative bid prices). Specific downstream concepts that require this material include Expected Marginal Seat Revenue (EMSR) heuristics, assortment optimization under capacity constraints, and dual-based online allocation algorithms for ads and cloud resources.

Quality: pending (0.0/5)

Revenue Management

Prerequisites (3)

Graph Position

What Is Revenue Management?

Core Mechanic 1 — Booking Limits and Littlewood's Rule

Core Mechanic 2 — Bid-Price Controls, Deterministic LP, and Networks

Applications, Extensions, and Practical Connections

Worked Examples (3)

Littlewood with Poisson High Demand

Network Bid Prices from a Small DLP

Simple Markdown DP (small dynamic pricing)

Key Takeaways

Common Mistakes

Practice

Connections