Multi-armed Bandit Allocation Indices

by ; ;
Edition: 2nd
Format: Hardcover
Pub. Date: 2011-03-21
Publisher(s): Wiley
List Price: $148.21

Buy New

Usually Ships in 8 - 10 Business Days.
$147.47

Rent Textbook

Select for Price
There was a problem. Please try again later.

Used Textbook

We're Sorry
Sold Out

eTextbook

We're Sorry
Not Available

How Marketplace Works:

  • This item is offered by an independent seller and not shipped from our warehouse
  • Item details like edition and cover design may differ from our description; see seller's comments before ordering.
  • Sellers much confirm and ship within two business days; otherwise, the order will be cancelled and refunded.
  • Marketplace purchases cannot be returned to eCampus.com. Contact the seller directly for inquiries; if no response within two days, contact customer service.
  • Additional shipping costs apply to Marketplace purchases. Review shipping costs at checkout.

Summary

Statisticians are familiar with bandit problems, operational researchers with scheduling problems, and economists with problems of resource allocation. Most such problems are computationally intractable and cannot be solved in polynomial time, meaning that accurate solutions are unobtainable except for small-scale problems. This is particularly true under conditions of uncertainty. This book shows that there is, however, a large class of allocation problems for which the optimal solution is expressible in terms of a priority index which is defined for each of the competing projects independently of the properties of the other projects. Such problems are therefore solved once the appropriate index has been found. In some cases there is a concise formula for the index; at worst it can usually be determined by a manageable calculation. Since the discovery of the index, which has become known as the Gittins index, its properties and its range of applicability have been worked out in some detail. This book gives an account of these developments and includes extensive tables of index values. The Gittens Index has had a great influence on the analysis of cost benefit trade-offs in a range if areas, from computer science and engineering to finance and marketing. This book re-introduces the topic and illustrates its relevance to modern statistical, economic and operations research projects.

Author Biography

John Gittins, Statistics Department, University of Oxford, UK

Kevin Glazebrook, Department of Management Science, Lancaster University, UK

Richard Weber, Statistical Laboratory, University of Cambridge, UK

Table of Contents

Forewordp. ix
Foreword to the first editionp. xi
Prefacep. xiii
Preface to the first editionp. xv
Introduction or explorationp. 1
Exercisesp. 16
Main ideas: Gittins indexp. 19
Introductionp. 19
Decision processesp. 20
Simple families of alternative bandit processesp. 21
Dynamic programmingp. 23
Gittins index theoremp. 24
Gittins indexp. 28
Gittins index and the multi-armed banditp. 28
Coins problemp. 29
Characterization of the optimal stopping timep. 30
The restart-in-state formulationp. 31
Dependence on discount factorp. 32
Myopic and forwards induction policiesp. 32
Proof of the index theorem by interchanging bandit portionsp. 33
Continuous-time bandit processesp. 36
Proof of the index theorem by induction and interchange argumentp. 40
Calculation of Gittins indicesp. 43
Monotonicity conditionsp. 44
Monotone indicesp. 44
Monotone jobsp. 45
History of the index theoremp. 47
Some decision process theoryp. 49
Exercisesp. 50
Necessary assumptions for indicesp. 55
Introductionp. 55
Jobsp. 56
Continuous-time jobsp. 58
Definitionp. 58
Policies for continuous-time jobsp. 58
The continuous-time index theorem for a SFABP of jobsp. 61
Necessary assumptionsp. 61
Necessity of an infinite time horizonp. 61
Necessity of constant exponential discountingp. 62
Necessity of a single processorp. 63
Beyond the necessary assumptionsp. 64
Bandit-dependent discount factorsp. 64
Stochastic discountingp. 66
Undiscounted rewardsp. 68
A discrete search problemp. 70
Multiple processorsp. 73
Exercisesp. 76
Superprocesses, precedence constraints and arrivalsp. 79
Introductionp. 79
Bandit superprocessesp. 80
The index theorem for superprocessesp. 83
Stoppable bandit processesp. 88
Proof of the index theorem by freezing and promotion rulesp. 90
Freezing rulesp. 93
Promotion rulesp. 95
The index theorem for jobs with precedence constraintsp. 97
Precedence constraints forming an out-forestp. 102
Bandit processes with arrivalsp. 105
Tax problemsp. 106
Ongoing bandits and tax problemsp. 106
Klimov's modelp. 108
Minimum EWFT for the M/G/1 queuep. 110
Near optimality of nearly index policiesp. 111
Exercisesp. 113
The achievable region methodologyp. 115
Introductionp. 115
A simple examplep. 116
Proof of the index theorem by greedy algorithmp. 119
Generalized conservation laws and indexable systemsp. 124
Performance bounds for policies for branching banditsp. 132
Job selection and scheduling problemsp. 136
Multi-armed bandits on parallel machinesp. 139
Exercisesp. 147
Restless bandits and Lagrangian relaxationp. 149
Introductionp. 149
Restless banditsp. 150
Whittle indices for restless banditsp. 152
Asymptotic optimalityp. 155
Monotone policies and simple proofs of indexabilityp. 155
Applications to multi-class queueing systemsp. 159
Performance bounds for the Whittle index policyp. 162
Indices for more general resource configurationsp. 169
Exercisesp. 171
Multi-population random sampling (theory)p. 173
Introductionp. 173
Jobs and targetsp. 179
Use of monotonicity propertiesp. 181
General methods of calculation: use of invariance propertiesp. 185
Random sampling timesp. 195
Brownian reward processesp. 201
Asymptotically normal reward processesp. 205
Diffusion banditsp. 210
Exercisesp. 211
Multi-population random sampling (calculations)p. 213
Introductionp. 213
Normal reward processes (known variance)p. 213
Normal reward processes (mean and variance both unknown)p. 218
Bernoulli reward processesp. 221
Exponential reward processesp. 225
Exponential target processp. 229
Bernoulli/exponential target processp. 234
Exercisesp. 239
Further exploitationp. 241
Introductionp. 241
Website morphingp. 242
Economicsp. 243
Value of informationp. 244
More on job-scheduling problemsp. 244
Military applicationsp. 245
Referencesp. 249
Tablesp. 261
Indexp. 285
Table of Contents provided by Ingram. All Rights Reserved.

An electronic version of this book is available through VitalSource.

This book is viewable on PC, Mac, iPhone, iPad, iPod Touch, and most smartphones.

By purchasing, you will be able to view this book online, as well as download it, for the chosen number of days.

Digital License

You are licensing a digital product for a set duration. Durations are set forth in the product description, with "Lifetime" typically meaning five (5) years of online access and permanent download to a supported device. All licenses are non-transferable.

More details can be found here.

A downloadable version of this book is available through the eCampus Reader or compatible Adobe readers.

Applications are available on iOS, Android, PC, Mac, and Windows Mobile platforms.

Please view the compatibility matrix prior to purchase.