TITLE: Dynamic policies to learn and earn in a customized pricing context
SPEAKER: J. Michael Harrison
ABSTRACT:
Motivated
by applications in financial services, we consider the following customized
pricing problem. A seller of some good
or service (like auto loans or small business loans) confronts a sequence of
potential customers numbered 1, 2, … , T. These customers are drawn at random from a
population characterized by a price-response function r(p). That is, if the seller offers price p, then the probability of a successful
sale is r(p). The profit realized from
a successful sale is p(p) = p – c, where c > 0 is known.
If
the price-response function r(×) were also known, then
the problem of finding a price p* to
maximize r(p)p(p) would be simple, and the
seller would offer price p* to each
of the T customers. We consider the more complicated case where r(×) is fixed but initially unknown: roughly speaking, the seller wants to
choose prices sequentially so as to maximize the total profit earned from the T potential customers; each successive
choice involves a trade-off between refined estimation of the unknown
price-response function (learning) and immediate profit (earning).
*
Joint work with Bora Keskin and Assaf Zeevi