Machine Learning & Mechanism Design -...

Machine Learning & Mechanism Design:Dynamic and Discriminatory Pricing in Auctions

Jason D. HartlineMicrosoft Research – Silicon Valley

(Joint with with Maria-Florina Balcan,

Avrim Blum, and Yishay Mansour)

August 5, 2005

The Problem

Sellers can extract more of surplus with discriminatory pricing.

DISCRIMINATORY AUCTIONS – AUGUST 5, 20051

The Problem

Two approaches:

1. Distinguish between products.(E.g., software versioning, airline tickets, etc.)

2. Price discriminate with observable customer features.(E.g., college tuition, DVDs, car insurance, shipping)

The Problem

Two approaches:

1. Distinguish between products.(E.g., software versioning, airline tickets, etc.)

2. Price discriminate with observable customer features.(E.g., college tuition, DVDs, car insurance, shipping)

Goal: design mechanism to optimally price discriminate.

Optimal Mechanism Design

Typical Economic approach to optimal mechanism design:

• Assume valuations are from known distribution.

• Design optimal auction for distribution.

Notes on optimal mechanism design problem:

• Solved by Myerson (for single-parameter case).

• non-identical distributions =⇒ discriminatory pricing.

• Assumed known distribution ignores:

– incentives (of acquiring distribution)

– performance (from inaccurate distribution)

• Assumed known distribution ignores:

– incentives (of acquiring distribution)

– performance (from inaccurate distribution)

Goal: understand how quality and incentives of learning distributionaffect profit.

Setting

1. Unlimited supply of stuff to sell.

2. bidders with private valuations for stuff.

3. make each bidder an offer.

4. revenue is incentive compatible function of offer and valuation.

Setting

1. Unlimited supply of stuff to sell.(Example 1: MS Office Professional (PV) & Student Version (SV))

2. bidders with private valuations for stuff.

Setting

2. bidders with private valuations for stuff.(Example 1: Bidder: “PV worth $400, SV worth $300”)

Setting

3. make each bidder an offer.(Example 1: Seller: “PV costs $369.88, SV costs $124.99”)

Setting

4. revenue is incentive compatible function of offer and valuation.(Example 1: Sold: SV for $124.99!)

Setting

1. Unlimited supply of stuff to sell.(Example 1: MS Office Professional (PV) & Student Version (SV))(Example 2: Tuition for in state (IS) and out of state (OS) students)

Setting

2. bidders with private valuations for stuff.(Example 1: Bidder: “PV worth $400, SV worth $300”)(Example 2: Bidder (OS): “Tuition worth $15,000”)

Setting

3. make each bidder an offer.(Example 1: Seller: “PV costs $369.88, SV costs $124.99”)(Example 2: Seller: “IS costs $9,256.80, OS costs $16,855.30,”)

Setting

3. make each bidder an offer.(Example 1: Seller: “PV costs $369.88, SV costs $124.99”)(Example 2: Seller: “IS costs $9,256.80, OS costs $16,855.30,”)

4. revenue is incentive compatible function of offer and valuation.(Example 1: Sold: SV for $124.99!)(Example 2: No Sale!)

Overview

1.=⇒ Auction Problem

(a) Random Sampling Solution

(b) Retrospective bounds.

(c) Software Versioning Example.

2. Online Auction Problem

(a) Expert Learning based Auction.

(b) Expert Learning with non-uniform bounds.

3. Conclusions

Auction Problem

The Unlimited Supply Auction Problem:

Given:

• unlimited supply of stuff.

• Set S of n bidders with valuations for stuff.

• class G of reasonable offers.

Design: Auction with profit near that of optimal single offer.

Auction Problem

The Unlimited Supply Auction Problem:

Given:

• Set S of n bidders with valuations for stuff.

Design: Auction with profit near that of optimal single offer.

Notation:

• g(i) = payoff from bidder i when offered g.

• g(S) =∑

i∈S g(i).

• optG(S) = argmaxg∈G g(S).

• OPTG(S) = maxg∈G g(S).

Random Sampling Auction

Random Sampling Optimal Offer Auction, RSOOG

1. Randomly partition bidders into two sets: S1 and S2.

2. compute g1 (resp. g2), optimal offer for S1 (resp. S2)

3. Offer g1 to S2 and g2 to S1.

(Random Sampling Auction from [Goldberg, Hartline, Wright 2001])

g1 = opt(S1)

g2 = opt(S2)

g1 = opt(S1)

g2 = opt(S2)

g1 = opt(S1)

g2 = opt(S2)

g1 = opt(S1)

g2 = opt(S2)

g1 = opt(S1)

g2 = opt(S2)

Question: when is RSOOG good?

Performance Analysis

Lemma: For g and random partitions S1 and S2:

Pr[|g(S1) − g(S2)| > εmax(p, g(S))] ≤ 2e−ε2p/2h.

Consider:

• Use p = OPTG .

• If |G| e−ε2 OPTG /2h ≤ δ,

• union bound probability any g ∈ G is bad by δ.

Consider:

• Use p = OPTG .

Theorem: With probability 1 − δ profit from RSOOG is at least

(1 − ε)OPTG −O( hε2 log |G|

Consider:

• Use p = OPTG .

Theorem: With probability 1 − δ profit from RSOOG is at least

(1 − ε)OPTG −O( hε2 log |G|

Interpretation: O(h log |G|) is convergence time.

Example

Example: Selling tee shirts. (discretized prices)

• Bidders with valuations in [1, h] for a tee shirt.

• Reasonable offers: G = {price 2i for i ∈ {1, . . . , log h}}.

• Convergence Time: O(h log |G|) = O(h log log h)

Overview

1. Auction Problem

(b)=⇒ Retrospective bounds.

3. Conclusions

|G| = ∞?

What if |G| = ∞?

|G| = ∞?

What if |G| = ∞?

Example: Selling tee shirts. (non-discretized prices)

• Bidders with valuations v1, . . . , vn in [1, h] for a tee shirt.

• Reasonable offers: G = {price p ∈ [1, h]}.

• Convergence Time:

|G| = ∞?

What if |G| = ∞?

Observation:

• Suppose RSOOG on S only offers g ∈ GS ⊂ G.

• Then RSOOGS(S) is same as RSOOG(S).

• Retrospectively perform analysis on GS instead of G.

|G| = ∞?

What if |G| = ∞?

Observation:

Consider: GS = {“offer vi” : i ∈ S}.

|G| = ∞?

What if |G| = ∞?

• Convergence Time: O(h |GS |) = O(h log n)

Observation:

Consider: GS = {“offer vi” : i ∈ S}.

Example: Software Versioning

Example: Software versioning.

• Microsoft has M possible versions of MS Office,

• but can only package and sell m � M distinct versions.(E.g., Student and Professional).

• Reasonable offers: G = {p ∈ (R ∪ {∞})M with pj = ∞ for allbut m items}.

• What is |GS |?

Lemma: |GS | = O((nm2M)m).

• What is |GS |?

Convergence time = O(hm log(nm2M)) .

• What is |GS |?

Convergence time = O(hm log(nm2M)) ≈ O(hm log n).

Proof of Lemma

Proof:

1. Fix set of m items to sell.

2. Bidder i’s valuation divides price space into m + 1 convex regions.

Proof of Lemma

Proof:

3. Regions are joined by (m + 1)2 hyperplanes.

Proof of Lemma

Proof:

4. n bidders total for n(m + 1)2 hyperplanes.

Proof of Lemma

Proof:

5. RSOOG offer price must be at intersection of hyperplanes.

Proof of Lemma

Proof:

6. K = n(m + 1)2 hyperplanes in m dimensions intersect in Km.

Proof of Lemma

Proof:

6. K = n(m + 1)2 hyperplanes in m dimensions intersect in Km.

7. Sum over Mm possible m-item sets.

Other Results

See paper for details on:

• Bounds for RSOOG for item-pricing in combinatorial auctions.

• Bounds for RSOOG on bidders with observable features.

• Better bounds with ε-covers of G.

• Better random sampling auction with structural risk minimization.

• Using approximation algorithms in RSOOG .

Overview

1. Auction Problem

2.=⇒ Online Auction Problem

3. Conclusions

Online Auction Problem

Online Auction Problem:

• Bidders arrive one at a time and place bids, b1, b2, . . .

• Auctioneer makes offer g from G before next bidder arrives.

• Goal: Auction with profit close to optimal single offer.

Two Difficulties:

1. Incentive Compatibility requirement:

2. Online Requirement (do not know future):

Two Difficulties:

offer to bidder i not function of bi.

Two Difficulties:

price offered bidder i not function of future bids.

Two Difficulties:

price offered bidder i not function of future bids.

Conclusion: offer for bidder i based only on prior bids: b1, . . . , bi−1.

Assumptions

1. We learn each bidders full valuation.

Assumptions

for partial information case see multi-armed bandit solutions:[Blum, Kumar, Rudra, Wu ’03][Kleinberg, Leighton ’03][Blum, Hartline 05]

Assumptions

2. Bidders cannot come back.

3. Bidders cannot lie about their arrival time.

Assumptions

for temporal strategyproofness see: [Hajiaghayi, Kleinberg, Parkes ’04]

Assumptions

4. items in unlimited supply.

Assumptions

4. items in unlimited supply.

for limited supply see:[Hajiaghayi, Kleinberg, Parkes ’04][Kleinberg ’05]

Online Learning

Expert Online Learning Problem:

In round i:

1. Each of k experts propose a strategy.

2. We choose an expert’s strategy.

3. Find out how each strategy performed (payoff)

Goal: Obtain payoff close to single best expert overall (in hindsight).

Online Learning

In round i:

Weighted Majority Algorithm: (for round i)

Let h be maximum payoff. For expert j, let sj be total payoff thus far.

Choose expert j ’s strategy with probability proportional to (1+2ε)sj/h.

Online Learning

In round i:

Weighted Majority Algorithm: (for round i)

Let h be maximum payoff. For expert j, let sj be total payoff thus far.

Choose expert j ’s strategy with probability proportional to (1+2ε)sj/h.

Result: E[payoff] = (1 − ε)OPT− h2ε log k.

Application to Online Auctions

Application: (to online auctions) [Blum Kumar Rudra Wu 2003]

1. Expert for each g ∈ G

2. Best expert ⇒ best offer.

Result: E[profit] = (1 − ε)OPTG −hε log |G|.

Note: Same convergence time as for RSOOG .

Example

• Convergence Time: O(h log |G|)

Example

• Convergence Time: O(h log |G|)= O(h log log h).

Better Bounds?

Can we get better bounds?

Retrospective technique like using GS does not work.

Overview

1. Auction Problem

(b)=⇒ Expert Learning with non-uniform bounds.

3. Conclusions

Non-uniform Bounds on Payoff

Expert Online Learning Problem: In round i:

4. Expert i’s payoff is always less than hi.

Non-uniform Experts Algorithm: [Kalai ’01][Blum, Hartline ’05]

1. (initialization) For each expert, j, add initial score, sj , as:

hi × number of heads flipped in a row.

2. Run deterministic “go with best expert” algorithm.

Non-uniform Experts Algorithm: [Kalai ’01][Blum, Hartline ’05]

1. (initialization) For each expert, j, add initial score, sj , as:

hi × number of heads flipped in a row.

2. Run deterministic “go with best expert” algorithm.

Result: E[profit] ≥ OPT /2 −∑

Application: (to online auctions)

1. Bound hg for each g ∈ G.

Result: E[profit] = OPTG /2 −∑

g∈G hg .

Result: E[profit] = OPTG /2 −∑

g∈G hg .

Note: Convergence time =∑

g∈G hg

Example

• Convergence Time:∑

g∈G hg .

Example

g∈G hg =∑log h

i 2i .

Example

g∈G hg =∑log h

i 2i ≤ 2h.

Example

g∈G hg =∑log h

i 2i ≤ 2h.

Note: this is optimal up to constant factors.

Conclusions

1. Used machine learning techniques for auction design/analysis.

2. Prior-free discriminatory optimal mechanism design.

(a) distinguishing between products (and selecting products to sell).

(b) price discriminate based on observable customer features.

3. Similar bounds for offline and online auctions.

4. Retrospective analysis for offline auctions.

Conclusions

1. Used machine learning techniques for auction design/analysis.

2. Prior-free discriminatory optimal mechanism design.

(a) distinguishing between products (and selecting products to sell).

(b) price discriminate based on observable customer features.

3. Similar bounds for offline and online auctions.

4. Retrospective analysis for offline auctions.

5. Open: ε-cover arguments for online auctions?

6. Open: limited supply?

7. Open: general cost function on outcomes?

Machine Learning & Mechanism Design -...

Documents

Transcript of Machine Learning & Mechanism Design -...

TERMINOLOGY for the MECHANISM and MACHINE SCIENCE · MECHANISM and MACHINE SCIENCE Proceedings of the Scientific Seminar International Federation for the Promotion of Mechanism and

Mechanism and Machine Theory - bioeng.nus.edu.sg Design and... · Mechanism and Machine Theory journal homepage:. Therefore, there is a great need for portable wearable robotic systems

Tseng 2004 Mechanism and Machine Theory

Mechanism and Machine Theory - NTU · Mechanism and Machine Theory journal homepage: structured XYZ translational compliant parallel micromanipulator is designed by Li and ...

TMG E1580 Electric Point Machine - Westinghouse 84M Mechanism

Mechanism and Machine Theory - Stanford Universitystanford.edu/~tkuchida/papers/Uchida-2012-GroebnerBasesGenerate... · Mechanism and Machine Theory journal homepage: . In this work,

THE INDEXING MECHANISM OF GEAR HOBBING MACHINE

sewing machine Feed mechanism

Mechanism and Machine Theory - Kingston Universityeprints.kingston.ac.uk/38988/1/Rayner-R-38988-VoR.pdf · Mechanism and Machine Theory 118 (2017) ... The design and optimisation

Mechanism and Machine Theory - Aalborg Universitethomes.m-tech.aau.dk/shb/papers/2017-MMT-CouplerLinkMobility-Bai.… · S. Bai / Mechanism and Machine Theory 118 (2017) 53–64 55

Development of Compliant Mechanism for Real-Time Machine … · 2018-01-10 · Development of Compliant Mechanism for Real-Time Machine Tool Accuracy Enhancement Using Dual Servo

Double Acting Hacksaw Machine Using Scotch Yoke Mechanism

Mechanism and Machine Theory - NTU · Do et al. / Mechanism and Machine Theory 85 (2015) 14–24. complexshapes. In addition,it is abletocapturethe friction forceatnearzero velocities

Games and Mechanism Design in Machine Scheduling - An Introductionuetzm/Preprints/gam.pdf · 2019-11-11 · Games and Mechanism Design in Machine Scheduling - An Introduction Birgit

Decentralization & Mechanism Design for Online Machine Scheduling

Mechanism and Machine Theory - Technion

FRom maChine to meChanism RieF histoRy oF ConstRuCtion oF ...

Mechanism and Machine Theory - gearlab.com.cn · Mechanism and Machine Theory 113 (2017) 40–52 Contents lists available at ScienceDirect Mechanism and Machine Theory journal homepage:

A Parallel Robotic Mechanism Replacing a Machine Bed for ...

THE ELEVENTH WORLD CONGRESS IN MECHANISM AND MACHINE …jc.fauroux.free.fr/PUB/ARTICLES/CONF/2003_IFToMM.pdf · THE ELEVENTH WORLD CONGRESS IN MECHANISM AND MACHINE SCIENCE FINAL