OBUST - Bhuvesh Kumarbhuveshkumar.com/pdf/LearnAuction_poster.pdf · Traditional auction design...

1
L EARNING A UCTIONS WITH R OBUST I NCENTIVE G UARANTEES { Jacob Abernethy, Rachel Cummings, Bhuvesh Kumar, Jamie Morgenstern} @ Georgia Tech, Samuel Taggart @ Oberlin College I NTRODUCTION We study the problem of designing revenue maximizing repeated auctions. Traditional auction design requires knowl- edge of a prior distribution over buyer valuations and constructs an auction as a function of that prior to maximize some objective. For the rev- enue objective, Myerson’s auction (Myerson81) characterizes the Bayesian-optimal auction. Known Priors (Eg Myerson81) Existing work has largely focused on the com- putational and sample complexity of the task ig- noring the possibility that bidders we learn from show up in the learned auction. Sample Complexity results (Eg CR15, DHP15, MR16) Our work focuses on an iterative setting 1. The seller does not know the prior and learns from the samples 2. Strategic bidders show up in multiple rounds Multiple rounds, strategic bidders show up multiple times (Eg LHW18, ACKMT19) S ETTING We consider a T -round auction, where a seller sells a supply of J identical items each round. n different populations of bidders & one bidder from each population i participates in each round t with value v i,t for winning an item. Each sam- ple from population i is distributed according to a distribution D i . U TILITY -A PPROXIMATE BIC O NLINE A UCTION Algorithm 1: Utility-Approx BIC Online Auc- tion Parameters: discretization β , privacy , upper bound h, num. of rounds T Initialize: H 0 i,0 Uni(0,h) for i =1, ··· ,n for t =1, ··· ,T do Receive bid profile b t rounded down to multiple of β Run Myerson with H 0 t-1 as prior and b t as bid for allocations/payments. for i =1,...,n do Update H 0 i,t via two-fold tree aggregation end end Results: Theorem 1. Algorithm 1 is kh ( 2+ 1 T ) - utility approximate BIC, i.e for each bid- der i, if everyone else bids truthfully then truthful bidding is an approximately utility- maximizing behavior for bidder i. Theorem 2. With probability at least 1 - α, the average expected revenue obtained by Al- gorithm 1 for T rounds OPT - βJ - 4hn 2 ˜ O q log( nT α ) T + 1 T for regular dis- tributions D and < 1. B ID -A PPROXIMATE BIC O NLINE A UCTION Algorithm 2: Bid-approx BIC online Auction Parameters: discretization β , privacy , upper bound h, num. of rounds T Initialize: H 0 i,0 Uni(0,h) for i =1, ··· ,n for t =1, ··· ,T do Receive vector of bids b t =(b 1,t ,...,b n,t ) rounded down to multiple of β With probability ρ t select a random allocation and reserve price else for i =1,...,n do Use H 0 i,t-1 to calculate φ i,t (b i,t )= b 1,t - 1-H 0 i,t-1 (b 1,t ) h 0 i,t (b 1,t ) end Use exponential mechanism to select allocation Use Black box payments to calculate payments p t (b t ). end for i =1,...,n do Update H 0 i,t via two-fold tree aggregation end end Results: Theorem 3. Algorithm 2 is is η t -bid ap- proximate BIC in round t for η t = c + ˜ O (t -1/4 ) for small constants c 0 , i.e there ex- ists a Bayesian Nash Equilibrium where in every round t, every bidder bids within η t of their truthful value. Theorem 4. With probability at least 1 - α, the average expected revenue obtained by Al- gorithm 2 for T rounds OPT - c 0 - ˜ O (T -1/4 ) for small constant c. Link to paper A SSUMPTION No bidder participates in more than k rounds of the T -round auction. K EY I DEAS 1. Differential Privacy: Control the impact of any individual buyer’s strategy on her future utility 2. Similar revenue on similar distributions: We show that if kD - D 0 k τ then |Rev(M; D) - Rev(M; D 0 )|≤ O (n 2 τ ) DP DISTRIBUTION ESTIMATION We use a 2 fold tree based aggregation proto- col (LHW18) with appropriate noise to estimate the value distributions of the bidders in a (, /T ) differentially private manner. Differential private empirical distribution estimation E XPONENTIAL M ECHANISM Exponential Mechanism (MT07) is an DP algorithm to sample allocation x as P[x] exp Q( ˆ φ(b),x) Δ where the quality function Q( ˆ φ(b), x) we use is n i=0 ˆ φ i (b i )x i B LACK B OX P AYMENTS We use a black box payment rule from APTT04 to decide payments which are truthful in expectation and bidder i’s payment depends on her own bid, and the differentially private es- timates of the distributions but does not depend on other’s bids.

Transcript of OBUST - Bhuvesh Kumarbhuveshkumar.com/pdf/LearnAuction_poster.pdf · Traditional auction design...

Page 1: OBUST - Bhuvesh Kumarbhuveshkumar.com/pdf/LearnAuction_poster.pdf · Traditional auction design requires knowl-edge of a prior distribution over buyer valuations and constructs an

LEARNING AUCTIONS WITH ROBUST INCENTIVE GUARANTEES{ Jacob Abernethy, Rachel Cummings, Bhuvesh Kumar, Jamie Morgenstern} @ Georgia Tech, Samuel Taggart @ Oberlin College

INTRODUCTIONWe study the problem of designing revenue

maximizing repeated auctions.Traditional auction design requires knowl-

edge of a prior distribution over buyer valuationsand constructs an auction as a function of thatprior to maximize some objective. For the rev-enue objective, Myerson’s auction (Myerson81)characterizes the Bayesian-optimal auction.

Known Priors (Eg Myerson81)Existing work has largely focused on the com-

putational and sample complexity of the task ig-noring the possibility that bidders we learn fromshow up in the learned auction.

Sample Complexity results (Eg CR15, DHP15, MR16)Our work focuses on an iterative setting

1. The seller does not know the prior andlearns from the samples

2. Strategic bidders show up in multiplerounds

Multiple rounds, strategic bidders show up multiple times(Eg LHW18, ACKMT19)

SETTINGWe consider a T -round auction, where a seller

sells a supply of J identical items each round.n different populations of bidders & one bidderfrom each population i participates in each roundt with value vi,t for winning an item. Each sam-ple from population i is distributed according toa distribution Di.

UTILITY-APPROXIMATE BIC ONLINE AUCTION

Algorithm 1: Utility-Approx BIC Online Auc-tion

Parameters: discretization β, privacy ε, upperbound h, num. of rounds T

Initialize: H ′i,0 ← Uni(0, h) for i = 1, · · · , nfor t = 1, · · · , T do

Receive bid profile bt rounded down tomultiple of β

Run Myerson with H′t−1 as prior and bt asbid for allocations/payments.

for i = 1, . . . , n doUpdate H ′i,t via two-fold tree aggregation

endend

Results:

Theorem 1. Algorithm 1 is khε(2 + 1

T

)-

utility approximate BIC, i.e for each bid-der i, if everyone else bids truthfully thentruthful bidding is an approximately utility-maximizing behavior for bidder i.

Theorem 2. With probability at least 1−α,the average expected revenue obtained by Al-gorithm 1 for T rounds ≥ OPT − βJ −

4hn2O

(√log(nTα )

T + 1Tε

)for regular dis-

tributions D and ε < 1.

BID-APPROXIMATE BIC ONLINE AUCTION

Algorithm 2: Bid-approx BIC online AuctionParameters: discretization β, privacy ε, upperbound h, num. of rounds T

Initialize: H ′i,0 ← Uni(0, h) for i = 1, · · · , nfor t = 1, · · · , T do

Receive vector of bids bt = (b1,t, . . . , bn,t)rounded down to multiple of β

With probability ρt select a random allocationand reserve price

elsefor i = 1, . . . , n do

Use H ′i,t−1 to calculate

φi,t(bi,t) = b1,t −1−H′i,t−1(b1,t)

h′i,t(b1,t)

endUse exponential mechanism to selectallocation

Use Black box payments to calculatepayments pt(bt).

endfor i = 1, . . . , n do

Update H ′i,t via two-fold tree aggregationend

end

Results:

Theorem 3. Algorithm 2 is is ηt-bid ap-proximate BIC in round t for ηt = c +O(t−1/4) for small constants c′, i.e there ex-ists a Bayesian Nash Equilibrium where inevery round t, every bidder bids within ηt oftheir truthful value.

Theorem 4. With probability at least 1−α,the average expected revenue obtained by Al-gorithm 2 for T rounds ≥ OPT − c′ −O(T−1/4) for small constant c.

Link to paper

ASSUMPTIONNo bidder participates in more than k rounds

of the T -round auction.

KEY IDEAS1. Differential Privacy: Control the impact of any

individual buyer’s strategy on her future utility2. Similar revenue on similar distributions: We

show that if ‖D−D′‖∞ ≤ τ then |Rev(M;D)−Rev(M;D′)| ≤ O(n2τ)

DP DISTRIBUTION ESTIMATIONWe use a 2 fold tree based aggregation proto-

col (LHW18) with appropriate noise to estimatethe value distributions of the bidders in a (ε, ε/T )differentially private manner.

Differential private empirical distribution estimation

EXPONENTIAL MECHANISMExponential Mechanism (MT07) is an

ε DP algorithm to sample allocation x asP[x] ∝ exp εQ(φ(b),x)

∆ where the quality functionQ(φ(b),x) we use is

∑ni=0 φi(bi)xi

BLACK BOX PAYMENTSWe use a black box payment rule from

APTT04 to decide payments which are truthfulin expectation and bidder i’s payment dependson her own bid, and the differentially private es-timates of the distributions but does not dependon other’s bids.