714 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 59,...

14
714 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 59, NO. 3, MARCH 2014 Stackelberg Routing on Parallel Networks With Horizontal Queues Walid Krichene, Graduate Student Member, IEEE, Jack D. Reilly, Graduate Student Member, IEEE, Saurabh Amin, Member, IEEE, and Alexandre M. Bayen, Member, IEEE Abstract—This paper presents a game theoretic framework for studying Stackelberg routing games on parallel networks with hor- izontal queues, such as transportation networks. First, we intro- duce a new class of latency functions that models congestion due to the formation of physical queues. For this new class, some re- sults from the classical congestion games literature (in which la- tency is assumed to be a non-decreasing function of the ow) do not hold. In particular, we nd that there may exist multiple Nash equi- libria that have different total costs. We provide a simple polyno- mial-time algorithm for computing the best Nash equilibrium, i.e., the one which achieves minimal total cost. Then we study the Stack- elberg routing game: assuming a central authority has control over a fraction of the ow on the network (compliant ow), and that the remaining ow (non-compliant) responds selshly, what is the best way to route the compliant ow in order to minimize the total cost? We propose a simple Stackelberg strategy, the Non-Com- pliant First (NCF) strategy, that can be computed in polynomial time. We show that it is optimal for this new class of latency on parallel networks. This work is applied to modeling and simulating congestion relief on transportation networks, in which a coordi- nator (trafc management agency) can choose to route a fraction of compliant drivers, while the rest of the drivers choose their routes selshly. Index Terms— Game theory, Nash equilibrium, network analysis and control, routing, Stackelberg game, transportation networks. I. INTRODUCTION AND MAIN RESULTS A. Motivation and Related Work R OUTING games model the interaction between players on a network, where the cost for each player on an edge depends on the total congestion of that edge. Extensive work Manuscript received July 25, 2012; revised May 07, 2013 and October 14, 2013; accepted October 30, 2013. Date of current version February 19, 2014. This work was supported in part by the National Science Foundation (NSF) under Award CNS-0845076 and by the Foundations Of Resilient CybEr-phys- ical Systems (FORCES), which receives support from the NSF (under Awards CNS-1238959, CNS-1238962, CNS-1239054, and CNS-1239166). This paper was recommended by Associate Editor A. Ozdaglar. W. Krichene is with the Department of Electrical Engineering and Com- puter Sciences, University of California, Berkeley, CA 94704 USA (e-mail: [email protected]). J. D. Reilly is with the Department of Civil Engineering, University of Cali- fornia, Berkeley, CA, 94704 USA (e-mail: [email protected]). S. Amin is with the Department of Civil and Environmental Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139 USA (e-mail: [email protected]). A. M. Bayen is with the Department of Electrical Engineering and Computer Sciences and the Department of Civil and Environmental Engineering, Univer- sity of California, Berkeley, CA 94704 USA (e-mail: [email protected]). Color versions of one or more of the gures in this paper are available online at http://ieeexplore.ieee.org. has been dedicated to the study of Nash equilibria (or user op- timal assignments) for routing games, in which players selshly choose the routes that minimize their individual costs (latencies) [4], [7], [8]. In general, Nash equilibria are inefcient compared to a system optimal assignment that minimizes the total cost on the network [16]. This inefciency has been characterized for different classes of latency functions and network topologies [27], [29]. This helps understand the inefciencies caused by congestion on numerous real-life networks, such as communi- cation networks and trafc networks. In order to reduce the inefciencies due to selsh routing, many instruments have been studied, including congestion pricing [21], capacity allocation [15], and Stackelberg routing [2], [14], [25], [29]. In the Stackelberg routing game, a subset of the players, corresponding to a fraction of the total ow, hereafter called the compliant ow, is centrally assigned by a coordinator (leader), and then the remaining players (fol- lowers) choose their routes selshly. The objective of the leader is to assign the compliant ow in a manner that minimizes a system-wide cost function, while anticipating the followers’ selsh response. This setting is relevant in the planning and operation of transportation and communication networks.. In transportation networks, advances in traveler information sys- tems have made it possible to interact with individual drivers and exchange information through GPS-enabled smartphone applications or vehicular navigation systems [31]. These de- vices can be used by a a trafc control center to provide routing advice that can improve the overall efciency of the network. Naturally, the question arises on how the trafc control center should coordinate with the compliant drivers while accounting for the selsh response of other drivers: hence the importance of the Stackelberg routing framework. One might argue that the drivers who are offered routing advice are not guaranteed to follow the suggested routes, especially when these routes do not have minimal latency (in order to improve the system-wide efciency, some drivers will be assigned routes that are sub-op- timal in the Nash sense). However, in some cases, it can be reasonably assumed that a fraction of the drivers will choose the routes suggested by the coordinator, despite immediate fair- ness concerns. For example, some drivers may have sufcient external incentives to be compliant with the coordinator. In addition, the compliant ow may also include altruistic drivers who care about the system-wide efciency (e.g., pollution levels). Stackelberg routing on parallel networks has been studied ex- tensively for the class of non-decreasing latency functions, and it is known that computing the optimal Stackelberg strategy is 0018-9286 © 2014 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.

Transcript of 714 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 59,...

Page 1: 714 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 59, …jackdreilly.github.io/papers/stackelberg.pdf · analysis and control, routing, Stackelberg game, transportation networks. I.

714 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 59, NO. 3, MARCH 2014

Stackelberg Routing on Parallel NetworksWith Horizontal Queues

Walid Krichene, Graduate Student Member, IEEE, Jack D. Reilly, Graduate Student Member, IEEE,Saurabh Amin, Member, IEEE, and Alexandre M. Bayen, Member, IEEE

Abstract—This paper presents a game theoretic framework forstudying Stackelberg routing games on parallel networks with hor-izontal queues, such as transportation networks. First, we intro-duce a new class of latency functions that models congestion dueto the formation of physical queues. For this new class, some re-sults from the classical congestion games literature (in which la-tency is assumed to be a non-decreasing function of the flow) do nothold. In particular, we find that theremay exist multiple Nash equi-libria that have different total costs. We provide a simple polyno-mial-time algorithm for computing the best Nash equilibrium, i.e.,the onewhich achievesminimal total cost. Thenwe study the Stack-elberg routing game: assuming a central authority has control overa fraction of the flow on the network (compliant flow), and thatthe remaining flow (non-compliant) responds selfishly, what is thebest way to route the compliant flow in order to minimize the totalcost? We propose a simple Stackelberg strategy, the Non-Com-pliant First (NCF) strategy, that can be computed in polynomialtime. We show that it is optimal for this new class of latency onparallel networks. This work is applied tomodeling and simulatingcongestion relief on transportation networks, in which a coordi-nator (trafficmanagement agency) can choose to route a fraction ofcompliant drivers, while the rest of the drivers choose their routesselfishly.

Index Terms— Game theory, Nash equilibrium, networkanalysis and control, routing, Stackelberg game, transportationnetworks.

I. INTRODUCTION AND MAIN RESULTS

A. Motivation and Related Work

R OUTING games model the interaction between playerson a network, where the cost for each player on an edge

depends on the total congestion of that edge. Extensive work

Manuscript received July 25, 2012; revised May 07, 2013 and October 14,2013; accepted October 30, 2013. Date of current version February 19, 2014.This work was supported in part by the National Science Foundation (NSF)under Award CNS-0845076 and by the Foundations Of Resilient CybEr-phys-ical Systems (FORCES), which receives support from the NSF (under AwardsCNS-1238959, CNS-1238962, CNS-1239054, and CNS-1239166). This paperwas recommended by Associate Editor A. Ozdaglar.W. Krichene is with the Department of Electrical Engineering and Com-

puter Sciences, University of California, Berkeley, CA 94704 USA (e-mail:[email protected]).J. D. Reilly is with the Department of Civil Engineering, University of Cali-

fornia, Berkeley, CA, 94704 USA (e-mail: [email protected]).S. Amin is with the Department of Civil and Environmental Engineering,

Massachusetts Institute of Technology, Cambridge, MA 02139 USA (e-mail:[email protected]).A. M. Bayen is with the Department of Electrical Engineering and Computer

Sciences and the Department of Civil and Environmental Engineering, Univer-sity of California, Berkeley, CA 94704 USA (e-mail: [email protected]).Color versions of one or more of the figures in this paper are available online

at http://ieeexplore.ieee.org.

has been dedicated to the study of Nash equilibria (or user op-timal assignments) for routing games, in which players selfishlychoose the routes that minimize their individual costs (latencies)[4], [7], [8]. In general, Nash equilibria are inefficient comparedto a system optimal assignment that minimizes the total cost onthe network [16]. This inefficiency has been characterized fordifferent classes of latency functions and network topologies[27], [29]. This helps understand the inefficiencies caused bycongestion on numerous real-life networks, such as communi-cation networks and traffic networks.In order to reduce the inefficiencies due to selfish routing,

many instruments have been studied, including congestionpricing [21], capacity allocation [15], and Stackelberg routing[2], [14], [25], [29]. In the Stackelberg routing game, a subsetof the players, corresponding to a fraction of the total flow,hereafter called the compliant flow, is centrally assigned bya coordinator (leader), and then the remaining players (fol-lowers) choose their routes selfishly. The objective of the leaderis to assign the compliant flow in a manner that minimizes asystem-wide cost function, while anticipating the followers’selfish response. This setting is relevant in the planning andoperation of transportation and communication networks.. Intransportation networks, advances in traveler information sys-tems have made it possible to interact with individual driversand exchange information through GPS-enabled smartphoneapplications or vehicular navigation systems [31]. These de-vices can be used by a a traffic control center to provide routingadvice that can improve the overall efficiency of the network.Naturally, the question arises on how the traffic control centershould coordinate with the compliant drivers while accountingfor the selfish response of other drivers: hence the importanceof the Stackelberg routing framework. One might argue thatthe drivers who are offered routing advice are not guaranteedto follow the suggested routes, especially when these routes donot have minimal latency (in order to improve the system-wideefficiency, some drivers will be assigned routes that are sub-op-timal in the Nash sense). However, in some cases, it can bereasonably assumed that a fraction of the drivers will choosethe routes suggested by the coordinator, despite immediate fair-ness concerns. For example, some drivers may have sufficientexternal incentives to be compliant with the coordinator. Inaddition, the compliant flow may also include altruistic driverswho care about the system-wide efficiency (e.g., pollutionlevels).Stackelberg routing on parallel networks has been studied ex-

tensively for the class of non-decreasing latency functions, andit is known that computing the optimal Stackelberg strategy is

0018-9286 © 2014 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.

Page 2: 714 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 59, …jackdreilly.github.io/papers/stackelberg.pdf · analysis and control, routing, Stackelberg game, transportation networks. I.

KRICHENE et al.: STACKELBERG ROUTING ON PARALLEL NETWORKS WITH HORIZONTAL QUEUES 715

Fig. 1. Map of a parallel highway network connecting San Francisco to SanJose.

NP-hard in the size of the network [25]. This led to the de-sign of polynomial time approximate strategies such as Scaleand Largest Latency First [25], [29]. While this class of latencyfunctions provides a goodmodel of congestion for a broad rangeof networks with vertical queues, such as communication net-works, it does not entirely capture congestion in networks withhorizontal queues, such as transportation networks, in whichqueuing results in an increase in density [9], [17], [19], [24],[31], which in turn affects the latency. In order to better modelthe effects of density, we introduce a new class of latency func-tions, and we study Stackelberg routing games for this new classon parallel networks. User-equilibria for routing games withhorizontal queues have been studied for example in [5], [12],[20], [30]. However, to the best of our knowledge, Stackelbergrouting with horizontal queues has not been addressed so far.We restrict our present study to parallel networks. This simple

network topology is of practical importance in several situa-tions, including traffic planning on parallel highway networksthat connect two highly populated areas [6]. Fig. 1 shows onesuch network that connects San Francisco to San Jose. We willconsider this network in Section V.

B. Congestion on Horizontal Queues

The classical model for vertical queues assumes that the la-tency on a link is a non-decreasing function of theflow on that link [3], [4], [8], [26], [29]. However, for net-works with horizontal queues [17], [19], [24], the latency notonly depends on the flow, but also on the density. For example,on a transportation network, the latency depends on the densityof cars on the road (e.g., in cars per meter), and not only on theflow (e.g., in cars per second), since for a fixed value of flow, alower density means higher velocity, hence lower latency. Theseeffects of changing density are not captured by models of ver-tical queues. In this section we describe a simplified model ofcongestion that takes into account both flow and density.Let be the density on link , assumed to be uniform, for

simplicity, and let the flow be given by a continuous, concavefunction of the density

Here, is the maximum flow or capacity of the link,and is the maximum density that the link can hold. The

Fig. 2. Examples of flux functions for horizontal queues (left) and cor-responding latency as a function of the density (middle) and as afunction of the flow and the congestion state (right). The free-flow(respectively congested) regime is shaded in green (respectively red).

function is determined by the physical properties of the link.It is termed the flux function in conservation law theory [11],[18] and the fundamental diagram in traffic flow theory [9], [13],[23]. In general, it is a non-injective function. We make the fol-lowing assumptions:• There exists a unique density such that

, called critical density. When, the link is said to be in free-flow, and when

, it is said to be congested.• In the congested regime, is continuous decreasingfrom onto . In particular,

(the flow reduces to zerowhen the density approaches the maximum density).

These are standard assumptions on the flux function, followingtraffic flow theory [9], [13], [23]. Additionally, we assumethat in the free-flow regime, is linearly increasing in ,and since , we have in the free-flow regime

(as a result, the flux function isnon-differentiable at the critical density). The assumption oflinearity in free-flow is the only restrictive assumption, andit is essential in deriving the results on optimal Stackelbergstrategies. Although somewhat restrictive, this assumptionis common, and the resulting flux model is widely used inmodeling transportation networks, such as in [9], [22]. Fig. 2shows examples of such flux functions.Since the density and the flow are assumed to be uni-

form on the link, the velocity is given by andthe latency is simply given by whereis the length of link . Thus to a given value of the flow, theremay correspond more than one value of the latency, since theflux function is non-injective in general. To illustrate this withan example, we consider a transportation setting. A given valueof flow of cars on a road-segment can correspond to• either a large concentration of cars moving slowly (highdensity, the road is congested), in which case the latency islarge,

Page 3: 714 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 59, …jackdreilly.github.io/papers/stackelberg.pdf · analysis and control, routing, Stackelberg game, transportation networks. I.

716 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 59, NO. 3, MARCH 2014

• or few cars moving fast (low density, the road is in free-flow), in which case the latency is small.

Therefore, the basic premise that the latency is a function of theflow does not hold for networks with horizontal queues, i.e., net-works in which the density may change and impact the latency.

C. Latency Function for Horizontal Queues

Given a flux function , the latency can be easily expressedas a non-decreasing function of the density

(1)

From the assumptions on the flux function, we have thefollowing.• In the free-flow regime, the flux function is linearly in-creasing, . Thus the latency isis single-valued in free-flow, . We

will denote its value by , called hence-forth the free-flow latency.

• In the congested regime, is bijective fromto . Let

be its inverse. It maps the flow to the unique congestiondensity that corresponds to that flow. Thus in the congestedregime, latency can be expressed as a function of the flow,

. This function is decreasing as thecomposition of the decreasing function and the in-creasing function .

We can therefore express the latency as a function of the flowif we additionally specify the congestion state using a binaryvariable , such that if is in free-flow,and if is congested.Definition 1: HQSF Latency Class: A function

(2)

defined on the domain1

is a HQSF latency function if it satisfies the followingproperties:

(A1) In the free-flow regime, the latency is single-valued.(A2) In the congested regime, the latencyis decreasing on .(A3) .

1The latency in congestion is defined on the open interval .In particular, if or then the link is always considered tobe in free-flow. When the link is empty , it is naturally in free-flow.When it is at maximum capacity it is in fact on the boundary ofthe free-flow and congestion regions, and we say by convention that the link isin free-flow.

Fig. 3. Network with parallel links under demand .

Property (A1) is equivalent to the assumption that the fluxfunction is linear in free-flow, and is the only restrictive propertyin the sense discussed above, hence the name of the latencyclass. Property (A2) results from the expression of the latencyas the composition , where is increasing, and

is decreasing. Property (A3) is equivalent to the continuityof the underlying flux function .Although it may be more natural to think of the latency as a

non-decreasing function of the density, the above representationin terms of flow and congestion state will be useful inderiving properties of the Nash equilibria of the routing game.Finally, we observe, as an immediate consequence of these

properties, that the latency in congestion is always greater thanthe free-flow latency: , . Someexamples of HQSF latency functions (and the underlying fluxfunctions) are illustrated in Fig. 2. For a detailed derivation ofan example latency function in a traffic setting, see Appendix A.

D. Model

We consider a non-atomic routing game on a parallel net-work, shown in Fig. 3. Here non-atomic means that the gameinvolves a continuum of players, where each player correspondsto an infinitesimal (non-atomic) amount of flow, [27], [28]. Thenetwork has a single source and a single sink. Connecting thesource and sink are parallel links indexed by .We assume, without loss of generality, that the links are orderedby increasing free-flow latencies. To simplify the discussion, wefurther assume that free-flow latencies are distinct. Thereforewe have . The network is subject to aconstant positive flow demand at the source. We will denoteby an instance of the routing game played on a networkwith parallel links subject to demand . The state of the net-work is given by a feasible flow assignment vectorsuch that where is the flow on link , and acongestion state vector where if the linkis in free-flow and if the link is congested, as definedabove. All physical quantities (density and flow) are assumed tobe static and uniform on the link.Every non-atomic player chooses a route in order to minimize

his/her individual latency [26]. If a player chooses link , his/herlatency is given by , where is a HQSF latencyfunction. We assume that players know the latency functions.Pure Nash equilibria of the game (which we will simply refer

to as Nash equilibria) are assignments such that everyplayer cannot improve his/her latency by switching to a differentlink.Definition 2: Nash Equilibrium: A feasible assignment

is a Nash equilibrium of the routinggame instance if , ,

.

Page 4: 714 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 59, …jackdreilly.github.io/papers/stackelberg.pdf · analysis and control, routing, Stackelberg game, transportation networks. I.

KRICHENE et al.: STACKELBERG ROUTING ON PARALLEL NETWORKS WITH HORIZONTAL QUEUES 717

Fig. 4. Example of Nash equilibria for a three-link network. One equilibrium(left) has one link in free-flow and one congested link. A second equilibrium(right) has three congested links.

Here denotes thesupport of . As a consequence of this definition, all links inthe support of have the same latency , and links that are notin the support have latency greater than or equal to . We willdenote by the set of Nash equilibria of the instance

. We note that a Nash equilibrium for the routing gameis a static equilibrium, we do not model dynamics of density orflow. Fig. 4 shows an example of a routing game instance andresulting Nash equilibria.While a Nash equilibrium achieves minimal individual laten-

cies, it does not minimize, in general, the system cost or totalcost defined as follows:Definition 3: The total cost of an assignment is the

total latency experienced by all players

(3)

As detailed in Section II, under the HQSF latency class, therouting game may have multiple Nash equilibria that have dif-ferent total costs. We are interested, in particular, in Nash equi-libria that have minimal cost, which are referred to as best Nashequilibria (BNE).Definition 4: Best Nash Equilibria: The set of best Nash equi-

libria is the set of equilibria that minimize the total cost, i.e.,

(4)

E. Stackelberg Routing Game

In the Stackelberg routing game, a coordinator (a central au-thority) is assumed to have control over a positive fractionof the total flow demand . We call the compliance rate. Thecoordinator wants to route the compliant flow in a way thatminimizes the system cost, while anticipating the response ofthe rest of the players, assumed to choose their routes selfishlyafter the strategy of the coordinator is revealed. We will refer tothe flow of selfish players as the non-compliant flow.More precisely, the game is played as follows:• First, the coordinator (the leader) chooses a Stackelbergstrategy, i.e., an assignment of the compliant flow(such that ).

• Then, the Stackelberg strategy of the leader is revealed,and the non-compliant players (followers) choose their

routes selfishly and form a Nash equilibrium ,induced2 by strategy . By definition, the induced equilib-rium satisfies

(5)

The total flow on the network is , thus the total costis . Note that a Stackelberg strategy mayinduce multiple Nash equilibria in general. However, we definethe assignment to be the best such equilibrium (theone with minimal total cost, which will be shown to be uniquein Section III).We will use the following notation:• is an instance of the Stackelberg routing gameplayed on a parallel network with links under flow de-mand with compliance rate . Note that the routing game

is a special case of the Stackelberg routing gamewith .

• is the set of Stackelberg strategies for theStackelberg instance .

• is the set of optimal Stackelberg strategies de-fined as

(6)

F. Main Result

We now define a candidate Stackelberg strategy, which wecall the non-compliant first strategy (NCF), and which we proveto be an optimal Stackelberg strategy. The NCF strategy corre-sponds to first computing the best Nash equilibrium of thenon-compliant flow for the routing game instance ,then finding a particular strategy that induces .Definition 5: The Non-Compliant First Strategy: Con-

sider the Stackelberg instance . Let bethe best Nash equilibrium of the non-compliant flow,

, andbe the last link in its support. Then the non-compliant firststrategy, denoted by , is by definition the Stack-elberg strategy given by

╹▔╹ ╹▔▔▔▔▔▔▔╹

(7)

where is the maximal index in such that.

In words, the NCF strategy saturates links one by one, byincreasing index starting from link , the last link used by thenon-compliant flow in the best Nash equilibrium of

2We note that a feasible flow assignment of compliant flow may fail toinduce a Nash equilibrium and therefore is not considered to be a Stack-elberg strategy.

Page 5: 714 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 59, …jackdreilly.github.io/papers/stackelberg.pdf · analysis and control, routing, Stackelberg game, transportation networks. I.

718 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 59, NO. 3, MARCH 2014

TABLE IMAIN ASSUMPTIONS AND RESULTS FOR THE STACKELBERG ROUTING GAME ON A PARALLEL NETWORK

Fig. 5. Non-compliant first (NCF) strategy and its induced equilibrium. Cir-cles show the best Nash equilibrium of the non-compliant flow :link is in free-flow, and links are congested. The Stackelbergstrategy is highlighted in blue.

. Thus it will assign to link , then to link, to link and so on, until the compliant flow is

assigned entirely (see Fig. 5). The following theorem states themain result.Theorem 1: The NCF Strategy is an Optimal Stackel-

berg Strategy: Under the class of HQSF latency functions,is an optimal Stackelberg strategy for the Stack-

elberg instance .We give a proof of Theorem 1 in Section III. We will also

show that for the class of HQSF latency functions, best Nashequilibria can be computed in polynomial time in the sizeof the network, and as a consequence, the NCF strategy canalso be computed in polynomial time. This stands in contrast toprevious results under the class of non-decreasing latency func-tions, for which computing the optimal Stackelberg strategy isNP-hard [25]. Table I summarizes the main differences betweenthe classical setting (vertical queues) and the setting studied inthis paper (horizontal queues, under the additional assumptionthat latency is single-valued in free-flow).

II. NASH EQUILIBRIA

In this section, we study Nash equilibria of the routing game.We show that under the class of HQSF latency functions, theremay exist multiple Nash equilibria that have different costs.Then we partition the set of equilibria into congested equilibriaand single-link-free-flow equilibria. Finally, we characterize thebest Nash equilibrium and show that it can be computed inquadratic time in the number of links.

A. Structure and Properties of Nash Equilibria

We first give some properties of Nash equilibria. The fol-lowing proposition is straightforward.Proposition 1: Total Cost of a Nash Equilibrium: Let

be a Nash equilibrium for the instance. Then there exists such that ,

and , . The totalcost of the equilibrium is then .Proposition 2: Let be a Nash equilib-

rium. Then , link is congested.Proof: By contradiction, if , then

, which contradicts Definition 2 of aNash equilibrium.Corollary 1: Support of a Nash Equilibrium: Let

be a Nash equilibrium and bethe last link in the support of (i.e., the one with the largestfree-flow latency). Then we have .

Proof: Since , we have by Proposition 2 that, link is congested, thus (by definition,

a congested link cannot be empty).NoEssential Uniqueness: For the HQSF latency class, the es-

sential uniqueness property3 does not hold, i.e., there may existmultiple Nash equilibria that have different costs, an example isgiven in Fig. 4.Single-Link-Free-Flow Equilibria and Congested Equilibria:

The example shows that in general, there may exist multipleNash equilibria that have different costs, different congestion

3The essential uniqueness property states that for the class of non-decreasinglatency functions, all Nash equilibria have the same total cost. See for example[4], [8], [26].

Page 6: 714 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 59, …jackdreilly.github.io/papers/stackelberg.pdf · analysis and control, routing, Stackelberg game, transportation networks. I.

KRICHENE et al.: STACKELBERG ROUTING ON PARALLEL NETWORKS WITH HORIZONTAL QUEUES 719

state vectors, and different supports. However, not every con-gestion state vector can be that of a Nash equi-librium: let be a Nash equilibrium, and let

be the index of the last link in the support of. Then by Proposition 2, we have that , , and

, . Thus we have

• Either╹▔╹

i.e., the last link inthe support is in free-flow, all other links in the supportare congested. In this case we call a single-link-free-flow equilibrium, and denote the set of such equilibriaby .

• Or╹▔╹

i.e., all links in the sup-port are congested. In this case we call a congestedequilibrium, and denote the set of such equilibria by

.

B. Existence of Single-Link-Free-Flow Equilibria

Let be a single-link-free-flow equilibrium, andlet . We have from Proposition 2that links are congested and link is infree-flow. Therefore we must have ,

. This uniquely determines the flowon the congested links:Definition 6: Congestion Flow: Let . Then

, there exists a unique flow such that. We denote this flow by and call it

-congestion flow on link . It is given by

(8)

We note that is decreasing in , since isdecreasing.Proposition 3: Single-Link-Free-Flow Equilibria:

is a single-link-free-flow equilibrium if and only ifsuch that , and

(9)

╹▔╹(10)

Illustrations of (10) and (9) are shown in Fig. 6.Next, we give a necessary and sufficient condition for the

existence of single-link-free-flow equilibria.Lemma 1: Existence of Single-Link-Free-Flow Equilibria:

Let

(11)

A single-link-free-flow equilibrium exists for the instanceif and only if .

Proof: If a single-link-free-flow equilibrium exists, then byProposition 3, it is of the form given by (10) and (9) for some .

Fig. 6. Example of a single-link-free-flow equilibrium. Link 3 is in free-flowand links 1 and 2 are congested. The common latency on all links in the supportis .

The flow on link is then given by .Therefore .We prove the converse by induction on the size of the net-

work. Let denote the property: , there ex-ists a single-link-free-flow equilibrium for the instance .For , it is clear that if , there is a

single-link-free-flow equilibrium simply given by.

Now let , assume holds and let us show . Letand consider an instance .

Case 1: If , then by the induction hypothesis ,there exists a single-link-free-flow equilibrium for theinstance . Then defined asand is clearly a single-link-free-flowequilibrium for the instance .Case 2: If then by Proposition

3, an equilibrium exists if

(12)

First, we note that since , then

Thus

which proves the second inequality in (12). To show the firstinequality, we have

Page 7: 714 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 59, …jackdreilly.github.io/papers/stackelberg.pdf · analysis and control, routing, Stackelberg game, transportation networks. I.

720 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 59, NO. 3, MARCH 2014

where the last inequality results from the fact thatand byDefinition 6 of congestion

flow. This achieves the induction.Corollary 2: The maximum demand such that the set of

Nash equilibria is non-empty is .Proof: By the previous lemma, is a lower

bound on the maximum demand. To show that it is also anupper bound, suppose that is non-empty, and let

and . Then we haveby Corollary 1, and by Definition 2 of a

Nash equilibrium, , ,therefore . We also have . Combiningthe inequalities, we have

C. Number of Equilibria

Proposition 4: An Upper Bound on the Number of Equilibria:Consider a routing game instance . For any given

, there is at most one single-link-free-flow equilib-rium and one congested equilibriumwith support . Asa consequence, by Corollary 1, the instance has at mostsingle-link-free-flow equilibria and congested equilibria.Proof: We prove the result for single-link-free-flow equi-

libria, the proof for congested equilibria is similar. Let, and assume and are single-link-

free-flow equilibria such that. We first observe that by Corollary 1, and have the samesupport , and by Proposition 2, . Since linkis in free-flow under both equilibria, we have

, and by Definition 2 of a Nash equilibrium,any link in the support of both equilibria has the same latency, i.e., , . Since the latency

in congestion is injective, we have , , therefore.

D. Best Nash Equilibrium

In order to study the inefficiency of Nash equilibria, and theimprovement of performance that we can achieve using optimalStackelberg routing, we focus our attention on best Nash equi-libria and price of stability [1] as a measure of their inefficiency.Lemma 2: Best Nash Equilibrium: For a routing game in-

stance , , the unique best Nash equilibriumis the single-link-free-flow equilibrium that has smallest support

Proof: We first show that a congested equilibrium cannotbe a best Nash equilibrium. Let be acongested equilibrium and let . By Propo-sition 1, the cost of is .We observe that restricted to is an equilib-rium for the instance , thus by Corollary 2, ,

and by Lemma 1, there exists a single-link-free-flow equi-librium for , with cost .Clearly, defined as and

, is a single-link-free-flowequilibrium for the original instance , with cost

, which provesthat is not a best Nash equilibrium. Therefore best Nashequilibria are single-link-free-flow equilibria. And since thecost of a single-link-free-flow equilibrium is simply

where , it is clear thatthe smaller the support, the lower the total cost. Uniquenessfollows from Proposition 4.Complexity of Computing the Best Nash Equilibrium:

Lemma 2 gives a simple algorithm for computing the bestNash equilibrium for any instance : simply enumerateall single-link-free-flow equilibria (there are at most suchequilibria by Proposition 4), and select the one with the smallestsupport. This is detailed in Algorithm 1.

Algorithm 1 Best Nash Equilibrium

procedure bestNEInputs: Size of the network , demandOutputs: Best Nash equilibriumfor

letif

returnreturn No-Solutionprocedure freeFlowConfigInputs: Free-flow link indexOutputs: Assignmentfor

if,

elseif,

else,

return

The congestion flow valuescan be precomputed in . There are at most calls tofreeFlowConfig, which runs in time, thus bestNEruns in time. This shows that the best Nash equilibriumcan be computed in quadratic time.

III. OPTIMAL STACKELBERG STRATEGIES

In this section, we prove our main result that NCF strategy isan optimal Stackelberg strategy (Theorem 1). Furthermore, weshow that the entire set of optimal strategies can becomputed in a simple way from the NCF strategy.Let be the best Nash equilibrium for the instance

. It represents the best Nash equilibrium of thenon-compliant flow when it is not sharing the networkwith the compliant flow. Let be the last link

Page 8: 714 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 59, …jackdreilly.github.io/papers/stackelberg.pdf · analysis and control, routing, Stackelberg game, transportation networks. I.

KRICHENE et al.: STACKELBERG ROUTING ON PARALLEL NETWORKS WITH HORIZONTAL QUEUES 721

in the support of . Let be the NCF strategy defined by (7).Then the total flow is given by

(13)

and the corresponding latencies are

╹▔╹ (14)

Fig. 5 shows the total flow on each link. Under, links are congested and have latency ,

links are in free-flow and at maximum capacity,and the remaining flow is assigned to link .We observe that for any Stackelberg strategy ,

the induced best Nash equilibrium is a single-link-free-flow equilibrium by Lemma 2, since is the bestNash equilibrium for the instance and latencies

(15)

where and.

A. Proof of Theorem 1: The NCF Strategy Is an OptimalStackelberg Strategy

Let be a Stackelberg strategy andbe the best Nash equilibrium of the non-compliant

flow, induced by . Let and be the totalflows. To prove Theorem 1, we seek to show that

.The proof is organized as follows: we first compare the sup-

ports of the induced equilibria (Lemma 3), then show that linksare more congested under assignment than

under , in the following sense: they hold less flow andhave greater latency (Lemma 4). Then we conclude by showingthe desired inequality.Lemma 3: Let and .

Then .In words, the last link in the support of has higher

free-flow latency than the last link in the support of .Proof: We first note that restricted to

is a Nash equilibrium. Then since link is infree-flow we have , and since

, we have by definition that any other link hasgreater or equal latency. In particular, ,

, thus . Thereforewe have . But

since

. Therefore

. By Lemma 1, there exists a single-link-free-flow equi-librium for the instance supported on the firstlinks. Let be such an equilibrium. The cost of this equi-librium is where is the free-flow latency ofthe last link in the support of . Thus .Since by definition is the best Nash equilibrium for theinstance and has cost , we must have

, i.e., .Lemma 4: Under assignment , the links

have greater (or equal) latency and hold less (or equal) flowthan under , i.e., ,

and .Proof: Since , we have by definition

of a Stackelberg strategy and its induced equilibrium that, ,

see (5). We also have by definition of the candidateassignment and the resulting latencies givenby (14), , is congested and

. Thus using the fact that , we have, ,

and .We have from (13) that , is in free-flow

and at maximum capacity under (i.e., and). Thus ,and . This completes the proof of

the lemma.We can now show the desired inequality. We have

(16)

where the last inequality is obtained using Lemma 4 and thefact that , . Then rear-ranging the terms we have

Then we have ,

[by Lemma 4, , and we have by(14)]. Thus

(17)

Page 9: 714 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 59, …jackdreilly.github.io/papers/stackelberg.pdf · analysis and control, routing, Stackelberg game, transportation networks. I.

722 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 59, NO. 3, MARCH 2014

and we have

But sinceand . Therefore

This completes the proof of Theorem 1.Therefore the NCF strategy is an optimal Stackelberg

strategy, and it can be computed in polynomial time since it isgenerated in linear time after computing the best Nash equi-librium , which was shown to be quadraticin .The NCF strategy is, in general, not the unique optimal Stack-

elberg strategy. In the next section, we show that any optimalStackelberg strategy can in fact be easily expressed in terms ofthe NCF strategy.

B. Set of Optimal Stackelberg Strategies

In this section, we show that the set of optimal Stackelbergstrategies can be generated from the NCF strategy.This shows in particular that the NCF strategy is robust, in asense explained below.Let be the non-compliant first strategy,

be the Nash equilibrium inducedby , and the last link in the support of theinduced equilibrium, as defined above. By definition, the NCFstrategy assigns zero compliant flow to links ,and saturates links one by one, starting from (see (7) andFig. 5).To give an example of an optimal Stackelberg strategy other

than the NCF strategy, consider a strategy defined bywhere

╹▔▔▔╹

and is such that , and (seeFig. 7). Strategy will induce , and the resultingtotal cost is minimal since

. This shows that is an optimal Stackelberg strategy.More generally, the following holds:

Fig. 7. Example of an optimal Stackelberg strategy . The circlesshow the best Nash equilibrium . The strategy is highlighted in green.

Lemma 5: Consider a Stackelberg strategy of the formwhere

╹▔╹(18)

and is such that

(19)

(20)

Then is an optimal Stackelberg strategy.Proof: We show that is a feasible assignment of

the compliant flow , and that the induced equilibrium of thefollowers is .Since by definition (18) of , we have

. We also have• , by (19). Thus

.• by (20), and .• , .

This shows that is a feasible assignment. To show that in-duces , we need to show that ,

This is true , by definition of and (5). Toconclude, we observe that .This shows that the NCF strategy is robust to perturbations:

even if the strategy is not realized exactly, it may still be op-timal if the perturbation satisfies the conditions given above.The converse of the previous lemma is true. This gives a nec-

essary and sufficient condition for optimal Stackelberg strate-gies, given in the following theorem.

Page 10: 714 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 59, …jackdreilly.github.io/papers/stackelberg.pdf · analysis and control, routing, Stackelberg game, transportation networks. I.

KRICHENE et al.: STACKELBERG ROUTING ON PARALLEL NETWORKS WITH HORIZONTAL QUEUES 723

Theorem 2: The Set of Optimal Stackelberg Strategies: Theset of optimal Stackelberg strategies is the set ofstrategies of form where is thenon-compliant first strategy, and satisfies (18), (19), and (20).

Proof: We prove the converse of Lemma 5. Letbe an optimal Stackelberg strategy,

the equilibrium of non-compliantflow induced by , the last link in the supportof , and the total flow assignment.We first show that . By optimality of both and , we

have , therefore inequalities (16) and (17)in the proof of Theorem 1 must hold with equality. In particular,to have equality in (16) we need to have

(21)

The terms in both sums are non-negative. Therefore

(22)

(23)

and to have equality in (17) we need to have

(24)

Let . From the expression (14) of thelatencies under , we have , thus from equality(24) we have . Now let . Wehave by definition of the latency functions,, thus from equality (23), . We also have from the

expression (13), . Therefore , but sinceand are both assignments of the same total flow , we also

have , which proves .Next we show that . We have from the proof of The-

orem 1 that . Assume by contradiction that . Thensince , we have by definition of the induced fol-lowers’ assignment in (5), ,

. And since , we have (inparticular for ) , i.e., link is congestedunder , thus . Finally, since , wehave . Therefore

, since , this contradicts (22).Now let . We want to show that satisfies (18), (19),

and (20).First, we have , , thus

. We also have , ,(since ), and [by (13)]. Therefore

. This proves (19).Second, we have , (since

), and (since ) thus.

Third, we have since and are assign-ments of the same compliant flow , thus

. This proves (18).Finally, we have (20) since by definition of .

IV. PRICE OF STABILITY UNDER OPTIMALSTACKELBERG ROUTING

To quantify the inefficiency of Nash equilibria, and the im-provement that can be achieved using Stackelberg routing, sev-eral metrics have been used including price of anarchy [26], [27]and price of stability [1]. We use price of stability as a metric,which is defined as the ratio between the cost of the best Nashequilibrium and the cost of the social optimum.4 Let de-note the social optimum of the instance —the social op-timum is simply the free-flow assignment that saturates linksone by one by increasing index (see Appendix B). Let be thenon-compliant first strategy , and theinduced equilibrium of the followers. The price of stability ofthe Stackelberg instance is

where is the NCF strategy, and its induced equilibrium.The improvement achieved by optimal Stackelberg routing withrespect to the Nash equilibrium can be measured usingvalue of altruism [2], defined as

This terminology refers to the improvement achieved by havinga fraction of altruistic (or compliant) players, compared to asituation where everyone is selfish.We give the expressions of price of stability and value of al-

truism in the case of a two-link network, as a function of thecompliance rate and demand .Case 1: : In this case, link 1 can accom-

modate all the non-compliant flow, thus the induced equilibriumof the followers is , and by(7) the total flow induced by isand coincides with the social optimum. Therefore, the price ofstability is one.Case 2: : Observe that this

case can only occur if . In this case, link1 cannot accommodate all the non-compliant flow, and the in-duced Nash equilibrium is then supported on bothlinks. It is equal to

, and the total flow is, with total cost [Fig. 8(b)]. The social optimum is

(see Appendix B), withtotal cost [Fig. 8(a)]. Therefore the priceof stability is

We observe that for a fixed flow demand , the priceof stability is an increasing function of . Intuitively, theinefficiency of Nash equilibria increases when the difference in

4Price of anarchy is defined as the ratio between the costs of the worst Nashequilibrium and the social optimum. For the case of non-decreasing latencyfunctions, the price of anarchy and the price of stability coincide since all Nashequilibria have the same cost by the essential uniqueness property.

Page 11: 714 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 59, …jackdreilly.github.io/papers/stackelberg.pdf · analysis and control, routing, Stackelberg game, transportation networks. I.

724 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 59, NO. 3, MARCH 2014

Fig. 8. Social optimum (a) and best Nash equilibrium (b) when the demandexceeds the capacity of the first link . The area of the shaded regionsrepresents the total costs of each assignment.

Fig. 9. Price of stability and value of altruism on a two-link network. Here weassume that . (a) Price of stability, ; (b) Price ofstability, ; (c) Value of altruism, .

free-flow latency between the links increases. And as ,the price of stability goes to 1.When the compliance rate is , the price of stability at-

tains a supremum equal to , at [Fig. 9(a)].This shows that selfish routing is most costly when the demandis slightly above critical value . This also showsthat for the general class of HQSF latencies on parallel net-works, the price of stability is unbounded, since one can de-sign an instance such that the maximal price of stability

is arbitrarily large. Under optimal Stackelberg routing, the price of stability attains a supremum equal to

at . We observe inparticular that the supremum is decreasing in , and that when

(total control), the price of stability is identically one.Therefore optimal Stackelberg routing can significantly de-

crease price of stability when . Thiscan occur for small values of the compliance rate in situationswhere the demand slightly exceeds the capacity of the first link[Fig. 9(c)].The same analysis can be done for a general network: given

the latency functions on the links, one can compute the price of

Fig. 10. Latency functions on an example highway network. Latency is in min-utes, and demand is in cars/minute.

stability as a function of the flow demand and the compliancerate , using the form of the NCF strategy together with Algo-rithm 1 to compute the BNE. Computing the price of stabilityfunction reveals critical values of demand, for which optimalStackelberg routing can lead to a significant improvement. Thisis discussed in further detail in the next section, using an ex-ample network with four links.

V. NUMERICAL RESULTS

In this section, we apply the previous results to a scenario offreeway traffic from the San Francisco Bay Area. Four parallelhighways are chosen starting in San Francisco and ending in SanJose: I-101, I-280, I-880, and I-580 (Fig. 1).We analyze the inef-ficiency of Nash equilibria, and show how optimal Stackelbergrouting (using the NCF strategy) can improve the efficiency.Fig. 10 shows the latency functions for the highway network,

assuming a triangular fundamental diagram for each highway(see Appendix A for a derivation of the latency function froma triangular fundamental diagram). Under free-flow conditions,I-101 is the fastest route available between San Francisco andSan Jose. When I-101 becomes congested, other routes repre-sent viable alternatives.We computed price of stability and value of altruism (defined

in the previous section) as a function of the demand for dif-ferent compliance rates. The results are shown in Fig. 11. Weobserve that for a fixed compliance rate, the price of stability ispiecewise continuous in the demand [Fig. 11(a)], with discon-tinuities corresponding to an increase in the cardinality of theequilibrium’s support (and a link transitioning from free-flowto congestion). If a transition exists for link , it occurs at crit-ical demand , defined to be the infimum demandsuch that is congested under the equilibrium induced by

.It can be shown that , and we

have in particular . Therefore if a linkis congested under best Nash equilibrium , op-timal Stackelberg routing can decongest if . Inparticular, when the demand is slightly above critical demand

, link can be decongested with a small compliancerate. This is illustrated by the numerical values of price of sta-bility on Fig. 11(a), where a small compliance rateachieves high value of altruism when the demand is slightlyabove the critical values. This shows that optimal Stackelberg

Page 12: 714 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 59, …jackdreilly.github.io/papers/stackelberg.pdf · analysis and control, routing, Stackelberg game, transportation networks. I.

KRICHENE et al.: STACKELBERG ROUTING ON PARALLEL NETWORKS WITH HORIZONTAL QUEUES 725

Fig. 11. Price of stability (a) and value of altruism (b) as a function of the demand for different values of compliance rate .

Fig. 12. Price of stability (a) and value of altruism (b) as a function of thecompliance rate and demand . Iso- lines are plotted for (dashed),

(dot-dashed), and (solid).

routing can achieve a significant improvement in efficiency, es-pecially when the demand is near one of the critical values

.Fig. 12 shows price of stability and value of altruism as a

function of the demand and compliance rate. We observe in particular that for a fixed value of

demand, price of stability is a piecewise constant function of. Computing this function can be useful for efficient plan-ning and control, since it informs the central coordinator of

the critical compliance rates that can achieve a strict improve-ment. For instance, if the demand on the example network is1100 cars/min, price of stability is constant for compliance rates

. Therefore if a compliance rate greater than0.46 is not feasible, the controller may prefer to implement acontrol strategy with , since further increasing thecompliance rate will not improve efficiency, and may incur ad-ditional external cost (e.g., due to incentivizing more drivers).

VI. SUMMARY AND CONCLUDING REMARKS

We introduced a new class of latency functions to model con-gestion on networks with horizontal queues, and studied theresulting Nash equilibria for non-atomic, static routing gameson parallel networks. We showed that the essential uniquenessproperty does not hold for the HQSF class of latency, and thatthe number of equilibria is at most . We also characterizedthe best Nash equilibrium.In the Stackelberg routing game, we proved that the

Non-compliant First (NCF) strategy is optimal, and that it canbe computed in polynomial time. We illustrated these resultsusing an example network for which we computed the decreasein inefficiency that can be achieved using optimal Stackelbergrouting. This example showed that when the demand is nearcritical values , optimal Stackelberg routing can achievea significant improvement in efficiency, even for small valuesof compliance rate.On the one hand, these results show that careful routing of a

small compliant population can dramatically improve the effi-ciency of the network. On the other hand, they also indicate thatfor certain demand and compliance values, Stackelberg routingcan be completely ineffective. Therefore identifying the rangeswhere optimal Stackelberg routing does improve the efficiencyof the network is crucial for effective planning and control.We believe this work offers several directions of future re-

search: the work presented here only considers parallel net-works under static assumptions (constant flow demand , andstatic equilibria) and one question is whether these equilibria arestable in the dynamic sense, and how one may steer the systemfrom one equilibrium to a better one: consider for example thecase where the players are stuck in a congested equilibrium, andassume a coordinator has control over a fraction of the flow. Can

Page 13: 714 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 59, …jackdreilly.github.io/papers/stackelberg.pdf · analysis and control, routing, Stackelberg game, transportation networks. I.

726 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 59, NO. 3, MARCH 2014

the coordinator steer the system to a single-link-free-flow equi-librium by decongesting a link? And what is the minimal com-pliance rate needed to achieve this? Another question is how ro-bust are these results? Do they hold for general network topolo-gies? We believe that some of our results extend to general net-work topologies, but we foresee interesting technical challengesin formalizing these extensions.

APPENDIX

A. HQSF Latency Function From a Triangular FundamentalDiagram of Traffic

In this section we derive one example of a HQSF latencyfunction in a traffic setting. We consider a triangular funda-mental diagram, used to model traffic flow for example in [9],[10], i.e., a piecewise affine flux function , given by

if

if

The flux function is linear in free-flow with positive slopecalled free-flow speed, affine in congestion with negative slope

, and continuous (thus). By definition, it satisfies the assumptions in Section I-B.

The latency is given by where is the lengthof link . It is then a simple function of the density

which can be expressed as two functions of flow: a constantfunction when the link is in free-flow, and a decreasingfunction when the link is congested

This defines a function that satisfies the assumptions ofDefinition 1, and thus belongs to the HQSF latency class. Fig. 2shows one example of a triangular fundamental diagram (topleft) and the corresponding latency function (top right).

B. Social Optimal Assignments

Consider an instance where the flow demanddoes not exceed the maximum capacity of the net-

work, i.e., . A social optimal assignmentis an assignment that minimizes the total cost function

, i.e., it is a solution to thefollowing Social Optimum (SO) optimization problem:

Proposition 5: is optimal for only if, .

Proof: This follows immediately from the fact the latencyon a link in congestion is always greater than the latency of thelink in free-flow .As a consequence of the previous proposition, and using the

fact that the latency is constant in free-flow ,the social optimum can be computed by solving the followingequivalent linear program:

Then since the links are ordered by increasing free-flowlatency , the social optimum is simplygiven by the assignment that saturates most efficientlinks first. Formally, ifthen the social optimal assignment is given by

REFERENCES

[1] E. Anshelevich, A. Dasgupta, J. Kleinberg, E. Tardos, T.Wexler, and T.Roughgarden, “The price of stability for network design with fair costallocation,” in Proc. 45th Annu. IEEE Symp. Foundations of ComputerScience, 2004, pp. 295–304.

[2] A. Aswani and C. Tomlin, “Game-theoretic routing of GPS-assistedvehicles for energy efficiency,” inProc. Amer. Control Conf. (ACC’11),2011, pp. 3375–3380.

[3] M. Babaioff, R. Kleinberg, and C. H. Papadimitriou, “Congestiongames with malicious players,” Games Econ. Behav., vol. 67, no. 1,pp. 22–35, 2009.

[4] M. Beckmann, C. B. McGuire, and C. B. Winsten, 1956.[5] T. Boulogne, E. Altman, H. Kameda, and O. Pourtallier, “Mixed equi-

librium for multiclass routing games,” IEEE Trans. Autom. Control,vol. 47, no. 1, pp. 58–74, Jan. 2001.

[6] Caltrans, US 101 South, Corridor System Management Plan, 2010.[7] S. Dafermos, “Traffic equilibrium and variational inequalities,”

Transp. Sci., vol. 14, no. 1, pp. 42–54, 1980.[8] S. C. Dafermos and F. T. Sparrow, “The traffic assignment problem for

a general network,” J. Res. Nat. Bureau Standards, vol. 73B, no. 2, pp.91–118, 1969.

[9] C. F. Daganzo, “The cell transmission model: A dynamic represen-tation of highway traffic consistent with the hydrodynamic theory,”Transp. Res. Part B: Methodol., vol. 28, no. 4, pp. 269–287, 1994.

[10] C. F. Daganzo, “The cell transmission model, part II: Network traffic,”Transp. Res. Part B: Methodol., vol. 29, no. 2, pp. 79–93, 1995.

[11] L. C. Evans, “Partial differential equations,” in Graduate Studies inMathematics. Providence, RI, USA: AMS, 1998.

[12] T. L. Friesz and R. Mookherjee, “Solving the dynamic network userequilibrium problem with state-dependent time shifts,” Transp. Res.Part B Methodol., vol. 40, no. 3, pp. 207–229, 2006.

[13] B. D. Greenshields, “A study of traffic capacity,” Highway Res. BoardProc., vol. 14, pp. 448–477, 1935.

[14] Y. A. Korilis, A. A. Lazar, and A. Orda, “Achieving network optimausing stackelberg routing strategies,” IEEE/ACM Trans. Networking,vol. 5, pp. 161–173, 1997.

[15] Y. A. Korilis, A. A. Lazar, and A. Orda, “Capacity allocation undernoncooperative routing,” IEEE Trans. Autom. Control, vol. 42, no. 3,pp. 309–325, Mar. 1997.

[16] E. Koutsoupias and C. Papadimitriou, “Worst-case equilibria,” in Proc.16th Annu. Conf. Theoretical Aspects of Computer Science, 1999, pp.404–413.

Page 14: 714 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 59, …jackdreilly.github.io/papers/stackelberg.pdf · analysis and control, routing, Stackelberg game, transportation networks. I.

KRICHENE et al.: STACKELBERG ROUTING ON PARALLEL NETWORKS WITH HORIZONTAL QUEUES 727

[17] J. P. Lebacque, “The Godunov scheme and what it means for first ordertraffic flow models,” in Proc. Int. Symp. Transportation and TrafficTheory, 1996, pp. 647–677.

[18] R. J. LeVeque, “Finite difference methods for ordinary and partialdifferential equations: Steady-state and time-dependent problems,” inClassics in Applied Mathematics. Philadelphia, PA, USA: SIAM,2007.

[19] M. J. Lighthill and G. B. Whitham, “On kinematic waves. II. A theoryof traffic flow on long crowded roads,” in Proc. R. Soc. London. SeriesA. Math. Phys. Sci., 1955, vol. 229, no. 1178, p. 317.

[20] H. K. Lo and W. Y. Szeto, “A cell-based variational inequality formu-lation of the dynamic user optimal assignment problem,” Transp. Res.Part B: Methodol., vol. 36, no. 5, pp. 421–443, 2002.

[21] A. Ozdaglar and R. Srikant, “Incentives and pricing in communicationnetworks,” in Algorithmic Game Theory, N. Nisan, T. Roughgarden,E. Tardos, and V. Vazirani, Eds. Cambridge, U.K.: Cambridge Univ.Press, 2007.

[22] M. Papageorgiou, J. Blosseville, and H. Hadj-Salem, “Modelling andrealtime control of traffic flow on the southern part of boulevard pe-ripherique in paris: Part I: Modelling,” Transp. Res. Part A: General,vol. 24, no. 5, pp. 345–359, 1990.

[23] M. Papageorgiou, J. M. Blosseville, and H. Hadj-Salem, “Macroscopicmodelling of traffic flow on the Boulevard Périphérique in Paris,”Transp. Res. Part B: Methodol., vol. 23, no. 1, pp. 29–47, 1989.

[24] P. I. Richards, “Shock waves on the highway,” Oper. Res., vol. 4, no.1, pp. 42–51, 1956.

[25] T. Roughgarden, “Stackelberg scheduling strategies,” in Proc. 33rdAnnu. ACM Symp. Theory of Computing, 2001, pp. 104–113, ACM.

[26] T. Roughgarden and E. Tardos, “How bad is selfish routing?,” J. ACM,vol. 49, no. 2, pp. 236–259, 2002.

[27] T. Roughgarden and E. Tardos, “Bounding the inefficiency of equilibriain nonatomic congestion games,” Games Econ. Behav., vol. 47, no. 2,pp. 389–403, 2004.

[28] D. Schmeidler, “Equilibrium points of nonatomic games,” J. Stat.Phys., vol. 7, no. 4, pp. 295–300, 1973.

[29] C. Swamy, “The effectiveness of Stackelberg strategies and tolls fornetwork congestion games,” in Proc. 18th Annu. ACM-SIAM Symp.Discrete Algorithms, 2007, pp. 1133–1142.

[30] Y. Wang, A. Messmer, and M. Papageorgiou, “Freeway network simu-lation and dynamic traffic assignment with METANET tools,” Transp.Res. Rec.: J. Transp. Res. Board, vol. 1776, no. -1, pp. 178–188, 2001.

[31] D. B.Work, S. Blandin, O. P. Tossavainen, B. Piccoli, andA.M. Bayen,“A traffic model for velocity data assimilation,” App. Math. Res. eX-press, vol. 2010, no. 1, Apr. 1, 2010.

Walid Krichene (S’12) received the M.S. degreein applied mathematics from the Ecole NationaleSuperieure des Mines de Paris, Paris, France, inSeptember 2010. He is currently pursuing the Ph.D.degree in the Department of Electrical Engineeringand Computer Sciences, University of California,Berkeley, CA, USA.From September 2010 to July 2011, he worked at

Criteo labs on designing techniques for predictingClick Through Rates using statistical learningtheory. His research focuses on network control and

optimization, applied to routing in large-scale networks. He uses game theoryand online learning theory to study convergence of population dynamics toNash equilibria; and applies online learning and optimal control to designrouting and tolling schemes which improve the efficiency of the network.

Jack D. Reilly (S’12) received the B.S. degree incivil engineering from the University of California,Los Angeles, CA, USA, in 2009 and the Mastersdegree in civil systems engineering from the Uni-versity of California (UC), Berkeley, CA, USA, in2013. He is currently pursuing the Ph.D. degree fromUC Berkeley in civil systems engineering, workingon a framework for optimal ramp metering and flowreroute algorithms applied to large-scale freewaysystems.

Saurabh Amin (M’09) received the B.Tech. degreein civil engineering from the Indian Institute of Tech-nology, Roorkee, India, the M.S. degreee in trans-portation engineering from the University of Texasat Austin, Austin, TX, USA, and the Ph.D. degreein systems engineering from the University of Cal-ifornia, Berkeley, CA, USA.He is currently an Assistant Professor in the De-

partment of Civil and Environmental Engineering,Massachusetts Institute of Technology (MIT),Cambridge, MA, USA. His research focuses on the

design and implementation of high-confidence network control algorithms forinfrastructure systems. He works on robust diagnostics and control problemsthat involve using networked systems to facilitate the monitoring and controlof large-scale critical infrastructures, including transportation, water, andenergy distribution systems. He also studies the effect of security attacksand random faults on the survivability of networked systems, and designsincentive-compatible control mechanisms to reduce network risks.

Alexandre M. Bayen (M’09) received the Engi-neering Degree in applied mathematics from theEcole Polytechnique, Palaiseau, France, in July1998, the M.S. degree in aeronautics and astronau-tics from Stanford University, Stanford, CA, USA, inJune 1999, and the Ph.D. degree in aeronautics andastronautics from Stanford University in December2003.He was a Visiting Researcher at the NASA Ames

Research Center from 2000 to 2003. BetweenJanuary 2004 and December 2004, he worked as

the Research Director of the Autonomous Navigation Laboratory, Laboratoirede Recherches Balistiques et Aerodynamiques, (Ministere de la Defense),Vernon, France, where he holds the rank of Major. He is an Associate Professorin the Department of Electrical Engineering and Computer Sciences, andthe Department of Civil and Environmental Engineering at the Universityof California, Berkeley, CA, USA. He has authored one book and over 100articles in peer-reviewed journals and conferences.Dr. Bayen is the recipient of the Ballhaus Award from Stanford University,

in 2004, of the CAREER award from the National Science Foundation, in 2009and he is a NASA Top 10 Innovators on Water Sustainability, received in 2010.He is the recipient of the Presidential Early Career Award for Scientists and En-gineers (PECASE) Award from the White House (2010). His projects “MobileCentury” and “Mobile Millennium” received the 2008 Best of ITS Award for“Best Innovative Practic,” at the ITS World Congress and a TRANNY Awardfrom the California Transportation Foundation, in 2009. “Mobile Millennium”has been featured more than 100 times in the media, including television chan-nels and radio stations (CBS, NBC, ABC, CNET, NPR, KGO, the BBC), and inthe popular press (Wall Street Journal, Washington Post, LA Times).