Survival of the strongest in a sequential truel

D. G. Ilinskiy; Ильинский Д. Г.; S. B. Izmalkov; Измалков С. Б.; A. V. Savvateev; Савватеев А. В.

doi:10.31857/S0424738824010111

Survival of the strongest in a sequential truel

Авторлар: Ilinskiy D.G.¹^,2, Izmalkov S.B.³, Savvateev A.V.⁴^,5^,6
Мекемелер:
1. Central Economics and Mathematics Institute RAS
2. Institute of Physics and Technology
3. New Economic School
4. Caucasus Mathematical Center, Adyghe State University
5. Moscow Institute of Physics and Technology
6. Central Economics and Mathematics Institute, RAS
Шығарылым: Том 60, № 1 (2024)
Беттер: 133-140
Бөлім: Mathematical analysis of economic models
URL: https://bakhtiniada.ru/0424-7388/article/view/258534
DOI: https://doi.org/10.31857/S0424738824010111
ID: 258534

Толық мәтін

Аннотация
Толық мәтін
Авторлар туралы
Әдебиет тізімі
Статистика

Аннотация

Sequential truel is a duel-like game between three players. Each players is endowed with his own marksmanship. At each turn, a player whose turn is to shoot can target any of the remaining alive opponents or abstain from shooting. The game ends when there is only one player alive or when all alive players abstained from shooting one after another. The single survivor obtains the “survivor” prize 1, while the payoff of all other players is 0. In the case the truel ends due to “inactivity”, all the players receive the payoff of 0. It is a deeply studied game with paradoxical finding that the weakest player has the highest probability of surviving in many settings, especially when the player can abstain from shooting. Here we present an explicit construction that contradicts this finding. There exists a mixed strategy subgame perfect equilibrium in which the strongest player has the highest probability of survival. This equilibrium exists for a specific order of play, in which the two stronger opponents act before the weakest one. When it exists, there are multiple subgame-perfect equilibria, including the existing stationary construction, in which two stronger opponents target each other.

Негізгі сөздер

sequential truel, subgame perfect equilibrium, survival of the weakest

Толық мәтін

1. INTRODUCTION

A truel is a duel-like competition between three players, in which the players fight for survival or for a prize by targeting (shooting at) opponents. Only one opponent can be targeted at a time. Originally, the sequential truel appeared as a brain teaser and has been part of many collections of mathematical puzzles (Phillips, 1937; Kinnaird, 1946; Gardner, 1966; Mosteller, 1987). The problem statement can be written as follows. Each of the three players is endowed with marksmanship $—$ the probability of eliminating the opponent with a shot. Three players are different, with marksmanships of $0.3$ , $0.5$ , and $1$ (or $0.5$ , $0.8$ , and $1$ ). Whom should they target? Who wins? These problems have a paradoxical answer $—$ the weakest player has the highest chance of surviving while the strongest player (a sure shot) has the lowest survival odds. There is also an extra layer to the problem $—$ the weakest player might do even better by intentionally abstaining, e.g., by shooting in the ground or in the air.

M. Shubik (Shubik, 1964, 43 $-$ 46) coined the term truel and described it as an example of a game in which the pursuit of individual goals leads to a paradoxical result. D. Kilgour (Kilgour, 1975) initiated a game-theoretic analysis of sequential truels as infinitely repeated games with and without abstentions. He considered undominated Nash equilbria in stationary strategies and showed that if the players marksmanships are different from each other, then the equilibrium is essentially unique, and for many values of the parameters the weakest player has the strongest survival odds. This happens because for two stronger opponents it is optimal to target each other, while the weakest player can simply stay aside and wait until one of the stronger opponents is eliminated. Many variations of truels have been considered: sequential truels with arbitrary duel values (Kilgour, 1978), simultaneous truels (Kilgour, 1972), truels with random order of shooting, finitely many rounds or bullets, possibilities for cooperation, target-undetectable or silent truels (Kilgour, Brams, 1997; Bossert, Brams, Kilgour, 2002). P. Amengual and R. Toral (Amengual, Toral, 2006) analyse different versions of the sequential game, and analyse them as Markov chains with three absorbing states (i.e. the possible victories of each of the players). A qualitatively robust prediction appears to follow from existing results: whenever there is a fight, the two strongest opponents shoot at each other, and, unless the marksmanships are dramatically different, the weakest player has the highest survival odds.

R. Shubik (Shubik, 1954) used a three-person duel to model ‘what sort of individual is best suited to survive, when every man acts for himself and by himself’. He pointed out immediate applications of truels to the analysis of elections, political competition, and multi-country interaction, arguing that a weaker country can benefit if it gets itself in the middle of a non-cooperative conflict between two stronger countries. Truels and $N$ -els were used for modeling decisions in intense conflict situations (Cole, Phillips, Hartman, 1977), evolutionary biology to explain persistent extensive variation in competitive skills (Archetti, 2012), dynamic targeted competition, where actions can be directed at a particular opponent (Dubovik, Parakhonyak, 2014), negative advertising, especially in political campaigning (Skaperdas, Grofman, 1995; Richman, 2020). (Toral, Amengual, 2005) consider a variant of a truel $—$ a game of opinion making or dynamic persuasion, in which players try to convince the others in their opinion on some matter and so all players are active until the consensus is reached. In all these applications the central feature is the advantage of the weakest player. Recently, in the article M. Wegener and E. Mutlu (Wegener, Mutlu, 2021) offered a model of network formation and evolution, in which players of three possible types (marksmanships) engage in truels with their neighbors. The authors assume that players target the strongest among the remaining opponents but not of the similar type, and show that the evolutionary pressures over network actually lead to strongest types surviving more often.

In this paper we challenge the existing findings regarding the survival of the weakest. We present a novel subgame-perfect equilibrium construction for a sequential truel in which the strongest player has the largest survival odds. We depart from the existing literature in three key elements: 1) we allow players to use non-stationary (and non-Markovian) strategies; 2) we consider mixed strategies; 3) we explicitly define the conditions and outcomes for a ‘peace’ scenario $—$ the game ends if all the three players abstain in a row. This equilibrium exists for a specific order of play, in which the two stronger opponents act before the weakest one. When it exists, there are multiple subgame-perfect equilibria including the existing stationary construction in which two stronger opponents target each other.

In equilbrium, the strongest opponent, player $γ$ abstains if the middle-skilled player $β$ has not targeted $γ$ previously; player $β$ abstains if the weaker player $α$ targeted $γ$ in the previous round, and randomizes between abstaining and targeting $γ$ , if $α$ shot at $β$ . If both stronger players abstain, the weaker player $α$ has to target someone and randomizes between the opponents. The player $α$ ’s incentives to target $β$ are provided by the expectation that $β$ then targets $γ$ with some positive probability, in which case $α$ ends up in a duel shooting first. In turn, randomization by $α$ is needed to provide incentives for $γ$ to abstain. In essence, the two stronger players tacitly collude to force $α$ to target someone. The strongest player benefits when $α$ targets $β$ , which can happen in a subgame-perfect non-stationary equilibrium!

2. SEQUENTIAL TRUEL

There are three players competing for being the sole survivor (winner) in a sequential duel-like game. At the beginning of the truel all players are alive. At each turn, a player whose turn is to shoot can target any of the remaining alive opponents or abstain from shooting (delope, e.g. shoot at the air or shoot at the ground). A targeted player is eliminated with probability equal to the marksmanship of the shooter. All the players observe who was the intended target and what is the outcome of the shot. The order of shooting is chosen in advance, and once and for all. No player can have two turns while some other (alive) player had none. Eliminated players do not shoot. The truel ends when all but one player was eliminated or when all alive players abstained from shooting one after another. The single survivor obtains the ‘survivor’ prize, the payoff to which is normalized to 1, while the payoff of all the other players is 0. In the case the truel ends due to ‘inactivity’, all the players receive the payoff of 0. The solution concept is subgame perfect equilibrium (SPE).

In our game-theoretic analysis we consider not only pure strategies for players that specify a particular target (or abstaining) after any history that can arise but also mixed strategies, where each player can randomize over its pure strategies and, in particular, randomly choose its target (or abstain) with some probabilities when called to act. The payoff from a mixed strategy is an expectation of the profits from pure strategies^[1].

We refer to the players by their marksmanship, $α$ , $β$ , $γ$ , and suppose that $α < β < γ$ , $γ =1.$ Let $α^{*} =1 - α$ and $β^{*} =1 - β$ be the probabilities of surviving a shot from a respective player. Denote by $s e q$ the order (sequence) of shooting. For instance, if $s e q = β γ α$ , then player $β$ shoots first, then player $γ$ shoots (if alive), then player $α$ , then again player $β$ and so on. We write $s e q (k)$ with $k =$ 0, 1 or 2 to highlight the number of successive abstentions prior to the first player in $s e q$ shooting. Finally, let $y_{i}^{s e q}$ and $z_{i}^{s e q}$ be, respectively, the lowest and the highest SPE payoffs to player $i$ if the sequence of shooting is seq. For instance, $y_{α}^{α β γ (1)}$ is the minimal SPE payoff of player $α$ for the sequence $α β γ$ for the subgame following one abstention (by player $γ) .$

Some of our parametric assumptions, such as $γ =1$ , unequal marksmanships, and 0 payoffs to the ‘peace’ outcome, are made to simplify the exposition. For example, all results (with slight modification in statements) are true for $γ$ in small neighbourhood of 1, but will require more technical details.

Some other parametric assumptions, such as three abstentions mean ‘peace’ are needed to complete specification of the game, which in turn allows for analysis of mixed-strategy equilibria. It would be interesting to know, how equilibria are affected when the number of abstentions that terminate the game or the ‘peace’ payoff vary, but this is left for future analysis.

There are six possible sequences of players, which can be divided in two groups: either $β$ acts immediately after $α$ or vise versa. Since our main goal is to present an explicit construction of equilibria in which the strongest player obtains high payoffs we will concentrate on the first case. Once we present the new equilibrium for sequence $β γ α$ in Section 6, as a corollary, we obtain a similar construction for sequence $γ β α$ , in which $α$ acts immediately after $β$ . Since we need to stimulate player $α$ to shoot in our equilibrium, it is crucial that $α$ must be the last player to shoot.

Lemma 1. The minimal and maximal SPE payoffs of players $α$ and $β$ satisfy the following inequalities

$y_{α}^{α \cdot \cdot} \geq α y_{α}^{β α} = α \frac{β^{*} α}{1 - α^{*} β^{*}},$ $y_{β}^{β} \geq β y_{β}^{α β} = β \frac{α^{*} β}{1 - α^{*} β^{*}},$ $z_{α}^{\cdot \cdot \cdot} \leq y_{α}^{α β} = \frac{α}{1 - α^{*} β^{*}},$ $z_{β}^{\cdot \cdot \cdot} \leq y_{β}^{β α} = \frac{β}{1 - α^{*} β^{*}} .$

(Dots here denote any of the appropriate order of shooting.)

Proof. The minimal SPE payoff of player $α$ (or $β$ ) cannot be lower than what the player $α$ (or $β$ ) can obtain by targeting $γ$ and, if successful, ending up in a duel shooting second. Clearly, there would be no abstentions in any duel. Similarly, the maximal possible payoff a player can get cannot be higher than that from a duel against the weakest of the opponents shooting first.■

3. PLAYER γ STRATEGIES

Lemma 2. In any SPE player $γ$ does not target $α$ if $β$ is alive. Player $γ$ ’s minimal SPE payoff for any sequence of moves in which $γ$ shoots first is bounded from below: $y_{γ}^{γ \cdot \cdot} \geq α^{*} .$

Proof. If player $γ$ shoots at another player, the target is eliminated and $γ$ ends up in a duel with the remaining player where that player shoots first. A duel with the weakest opponent is preferred. Thus, player $γ$ can guarantee payoff of $α^{*}$ by targeting $β$ .■

Can $γ$ obtain a higher payoff than $α^{*}$ in equilibrium when $γ$ is the first to act? For this to happen, $γ$ has to shoot in the air and one of the opponents has to target the other one with positive probability in the future, so that $γ$ reaches a duel shooting first. Otherwise, if players $α$ and $β$ target $γ$ or abstain each time they act, player $γ$ has to target someone to end up in a duel (shooting second) to obtain positive payoff, which is at most $α^{*}$ .

4. PLAYER β STRATEGIES

So, from here on we consider sequences of players in which $β$ shoots immediately after $α .$

Lemma 3. For all sequences of players in which player $β$ shoots immediately after $α$ , player $β$ never targets $α$ .

Proof. Suppose not, and that there exists an SPE in which $β$ targets $α$ . Consider a subgame where this happens. If $β$ eliminates $α$ , then $γ$ eliminates $β$ in the next round with probability 1. Therefore, $β$ has to expect a substantial continuation payoff $u_{β}$ following a miss: $β^{*} u_{β} \geq y_{β}^{β γ α},$ or, from Lemma 1

$u_{β} \geq \frac{β}{β^{*}} \times \frac{α^{*} β}{1 - α^{*} β^{*}} .$ (1)

Let us focus on $s e q = γ α β$ and the equilibrium in which $β$ obtains $u_{β}$ . As follows from Lemma 2, player $γ$ has to abstain with positive probability for $β$ to get a positive payoff as otherwise $γ$ eliminates $β$ . Player $γ$ then receives at least $α^{*}$ , while $α$ who moves next obtains at least $y_{α}^{α β γ}$ . Combined, the sum of payoffs of all three players has to be at least $y_{α}^{α β γ} + u_{β} + α^{*} .$ From Lemma 1 and inequality (1), this sum exceeds

$α \frac{β^{*} - α^{*} β^{*}}{1 - α^{*} β^{*}} + \frac{β}{β^{*}} \frac{α^{*} β}{1 - α^{*} β^{*}} + α^{*} = \frac{α^{*} β + α β^{*} β^{*}}{β^{*} (1 - α^{*} β^{*})} = \frac{β - α β + α - 2 α β + α β^{2}}{α + β - 2 α β - β^{2} + α β^{2}} >1,$

since $β^{2} > α β$ $—$ that is a contradiction.

Thus, we have shown that for considered sequences of players in any SPE neither $β$ nor $γ$ target $α$ . This is consistent with previous findings (Shubik, 1954; Kilgour, 1975; Toral, Amengual, 2005), yet here we have derived it without assumptions of stationarity, pure strategies, or necessity of targeting someone.

5. PLAYER α STRATEGIES

We explore first, what player $α$ can get by abstaining.

Lemma 4. For any SPE, for any subgame in which player $α$ has a chance to abstain without ending the game, we have (here $k =0$ or 1) $y_{α}^{α β γ (k)} \geq α .$

In addition, for sequences $γ α β (0)$ and $γ α β (2)$ player $γ$ targets $β$ , while for sequence $β γ α (1)$ player $β$ targets $γ$ .

Proof. Suppose player $α$ abstains. If $β$ also abstains, then $γ$ targets $β$ (who obtains payoff $0$ ), while $α$ ends up in the duel with $γ$ shooting first, getting payoff $α$ . Thus, $β$ will not abstain, and target $γ$ instead.

If $β$ eliminates $γ$ , then $α$ obtains $y_{α}^{α β} = α / (1 - α^{*} β^{*}) > α$ . If $β$ misses and $γ$ targets $β$ , then $α$ obtains payoff $α$ . Finally, if $β$ misses and $γ$ abstains, we come to the subgame in which $α$ can again abstain, and, as above, either guarantee herself at least $α$ or reach a subgame in which she can abstain, and so on. In expectation over all possible outcomes, player $α$ obtains at least $α$ in any SPE whenever she (he) can abstain.

Player $γ$ does not abstain if $α$ can abstain next round, and thus $γ$ targets $β$ for sequence $γ α β (0)$ . Indeed, since $α$ can abstain and guarantee herself payoff $α$ , by targeting $β$ , player $γ$ obtains $α^{*}$ . If $γ$ abstains, then $β$ obtains a positive payoff, in this case, what remains to $γ$ is strictly smaller than $α^{*}$ .

In turn, if player $β$ knows that $α$ is able to abstain, she will expect that $α$ abstains and then $γ$ targets $β$ . So, player $β$ should target $γ$ and not abstain in this case.■

Thus, having an option to abstain allows player $α$ to guarantee herself a decent payoff as the other players target each other. Will $α$ actually abstain in equilibrium? Suppose that $β$ and $γ$ target each other also when $α$ targets one of them (as in (Kilgour, 1975)). Then, for $α$ it is better to target $γ$ , since the duel with the weaker of the two opponents is preferred. By targeting $γ$ , player $α$ obtains payoff $α y_{α}^{β α} + α^{*} [β y_{α}^{α β} + β^{*} α],$ and by abstaining she gets $β y_{α}^{α β} + β^{*} α .$

Therefore, targeting $γ$ is better for $α$ , if

$y_{α}^{β α} > β y_{α}^{α β} + β^{*} α$ ; $\frac{β^{*} α}{1 - α^{*} β^{*}} > β \frac{α}{1 - α^{*} β^{*}} + β^{*} α$ ; $β^{*} α > α (β + β^{*} - α^{*} β^{*} β^{*})$ ;
$β^{*} >1 - α^{*} β^{*} β^{*}$ ; $β < α^{*} β^{*} β^{*} .$

We have shown the following lemma.

Lemma 5. A subgame perfect equilibrium in which neither $β$ nor $γ$ abstain exists. In this equilibrium, player $β$ targets $γ$ , player $γ$ targets $β$ , and player $α$ targets $γ$ if $β < α^{*} β^{*} β^{*}$ and abstains if $β > α^{*} β^{*} β^{*}$ .

It is trivial to check that the described strategies indeed form an SPE. Note that if $β > β^{*} β^{*}$ or $β >0.5 (3 - \sqrt{5}) \approx 0.3820$ , player $α$ abstains no matter what $α$ is.

6. NOVEL SPE CONSTRUCTION: SURVIVAL OF THE STRONGEST

Can it happen in equilibrium that players $β$ or $γ$ or both abstain? Consider sequence $β γ α .$ To abstain, player $β$ should expect that $γ$ will also abstain with high probability in the next round and that $α$ later (not necessarily immediately in round 3) will target $γ$ , hoping for a duel with $α$ in which $β$ shoots first. In turn, for player $γ$ to abstain, player $γ$ should expect that $α$ will be shooting at $β$ with positive probability, as otherwise $γ$ ’s expected payoff is strictly below $α^{*}$ . For both $β$ and $γ$ to abstain, player $α$ has to target someone!

Theorem 1 (main result). For sequence $β γ α$ and $α >0.5$ , there exists a mixed SPE in which players $β$ and $γ$ abstain with positive probabilities, while player $α$ mixes between the opponents. The equilibrium utilities $U_{i}$ of the players ( $i \in {α, β, γ}$ ) are given by the following formulas:

$U_{β} = β \frac{α^{*} β}{1 - α^{*} β^{*}}, U_{γ} = \frac{q^{*} α + q^{*} p α^{*} α^{*} β^{*}}{α + q^{*} p α^{*}}, U_{α} =1 - U_{β} - U_{γ},$

where $p = α β^{*} / [α^{*} (β - α^{*} β^{*} β^{*})], q = α^{*} β .$

Proof. Consider the following profile of strategies $—$ a candidate for the equilibrium. Player $γ$ abstains for $s e q = γ α β (1)$ (following an abstention by $β$ if it is the only one in a row) and targets $β$ in any other subgame. Player $β$ abstains in round 1 and for any subgame in which $α$ targeted and missed $γ$ in the previous round. If $α$ targeted and missed $β$ , then $β$ mixes between abstaining and targeting $γ$ . (This will provide incentives for $α$ to target $β$ .) For all other subgames, player $β$ targets $γ$ . Finally, player $α$ mixes shooting at $β$ and $γ$ for $s e q = α β γ (2),$ and abstains in all other subgames.

For the transition matrix between subgames for the proposed profile of strategies see Figure. Each decision node is marked by the acting player with the number of previous abstentions in the brackets. Only the transitions following misses are shown. Horizontal arrows point to the intended target, $↑$ stands for abstention. In figure 1 $p$ and $q$ are the probabilities with which players $β$ and $α$ target $γ$ in their mixed strategies.

Figure. Transitions in equilibrium

For the suggested profile to be an SPE, the following set of conditions has to be satisfied. Here, $U_{i}$ is the equilibrium utility of player $i$ , $p^{*} =1 - p$ , and $q^{*} =1 - q$ . By $U_{i} (j)$ we denote the utility of player $i$ using pure strategy shooting $j$ .

$U_{β} = α^{*} U_{β} + α [q \frac{β}{1 - α^{*} β^{*}} + q^{*} \times 0] = β \frac{α^{*} β}{1 - α^{*} β^{*}},$

$U_{γ} = q [α \times 0 + α^{*} U_{γ}] + q^{*} [α \times 1 + α^{*} p^{*} U_{γ} + α^{*} p β^{*} α^{*}] \geq α^{*},$

$U_{α} (γ)= α \times \frac{β^{*} α}{1 - α^{*} β^{*}} + α^{*} U_{α} = α \times 0 + α^{*} [p β \frac{α}{1 - α^{*} β^{*}} + p β^{*} α + p^{*} U_{α}] = U_{α} (β).$

Probability $q$ can be found from the condition for player $β$ to be indifferent between targeting $γ$ and abstaining:

$α β \frac{α^{*} β}{1 - α^{*} β^{*}} = α q \frac{β}{1 - α^{*} β^{*}},$ $q = α^{*} β .$

Similarly, from the indifference condition for player $α$ we can find $p$ :

$U_{α} (γ)= \frac{α β^{*}}{1 - α^{*} β^{*}} = α^{*} [p β \frac{α}{1 - α^{*} β^{*}} + p β^{*} α + p^{*} \frac{α β^{*}}{1 - α^{*} β^{*}}]$ ;

$β^{*} = α^{*} (p β + p^{*} β^{*} + p β^{*} (1 - α^{*} β^{*}))$ ; $β^{*} = α^{*} (p β + β^{*} - p β^{*} α^{*} β^{*})$ ;

$p = \frac{α β^{*}}{α^{*} (β - α^{*} β^{*} β^{*})}$ .

Finally, we need to check that $γ$ prefers abstaining from targeting $β$ . From the expression for $γ$ above we can compute

$U_{γ} = \frac{q^{*} α + q^{*} α^{*} p β^{*} α^{*}}{1 - α^{*} q - α^{*} q^{*} p^{*}} = \frac{q^{*} α + q^{*} p α^{*} α^{*} β^{*}}{α + q^{*} p α^{*}} .$

We want to show that $U_{γ} > α^{*}$ , when $α > α^{*}$ . We have

$\begin{array}{l} (U_{γ} - α^{*}) (α + q^{*} p α^{*}) = q^{*} p α^{*} α^{*} (β^{*} - 1) + α (q^{*} - α^{*}) = \\ = α (α - q) - q^{*} p α^{*} α^{*} β = α (α - α^{*} β) - p α^{*} α^{*} β (1 - α^{*} β) = \\ = \frac{α}{β - α^{*} β^{*} β^{*}} [(α - α^{*} β) (β - α^{*} β^{*} β^{*}) - (β^{*} α^{*}) (β - β α^{*} β)] . \end{array}$

If $β > α >0.5$ , then $α > α^{*}$ , $β > β^{*}$ , and $α - α^{*} β > α^{*} - α^{*} β = β^{*} α^{*}$ , $β - α^{*} β^{*} β^{*} > β - α^{*} β β$ . Therefore, the difference of products in the above expression is positive. ■

The presented equilibrium construction clearly favors the strongest player as compared to the equilibrium in Lemma 5. In this equilibrium player $γ$ had to rely on both $β$ and $α$ missing their shots to survive. In the new construction, the same scenario following $β$ targeting $γ$ occurs as well, but with probability less than one, while in a complementary event of $α$ targeting $β$ and making a shot, $γ$ wins the prize. The probability of this complimentary event is maximized in this equilibrium.

For a specific example, consider $α =0.8,$ $β =0.9$ , $γ =1$ . Then $q =0.18$ , $p \sim 0.4454$ and players payoffs are $U_{α} \sim 0.0816, U_{β} \sim 0.1653, U_{γ} \sim 0.7531.$ In the equilibrium from lemma 5, $U_{α}^{0} =1 - U_{β}^{0} - U_{γ}^{0} \sim 0.8147, U_{β}^{0} = U_{β} \sim 0.1653, U_{γ}^{0} = β^{*} α^{*} =0.02.$

Corollary 2. If $α >0.5$ and the sequence of players is $β γ α$ , then there exists a continuum of subgame perfect equilibria in which player $β$ earns the same expected payoff as in the equilibrium in Theorem 1, and payoffs of players $α$ and $γ$ vary: $U_{γ}$ takes any value from $β^{*} α^{*}$ to the value from the Theorem 1.

Proof. Consider the equilibrium construction in the Theorem. Player $β$ in the first round can instead shoot at $γ$ with some probability $r$ . The higher is $r$ the higher is $U_{α}$ (and lower $U_{γ}$ ). For $r =1$ we obtain the payoffs as in Lemma 5 with $U_{γ} = β^{*} α^{*}$ . ■

Corollary 3. For sequence $γ β α$ and sufficiently high $α$ , there exists an SPE with players $γ$ and $β$ abstaining (with probability 1), and $γ$ earning the highest expected payoff.

Proof. The equilibrium construction here is similar to the one in Theorem 1 with one correction to account for $β$ shooting immediately after $γ$ and not before. Player $γ$ is going to abstain if $α$ did not abstain and $β$ did not target $γ$ previously. Player $α$ is going to mix targets, hoping that $β$ will target $γ$ with positive probability. Differently from above, player $β$ is going to abstain with probability $1$ whenever $γ$ abstained in the previous round and $α$ did not target $β$ in the round before. If $α$ targets $β$ , $β$ survives, $γ$ abstains, then $β$ is going to randomize between abstaining and shooting at $γ$ . Similarly to Theorem 1, this will provide incentives for $α$ to shoot at $β .$ ■

Condition $α >0.5$ in Theorem 1 is a simple sufficient condition for the equilibrium we present. The exact condition on $α$ and $β$ jointly is for the expression at the end of the proof of Theorem 1 to be positive to guarantee that $U_{γ} > α^{*}$ . If $α <0.5$ , when shooting first, $γ$ can get payoff $α^{*} >0.5$ (simply shooting $β$ ) and therefore also get highest payoff among the players.

Clearly our equilibrium construction also extends for values of $γ$ less than but close to 1 (while $γ > β)$ . When $γ$ is close to 1, the payoffs of all the players are similar to those in Theorem 1, and the equilibrium is supported by probabilities $q$ and $p$ , that make players $α$ and $β$ to randomize their actions. Certainly, $q$ and $r$ should be adjusted depending on $γ$ , but as long as they are within $(0, 1)$ the equilibrium construction works.

With our assumptions of three abstentions leading to “peace” outcome and $γ =1$ similar mixed strategy equilibria do not exist for other sequences of play, in particular, when $α$ is the first player to act and there are no previous abstentions. If we consider a different assumption on abstentions, for instance, if the “peace” realizes once some player abstains for the second time following abstentions by every player, then the presented equilibrium construction exists for all sequences of play. Player $α$ randomizes over $β$ and $γ$ , and if abstains, the other two players abstain as well, forcing player $α$ to pick a target.

Corollary 4. Suppose that there is no limit on how many abstentions lead to the “peace” outcome. “Peace” is resched only if all three players abstain forever. Then, for any sequences of players and sufficiently high $α$ , there exists an SPE with players $γ$ and $β$ abstaining (with probability 1), and $γ$ earning the highest expected payoff.

Proof. The above constructions can be amended by specifying that both players $β$ and $γ$ necessarily abstain if player $α$ abstains. Clearly then, abstaining forever is not optimal for player $α .$ ■

7. CONCLUSION

Truels and $N$ -els are fascinating conflicts with numerous applications that bring in new insights with each new wave of attention to them. The theory of truelts started with the survival of the weakest observation which was and continues to be the main driver of interest in truels. It continued with abstentions as a strategic choice which the weakest player is likely to make. An abstention is a practically feasible strategy so it has to be modeled formally. This eventually lead to a cooperative element in the analysis (Bossert, Brams, Kilgour, 2002) $—$ players can potentially cooperate even if they are fighting for survival and only one survives. Our novel equilibrium construction can also be interpreted like that: two strongest players cooperate to force the weakest player to shoot hoping that one of them survives and be the first to shoot in the ensuing duel. In this paper we show: it is not necessary that the stronger opponents target each other in equilibrium and that other types of SPE equilibria exist, with the strongest player having largest odds of surviving and in which multiple people abstain from shooting with positive probability.

Using mixed strategies is another important element of our construction. This allows to break the curse of the strongest player, viz. being everyone’s target. In our construction, using a mixed strategy by player $α$ provides the incentive for the strongest player to abstain from shooting, while using a mixed strategy by player $β$ conditional on some histories of playing provides the incentive for $α$ to mix. It is easy to overlook such possibilities if the strategies are assumed to be stationary.

Many questions remain for further research. In particular, it would be valuable to characterize the whole set of achievable subgame perfect equilibrium payoffs for different values of parameters for truels. Are there any other kinds of equilibria in $N$ -els with $N >3$ players?

[1] For example, consider two possible strategies of the player $α$ : always target $β$ (if not eliminated, otherwise target $γ$ ) and always target $γ$ (if not eliminated, otherwise target $β$ ). Fix some strategies of the players $β$ and $γ$ . Payoffs in the game (expected probability of being a sole survivor) are well defined once all the strategies are specified, and let $π_{α, β}$ and $π_{α, γ}$ be the payoffs to the player $α$ from the two considered pure strategies. Suppose that the player $α$ considers a mixed strategy: flip a coin and if heads target $β$ always and if tails target $γ$ always. Then the player $α$ ’s payoff from this strategy is simply $0.5 π_{α, β} + 0.5 π_{α, γ}$ . Of course, more complex mixed strategies can be considered, e.g., those that implement randomization at any time a player acts. In the example, the player $α$ can flip a coin between two players (if available) any time it has a shot. In equilibrium, a player’s mixed strategy has to give the maximal and identical payoff to the pure strategies it randomizes over.

Әдебиет тізімі

Amengual P., Toral R. (2006). Truels, or the survival of the weakest. Computing in Science and Engineering, 8 (5), 88–95.
Archetti M. (2012). Survival of the weakest in n-person duels and the maintenance of variation under constant selection. Evolution: International Journal of Organic Evolution, 66 (3), 637–650.
Bossert W., Brams S. J., Kilgour D. M. (2002). Cooperative vs non-cooperative truels: Little agreement, but does that matter? Games and Economic Behavior, 40 (2), 185–202.
Cole S. G., Phillips J. L., Hartman E. A. (1977). Test of a model of decision processes in an intense conflict situation. Behavioral Science, 22 (3), 186–196.
Dubovik A., Parakhonyak A. (2014). Drugs, guns, and targeted competition. Games and Economic Behavior, 87, 497–507.
Gardner M. (1966). More mathematical puzzles and diversions. London: Penguin.
Kilgour D. M. (1972). The simultaneous truel. International Journal of Game Theory, 1 (5), 229–242.
Kilgour D. M. (1975). The sequential truel. International Journal of Game Theory, 4 (3), 151–174.
Kilgour D. M. (1978). Equilibrium points of infinite sequential truels. International Journal of Game Theory, 6 (3), 167–180.
Kilgour D. M., Brams S. J. (1997). The truel. Mathematics Magazine, 70 (5), 315–326.
Kinnaird C. (1946). Encyclopedia of puzzles and pastimes. N.Y.: Citadel Press.
Mosteller F. (1987). Fifty challenging problems in probability with solutions. Reading: Addison-Wesley Publishing Co.
Phillips H. (1937). Question time: An omnibus of problems for a brainy day. London: J. M. Dent & Sons.
Richman J. T. (2020). Victory by the weakest: Effects of negative advertising in N > 2 candidate campaigns. Virginia Social Science Journal, 54, 30–39.
Shubik M. (1954). Readings in game theory and related behavior. Chapter: Does the fittest necessarily survive? N.Y.: Doubleday, Garden City.
Shubik M. (1964). Game theory and related approaches to social behavior selections. N.Y.: John Wiley & Sons.
Skaperdas S., Grofman B. (1995). Modeling negative campaigning. American Political Science Review, 89 (1), 49–61.
Toral R., Amengual P. (2005). Distribution of winners in truel games. In: AIP Conference Proceedings, 779, 128–141. College Park (MD): American Institute of Physics.
Wegener M., Mutlu E. (2021). The good, the bad, the well-connected. International Journal of Game Theory, 50 (3), 759–771.

Пайдаланушының аты
Құпиясөз
Мені есте сақтау

Құпия сөзді ұмыттыңыз ба?	Тіркеу

Пайдаланушының аты
Құпиясөз
Мені есте сақтау

Құпия сөзді ұмыттыңыз ба?	Тіркеу

Том 61, № 4 (2025)

Survival of the strongest in a sequential truel

Толық мәтін

Аннотация

Негізгі сөздер

Толық мәтін

1. INTRODUCTION

2. SEQUENTIAL TRUEL

3. PLAYER γ STRATEGIES

4. PLAYER β STRATEGIES

5. PLAYER α STRATEGIES

6. NOVEL SPE CONSTRUCTION: SURVIVAL OF THE STRONGEST

7. CONCLUSION

Авторлар туралы

D. Ilinskiy

S. Izmalkov

A. Savvateev

Әдебиет тізімі