1. INTRODUCTION
A truel is a duel-like competition between three players, in which the players fight for survival or for a prize by targeting (shooting at) opponents. Only one opponent can be targeted at a time. Originally, the sequential truel appeared as a brain teaser and has been part of many collections of mathematical puzzles (Phillips, 1937; Kinnaird, 1946; Gardner, 1966; Mosteller, 1987). The problem statement can be written as follows. Each of the three players is endowed with marksmanship the probability of eliminating the opponent with a shot. Three players are different, with marksmanships of , , and (or , , and ). Whom should they target? Who wins? These problems have a paradoxical answer the weakest player has the highest chance of surviving while the strongest player (a sure shot) has the lowest survival odds. There is also an extra layer to the problem the weakest player might do even better by intentionally abstaining, e.g., by shooting in the ground or in the air.
M. Shubik (Shubik, 1964, 4346) coined the term truel and described it as an example of a game in which the pursuit of individual goals leads to a paradoxical result. D. Kilgour (Kilgour, 1975) initiated a game-theoretic analysis of sequential truels as infinitely repeated games with and without abstentions. He considered undominated Nash equilbria in stationary strategies and showed that if the players marksmanships are different from each other, then the equilibrium is essentially unique, and for many values of the parameters the weakest player has the strongest survival odds. This happens because for two stronger opponents it is optimal to target each other, while the weakest player can simply stay aside and wait until one of the stronger opponents is eliminated. Many variations of truels have been considered: sequential truels with arbitrary duel values (Kilgour, 1978), simultaneous truels (Kilgour, 1972), truels with random order of shooting, finitely many rounds or bullets, possibilities for cooperation, target-undetectable or silent truels (Kilgour, Brams, 1997; Bossert, Brams, Kilgour, 2002). P. Amengual and R. Toral (Amengual, Toral, 2006) analyse different versions of the sequential game, and analyse them as Markov chains with three absorbing states (i.e. the possible victories of each of the players). A qualitatively robust prediction appears to follow from existing results: whenever there is a fight, the two strongest opponents shoot at each other, and, unless the marksmanships are dramatically different, the weakest player has the highest survival odds.
R. Shubik (Shubik, 1954) used a three-person duel to model ‘what sort of individual is best suited to survive, when every man acts for himself and by himself’. He pointed out immediate applications of truels to the analysis of elections, political competition, and multi-country interaction, arguing that a weaker country can benefit if it gets itself in the middle of a non-cooperative conflict between two stronger countries. Truels and -els were used for modeling decisions in intense conflict situations (Cole, Phillips, Hartman, 1977), evolutionary biology to explain persistent extensive variation in competitive skills (Archetti, 2012), dynamic targeted competition, where actions can be directed at a particular opponent (Dubovik, Parakhonyak, 2014), negative advertising, especially in political campaigning (Skaperdas, Grofman, 1995; Richman, 2020). (Toral, Amengual, 2005) consider a variant of a truel a game of opinion making or dynamic persuasion, in which players try to convince the others in their opinion on some matter and so all players are active until the consensus is reached. In all these applications the central feature is the advantage of the weakest player. Recently, in the article M. Wegener and E. Mutlu (Wegener, Mutlu, 2021) offered a model of network formation and evolution, in which players of three possible types (marksmanships) engage in truels with their neighbors. The authors assume that players target the strongest among the remaining opponents but not of the similar type, and show that the evolutionary pressures over network actually lead to strongest types surviving more often.
In this paper we challenge the existing findings regarding the survival of the weakest. We present a novel subgame-perfect equilibrium construction for a sequential truel in which the strongest player has the largest survival odds. We depart from the existing literature in three key elements: 1) we allow players to use non-stationary (and non-Markovian) strategies; 2) we consider mixed strategies; 3) we explicitly define the conditions and outcomes for a ‘peace’ scenario the game ends if all the three players abstain in a row. This equilibrium exists for a specific order of play, in which the two stronger opponents act before the weakest one. When it exists, there are multiple subgame-perfect equilibria including the existing stationary construction in which two stronger opponents target each other.
In equilbrium, the strongest opponent, player abstains if the middle-skilled player has not targeted previously; player abstains if the weaker player targeted in the previous round, and randomizes between abstaining and targeting , if shot at . If both stronger players abstain, the weaker player has to target someone and randomizes between the opponents. The player ’s incentives to target are provided by the expectation that then targets with some positive probability, in which case ends up in a duel shooting first. In turn, randomization by is needed to provide incentives for to abstain. In essence, the two stronger players tacitly collude to force to target someone. The strongest player benefits when targets , which can happen in a subgame-perfect non-stationary equilibrium!
2. SEQUENTIAL TRUEL
There are three players competing for being the sole survivor (winner) in a sequential duel-like game. At the beginning of the truel all players are alive. At each turn, a player whose turn is to shoot can target any of the remaining alive opponents or abstain from shooting (delope, e.g. shoot at the air or shoot at the ground). A targeted player is eliminated with probability equal to the marksmanship of the shooter. All the players observe who was the intended target and what is the outcome of the shot. The order of shooting is chosen in advance, and once and for all. No player can have two turns while some other (alive) player had none. Eliminated players do not shoot. The truel ends when all but one player was eliminated or when all alive players abstained from shooting one after another. The single survivor obtains the ‘survivor’ prize, the payoff to which is normalized to 1, while the payoff of all the other players is 0. In the case the truel ends due to ‘inactivity’, all the players receive the payoff of 0. The solution concept is subgame perfect equilibrium (SPE).
In our game-theoretic analysis we consider not only pure strategies for players that specify a particular target (or abstaining) after any history that can arise but also mixed strategies, where each player can randomize over its pure strategies and, in particular, randomly choose its target (or abstain) with some probabilities when called to act. The payoff from a mixed strategy is an expectation of the profits from pure strategies[1].
We refer to the players by their marksmanship, , , , and suppose that , Let and be the probabilities of surviving a shot from a respective player. Denote by the order (sequence) of shooting. For instance, if , then player shoots first, then player shoots (if alive), then player , then again player and so on. We write with 0, 1 or 2 to highlight the number of successive abstentions prior to the first player in shooting. Finally, let and be, respectively, the lowest and the highest SPE payoffs to player if the sequence of shooting is seq. For instance, is the minimal SPE payoff of player for the sequence for the subgame following one abstention (by player
Some of our parametric assumptions, such as , unequal marksmanships, and 0 payoffs to the ‘peace’ outcome, are made to simplify the exposition. For example, all results (with slight modification in statements) are true for in small neighbourhood of 1, but will require more technical details.
Some other parametric assumptions, such as three abstentions mean ‘peace’ are needed to complete specification of the game, which in turn allows for analysis of mixed-strategy equilibria. It would be interesting to know, how equilibria are affected when the number of abstentions that terminate the game or the ‘peace’ payoff vary, but this is left for future analysis.
There are six possible sequences of players, which can be divided in two groups: either acts immediately after or vise versa. Since our main goal is to present an explicit construction of equilibria in which the strongest player obtains high payoffs we will concentrate on the first case. Once we present the new equilibrium for sequence in Section 6, as a corollary, we obtain a similar construction for sequence , in which acts immediately after . Since we need to stimulate player to shoot in our equilibrium, it is crucial that must be the last player to shoot.
Lemma 1. The minimal and maximal SPE payoffs of players and satisfy the following inequalities
(Dots here denote any of the appropriate order of shooting.)
Proof. The minimal SPE payoff of player (or ) cannot be lower than what the player (or ) can obtain by targeting and, if successful, ending up in a duel shooting second. Clearly, there would be no abstentions in any duel. Similarly, the maximal possible payoff a player can get cannot be higher than that from a duel against the weakest of the opponents shooting first.■
3. PLAYER γ STRATEGIES
Lemma 2. In any SPE player does not target if is alive. Player ’s minimal SPE payoff for any sequence of moves in which shoots first is bounded from below:
Proof. If player shoots at another player, the target is eliminated and ends up in a duel with the remaining player where that player shoots first. A duel with the weakest opponent is preferred. Thus, player can guarantee payoff of by targeting .■
Can obtain a higher payoff than in equilibrium when is the first to act? For this to happen, has to shoot in the air and one of the opponents has to target the other one with positive probability in the future, so that reaches a duel shooting first. Otherwise, if players and target or abstain each time they act, player has to target someone to end up in a duel (shooting second) to obtain positive payoff, which is at most .
4. PLAYER β STRATEGIES
So, from here on we consider sequences of players in which shoots immediately after
Lemma 3. For all sequences of players in which player shoots immediately after , player never targets .
Proof. Suppose not, and that there exists an SPE in which targets . Consider a subgame where this happens. If eliminates , then eliminates in the next round with probability 1. Therefore, has to expect a substantial continuation payoff following a miss: or, from Lemma 1
(1)
Let us focus on and the equilibrium in which obtains . As follows from Lemma 2, player has to abstain with positive probability for to get a positive payoff as otherwise eliminates . Player then receives at least , while who moves next obtains at least . Combined, the sum of payoffs of all three players has to be at least From Lemma 1 and inequality (1), this sum exceeds
since that is a contradiction.
Thus, we have shown that for considered sequences of players in any SPE neither nor target . This is consistent with previous findings (Shubik, 1954; Kilgour, 1975; Toral, Amengual, 2005), yet here we have derived it without assumptions of stationarity, pure strategies, or necessity of targeting someone.
5. PLAYER α STRATEGIES
We explore first, what player can get by abstaining.
Lemma 4. For any SPE, for any subgame in which player has a chance to abstain without ending the game, we have (here or 1)
In addition, for sequences and player targets , while for sequence player targets .
Proof. Suppose player abstains. If also abstains, then targets (who obtains payoff ), while ends up in the duel with shooting first, getting payoff . Thus, will not abstain, and target instead.
If eliminates , then obtains . If misses and targets , then obtains payoff . Finally, if misses and abstains, we come to the subgame in which can again abstain, and, as above, either guarantee herself at least or reach a subgame in which she can abstain, and so on. In expectation over all possible outcomes, player obtains at least in any SPE whenever she (he) can abstain.
Player does not abstain if can abstain next round, and thus targets for sequence . Indeed, since can abstain and guarantee herself payoff , by targeting , player obtains . If abstains, then obtains a positive payoff, in this case, what remains to is strictly smaller than .
In turn, if player knows that is able to abstain, she will expect that abstains and then targets . So, player should target and not abstain in this case.■
Thus, having an option to abstain allows player to guarantee herself a decent payoff as the other players target each other. Will actually abstain in equilibrium? Suppose that and target each other also when targets one of them (as in (Kilgour, 1975)). Then, for it is better to target , since the duel with the weaker of the two opponents is preferred. By targeting , player obtains payoff and by abstaining she gets
Therefore, targeting is better for , if
; ; ;
;
We have shown the following lemma.
Lemma 5. A subgame perfect equilibrium in which neither nor abstain exists. In this equilibrium, player targets , player targets , and player targets if and abstains if .
It is trivial to check that the described strategies indeed form an SPE. Note that if or , player abstains no matter what is.
6. NOVEL SPE CONSTRUCTION: SURVIVAL OF THE STRONGEST
Can it happen in equilibrium that players or or both abstain? Consider sequence To abstain, player should expect that will also abstain with high probability in the next round and that later (not necessarily immediately in round 3) will target , hoping for a duel with in which shoots first. In turn, for player to abstain, player should expect that will be shooting at with positive probability, as otherwise ’s expected payoff is strictly below . For both and to abstain, player has to target someone!
Theorem 1 (main result). For sequence and , there exists a mixed SPE in which players and abstain with positive probabilities, while player mixes between the opponents. The equilibrium utilities of the players ( ) are given by the following formulas:
where
Proof. Consider the following profile of strategies a candidate for the equilibrium. Player abstains for (following an abstention by if it is the only one in a row) and targets in any other subgame. Player abstains in round 1 and for any subgame in which targeted and missed in the previous round. If targeted and missed , then mixes between abstaining and targeting . (This will provide incentives for to target .) For all other subgames, player targets . Finally, player mixes shooting at and for and abstains in all other subgames.
For the transition matrix between subgames for the proposed profile of strategies see Figure. Each decision node is marked by the acting player with the number of previous abstentions in the brackets. Only the transitions following misses are shown. Horizontal arrows point to the intended target, stands for abstention. In figure 1 and are the probabilities with which players and target in their mixed strategies.
Figure. Transitions in equilibrium
For the suggested profile to be an SPE, the following set of conditions has to be satisfied. Here, is the equilibrium utility of player , , and . By we denote the utility of player using pure strategy shooting .
Probability can be found from the condition for player to be indifferent between targeting and abstaining:
Similarly, from the indifference condition for player we can find :
;
; ;
.
Finally, we need to check that prefers abstaining from targeting . From the expression for above we can compute
We want to show that , when . We have
If , then , , and , . Therefore, the difference of products in the above expression is positive. ■
The presented equilibrium construction clearly favors the strongest player as compared to the equilibrium in Lemma 5. In this equilibrium player had to rely on both and missing their shots to survive. In the new construction, the same scenario following targeting occurs as well, but with probability less than one, while in a complementary event of targeting and making a shot, wins the prize. The probability of this complimentary event is maximized in this equilibrium.
For a specific example, consider , . Then , and players payoffs are In the equilibrium from lemma 5,
Corollary 2. If and the sequence of players is , then there exists a continuum of subgame perfect equilibria in which player earns the same expected payoff as in the equilibrium in Theorem 1, and payoffs of players and vary: takes any value from to the value from the Theorem 1.
Proof. Consider the equilibrium construction in the Theorem. Player in the first round can instead shoot at with some probability . The higher is the higher is (and lower ). For we obtain the payoffs as in Lemma 5 with . ■
Corollary 3. For sequence and sufficiently high , there exists an SPE with players and abstaining (with probability 1), and earning the highest expected payoff.
Proof. The equilibrium construction here is similar to the one in Theorem 1 with one correction to account for shooting immediately after and not before. Player is going to abstain if did not abstain and did not target previously. Player is going to mix targets, hoping that will target with positive probability. Differently from above, player is going to abstain with probability whenever abstained in the previous round and did not target in the round before. If targets , survives, abstains, then is going to randomize between abstaining and shooting at . Similarly to Theorem 1, this will provide incentives for to shoot at ■
Condition in Theorem 1 is a simple sufficient condition for the equilibrium we present. The exact condition on and jointly is for the expression at the end of the proof of Theorem 1 to be positive to guarantee that . If , when shooting first, can get payoff (simply shooting ) and therefore also get highest payoff among the players.
Clearly our equilibrium construction also extends for values of less than but close to 1 (while . When is close to 1, the payoffs of all the players are similar to those in Theorem 1, and the equilibrium is supported by probabilities and , that make players and to randomize their actions. Certainly, and should be adjusted depending on , but as long as they are within the equilibrium construction works.
With our assumptions of three abstentions leading to “peace” outcome and similar mixed strategy equilibria do not exist for other sequences of play, in particular, when is the first player to act and there are no previous abstentions. If we consider a different assumption on abstentions, for instance, if the “peace” realizes once some player abstains for the second time following abstentions by every player, then the presented equilibrium construction exists for all sequences of play. Player randomizes over and , and if abstains, the other two players abstain as well, forcing player to pick a target.
Corollary 4. Suppose that there is no limit on how many abstentions lead to the “peace” outcome. “Peace” is resched only if all three players abstain forever. Then, for any sequences of players and sufficiently high , there exists an SPE with players and abstaining (with probability 1), and earning the highest expected payoff.
Proof. The above constructions can be amended by specifying that both players and necessarily abstain if player abstains. Clearly then, abstaining forever is not optimal for player ■
7. CONCLUSION
Truels and -els are fascinating conflicts with numerous applications that bring in new insights with each new wave of attention to them. The theory of truelts started with the survival of the weakest observation which was and continues to be the main driver of interest in truels. It continued with abstentions as a strategic choice which the weakest player is likely to make. An abstention is a practically feasible strategy so it has to be modeled formally. This eventually lead to a cooperative element in the analysis (Bossert, Brams, Kilgour, 2002) players can potentially cooperate even if they are fighting for survival and only one survives. Our novel equilibrium construction can also be interpreted like that: two strongest players cooperate to force the weakest player to shoot hoping that one of them survives and be the first to shoot in the ensuing duel. In this paper we show: it is not necessary that the stronger opponents target each other in equilibrium and that other types of SPE equilibria exist, with the strongest player having largest odds of surviving and in which multiple people abstain from shooting with positive probability.
Using mixed strategies is another important element of our construction. This allows to break the curse of the strongest player, viz. being everyone’s target. In our construction, using a mixed strategy by player provides the incentive for the strongest player to abstain from shooting, while using a mixed strategy by player conditional on some histories of playing provides the incentive for to mix. It is easy to overlook such possibilities if the strategies are assumed to be stationary.
Many questions remain for further research. In particular, it would be valuable to characterize the whole set of achievable subgame perfect equilibrium payoffs for different values of parameters for truels. Are there any other kinds of equilibria in -els with players?
[1] For example, consider two possible strategies of the player : always target (if not eliminated, otherwise target ) and always target (if not eliminated, otherwise target ). Fix some strategies of the players and . Payoffs in the game (expected probability of being a sole survivor) are well defined once all the strategies are specified, and let and be the payoffs to the player from the two considered pure strategies. Suppose that the player considers a mixed strategy: flip a coin and if heads target always and if tails target always. Then the player ’s payoff from this strategy is simply . Of course, more complex mixed strategies can be considered, e.g., those that implement randomization at any time a player acts. In the example, the player can flip a coin between two players (if available) any time it has a shot. In equilibrium, a player’s mixed strategy has to give the maximal and identical payoff to the pure strategies it randomizes over.