Emergence and suppression of cooperation by action visibility in transparent games

Anton M. Unakafov; Thomas Schultze; Igor Kagan; Sebastian Möller; Stephan Eule; Fred Wolf

doi:10.1101/314500

Abstract

Real-world agents, such as humans, animals and robots, observe each other during interactions and choose their own actions taking the partners’ ongoing behaviour into account. Yet, classical game theory assumes that players act either strictly sequentially or strictly simultaneously (without knowing the choices of each other). To account for action visibility and provide a more realistic model of interactions under time constraints, we introduce a new game-theoretic setting called transparent game, where each player has a certain probability to observe the choice of the partner before deciding on its own action. Using evolutionary simulations, we demonstrate that even a small probability of seeing the partner’s choice before one’s own decision substantially changes evolutionary successful strategies. Action visibility enhances cooperation in a Bach-or-Stravinsky game, but disrupts cooperation in a more competitive iterated Prisoner’s Dilemma. In both games, strategies based on the “Win–stay, lose–shift” and “Tit-for-tat” principles are predominant for moderate transparency, while for high transparency strategies of “Leader-Follower” type emerge. Our results have implications for studies of human and animal social behaviour, especially for the analysis of dyadic and group interactions.

One of the most interesting questions in economics, biological, and social sciences is the emergence and maintenance of cooperation. A popular framework for studying cooperation (or the lack thereof) is Game Theory, which is frequently used to model interactions between “rational” decision-makers. In particular, a model for repeated interactions is provided by iterated games; two settings were previously used [1]:

Simultaneous games: players act at the same time without having any information about the current choice of the partners. Consequently, all players must make a decision under uncertainty concerning the choices of others.
Sequential games: players act in a certain order and the player acting later in the sequence is guaranteed to see the choices of the preceding players. Here the burden of uncertainty only applies to the first player or – if there are more than two players – becomes lighter with every turn in the sequence.

Both settings place a simplifying restriction on the decisional context: either all players have no information about the choices of the partners (simultaneous game), or some players always have more information than others (sequential game). This simplification might be disadvantageous for modelling certain behaviours, since humans and animals usually act neither strictly simultaneously nor sequentially, but observe the choices of each other and adjust their actions accordingly [2]. Indeed, the visibility of the partner’s actions plays a crucial role in social interactions, both in laboratory experiments [3–6] and in natural environments [7–11].

For example, in soccer the penalty kicker must decide where to place the ball and the goalkeeper must decide whether to jump to one of the sides or to stay in the centre. Since the ball reaches the target in 0.2-0.3 s [12], the goalkeeper cannot postpone the decision until the trajectory of the ball is clear, and must make the choice while opponent is preparing the shot. Thus, a simultaneous game could be used as a crude model for such interactions (see, for instance, [13, 14]). However, in practice, both players observe each other’s behaviour and try to anticipate the direction of the kick or of the goalkeeper’s jump from subtle preparatory cues [6]. Thanks to these observations, professional goalkeepers manage to use their tiny temporal advantage and predict the direction of the shot better than chance [12–14]. The advantage of a professional goalkeeper over an amateur kicker would result in even better prediction of the shooting direction. Similar considerations might apply to a wide range of interactions in real life; however, a framework for the treatment of such cases is missing in the classical game theory.

To better predict and explain the outcomes of interactions between agents by taking the visibility factor into account, we introduce the concept of transparent games, where players can monitor actions of each other. The access to the information about choices of other players is therefore probabilistic; in particular, for a game between two players at each round three cases are possible:

Player 1 knows the choice of Player 2 before making its own choice.
Player 2 knows the choice of Player 1 before making its own choice.
Neither of players knows the choice of the partner.

Which of these cases applies depends on the reaction times of the players. If they act nearly at the same time, neither is able to use the information about partner’s action; but a player who waits before making the choice has a higher probability to see the choice of the partner. Setting a time constraint (which is always present, either explicitly or implicitly, both in natural and in experimental situations) prevents players from waiting indefinitely for the partner’s choice. Then, given the reaction time distributions for the players, one can infer the probability of Player i to see the choice of the partner before making own choice.

Transparent games provide a general framework that also includes classical game-theoretical settings: simultaneous games correspond to , while sequential games result in for a fixed order of turns in each round (Player 1 always moves first, Player 2 – second), and in for a random order of turns.

The main question is whether the probabilistic access to the information in transparent games leads to the success of same or different behavioural strategies as compared to classic games. In other words, the possibility to see the choice of the partner on some occasions, to be observed by the partner on others, or to act under mutual uncertainty, may favour behaviours qualitatively different from those that yield the best performance in games with either full unidirectional transparency (sequential games) or with no transparency (simultaneous games).

To answer this question, here we study transparent versions of two classical two-player games: the iterated Prisoner’s dilemma (iPD) [15] and the iterated Bach-or-Stravinsky game (iBoS, also known as Battle of the Sexes and as Hero) [16]. We selected iPD and iBoS because they are counted among the most interesting games where cooperation is possible (non-zero-sum games) [16, 17], and because they require two distinct types of cooperative behaviour [18, 19]. While iPD is traditionally used for studying cooperation [15], iBoS is sometimes considered as a more suitable model [20, 21]. We employ evolutionary simulations, which allow evaluating optimal strategies using principles of natural selection, and consider memory-one strategies [22,23] that take into account own and partner’s choices at the previous round of the game.

We find that even a small probability of seeing the choice of the partner before one’s own decision changes the optimal behaviour in the iPD and iBoS games. The possibility to see the partner’s choice enhances cooperation in the generally cooperative iBoS, but disrupts cooperation in the more competitive iPD. Different transparency levels also bring qualitatively different strategies to success. In particular, we show that strategies based on the “Win–stay, lose–shift” and “Tit-for-tat” principles are the most successful in both games for low and moderate transparency, while for high transparency a new class of strategies, which we term “Leader-Follower” strategies, evolves. Although frequently observed in humans and animals (see, for instance, [24]), these strategies have up to now remained beyond the scope of game-theoretical studies, but naturally emerge in our transparent games frame-work.

Results

Evolutionary simulations for transparent games

We used evolutionary simulations [23] to investigate strategies evolving in transparent versions of iPD and iBoS. Payoff matrices for these games are shown in Fig. 1. In both games, evolution results in equal mean reaction times for all players (see “Methods” section). Then the probability p_see to see the choice of the partner is equal for all players, which in a dyadic game results in p_see ≤ 0.5.

Figure 1: Payoff matrices for Prisoner’s Dilemma and Bach-or-Stravinsky game.

a In Prisoner’s Dilemma, players adopt roles of criminals suspected of committing a crime, arrested and kept in isolated rooms. Since prosecutors do not have sufficient evidence, they offer each prisoner an option to minimize the punishment by making a confession. A prisoner can either betray the other by defecting (D), or cooperate (C) with the partner by remaining silent. The maximal charge is five years in prison, and the payoff matrix represents the number of years deducted from it (for instance, if both players cooperate (CC, upper left), each gets a two-year sentence, because 3 years of prison time have been deducted). b In Bach-or-Stravinsky game two people are choosing between Bach and Stravinsky music concerts. Player 1 prefers Bach, Player 2 – Stravinsky; yet, both prefer going to the concert together. To make the game symmetric we convert musical tastes to the behavioural descriptions: insisting (I) on own preference or accommodating (A) the preference of the partner. Here cooperation is achieved when players choose different actions: either (I, A) or (A, I). Importantly, in the classical version of these games it is assumed that the players cannot communicate.

We studied an infinite population of players using the methods described in [22, 23]. The population consists of “species” of players, each defined by a strategy vector s_i and frequency x_i(t) in the population with . A strategy determines the probability of a player to choose one of two actions, A₁ and A₂ (corresponding to cooperation and defection in iPD or to insisting and accommodating in iBoS, respectively). For each species i the strategy is represented by a vector , where k enumerates the 12 different situations in which the player can be when making the choice. These depend on the outcome of the previous round, whether or not the player can see the current choice of the partner, and what the choice is if it is visible. The thus represent the conditional probabilities to select action A₁, specifically

are probabilities to select A₁ without seeing partner’s choice, given that in the previous round the joint choice of the player and the partner was A₁A₁, A₁A₂, A₂A₁, and A₂A₂ respectively;
are probabilities to select A₁, seeing partner selecting A₁ and given the outcome of the previous round (as before).
are probabilities to select A₁, seeing partner selecting A₂ and given the outcome of the previous round.

Probabilities to select A₂ are represented by , respectively. To ensure numerical stability of the simulations, it is common to introduce a minimal possible error ε in the strategies such that , with ε = 0.001, see [22, 23]. The fact that players cannot have pure strategies and are prone to errors is also closely related to the “trembling hand” effect [22]. Note that in iPD no rational player would cooperate seeing that partner defects; thus we simplify iPD strategies by setting .

For every value of p_see = 0.0, 0.1,…,0.5 we performed 80 runs of evolutionary simulations tracing 10⁹ generations in each run. We began each run of simulations with five species having equal initial frequencies x₁(1) = … = x₅(1) = 0.2 and random strategies s_i. The frequency of the strategies x_i(t) evolved in time according to the replicator dynamics equation (see “Methods” section). If x_i(t) dropped below 0.001, the species was assumed to die out. On average every 100 generations new species with random strategies emerged in the population. Details of our simulations can be found in the “Methods” section.

Since the strategies in the evolutionary simulations were generated randomly, convergence to the theoretical optimum may take many generations and the observed successful strategies may deviate from the optimum. Therefore, we provide a coarse-grained description of strategies using the following notation: symbol 0 for , symbol 1 for , symbol ^* is used as a wildcard character to denote an arbitrary probability.

Let us exemplify this notation for the well-known strategies in the iPD. For instance, the Generous tit-for-tat (GTFT) strategy is encoded by (1a1b;1^***;0000), where 0.1 < a, b < 0.9. Indeed, GTFT cooperates with cooperators and forgives defectors. To satisfy the first property, the probability to cooperate after the partner cooperated in previous round should be rather high, say above 0.9, thus the corresponding entries of the strategy are encoded by 1. To satisfy the second property, probability to cooperate after partner defected should be somewhere between zero and one with the optimal value 1/3 [22]. Since evolving towards this optimum may take long, we allow a broad range of values for and , for instance [0.1, 0.9]. We leave arbitrary since for low values of p_see these entries have little influence on the strategy performance, meaning that their evolution towards optimal values may take especially long. Finally, as stated above, no sensible agent would cooperate in the iPD if aware that the partner is defecting, leading us to encode to by 0. Further we omit these predefined zero entries when referring to the iPD strategies. Thus, we encode GTFT by (1a1b;1^***), where 0.1 < a, b < 0.9. The Always Defect strategy (AllD) is encoded by (0000;^****), meaning that the probability to cooperate when not seeing partner’s choice is below 0.1, and behaviour when seeing partner’s choice in not specified. Win – stay, lose – shift (WSLS) is encoded by (1001;1^***), and Firm-but-fair (FbF) by (101b;1^***), where 0.1 < b < 0.9.

Transparency suppresses cooperation in Prisoner’s Dilemma

Simulation results for the transparent iPD are presented in Table 1. Most of the effective strategies were known from earlier studies on non-transparent games, but for high transparency (p_see → 0.5) a new previously unknown strategy emerged. We dub this strategy “Leader-Follower” (L-F); theoretically it is represented by s = (1, 1, 1, 1; 0, 0, 0, 0), that is the player cooperates when it does not see the choice of the partner and defects otherwise. In the simultaneous iPD (p_see = 0) L-F behaves as unconditional cooperator and is easily beaten, but it becomes predominant for p_see = 0.5. In the latter case, when two L-F players meet, the player acting first (the Leader) makes a “self-sacrificing” decision to cooperate, while the second player (the Follower) sees this and defects (note that for the next round the roles of the individuals may switch, thus ensuring a certain balance when reaping the benefits of exploiting a sacrificial first move). We classified as L-F all strategies with profile (^*11^*; ^*00^*) since behaviour after mutual cooperation or mutual defection is only relevant when L-F is playing against another strategy, and success for different types of behaviour depends on the composition of the population. For instance, (111^*; 000^*) is optimal in a cooperative population, while (^*110; ^*000) is more robust against defectors. Note that L-F did not emerge for sequential iPD in [25–27], since in these studies, players were bound to the same strategy regardless of whether they made their choice before or after the partner. In contrast, transparent games allow different sub-strategies (s₁, …, s₄) and (s₅, …, s₈) for these situations.

View this table:

Table 1: Frequencies of stable strategies in the iterated Prisoner’s Dilemma for different transparency levels.

The frequencies were computed over 10⁹ generations in 80 runs; strategies were counted as stable if they survived for more than 100 generations after they emerged in the population. The frequency of the most successful strategy for each p_see value is shown in bold.

Similar to the classic simultaneous iPD, WSLS was predominant in the transparent iPD for low and moderate p_see, which is reflected by the clearly visible WSLS profiles in the final strategies of the population (Fig. 2). Note that GTFT, another successful strategy in the simultaneous iPD, disappeared completely for p_see > 0. For p_see ≥ 0.4, the game resembled the sequential iPD and the results changed accordingly. Similar to the sequential iPD [25–27], the frequency of WSLS waned, the FbF strategy emerged, cooperation became less frequent and took longer to establish itself (Fig. 3a). For p_see = 0.5 the population was taken over either by L-F, WSLS or (rarely) by FbF, which is reflected by the mixed profile in Fig. 2. Pairwise comparison of strategies in iPD (Supplementary Fig. 1) helps to explain the superiority of WSLS for p_see < 0.5, the disappearance of GTFT for p_see > 0.0, and the drastic increase of L-F frequency for p_see = 0.5.

Figure 2: iPD strategies present in the final population.

Strategies are taken for the 10⁹-th generation and averaged over all runs. CC, CD, DC and DD stand for the outcomes “both players co-operate”, “self cooperated, partner defected”, “self defected, partner cooperated” and “both defected”, respectively. a Strategy entries s₁, …, s₄ are close to (1001) for p_see = 0.1, …, 0.3 demonstrating the dominance of WSLS. Deviations from this pattern for p_see = 0.0 and p_see = 0.4 indicate the presence of the GTFT (1a1b) and FbF (101b) strategies, respectively. The almost uniform profile for p_see = 0.5 is caused by mixture of WSLS and Leader-Follower strategies; as neither has an absolute majority in the population, s₁, …, s₄ are quite low. b Entries s₅, …, s₈ are irrelevant for p_see = 0.0 and indicate the same WSLS-like pattern for p_see = 0.1, …, 0.4. Note that s₆, s₇ > 0, which means that in transparent settings WSLS-players tend to cooperate seeing that the partner is cooperating even when this is against the WSLS principle. The abrupt decrease of reciprocal cooperation for p_see = 0.5 indicates the rise of Leader-Follower strategy.

Figure 3: Fraction of runs for which cooperation was established a in iPD and b in iBoS.

We assumed that cooperation was established in the population if the average payoff was above 0.9 3 for iPD and above 0.95 3.5 for iBoS (90% and 95% of maximal possible value). The threshold for iPD was proposed in [22], while for iBoS we set a higher threshold due to the less competitive nature of this game.

For p_see ≤ 0.3 cooperation evolved relatively quickly thanks to the predominance of WSLS. Fig. 3a shows that further increase of p_see apparently undermined cooperation in iPD, this is why in the realistic iPD-prototype a face-to-face interrogation would be used. However, Leader-Follower is in a sense a cooperative strategy for iPD: it alternates between cooperation and defection instead of using a synchronized cooperation.

Cooperation emergence in the transparent Bach-or-Stravinsky game

Our simulations revealed that four memory-one strategies are most effective in iBoS for various levels of transparency. In contrast to iPD there exist only few studies of iBoS strategies, therefore we describe the observed strategies in detail.

Turn-taker aims to enter a fair coordination regime, where players alternate between IA (Player 1 insists and Player 2 accommodates) and AI (Player 1 accommodates and Player 2 insists) states. In the simultaneous iBoS, this strategy takes the form (q, 0, 1, q), where q = 5/8 guarantees maximal reward in a non-coordinated play against a partner with the same strategy for the payoff matrix in Fig. 1b. We classify as Turn-takers all strategies encoded by (^*01^*;^*0^**;^**1^*). Turn-taking was shown to be successful in the simultaneous iBoS for a finite population of agents with pure strategies (i.e., having 0 or 1 entries only, with no account for mistakes) and a memory spanning three previous rounds [19].
Challenger takes the form (1, 1, 0, 1) in the simultaneous iBoS. When two players with this strategy meet, they initiate a “challenge”: both insist until one of the players makes a mistake (that is, accommodates). Then, the player making the mistake (looser) submits and continues accommodating, while the winner continues insisting. This period of unfair coordination beneficial for the winner ends when the next mistake of either player (the winner accommodating or the loser insisting) triggers a new “challenge”. This strategy is encoded by (11b^*;^****;^*1^**) and has two variants: Challenger “obeys the rules” and does not initiate the challenge after losing (b ≤ 0.1), while Aggressive Challenger may switch to insisting (0.1 < b ≤ 1/3). Challenging strategies were theoretically predicted to be successful in simultaneous iBoS [28, 29].
The Leader-Follower (L-F) strategy s = (1, 1, 1, 1; 0, 0, 0, 0; 1, 1, 1, 1) was not considered previously. In a game between two players with this strategy, the faster player insists and the slower player accommodates. In simultaneous game, this strategy lapses into inefficient stubborn insisting since all players consider themselves leaders, but in transparent settings with high p_see this strategy provides an effective and fair cooperation (because of the, on average, equal reaction times). When the whole population adopts an L-F strategy, most entries of the strategy vector become irrelevant since (i) only IA and AI states are visited and (ii) the faster player never accommodates. Therefore, we classify all strategies encoded by (^*11^*;^*00^*;^****) as L-F.
Challenging Leader-Follower is simply a hybrid of Challenger and L-F strategies encoded by (11b^*;0c0^*;^*1^**), where b > 1/3, c ≤ 1/3.

The results of the simulations are presented in Table 2. The entries of the averaged over all runs strategy established in the population (Fig. 4) show considerably different profiles for various values of p_see. Challengers, Turn-takers, and Leader-Followers succeeded for low, medium and high probabilities to see partner’s choice, respectively.

View this table:

Table 2: Frequencies of stable strategies in the Bach-or-Stravinsky game for different transparency levels.

Figure 4: iBoS strategies present in the final population.

Strategies entries a s₁, …, s₄, b s₅, …, s₈ and c s₉, …, s₁₂ are taken for the 10⁹-th generation and averaged over all runs. II, IA, AI and AA stand for the outcomes “both players insisted”, “self insisted, partner accommodated”, “self accommodated, partner insisted” and “both accommodated”, respectively. The decrease of the s₂/s₃ and s₁₀/s₁₁ ratios reflects the transition of the dominant strategy from challenging to turn-taking for p_see = 0.1, …, 0.4. For p_see = 0.5 the triumph of the Leader-Follower strategy is indicated by s₂ = s₃ = 1 and s₆ = s₇ = 0.

To provide additional insight into the results of the iBoS simulations, we studied how various strategies perform against each other (Supplementary Fig. 2). As with the iPD, this analysis helps to understand why different strategies were successful at different transparency levels.

In contrast to iPD, for iBoS high visibility results in a more effective cooperation, which is consistent with the notion that cooperation in the iBoS game rests on effective coordination (rather than trust in the good intentions of the partner). Indeed, for p_see ≥ 0.3 non-cooperative Challengers no longer constituted the majority of the population. The break of cooperation at p_see = 0.4 was caused by a transition from turn-taking to leader-following. Note that for p_see = 0.5 cooperation thrives and is established much faster than for lower transparency (Fig. 3b) thanks to the Leader-Follower strategy.

Discussion

In this paper, we introduced the concept of transparent games which integrates the visibility of the partner’s actions into a game-theoretic settings. Specifically, we considered iterated dyadic games where players have probabilistic access to the information about the partner’s choice in the current round. When reaction times for both players are equal on average, the probability p_see of accessing this information can vary from p_see = 0.0 (corresponding to the canonical simultaneous games) to p_see = 0.5 (corresponding to sequential games with random order of choices).

The value of p_see strongly affects the evolutionary success of strategies. In particular, for the iterated Prisoner’s Dilemma (iPD) we have shown that for p_see > 0 the Generous tit-for-tat strategy is unsuccessful and Win–stay, lose–shift becomes an unquestionable evolutionary winner. For p_see = 0.5, a new strategy, Leader-Follower triumphs. In the Bach-or-Stravinsky game (iBoS) even moderate p_see helps to establish cooperative turn-taking, while high p_see again brings the Leader-Follower strategy to success.

Despite the clear differences between the two games, predominant strategies evolving in iPD and iBoS for various levels of transparency have some striking similarities. First of all, in both games, Leader-Follower appears to be the most successful strategy for high p_see. This can be explained as follows: in a group where the behaviour of each agent is visible to the others and can be correctly interpreted, group actions hinge upon agents initiating these actions. The exact role of the initiators can vary: in some cases, these agents reap special benefits (for instance, dominant male baboons despotically initiate group movements to the foraging locations that are beneficial for themselves [30]), but in other cases they also carry the burden. Accordingly, in our study, Leaders enjoy maximal payoffs in the transparent iBoS game, but have to sacrifice their own payoff for the mutual success in the transparent iPD. Although counter-intuitive at first glance, the cooperativeness of Leaders in the L-F strategy corresponds to the behaviour of individuals that agree to do a necessary but risky or unpleasant job without immediate benefit. Examples include volunteering in human societies and acting as sentries in animal groups – watching out for predators while conspecifics forage for food [31, 32], see [33] for further examples. Note, however, that it is still debated how altruistic sentinel behaviour actually is [31, 32, 34, 35]. Such situations are formalized in game theory by a Volunteer’s Dilemma [33, 36, 37], but here we emphasize the aspect of visibility: the L-F strategy becomes dominant only when the probability that one of the players sees the choice of the other is close to one (that is for p_see close to 0.5). Thus self-sacrificing behaviour is only useful when others can interpret and utilize it, which is the case both for sentry animals and for human volunteers. Our results for the transparent iPD demonstrate that altruistic behaviour for the sake of the species success may evolve in a population even without direct reciprocity.

For low and moderate values of p_see the similarities of the two games are less obvious. However, the Challenger strategy in iBoS follows the same principle of “Win – stay, lose – shift” as the pre-dominant strategy WSLS in iPD, but with modified definitions of “win” and “lose”. For Challenger winning is associated with any outcome better than the minimal payoff corresponding to the mutual accommodation. Indeed, Challenger accommodates until mutual accommodation takes place and then switches to insisting. Such behaviour is described as “modest WSLS” in [29, 38] and is in-line with the interpretation of the “Win – stay, lose – shift” principle observed in animals [39].

The third successful principle in the transparent iPD is “Tit-for-tat”, embodied in Generous tit-for-tat (GTFT) and Firm-but-fair (FbF) strategies. This principle also works in both games since turn-taking in iBoS is nothing else but giving tit for tat. In particular, the FbF strategy, which occurs frequently in iPD for p_see ≥ 0.4, is partially based on taking turns and is similar to the Turn-Taker strategy in iBoS. The same holds to a lesser extent for the GTFT strategy.

The success of specific strategies for different levels of p_see makes sense if we understand p_see as a species’ ability to signal intentions and to interpret these signals when trying to coordinate (or compete). The higher p_see, the better (more probable) is the explicit coordination. This could mean that a high ability to explicitly coordinate actions leads to coordination based on observing the leader’s behaviour. In contrast, moderate coordination ability results in some form of turn-taking, while low ability leads to simple strategies of WSLS-type. In fact, an agent utilizing the WSLS principle does not even need to comprehend the existence of the second player, since WSLS “embodies an almost reflex-like response to the pay-off” [22]). The ability to cooperate may also depend on the circumstances, for example, on the physical visibility of partner’s actions. In a relatively clear situation, following the leader can be the best strategy. Moderate uncertainty requires some (implicit) rules of reciprocity embodied in turn-taking. High uncertainty makes coordination difficult or even impossible, and may result in a seemingly irrational “challenging behaviour” as we have shown for the transparent BoS. However, when players can succeed without coordination (which was the case in iPD), high uncertainty about the other players’ actions does not cause a problem.

By taking the visibility of agent’s actions into account, transparent games can provide a simple explanation for certain biological, sociological and psychological phenomena. Here, we illustrate the potential of this approach with two examples. The first concerns authoritarianism, a personality trait that manifests as uncritical acceptance of authority and is often associated with conformity. The most prominent example of how it manifests is the Milgram experiment [40]. In a series of studies presented as learning experiments, participants were tasked with punishing mistakes of another participant with increasingly painful electric shocks, under the premise of helping to learn more effectively. Some participants were willing to essentially electrocute the learner (who was a confidant of the experimenter) by applying shocks of up to 400 V. In one particular version of this experiment, where participants were urged to continue applying electro-shocks by a perceived authority figure (i.e., the experimenter), the proportion of participants who were willing to go to maximal voltage rose to about two thirds. Importantly, this conformity with authority occurred in a similar fashion across gender and ages, suggesting that is may be a universal human trait. Most people wonder why so many individuals show uncritical obedience to authorities, especially when considering how it can lead to unethical behaviour. The transparent iBoS results hint towards a provocative answer: a disposition for conformity might provide an evolutionary advantage because it allows for effective coordination. Thus, the sometimes extreme conformity observed in social psychology [41, 42], might – at least partially – rest in the evolutionary superiority of a Leader-Follower strategy.

Another application of transparent games is related to the burgeoning experimental research of social interactions, including the emergent field of social neuroscience that seeks to uncover the neural basis of social signalling and decision-making using neuroimaging and electrophysiology in humans and animals [43–46]. So far, most studies have focused on sequential [47, 48] or simultaneous games [49]. One of the main challenges in this field is extending these studies to direct real-time interactions that would entail a broad spectrum of dynamic competitive and cooperative behaviours. In line with this, several recent studies also considered direct social interactions in humans and non-human primates [3–5, 50–55] during dyadic games where players can monitor actions and outcomes of each other. Transparent games allow modelling the players’ access to social cues, which is essential for the analysis of experimental data in the studies of this kind [21]. This might be especially useful when behaviour is explicitly compared between “simultaneous” and “transparent” game settings, as in [3, 5, 50, 55]. In particular, the enhanced cooperation in the transparent iBoS for high p_see provides a theoretical explanation for the empirical observations in [5], where humans playing an iBoS-type game demonstrated a higher level of cooperation and a fairer payoff distribution when they were able to observe the actions of the partner while making their own choice. In view of the argument that true cooperation should benefit from enhanced communication [21], the transparent iBoS can in certain cases be a more suitable model for studying cooperation than the iPD (see also [56,57] for a discussion of studying cooperation by means of iBoS-type games).

In summary, transparent games provide a theoretically attractive link between classical concepts of simultaneous and sequential games, as well as a computational tool for modelling real-world interactions. We thus expect that the transparent games framework can help to establish a deeper understanding of social behaviour in humans and animals.

Methods

Transparent games between two players

In this study, we focus on iterated two-player two-action games: in every round both players choose one of two possible actions and get a payoff depending on the mutual choice according to the payoff matrix (Fig. 1). A new game setting, transparent game, is defined by a payoff matrix and probabilities of Player i to see the choice of the other player, . Note that , and is the probability that neither of players knows the choice of the partner because they act sufficiently close in time so that neither players can infer the other’s action prior to making their own choice. The probabilities can be computed from the distributions of reaction times for the two players, as shown in Supplementary Fig. 3 for reaction times modelled by exponentially modified Gaussian distribution [58, 59]. In this figure, reaction times for both players have the same mean, which results in symmetric distribution of reaction time difference (Supplementary Fig. 3b) and . Here we focus only on this case since for both games considered in this study, unequal reaction times provide a strong advantage to one of the players (see below). However, in general

To illustrate how transparent, simultaneous and sequential games differ, let us consider three setups for an iterated Prisoner’s Dilemma (iPD):

If prisoners write their statements and put them into envelopes, this case is described by simultaneous iPD.
If prisoners are questioned in the same room in a random or pre-defined order, this case is described by sequential iPD.
Finally, in a case of a face-to-face interrogation where prisoners are allowed to answer the questions of prosecutors in any order (or even to talk simultaneously) the transparent iPD comes into play. Here prisoners are able to monitor each other and interpret inclinations of the partner in order to adjust their own choice accordingly.

While the transparent setting can be used both in zero-sum and non-zero-sum games, here we concentrate on the latter class where players can cooperate to increase their joint payoff. For the purposes of this work, we define cooperation simply as joint actions towards mutually beneficially outcomes. In various areas more specific definitions of cooperation are used (see, for example, [7,21] for a discussion of cooperation in animals). We consider the transparent versions of two classical games, the iPD and the iterated Bach-or-Stravinsky game (iBoS). We have selected iPD and iBoS as representatives of two distinct types of symmetric non-zero-sum games [18, 19]: maximal joint payoff is awarded when players select the same action (cooperate) in iPD, but different actions in iBoS (one insists, and the other accommodates). The games of iPD type are known as synchronization games; other examples of synchronization games include Stag Hunt and Game of Chicken [19]. Games with two optimal mutual choices are called alternation games [18,19]; as one of these choices is more beneficial for Player 1, and the other for Player 2, to achieve fair cooperation players should alternate between these two states.

Another important difference between the two considered games is that in iBoS it is better to act before the partner, while in iPD – after the partner. Indeed, in iPD defection is less beneficial if it can be discovered by the opponent. Meanwhile in iBoS the player acting first has good chances to get the maximal payoff of 4 by insisting: when the second player knows that the partner insists, it is better to accommodate and get a payoff of 3, than to insist and get 2. Therefore, the optimal behaviour in iPD is to wait as long as possible, while in iBoS a player should react as quickly as possible. Consequently, evolution in these games favours species with marginal mean reaction times: maximal allowed reaction time in iPD and minimal allowed reaction time in iBoS. Species with different behaviour are easily invaded. Therefore we assumed in all simulations that the mean reaction times are constant, that is is the same for all species and all players have equal chances to see the choices of each other.

Evolutionary simulations for transparent games

For our evolutionary simulations we adopt the methods described in [22, 23]. Consider an infinite population of players evolving in generations. For any generation t = 1, 2, … the population consists of n(t) “species” defined by their strategies and their frequencies x_i(t) in the population, . Besides, the probability of a player from species i to see the choice of a partner from species j is given by (in our case for all species i and j, but in this section we use the general notation).

Consider a player from species i playing an infinitely long iterated game against a player from species j. Since both players use memory-one strategies, this game can be formalized as a Markov chain with states being the mutual choices of the two players and a transition matrix M given by where the matrices M₀, M₁ and M₂ describe the cases when neither player sees the choice of the partner, Player 1 sees the choice of the partner before making own choice, and Player 2 sees the choice of the partner, respectively. These matrices are given by

The gain of species i when playing against species j is given by the expected payoff E_ij, defined by where P_ij are the entries of the payoff matrix (P₁₁ = 3, P₁₂ = 0, P₂₁ = 5, P₂₂ = 1 for iPD and P₁₁ = 2, P₁₂ = 4, P₂₁ = 3, P₂₂ = 1 for iBoS, see Fig. 1), and y₁, y₂, y₃, y₄ represent the probabilities of getting to the states associated with the corresponding payoffs by playing s_i against s_j. This vector is computed as a unique left-hand eigenvector of matrix M associated with eigenvalue one [23]:

The evolutionary success of species i is encoded by its fitness f_i(t): if species i has higher fitness than the average fitness of the population then x_i(t) increases with time, otherwise x_i(t) decreases and the species is dying out. This evolutionary process is formalized by the replicator dynamics equation, which in discrete time takes the form

The fitness f_i(t) is computed as the expected payoff for a player of species i when playing with a random player from the current population: where E_ij is given by (2).

Each run of simulations starts with five species having equal initial frequencies: n(1) = 5, x₁(1) =… = x₅(1) = 0.2. Following [22], probabilities with k = 1, …, 12 for these species are randomly drawn from the distribution with U-shaped probability density: for y ∈ (0, 1). Additionally, we require , where ε = 0.001 accounts for the minimal possible error in the strategies [22]. The frequencies of strategies x_i(t) change according to the replicator dynamics equation (3). If x_i(t) < ∊, the species is assumed to die out and is removed from the population; we follow [22,23] in taking E = 0.001. Occasionally (every 100 generations on average), new species emerge in the population. The strategies for the new species are drawn from (4) and the initial frequencies are set to x_i(t₀) = 1.1∊ [22].

Evolutionary dynamics of two strategies

To provide an example of evolutionary dynamics and introduce some useful notation, we consider a population consisting of two species playing iPD with strategies: s₁ = (1−ε, ε, ε, 1−ε; 1−ε, ε, ε, 1−ε), s₂ = (ε, ε, ε, ε; ε, ε, ε, ε) (recall that for iPD ) and initial conditions x₁(1) = x₂(1) = 0.5. That is, the first species plays WSLS, and the second uses AllD. We set . Note that since and , it holds . Given p_see we can compute a transition matrix of the game using (1) and then calculate the expected payoffs for all possible pairs of players ij using (2). For instance, for p_see = 0 we have

This means that a player from the WSLS-species on average gets a payoff E₁₁ = 2.995 when playing against a conspecific partner, and only E₁₂ = 0.504, when playing against an AllD-player. The fitness for each species is given by

Since f₂(t) > f₁(t) for any 0 < x₁(t), x₂(t) < 1, the AllD-players take over the whole population after several generations. Dynamics of the species frequencies x_i(t) computed using (3) shows that this is indeed the case (Fig. 5a). Note that since E₂₁ > E₁₁ and E₂₂ > E₁₂, AllD is garanteed to win over WSLS for any initial frequency of WSLS-players x₁(1). In this case one says that AllD dominates WSLS and can invade it for any x₁(1).

As we increase p_see, the population dynamics changes. While for p_see = 0.2 AllD still takes over the population, for p_see = 0.4 WSLS wins (Fig. 5a). This can be explained by computing the expected payoff for p_see = 0.4:

Figure 5: Evolutionary dynamics of iPD-population consisting of species with WSLS and AllD strategies.

a Initially, both species have the same frequency, but after 40 generations the fraction of WSLS-players x₁(t) converges to 0 for probabilities to see partner’s choice p_see = 0.0, 0.2 and to 1 for p_see = 0.4, 0.5. b This is due to the decrease of the invasion threshold h₁ for WSLS: while h₁ = 1 for p_see = 0 (AllD dominates WSLS and the fraction of WSLS-players unconditionally decreases), AllD and WSLS are bistable for p_see > 0 and WSLS wins whenever x₁(t) > h₁. Arrows indicate whether frequency x₁(t) of WSLS increases or decreases. Note that h₁ = 0.5 for p_see = 1/3. A detailed quantitative discussion of this fact is beyond the scope of this paper and will be provided elsewhere.

Hence f₁(t) > f₂(t) for 0 ≤ x₂(t) ≤ 0.5 ≤ x₁(t) ≤ 0, which explains the observed dynamics. Note that here E₁₁ > E₂₁, while E₁₂ < E₂₂, that is a conspecific partner wins more than a partner from a different species when playing against WSLS- and AllD-players alike. In this case one says that WSLS and AllD are bistable and there is an unstable equilibrium fraction of WSLS players given by

We call h_i an invasion threshold for species i, since it takes over the whole population for x_i(t) > h_i, but dies out for x_i(t) < h_i. To illustrate this concept, we plot in Fig. 5b the invasion threshold h₁ for WSLS species playing against AllD as a function of p_see.

One more possible type of two-species dynamics is coexistence, which takes place when E₁₁ < E₂₁, E₁₂ > E₂₂, that is when playing against a player from any of the species is less beneficial for a conspecific partner than for a partner from a different species. In this case the fraction of a species given by (5) corresponds to a stable equilibrium meaning that the frequency of the first species x₁(t) increases for x₁(t) < h₁, but decreases for x₁(t) > h₁. We refer to [23] for more details.

Data availability

The empirical datasets generated during the current study and the source code used for this are available from the corresponding author on reasonable request.

Contributions

A.U. conceived the original idea and performed simulations with the help and advice of S.E. and F.W.; T.S., I.K. and S.M. contributed to the interpretation of the results. All authors contributed to writing and revision of the manuscript.

Competing interests

The authors declare no competing financial interests.

Acknowledgements

We acknowledge funding from the Ministry for Science and Education of Lower Saxony and the Volks-wagen Foundation through the program “Niedersächsisches Vorab”. Additional support was provided by the Leibniz Association through funding for the Leibniz ScienceCampus Primate Cognition and the Max Planck Society.

References

[1].↵
Axelrod, R. On six advances in cooperation theory. Analyse & Kritik 22, 130–151 (2000).
OpenUrl
[2].↵
Dugatkin, L., Mesterton-Gibbonsand, M. & Houston, A. Beyond the Prisoner’s dilemma: Toward models to discriminate among mechanisms of cooperation in nature. Trends in ecology & evolution 7, 202–205 (1992).
OpenUrl
[3].↵
Brosnan, S., Wilson, B. J. & Beran, M. Old world monkeys are more similar to humans than new world monkeys when playing a coordination game. Proceedings of the Royal Society of London B: Biological Sciences 279, 1522–1530 (2012).
OpenUrl CrossRef PubMed
[4].
Duguid, S., Wyman, E., Bullinger, A., Herfurth-Majstorovic, K. & Tomasello, M. Coordination strategies of chimpanzees and human children in a stag hunt game. Proceedings of the Royal Society of London B: Biological Sciences 281, 20141973 (2014).
OpenUrl CrossRef PubMed
[5].↵
Hawkins, R. & Goldstone, R. The formation of social conventions in real-time environments. PlOS ONE 11, e0151670 (2016).
OpenUrl
[6].↵
Vaziri-Pashkam, M., Cormiea, S. & Nakayama, K. Predicting actions from subtle preparatory movements. Cognition 168, 65–75 (2017).
OpenUrl
[7].↵
Silk, J. The strategic dynamics of cooperation in primate groups. Advances in the Study of Behavior 37, 1–41 (2007).
OpenUrl
[8].
Sueur, C. & Petit, O. Signals use by leaders in macaca tonkeana and macaca mulatta: group-mate recruitment and behaviour monitoring. Animal cognition 13, 239–248 (2010).
OpenUrl CrossRef PubMed Web of Science
[9].
King, A. & Sueur, C. Where next? Group coordination and collective decision making by primates. International Journal of Primatology 32, 1245–1267 (2011).
OpenUrl
[10].
Fichtel, C., Zucchini, W. & Hilgartner, R. Out of sight but not out of mind? behavioral coordination in red-tailed sportive lemurs (lepilemur ruficaudatus). International journal of primatology 32, 1383–1396 (2011).
OpenUrl PubMed
[11].↵
Strandburg-Peshkin, A., Papageorgiou, D., Crofoot, M. & Farine, D. Inferring influence and leadership in moving animal groups. Philosophical Transactions B: Biological Sciences 373, 2170006 (2017).
OpenUrl
[12].↵
Bar-Eli, M., Azar, O., Ritov, I., Keidar-Levin, Y. & Schein, G. Action bias among elite soccer goalkeepers: The case of penalty kicks. Journal of economic psychology 28, 606–621 (2007).
OpenUrl CrossRef
[13].↵
Chiappori, P.-A., Levitt, S. & Groseclose, T. Testing mixed-strategy equilibria when players are heterogeneous: The case of penalty kicks in soccer. American Economic Review 92, 1138–1151 (2002).
OpenUrl CrossRef Web of Science
[14].↵
Palacios-Huerta, I. Professionals play minimax. The Review of Economic Studies 70, 395–415 (2003).
OpenUrl CrossRef
[15].↵
Axelrod, R. & Hamilton, W. The evolution of cooperation. Science 211, 13–90 (1981).
OpenUrl
[16].↵
Rapoport, A. Exploiter, Leader, Hero, and Martyr: the four archetypes of the 2×2 game. Systems Research and Behavioral Science 12, 81–84 (1967).
OpenUrl
[17].↵
Colman, A. Game theory and its applications: In the social and biological sciences. 2nd ed. (London: Routledge, 2005).
[18].↵
Helbing, D., Schönhof, M., Stark, H.-U. & Hołyst, J. How individuals learn to take turns: Emergence of alternating cooperation in a congestion game and the prisoner’s dilemma. Advances in Complex Systems 8, 87–116 (2005).
OpenUrl CrossRef Web of Science
[19].↵
Colman, A. & Browning, L. Evolution of cooperative turn-taking. Evolutionary Ecology Research 11, 949–963 (2009).
OpenUrl Web of Science
[20].↵
Noë, R. A veto game played by baboons: a challenge to the use of the Prisoner’s Dilemma as a paradigm for reciprocity and cooperation. Animal Behaviour 39, 78–90 (1990).
OpenUrl CrossRef Web of Science
[21].↵
Noë, R. Cooperation experiments: coordination through communication versus acting apart together. Animal Behaviour 71, 1–18 (2006).
OpenUrl CrossRef Web of Science
[22].↵
Nowak, M. & Sigmund, K. A strategy of win-stay, lose-shift that outperforms tit-for-tat in the Prisoner’s Dilemma game. Nature 364, 56–58 (1993).
OpenUrl CrossRef PubMed Web of Science
[23].↵
Nowak, M. Evolutionary dynamics (Harvard University Press, 2006).
[24].↵
Tomasello, M., Melis, A., Tennie, C., Wyman, E. & Herrmann, E. Two key steps in the evolution of human cooperation: The interdependence hypothesis. Current anthropology 53, 673–692 (2012).
OpenUrl CrossRef Web of Science
[25].↵
Nowak, M. & Sigmund, K. The alternating prisoner’s dilemma. Journal of theoretical Biology 168, 219–226 (1994).
OpenUrl CrossRef Web of Science
[26].
Frean, M. The prisoner’s dilemma without synchrony. Proceedings of the Royal Society of London B: Biological Sciences 257, 75–79 (1994).
OpenUrl CrossRef
[27].↵
Zagorsky, B., Reiter, J., Chatterjee, K. & Nowak, M. Forgiver triumphs in alternating Prisoner’s Dilemma. PlOS ONE 8, e80814 (2013).
OpenUrl CrossRef PubMed
[28].↵
Friedman, D. Evolutionary games in economics. Econometrica: Journal of the Econometric Society 637–666 (1991).
[29].↵
Posch, M., Pichler, A. & Sigmund, K. The efficiency of adapting aspiration levels. Proceedings of the Royal Society of London B: Biological Sciences 266, 1427–1435 (1999).
OpenUrl CrossRef Web of Science
[30].↵
King, A., Douglas, C., Huchard, E., Isaac, N. & Cowlishaw, G. Dominance and affiliation mediate despotism in a social primate. Current Biology 18, 1833–1838 (2008).
OpenUrl CrossRef PubMed
[31].↵
Ridley, A., Raihani, N. & Bell, M. Experimental evidence that sentinel behaviour is affected by risk. Biology letters 445–448 (2010).
[32].↵
Bednekoff, P. Sentinel behavior: a review and prospectus. Advances in the study of behavior 47, 115–146 (2015).
OpenUrl
[33].↵
Archetti, M. The volunteer’s dilemma and the optimal size of a social group. Journal of Theoretical Biology 261, 475–480 (2009).
OpenUrl CrossRef PubMed Web of Science
[34].↵
Bednekoff, P. Mutualism among safe, selfish sentinels: a dynamic game. The American Naturalist 150, 373–392 (1997).
OpenUrl CrossRef PubMed Web of Science
[35].↵
Clutton-Brock, T. et al. Selfish sentinels in cooperative mammals. Science 284, 1640–1644 (1999).
OpenUrl Abstract/FREE Full Text
[36].↵
Diekmann, A. Volunteer’s dilemma. Journal of conflict resolution 29, 605–610 (1985).
OpenUrl CrossRef Web of Science
[37].↵
Archetti, M. A strategy to increase cooperation in the volunteer’s dilemma: reducing vigilance improves alarm calls. Evolution 65, 885–892 (2011).
OpenUrl CrossRef PubMed Web of Science
[38].↵
Posch, M. Win–Stay, Lose–Shift Strategies for Repeated Games—Memory Length, Aspiration Levels and Noise. Journal of theoretical biology 198, 183–195 (1999).
OpenUrl CrossRef PubMed
[39].↵
Clements, K. & Stephens, D. Testing models of non-kin cooperation: mutualism and the prisoner’s dilemma. Animal Behaviour 50, 527–535 (1995).
OpenUrl CrossRef Web of Science
[40].↵
Milgram, S. Behavioral study of obedience. The Journal of abnormal and social psychology 67, 371 (1963).
OpenUrl CrossRef Web of Science
[41].↵
Asch, S. Effects of group pressure upon the modification and distortion of judgments. Groups, leadership, and men 222–236 (1951).
[42].↵
Deutsch, M. & Gerard, H. A study of normative and informational social influences upon individual judgment. The journal of abnormal and social psychology 51, 629 (1955).
OpenUrl CrossRef Web of Science
[43].↵
Chang, S. An emerging field of primate social neurophysiology: Current developments. eNeuro 4, ENEURO-0295-17 (2017).
[44].
Ruff, C. & Fehr, E. The neurobiology of rewards and values in social decision making. Nature Reviews Neuroscience 15, 549 (2014).
[45].
Platt, M., Seyfarth, R. & Cheney, D. Adaptations for social cognition in the primate brain. Phil. Trans. R. Soc. B 371, 20150096 (2016).
OpenUrl CrossRef PubMed
[46].↵
Tremblay, S., Sharika, K. & Platt, M. Social decision-making and the brain: A comparative perspective. Trends in cognitive sciences 21, 265–276 (2017).
OpenUrl
[47].↵
Ballesta, S. & Duhamel, J.-R. Rudimentary empathy in macaques’ social decision-making. Proceedings of the National Academy of Sciences 112, 15516–15521 (2015).
OpenUrl Abstract/FREE Full Text
[48].↵
Báez-Mendoza, R. & Schultz, W. Performance error-related activity in monkey striatum during social interactions. Scientific Reports 6, 37199 (2016).
[49].↵
Haroush, K. & Williams, Z. Neuronal prediction of opponent’s behavior during cooperative social interchange in primates. Cell 160, 1233–1245 (2015).
OpenUrl CrossRef PubMed
[50].↵
Bullinger, A., Wyman, E., Melis, A. & Tomasello, M. Coordination of chimpanzees (pan troglodytes) in a stag hunt game. International Journal of Primatology 32, 1296–1310 (2011).
OpenUrl
[51].
Brosnan, S. et al. Responses to the assurance game in monkeys, apes, and humans using equivalent procedures. Proceedings of the National Academy of Sciences 108, 3442–3447 (2011).
OpenUrl Abstract/FREE Full Text
[52].
Visco-Comandini, F. et al. Do non-human primates cooperate? evidences of motor coordination during a joint action task in macaque monkeys. Cortex 70, 115–127 (2015).
OpenUrl CrossRef PubMed
[53].
Sánchez-Amaro, A., Duguid, S., Call, J. & Tomasello, M. Chimpanzees coordinate in a snowdrift game. Animal Behaviour 116, 61–74 (2016). URL http://www.sciencedirect.com/science/article/pii/S0003347216300045.
OpenUrl
[54].
Sánchez-Amaro, A., Duguid, S., Call, J. & Tomasello, M. Chimpanzees, bonobos and children successfully coordinate in conflict situations. In Proc. R. Soc. B, vol. 284, 20170259 (The Royal Society, 2017).
OpenUrl CrossRef PubMed
[55].↵
Brosnan, S. et al. Human and monkey responses in a symmetric game of conflict with asymmetric equilibria. Journal of Economic Behavior & Organization 142, 293–306 (2017).
OpenUrl
[56].↵
King, A., Johnson, D. & Van Vugt, M. The origins and evolution of leadership. Current biology 19, R911–R916 (2009).
OpenUrl CrossRef PubMed Web of Science
[57].↵
Devaine, M., Hollard, G. & Daunizeau, J. Theory of mind: did evolution fool us? PloS One 9, e87619 (2014).
OpenUrl CrossRef
[58].↵
Luce, R. Response times: Their role in inferring elementary mental organization (Oxford University Press, 1986).
[59].↵
Ratcliff, R. Methods for dealing with reaction time outliers. Psychological bulletin 114, 510 (1993).
OpenUrl CrossRef PubMed Web of Science

View the discussion thread.

Posted May 07, 2018.

Download PDF

Citation Tools

Subject Area

Evolutionary Biology

Subject Areas

All Articles

Animal Behavior and Cognition (5204)
Biochemistry (11718)
Bioengineering (8724)
Bioinformatics (29132)
Biophysics (14937)
Cancer Biology (12052)
Cell Biology (17362)
Clinical Trials (138)
Developmental Biology (9407)
Ecology (14146)
Epidemiology (2067)
Evolutionary Biology (18270)
Genetics (12223)
Genomics (16768)
Immunology (11844)
Microbiology (28016)
Molecular Biology (11560)
Neuroscience (60841)
Paleontology (450)
Pathology (1864)
Pharmacology and Toxicology (3231)
Physiology (4940)
Plant Biology (10405)
Scientific Communication and Education (1681)
Synthetic Biology (2878)
Systems Biology (7333)
Zoology (1642)

[1] [1].↵
Axelrod, R. On six advances in cooperation theory. Analyse & Kritik 22, 130–151 (2000).
OpenUrl

[2] [2].↵
Dugatkin, L., Mesterton-Gibbonsand, M. & Houston, A. Beyond the Prisoner’s dilemma: Toward models to discriminate among mechanisms of cooperation in nature. Trends in ecology & evolution 7, 202–205 (1992).
OpenUrl

[3] [3].↵
Brosnan, S., Wilson, B. J. & Beran, M. Old world monkeys are more similar to humans than new world monkeys when playing a coordination game. Proceedings of the Royal Society of London B: Biological Sciences 279, 1522–1530 (2012).
OpenUrl CrossRef PubMed

[4] [4].
Duguid, S., Wyman, E., Bullinger, A., Herfurth-Majstorovic, K. & Tomasello, M. Coordination strategies of chimpanzees and human children in a stag hunt game. Proceedings of the Royal Society of London B: Biological Sciences 281, 20141973 (2014).
OpenUrl CrossRef PubMed

[5] [5].↵
Hawkins, R. & Goldstone, R. The formation of social conventions in real-time environments. PlOS ONE 11, e0151670 (2016).
OpenUrl

[6] [6].↵
Vaziri-Pashkam, M., Cormiea, S. & Nakayama, K. Predicting actions from subtle preparatory movements. Cognition 168, 65–75 (2017).
OpenUrl

[7] [7].↵
Silk, J. The strategic dynamics of cooperation in primate groups. Advances in the Study of Behavior 37, 1–41 (2007).
OpenUrl

[8] [8].
Sueur, C. & Petit, O. Signals use by leaders in macaca tonkeana and macaca mulatta: group-mate recruitment and behaviour monitoring. Animal cognition 13, 239–248 (2010).
OpenUrl CrossRef PubMed Web of Science

[9] [9].
King, A. & Sueur, C. Where next? Group coordination and collective decision making by primates. International Journal of Primatology 32, 1245–1267 (2011).
OpenUrl

[10] [10].
Fichtel, C., Zucchini, W. & Hilgartner, R. Out of sight but not out of mind? behavioral coordination in red-tailed sportive lemurs (lepilemur ruficaudatus). International journal of primatology 32, 1383–1396 (2011).
OpenUrl PubMed

[11] [11].↵
Strandburg-Peshkin, A., Papageorgiou, D., Crofoot, M. & Farine, D. Inferring influence and leadership in moving animal groups. Philosophical Transactions B: Biological Sciences 373, 2170006 (2017).
OpenUrl

[12] [12].↵
Bar-Eli, M., Azar, O., Ritov, I., Keidar-Levin, Y. & Schein, G. Action bias among elite soccer goalkeepers: The case of penalty kicks. Journal of economic psychology 28, 606–621 (2007).
OpenUrl CrossRef

[13] [13].↵
Chiappori, P.-A., Levitt, S. & Groseclose, T. Testing mixed-strategy equilibria when players are heterogeneous: The case of penalty kicks in soccer. American Economic Review 92, 1138–1151 (2002).
OpenUrl CrossRef Web of Science

[14] [14].↵
Palacios-Huerta, I. Professionals play minimax. The Review of Economic Studies 70, 395–415 (2003).
OpenUrl CrossRef

[15] [15].↵
Axelrod, R. & Hamilton, W. The evolution of cooperation. Science 211, 13–90 (1981).
OpenUrl

[16] [16].↵
Rapoport, A. Exploiter, Leader, Hero, and Martyr: the four archetypes of the 2×2 game. Systems Research and Behavioral Science 12, 81–84 (1967).
OpenUrl

[17] [17].↵
Colman, A. Game theory and its applications: In the social and biological sciences. 2nd ed. (London: Routledge, 2005).

[18] [18].↵
Helbing, D., Schönhof, M., Stark, H.-U. & Hołyst, J. How individuals learn to take turns: Emergence of alternating cooperation in a congestion game and the prisoner’s dilemma. Advances in Complex Systems 8, 87–116 (2005).
OpenUrl CrossRef Web of Science

[19] [19].↵
Colman, A. & Browning, L. Evolution of cooperative turn-taking. Evolutionary Ecology Research 11, 949–963 (2009).
OpenUrl Web of Science

[20] [20].↵
Noë, R. A veto game played by baboons: a challenge to the use of the Prisoner’s Dilemma as a paradigm for reciprocity and cooperation. Animal Behaviour 39, 78–90 (1990).
OpenUrl CrossRef Web of Science

[21] [21].↵
Noë, R. Cooperation experiments: coordination through communication versus acting apart together. Animal Behaviour 71, 1–18 (2006).
OpenUrl CrossRef Web of Science

[22] [22].↵
Nowak, M. & Sigmund, K. A strategy of win-stay, lose-shift that outperforms tit-for-tat in the Prisoner’s Dilemma game. Nature 364, 56–58 (1993).
OpenUrl CrossRef PubMed Web of Science

[23] [23].↵
Nowak, M. Evolutionary dynamics (Harvard University Press, 2006).

[24] [24].↵
Tomasello, M., Melis, A., Tennie, C., Wyman, E. & Herrmann, E. Two key steps in the evolution of human cooperation: The interdependence hypothesis. Current anthropology 53, 673–692 (2012).
OpenUrl CrossRef Web of Science

[25] [25].↵
Nowak, M. & Sigmund, K. The alternating prisoner’s dilemma. Journal of theoretical Biology 168, 219–226 (1994).
OpenUrl CrossRef Web of Science

[26] [26].
Frean, M. The prisoner’s dilemma without synchrony. Proceedings of the Royal Society of London B: Biological Sciences 257, 75–79 (1994).
OpenUrl CrossRef

[27] [27].↵
Zagorsky, B., Reiter, J., Chatterjee, K. & Nowak, M. Forgiver triumphs in alternating Prisoner’s Dilemma. PlOS ONE 8, e80814 (2013).
OpenUrl CrossRef PubMed

[28] [28].↵
Friedman, D. Evolutionary games in economics. Econometrica: Journal of the Econometric Society 637–666 (1991).

[29] [29].↵
Posch, M., Pichler, A. & Sigmund, K. The efficiency of adapting aspiration levels. Proceedings of the Royal Society of London B: Biological Sciences 266, 1427–1435 (1999).
OpenUrl CrossRef Web of Science

[30] [30].↵
King, A., Douglas, C., Huchard, E., Isaac, N. & Cowlishaw, G. Dominance and affiliation mediate despotism in a social primate. Current Biology 18, 1833–1838 (2008).
OpenUrl CrossRef PubMed

[31] [31].↵
Ridley, A., Raihani, N. & Bell, M. Experimental evidence that sentinel behaviour is affected by risk. Biology letters 445–448 (2010).

[32] [32].↵
Bednekoff, P. Sentinel behavior: a review and prospectus. Advances in the study of behavior 47, 115–146 (2015).
OpenUrl

[33] [33].↵
Archetti, M. The volunteer’s dilemma and the optimal size of a social group. Journal of Theoretical Biology 261, 475–480 (2009).
OpenUrl CrossRef PubMed Web of Science

[34] [34].↵
Bednekoff, P. Mutualism among safe, selfish sentinels: a dynamic game. The American Naturalist 150, 373–392 (1997).
OpenUrl CrossRef PubMed Web of Science

[35] [35].↵
Clutton-Brock, T. et al. Selfish sentinels in cooperative mammals. Science 284, 1640–1644 (1999).
OpenUrl Abstract/FREE Full Text

[36] [36].↵
Diekmann, A. Volunteer’s dilemma. Journal of conflict resolution 29, 605–610 (1985).
OpenUrl CrossRef Web of Science

[37] [37].↵
Archetti, M. A strategy to increase cooperation in the volunteer’s dilemma: reducing vigilance improves alarm calls. Evolution 65, 885–892 (2011).
OpenUrl CrossRef PubMed Web of Science

[38] [38].↵
Posch, M. Win–Stay, Lose–Shift Strategies for Repeated Games—Memory Length, Aspiration Levels and Noise. Journal of theoretical biology 198, 183–195 (1999).
OpenUrl CrossRef PubMed

[39] [39].↵
Clements, K. & Stephens, D. Testing models of non-kin cooperation: mutualism and the prisoner’s dilemma. Animal Behaviour 50, 527–535 (1995).
OpenUrl CrossRef Web of Science

[40] [40].↵
Milgram, S. Behavioral study of obedience. The Journal of abnormal and social psychology 67, 371 (1963).
OpenUrl CrossRef Web of Science

[41] [41].↵
Asch, S. Effects of group pressure upon the modification and distortion of judgments. Groups, leadership, and men 222–236 (1951).

[42] [42].↵
Deutsch, M. & Gerard, H. A study of normative and informational social influences upon individual judgment. The journal of abnormal and social psychology 51, 629 (1955).
OpenUrl CrossRef Web of Science

[43] [43].↵
Chang, S. An emerging field of primate social neurophysiology: Current developments. eNeuro 4, ENEURO-0295-17 (2017).

[44] [44].
Ruff, C. & Fehr, E. The neurobiology of rewards and values in social decision making. Nature Reviews Neuroscience 15, 549 (2014).

[45] [45].
Platt, M., Seyfarth, R. & Cheney, D. Adaptations for social cognition in the primate brain. Phil. Trans. R. Soc. B 371, 20150096 (2016).
OpenUrl CrossRef PubMed

[46] [46].↵
Tremblay, S., Sharika, K. & Platt, M. Social decision-making and the brain: A comparative perspective. Trends in cognitive sciences 21, 265–276 (2017).
OpenUrl

[47] [47].↵
Ballesta, S. & Duhamel, J.-R. Rudimentary empathy in macaques’ social decision-making. Proceedings of the National Academy of Sciences 112, 15516–15521 (2015).
OpenUrl Abstract/FREE Full Text

[48] [48].↵
Báez-Mendoza, R. & Schultz, W. Performance error-related activity in monkey striatum during social interactions. Scientific Reports 6, 37199 (2016).

[49] [49].↵
Haroush, K. & Williams, Z. Neuronal prediction of opponent’s behavior during cooperative social interchange in primates. Cell 160, 1233–1245 (2015).
OpenUrl CrossRef PubMed

[50] [50].↵
Bullinger, A., Wyman, E., Melis, A. & Tomasello, M. Coordination of chimpanzees (pan troglodytes) in a stag hunt game. International Journal of Primatology 32, 1296–1310 (2011).
OpenUrl

[51] [51].
Brosnan, S. et al. Responses to the assurance game in monkeys, apes, and humans using equivalent procedures. Proceedings of the National Academy of Sciences 108, 3442–3447 (2011).
OpenUrl Abstract/FREE Full Text

[52] [52].
Visco-Comandini, F. et al. Do non-human primates cooperate? evidences of motor coordination during a joint action task in macaque monkeys. Cortex 70, 115–127 (2015).
OpenUrl CrossRef PubMed

[53] [53].
Sánchez-Amaro, A., Duguid, S., Call, J. & Tomasello, M. Chimpanzees coordinate in a snowdrift game. Animal Behaviour 116, 61–74 (2016). URL http://www.sciencedirect.com/science/article/pii/S0003347216300045.
OpenUrl

[54] [54].
Sánchez-Amaro, A., Duguid, S., Call, J. & Tomasello, M. Chimpanzees, bonobos and children successfully coordinate in conflict situations. In Proc. R. Soc. B, vol. 284, 20170259 (The Royal Society, 2017).
OpenUrl CrossRef PubMed

[55] [55].↵
Brosnan, S. et al. Human and monkey responses in a symmetric game of conflict with asymmetric equilibria. Journal of Economic Behavior & Organization 142, 293–306 (2017).
OpenUrl

[56] [56].↵
King, A., Johnson, D. & Van Vugt, M. The origins and evolution of leadership. Current biology 19, R911–R916 (2009).
OpenUrl CrossRef PubMed Web of Science

[57] [57].↵
Devaine, M., Hollard, G. & Daunizeau, J. Theory of mind: did evolution fool us? PloS One 9, e87619 (2014).
OpenUrl CrossRef

[58] [58].↵
Luce, R. Response times: Their role in inferring elementary mental organization (Oxford University Press, 1986).

[59] [59].↵
Ratcliff, R. Methods for dealing with reaction time outliers. Psychological bulletin 114, 510 (1993).
OpenUrl CrossRef PubMed Web of Science

Emergence and suppression of cooperation by action visibility in transparent games

Abstract

Results

Evolutionary simulations for transparent games

Transparency suppresses cooperation in Prisoner’s Dilemma

Cooperation emergence in the transparent Bach-or-Stravinsky game

Discussion

Methods

Transparent games between two players

Evolutionary simulations for transparent games

Evolutionary dynamics of two strategies

Data availability

Contributions

Competing interests

Acknowledgements

References

Citation Manager Formats

Subject Area