In misspecified environments, should an economic agent act rationally towards op- timizing some goal? If so, what should that goal be? Recent work has focused on the goal of bidirectional consistency of beliefs and actions, in effect finding a Nash equilibrium of an imaginary game in which one player chooses actions and another player chooses beliefs. In general, such outcomes, known as Berk-Nash equilibria, maximize neither log-likelihood nor objective payoffs over the combined space of beliefs and actions. We suggest an alternative: a solution concept and associated learning algorithm by which economic agents maximize a goal function that is a convex combination of log-likelihood (accuracy) and objective payoffs. This selects models that are Pareto efficient and favored by evolutionary forces. That is, in a society of individuals following different models, if models leading to high payoffs and accuracy replicate themselves or are imitated more than less successful models, then society evolves towards maximizing our goal function. One implication is that individuals who play Berk-Nash equilibrium in such societies will go extinct unless they happen to be successful in terms of our goal function.
Massari, F., Newton, J. (2026). Rational beliefs when the truth is not an option. INTERNATIONAL JOURNAL OF GAME THEORY, 55(1), 1-26 [10.1007/s00182-025-00976-w].
Rational beliefs when the truth is not an option
Filippo Massari
;Jonathan Newton
2026
Abstract
In misspecified environments, should an economic agent act rationally towards op- timizing some goal? If so, what should that goal be? Recent work has focused on the goal of bidirectional consistency of beliefs and actions, in effect finding a Nash equilibrium of an imaginary game in which one player chooses actions and another player chooses beliefs. In general, such outcomes, known as Berk-Nash equilibria, maximize neither log-likelihood nor objective payoffs over the combined space of beliefs and actions. We suggest an alternative: a solution concept and associated learning algorithm by which economic agents maximize a goal function that is a convex combination of log-likelihood (accuracy) and objective payoffs. This selects models that are Pareto efficient and favored by evolutionary forces. That is, in a society of individuals following different models, if models leading to high payoffs and accuracy replicate themselves or are imitated more than less successful models, then society evolves towards maximizing our goal function. One implication is that individuals who play Berk-Nash equilibrium in such societies will go extinct unless they happen to be successful in terms of our goal function.| File | Dimensione | Formato | |
|---|---|---|---|
|
WSEAL-3.pdf
embargo fino al 11/02/2027
Tipo:
Postprint / Author's Accepted Manuscript (AAM) - versione accettata per la pubblicazione dopo la peer-review
Licenza:
Licenza per accesso libero gratuito
Dimensione
240.52 kB
Formato
Adobe PDF
|
240.52 kB | Adobe PDF | Visualizza/Apri Contatta l'autore |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


