Peter Auer, Nicol{\`{o}} Cesa-Bianchi, , and Claudio Gentile. Adaptive and self-confident on-line learning algorithms. \emph{JCSS}, 64\penalty0 (1):\penalty0 48--75, 2002{\natexlab{a}}. (A preliminary version has appeared in Proc. 13th Ann. Conf. Computational Learning Theory.). Peter Auer, Nicol\`o Cesa-Bianchi, Yoav Freund, and Robert E. Schapire. The nonstochastic multiarmed bandit problem. \emph{SIAM Journal on Computing}, 32\penalty0 (1):\penalty0 48--77, 2002{\natexlab{b}}. Nicol\`o Cesa-Bianchi, Yoav Freund, David P. Helmbold, David Haussler, Robert E. Schapire, and Manfred K. Warmuth. How to use expert advice. In \emph{STOC}, pages 382--391, 1993. Also, {\it Journal of the Association for Computing Machinery}, 44(3): 427-485 (1997). Nicol{\`{o}} Cesa-Bianchi and G\'{a}bor Lugosi. Potential-based algorithms in on-line prediction and game theory. \emph{Machine Learning}, 51\penalty0 (3):\penalty0 239--261, 2003. Nicol\`o Cesa-Bianchi, G\'{a}bor Lugosi, and Gilles Stoltz. Regret minimization under partial monitoring. unpublished manuscript, 2004. Nicol\`o Cesa-Bianchi, Yishay Mansour, and Gilles Stoltz. Improved second-order bounds for prediction with expert advice. In \emph{COLT}, 2005. D. Foster and R. Vohra. Calibrated learning and correlated equilibrium. \emph{Games and Economic Behavior}, 21:\penalty0 40--55, 1997. D. Foster and R. Vohra. Asymptotic calibration. \emph{Biometrika}, 85:\penalty0 379--390, 1998. D. Foster and R. Vohra. Regret in the on-line decision problem. \emph{Games and Economic Behavior}, 29:\penalty0 7--36, 1999. Dean P. Foster and Rakesh V. Vohra. A randomization rule for selecting forecasts. \emph{Operations Research}, 41\penalty0 (4):\penalty0 704--709, July--August 1993. Y. Freund, R. Schapire, Y. Singer, and M. Warmuth. Using and combining predictors that specialize. In \emph{Proceedings of the 29th Annual Symposium on Theory of Computing}, pages 334--343, 1997. Yoav Freund and Robert E. Schapire. A decision-theoretic generalization of on-line learning and an application to boosting. In \emph{Euro-COLT}, pages 23--37. Springer-Verlag, 1995. Also, JCSS 55(1): 119-139 (1997). Yoav Freund and Robert E. Schapire. Adaptive game playing using multiplicative weights. \emph{Games and Economic Behavior}, 29:\penalty0 79--103, 1999. (A preliminary version appeared in the Proceedings of the Ninth Annual Conference on Computational Learning Theory, pages 325--332, 1996.). \bibitem[Hart and Mas-Colell(2000)]{HM} S. Hart and A. Mas-Colell. A simple adaptive procedure leading to correlated equilibrium. \emph{Econometrica}, 68:\penalty0 1127--1150, 2000. S. Hart and A. Mas-Colell. A reinforcement procedure leading to correlated equilibrium. In Wilhelm Neuefeind Gerard Debreu and Walter Trockel, editors, \emph{Economic Essays}, pages 181--200. Springer, 2001. E. Lehrer. A wide range no-regret theorem. \emph{Games and Economic Behavior}, 42:\penalty0 101--115, 2003. Gilles Stoltz and G\'{a}bor Lugosi. Internal regret in on-line portfolio selection. In \emph{COLT}, 2003. To appear in Machine Learning Journal. Gilles Stoltz and G\'{a}bor Lugosi. Learning correlated equilibria in games with compact sets of strategies. submitted to Games and Economic Behavior, 2004.