1. Home
  2. Research
  3. Centers & Labs
  4. RIKEN Center for Advanced Intelligence Project
  5. Generic Technology Research Group

RIKEN Center for Advanced Intelligence Project Sequential Decision Making Team

Team leader: Shinji Ito (Ph.D.)

Research Summary

Shinji It

The Sequential Decision Making Team works to develop algorithms and theories for making rational decisions in a sequential manner in the face of forecast uncertainty and environmental fluctuations. In recent years, along with the evolution of information technology, there has been a demand for technology to make rational decisions based on the large amount of data being generated in real-time in today's world. To meet this challenge, we promote research related to online learning, bandit problems, and reinforcement learning, aiming to understand effective decision-making algorithms in a fluctuating environment and to construct and extend theoretical systems that support such algorithms.

Main Research Fields

  • Informatics

Related Research Fields

  • Engineering
  • Mathematical & Physical Sciences
  • Theory of informatics
  • Mathematical informatics
  • Intelligent informatics

Keywords

  • Sequential decision-making
  • Online learning
  • Bandit problems
  • Reinforcement learning
  • Learning theory

Selected Publications

  • 1. S. Ito and K. Takemura:
    "An Exploration-by-Optimization Approach to Best of Both Worlds in Linear Bandits"
    Advances in Neural Information and Processing Systems 36 (NeurIPS), to appear (2023).
  • 2. S. Ito, D. Hatano, H. Sumita, K. Takemura, T. Fukunaga, N. Kakimura, and K.-I. Kawarabayashi:
    "Bandit Task Assignment with Unknown Processing Time“
    Advances in Neural Information and Processing Systems 36 (NeurIPS), to appear (2023).
  • 3. T. Tsuchiya, S. Ito, and J. Honda:
    "Stability-penalty-adaptive follow-the-regularized-leader: Sparsity, game-dependency, and best-of-both-worlds“
    Advances in Neural Information and Processing Systems 36 (NeurIPS), to appear (2023).
  • 4. S. Ito and K. Takemura:
    "Best-of-Three-Worlds Linear Bandit Algorithm with Variance-Adaptive Regret Bounds"
    Proceedings of 36th Conference on Learning Theory (COLT), pp. 2653-2677 (2023).
  • 5. T. Tsuchiya, S. Ito, and J. Honda:
    "Further Adaptive Best-of-Both-Worlds Algorithm for Combinatorial Semi-Bandits"
    Proceedings of The 26th International Conference on Artificial Intelligence and Statistics (AISTATS), pp. 8117-8144 (2023).
  • 6. J. Honda, S. Ito, and T. Tsuchiya:
    "Follow-the-Perturbed-Leader Achieves Best-of-Both-Worlds for Bandit Problems"
    Proceedings of The 34th International Conference on Algorithmic Learning Theory (ALT), pp. 726-754 (2023).
  • 7. T. Tsuchiya, S. Ito, and J. Honda:
    "Best-of-Both-Worlds Algorithms for Partial Monitoring"
    Proceedings of The 34th International Conference on Algorithmic Learning Theory (ALT), pp. 1484-1515 (2023).
  • 8. S. Ito, T. Tsuchiya, and J. Honda:
    "Nearly Optimal Best-of-Both-Worlds Algorithms for Online Learning with Feedback Graphs"
    Advances in Neural Information and Processing Systems 35 (NeurIPS), pp. 28631-28643 (2022).
  • 9. S. Ito:
    "Revisiting Online Submodular Minimization: Gap-Dependent Regret Bounds, Best of Both Worlds and Adversarial Robustness"
    Proceedings of the 39th International Conference on Machine Learning (ICML), pp. 9678-9694 (2022).
  • 10. S. Ito, T. Tsuchiya, and J. Honda:
    "Adversarially Robust Multi-Armed Bandit Algorithm with Variance-Dependent Regret Bounds"
    Proceedings of 36th Conference on Learning Theory (COLT), pp. 1421-1422 (2022).

Lab Members

Principal investigator

Shinji Ito
Team leader

Core members

Junya Honda
Visiting Scientist
Taira Tsuchiya
Visiting Scientist
Junpei Komiyama
Visiting Scientist
Kazushi Tsutsui
Visiting Scientist

Contact Information

Nihonbashi 1-chome Mitsui Building, 15th floor,
1-4-1 Nihonbashi, Chuo-ku, Tokyo
103-0027, Japan
Email:shinji.ito.hh [at] riken.jp

Top