HOẠT ĐỘNG TRONG TUẦN

Robust Markov Decision Process

Báo cáo viên: Mai Anh Tiến (Singapore Management University)

Thời gian: 9h30, Thứ 5 ngày 5 tháng 8 năm 2021

Địa điểm: Phòng 302, Nhà A5 Viện Toán học

link Online

meet.google.com/qio-vuvf-mro

Tóm tắt: Markov decision processes (MDP) are popular in many planning, reinforcement learning and imitation learning applications. Motivated by the fact that policies in MDP are sensitive with respect to the state transition probabilities, and the estimation of these probabilities may be inaccurate, an MDP framework that allows to handle such uncertainties would be relevant and important. In this talk, I will present a robust version of the MDP model, where the optimal policies are required to be robust with respect to the ambiguity in the underlying transition probabilities. I will show that essential properties that hold for the non-robust MDP model also hold in our settings, making the robust MDP problem tractable. We show how our framework and results can be integrated into different algorithmic schemes including value or (modified) policy iteration, which would lead to new robust reinforcement learning and imitation learning algorithms to handle uncertainties. Analyses on computational complexity and error propagation under conventional uncertainty settings are also provided.

Trở lại

Tin tức nổi bật

THÔNG BÁO Triệu tập thí sinh tham dự thi vấn đáp tại vòng 2

Thông báo Công bố công khai kết quả xét bổ nhiệm chức danh giáo sư, phó giáo sư năm 2024

Thông báo Nhu cầu bổ nhiệm chức danh giáo sư, phó giáo sư năm 2024

THÔNG BÁO Tuyển viên chức năm 2024

Danh mục bài báo và thang điểm của Hội đồng giáo sư ngành Toán từ năm 2025

20/12/24, Bài giảng viện:
New matrix perturbation bounds via “skewness”: theory and applications.

06/01/25, Hội nghị, hội thảo:
The 3rd Vietnam - Korea joint workshop on selected topics in mathematics

06/01/25, Hội nghị, hội thảo:
Geometry and Dynamics in Low Dimensions

Công bố khoa học mới

Tran Van Thang, Lê Xuân Thanh, Self-Adaptive Extragradient Algorithms for Quasi-Equilibrium Problems, Journal of Optimization Theory and Applications (2024) 203:2988–3013, (SCI-E, Scopus).
Vũ Ngọc Phát, Nguyen T. Thanh, Linear singular continuous time-varying delay equations: Stability and filtering via LMI approach, Acta Mathematica Vietnamica, 49(2024), 595-609, (ESCI, Scopus).
Ngô Thị Ngoan, Nguyễn Quốc Thắng, On some splitting properties for algebraic groups over algebraic extensions of global fields, Rendiconti del Circolo Matematico di Palermo Series 2, Volume 73, pages 2613–2633, (2024), (SCI-E, Scopus).

Các hoạt động

Lịch hoạt động chung

Lịch sử dụng phòng, hội trường, hội thảo

Liên kết

HOẠT ĐỘNG TRONG TUẦN

Tin tức nổi bật

Công bố khoa học mới