Publications
You can also find my articles on my Google Scholar profile.
* means Equal Contribution.
- Qiwei Di, Jiafan He, Dongruo Zhou, Quanquan Gu. Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path. ICML2023.
 - Qiwei Di*, Tao Jin*, Yue Wu, Heyang Zhao, Farzad Farnoud, Quanquan Gu. Variance-Aware Regret Bounds for Stochastic Contextual Dueling Bandits. ICLR2024
 - Qiwei Di, Heyang Zhao, Jiafan He, Quanquan Gu. Pessimistic nonlinear least-squares value iteration for offline reinforcement learning. ICLR2024
 - Yue Wu, Tao Jin*, Qiwei Di*, Hao Lou, Farzad Farnoud, Quanquan Gu. Borda Regret Minimization for Generalized Linear Dueling Bandits. ICML2024
 - Qiwei Di, Jiafan He, Quanquan Gu. Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback. Preprint.
 - Binshuai Wang, Qiwei Di, Ming Yin, Mengdi Wang, Quanquan Gu, Peng Wei. Relative-Translation Invariant Wasserstein Distance. Preprint.
 - Runjia Li, Qiwei Di, Quanquan Gu. Unified Convergence Analysis for Score-Based Diffusion Models with Deterministic Samplers . Preprint.
 
