Publications
You can also find my articles on my Google Scholar profile.
* means Equal Contribution.
- Qiwei Di, Jiafan He, Dongruo Zhou, Quanquan Gu. Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path. ICML2023.
- Qiwei Di*, Tao Jin*, Yue Wu, Heyang Zhao, Farzad Farnoud, Quanquan Gu. Variance-Aware Regret Bounds for Stochastic Contextual Dueling Bandits. ICLR2024
- Qiwei Di, Heyang Zhao, Jiafan He, Quanquan Gu. Pessimistic nonlinear least-squares value iteration for offline reinforcement learning. ICLR2024
- Yue Wu, Tao Jin*, Qiwei Di*, Hao Lou, Farzad Farnoud, Quanquan Gu. Borda Regret Minimization for Generalized Linear Dueling Bandits. ICML2024
- Qiwei Di, Jiafan He, Quanquan Gu. Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback. Preprint.
- Binshuai Wang, Qiwei Di, Ming Yin, Mengdi Wang, Quanquan Gu, Peng Wei. Relative-Translation Invariant Wasserstein Distance. Preprint.
- Runjia Li, Qiwei Di, Quanquan Gu. Unified Convergence Analysis for Score-Based Diffusion Models with Deterministic Samplers . Preprint.