My current interests lie at the intersection of LLM reasoning, reinforcement learning, and constrained optimization. Broadly, I work on topics including LLM alignment , Constrained Reinforcement Learning, and Bandits. Please refer to my recent publications for further details.

🔥 News

2026.01: 🎉🎉 One paper has been accepted by ICLR 2026.
2026.01: 🎉🎉 Invited to give a talk at Washington State University on Score Matching.(See Blogs for more details)
2025.06: 🎉🎉 Invited to give a talk at SIGMETRICS 2025.
2025.05: 🎉🎉 One paper has been accepted by ICML 2025.

📝 Selected Publications

ICLR 2026

Keep the Best, Forget the Rest: Reliable Alignment with Order-Aware Preference Optimization Jiahui Zhu, Yuanjie Shi, Xiyue Peng, Xin Liu, Yan Yan, Honghao Wei

ICML 2025

Online Constrained Markov Decision Processes Jiahui Zhu, Kihyun Yu, Dabeen Lee, Xin Liu, Honghao Wei

📖 Educations

2023.08 - now, Washington State University
2021.08 - 2023.01, Boston University

💬 Invited Talks

2026.01, GenAI Seminar, Washington State University
2025.06, SIGMETRICS 2025

Jiahui ZHU

🔥 News

📝 Selected Publications

📖 Educations

💬 Invited Talks