news

Jul 07, 2025 Our Gemini 2.5 Technical Report is released ArXiv.
May 15, 2025 I will serve as an Area Chair for EMNLP 2025.
May 15, 2025 Our T-REG: Preference Optimization with Token-Level Reward Regularization is accepted by ACL 2025.
Jan 22, 2025 Our Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy is accepted by ICLR 2025.
Sep 18, 2024 Our WPO: Enhancing RLHF with Weighted Preference Optimization is accepted by EMNLP 2024.