publications
2025
- DLM-One: Diffusion Language Models for One-Step Sequence GenerationarXiv preprint arXiv:2506.00290, 2025
- Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic CapabilitiesarXiv preprint arXiv:2507.06261, 2025
2024
- PreprintIntroducing Gemini 2.0: our new AI model for the agentic era (2024)Accessed:[Insert Date Accessed Here], 2024
- ICLR 2025Score Forgetting Distillation: A Swift, Data-Free Method for Machine Unlearning in Diffusion ModelsarXiv preprint arXiv:2409.11219, 2024
- ICLR 2025Statistical Advantages of Perturbing Cosine Router in Sparse Mixture of ExpertsarXiv preprint arXiv:2405.14131, 2024
- ACL 2025T-REG: Preference Optimization with Token-Level Reward RegularizationarXiv preprint arXiv:2412.02685, 2024
- ICLR 2025Instructional Segment Embedding: Improving LLM Safety with Instruction HierarchyarXiv preprint arXiv:2410.09102, 2024
- EMNLP 2024WPO: Enhancing RLHF with Weighted Preference OptimizationIn Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
- NAACL 2024LanguageFlow: Advancing Diffusion Language Generation with Probabilistic FlowsIn Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
- ICML 2024Sliced Wasserstein with random-path projecting directionsProceedings of the ICML, 2024, 2024
- ICML 2024Switchable Decision: Dynamic Neural Generation NetworksProceedings of the ICML 2024, 2024
- Preference-grounded token-level guidance for language model fine-tuningAdvances in Neural Information Processing Systems, 2024
2023
- CVPR 2023FlowGrad: Controlling the Output of Generative ODEs with GradientsIn Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
- PreprintAutoML-GPT: Automatic Machine Learning with GPTarXiv preprint arXiv:2305.02499, 2023
- ICML 2023POUF: Prompt-oriented unsupervised fine-tuning for large pre-trained modelsIn International Conference on Machine Learning, 2023
- ICLR 2023Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-Oriented Dialogue SystemsarXiv preprint arXiv:2302.10342, 2023
- PreprintA prototype-oriented clustering for domain shift with source privacyarXiv preprint arXiv:2302.03807, 2023
2022
- EMNLP 2022Passage-Mask: A Learnable Regularization Strategy for Retriever-Reader ModelsIn Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
- NeurIPS 2022A unified framework for alternating offline model training and policy learningAdvances in Neural Information Processing Systems, 2022
- ICML 2022Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement LearningIn International Conference on Machine Learning, 2022
- NAACL 2022ALLSH: Active Learning Guided by Local Sensitivity and HardnessarXiv preprint arXiv:2205.04980, 2022
2021
- PreprintCrossformer: Transformer with Alternated Cross-Layer Guidance2021
- EMNLP 2021Learning from uneven training data: Unlabeled, single label, and multiple labelsarXiv e-prints, 2021
- PreprintCapturing label distribution: A case study in nliarXiv preprint arXiv:2102.06859, 2021
- ICLR 2021Contextual dropout: An efficient sample-dependent dropout modulearXiv preprint arXiv:2103.04181, 2021
- PreprintFusedream: Training-free text-to-image generation with improved clip+ gan space optimizationarXiv preprint arXiv:2112.01573, 2021
- ACL 2021Knowing more about questions can help: Improving calibration in question answeringarXiv preprint arXiv:2106.01494, 2021
- NeurIPS 2021A prototype-oriented framework for unsupervised domain adaptationAdvances in Neural Information Processing Systems, 2021
- ICML 2021Bayesian attention belief networksIn International Conference on Machine Learning, 2021
- NeurIPS 2021Alignment attention by matching key and query distributionsAdvances in Neural Information Processing Systems, 2021
2020
- NeurIPS 2020Bayesian attention modulesAdvances in Neural Information Processing Systems, 2020