1. Optimization of Deep Learning & LLM
DNT: A Deeply Normalized Transformer That Can Be Trained by Momentum SGD
Xianbiao Qi*, Marco Chen, Wenjie Xiao, Jiaquan Ye, Yelin He, Chun-Guang Li, Zhouchen Lin
ICLR 2026
Taming Transformer Without Using Learning Rate Warmup
Xianbiao Qi*, Yelin He, Jiaquan Ye, Chun-Guang Li, Bojia Zi, Xili Dai, Qin Zou, Rong Xiao
ICLR 2025
LipsFormer: Introducing Lipschitz Continuity to Vision Transformers
Xianbiao Qi*, Jianan Wang, Yihao Chen, Yukai Shi, Lei Zhang
ICLR 2023
SimpleGPT: Improving GPT via a Simple Normalization Strategy
Xianbiao Qi*, Marco Chen, Yelin He, Jiaquan Ye, Rong Xiao
Submission ICML 2026
Delving into Muon and Beyond: Deep Analysis and Extensions
Xianbiao Qi*, Marco Chen, Yelin He, Jiaquan Ye, Rong Xiao
Submission ICML 2026
2. AIGC (Image / Video / 3D Generation)
Refaçade: Editing Object with Given Reference Texture
Youze Huang, Penghui Ruan, Bojia Zi, Xianbiao Qi*, Jianan Wang, Rong Xiao
CVPR 2026
CTRL&SHIFT: High-quality Geometry-Aware Object Manipulation in Visual Generation
Penghui Ruan, Bojia Zi, Xianbiao Qi*, et al.
ICLR 2026
Senorita-2M: A High-Quality Instruction-based Dataset for General Video Editing
Bojia Zi, Penghui Ruan, Marco Chen, Xianbiao Qi*, et al.
NeurIPS 2025
MiniMax-Remover: Taming Bad Noise Helps Video Object Removal
Bojia Zi, Weixuan Peng, Xianbiao Qi*, Jianan Wang, Shihao Zhao, Rong Xiao, Kam-Fai Wong
NeurIPS 2025
BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation
Shaozhe Hao, Xuantong Liu, Xianbiao Qi*, Shihao Zhao, Bojia Zi, Rong Xiao, Kai Han, Kwan-Yee K. Wong
ICLR 2025
Elucidating the Design Space of Language Models for Image Generation
Xuantong Liu, Shaozhe Hao, Xianbiao Qi*, Tianyang Hu, Jun Wang, Rong Xiao, Yuan Yao
ICML 2025
CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility
Bojia Zi, Shihao Zhao, Xianbiao Qi*, et al.
AAAI 2025
DreamTime: An Improved Optimization Strategy for Text-to-3D Content Creation
Yukun Huang, Jianan Wang, Yukai Shi, Xianbiao Qi, Zheng-Jun Zha, Lei Zhang
ICLR 2024
TOSS: High-quality Text-guided Novel View Synthesis from a Single Image
Yukai Shi, Jianan Wang, He Cao, Boshi Tang, Xianbiao Qi, et al.
ICLR 2024
DreamWaltz: Make a Scene with Complex 3D Animatable Avatars
Yukun Huang, Jianan Wang, Ailing Zeng, He Cao, Xianbiao Qi, Yukai Shi, Zheng-Jun Zha, Lei Zhang
NeurIPS 2023
3. General Computer Vision
DAB-DETR: Dynamic Anchor Boxes Are Better Queries for DETR
Shilong Liu, Feng Li, Hao Zhang, Xiao Yang, Xianbiao Qi, Hang Su, Jun Zhu, Lei Zhang
ICLR 2022
DeepCrack: Learning Hierarchical Convolutional Features for Crack Detection
Qin Zou, Zheng Zhang, Qingquan Li, Xianbiao Qi, Qian Wang, Song Wang
IEEE TIP 2018
4. OCR & Document Understanding
MASTER: Multi-Aspect Non-local Network for Scene Text Recognition
Ning Lu, Wenwen Yu, Xianbiao Qi*, Yihao Chen, Ping Gong, Rong Xiao, Xiang Bai
Pattern Recognition 2021
PICK: Processing Key Information Extraction from Documents Using Improved Graph Learning
Wenwen Yu, Ning Lu, Xianbiao Qi*, Ping Gong, Rong Xiao
ICPR 2021
EMU: Effective Multi-Hot Encoding for Lightweight Scene Text Recognition
Bingcong Li, Xin Tang, Xianbiao Qi*, Yihao Chen, Chun-Guang Li, Rong Xiao
IEEE TCSVT 2022
5. LLM & MLLM
DisCo-CLIP: A Distributed Contrastive Loss for Memory Efficient CLIP Training
Yihao Chen, Xianbiao Qi*, Jianan Wang, Lei Zhang
CVPR 2023
Delta-LoRA: Fine-Tuning High-Rank Parameters with the Delta of Low-Rank Matrices
Bojia Zi, Xianbiao Qi*, Lingzhi Wang, Jianan Wang, Kam-Fai Wong, Lei Zhang
arXiv 2023
Multi-Modal Representation Learning via Semi-Supervised Rate Reduction for Generalized Category Discovery
Wei He, Xianghan Meng, Zhiyuan Huang, Xianbiao Qi, Rong Xiao, Chun-Guang Li
CVPR 2026