About Me

Hello! ๐Ÿ‘ Iโ€™m a third-year undergraduate student in UESTCโ€™s โ€œEverest Projectโ€ Computer Top-Talent Experimental Class (2023โ€“2027), majoring in Computer Science. Now, Iโ€™m interested in LLM SFT/RL and reasoning, image/video generation, and unified multimodal models. Earlier, I explored AI for smart grids and remote-sensing image fusion. You can find my CV here.

โญ I am eager to discuss potential collaborations and am actively seeking research internship opportunities(industry/academia),including onsite roles.I โ€˜m also seeking for 2027 fall PHD position. Please feel free to contact me via email: [huang_rui@std.uestc.edu.cn],[paulafixamiworali@gmail.com] or WeChat: huangrui_dby if you are interested.I warmly welcome your message and look forward to connecting!

๐Ÿ”ฅ News

  • 2026.02: ย ๐ŸŽ‰ Two papers accepted to CVPR 2026.

  • 2025.11: ย ๐ŸŽ‰ One paper accepted to ๐—œ๐—˜๐—˜๐—˜ ๐—ง๐—ฟ๐—ฎ๐—ป๐˜€๐—ฎ๐—ฐ๐˜๐—ถ๐—ผ๐—ป๐˜€ ๐—ผ๐—ป ๐—œ๐—ป๐—ฑ๐˜‚๐˜€๐˜๐—ฟ๐—ถ๐—ฎ๐—น ๐—œ๐—ป๐—ณ๐—ผ๐—ฟ๐—บ๐—ฎ๐˜๐—ถ๐—ฐ๐˜€(๐—ฆ๐—–๐—œ ๐—ค๐Ÿญ), which was completed during my first research internship on AI for smart grid (Oct. 2024) and underwent over a year of review.

  • 2025.11: ย ๐ŸŽ‰ Released P1, a series of models that achieved ๐—ด๐—ผ๐—น๐—ฑ ๐—บ๐—ฒ๐—ฑ๐—ฎ๐—น-๐—น๐—ฒ๐˜ƒ๐—ฒ๐—น ๐—ฝ๐—ฒ๐—ฟ๐—ณ๐—ผ๐—ฟ๐—บ๐—ฎ๐—ป๐—ฐ๐—ฒ in the Physics Olympiad (IPhO), ๐—ฏ๐—ฒ๐—ฎ๐˜๐—ถ๐—ป๐—ด ๐—ฎ๐—น๐—น ๐˜๐—ต๐—ฒ ๐—บ๐—ผ๐—ฑ๐—ฒ๐—น๐˜€, including Gemini 2.5 Pro, GPT-5 and Grok 4. Here is the blog

  • 2025.11: ย ๐ŸŽ‰ Released UniVA: Universal Video Agent โ€” an open-source next-generation video generalist! UniVA features: 1) ๐Ÿค– Unified Agentic System 2) ๐ŸŽฌ Powerful Creation. Try the online demo now! Check the paper. UniVA has been reported by AK and ๆ–ฐๆ™บๅ…ƒ.

  • 2025.10: ย ๐Ÿ† Awarded the National Scholarship of 2025.

  • 2025.07: ย ๐ŸŽ‰ Invited to server as a reviewer for AAAI 2026.

  • 2025.06: ย ๐Ÿ† Awarded the SenseTime Scholarship 2025! 30 undergraduate students nationwide.

Show more news โ–ผ
  • 2025.06: ย ๐ŸŽ‰ One paper accepted to IEEE Transactions on Industrial Informatics(SCI Q1), completed during my first research internship in AI for smart grid.

  • 2025.03: ย ๐ŸŽ‰ Released CoT Image. The first to introduce CoT-style strategies into image generation, it has attracted widespread attention in the community and received over 800+ stars!!

  • 2024.12: ย ๐ŸŽ‰ One paper accepted to AAAI 2025 and selected for Oral Presentation.

  • 2024.10: ย ๐Ÿ† Awarded the National Scholarship of 2024 and the Gratitude Scholarship for Modern Scientists.

  • 2024.09: ย ๐ŸŽ‰ One paper accepted to IEEE Transactions on Sustainable Energy(SCI Q1), completed during my first research internship in AI for smart grid.

  • 2024.08: ย ๐ŸŽ‰ Delighted to be going to Cambridge as a visiting student.

  • 2023.12: ย ๐ŸŒฑ The starting point of my academic journey!

๐Ÿ“ Selected Publications ๏ผ† Preprints [Full List]

CVPR 2026
D2C

๐Ÿ”ฅDiffusion Dataset Condensation: Training Your Diffusion Model Faster with Less Data

Rui Huang* , Shitong Shao* , Zikai Zhou, Pukun Zhao, Hangyu Guo, Tian Ye, Lichen Bai, Shuo Yang, Zeke Xieโ€ 

[PDF] [Github]

TL;DR: Proposed DยฒC: Diffusion Dataset Condensation for diffusion models, enabling 100ร— faster training with 0.8%โ€“4% data via sample selection and semantic enhancement; trained on hundreds of A800/H100 GPUs.

CVPR 2026
D2C

๐Ÿ”ฅBeyond Fixed Formulas: Data-Driven Linear Predictor for Efficient Diffusion Models

Zhirong Shen* , Rui Huang* , Jiacheng Liu, Chang Zou, Peiliang Cai, Shikang Zheng, Zhengyi Shi, Liang Feng, Linfeng Zhangโ€ 

[[PDF]] [Github]

TL;DR: The paper introduces LยฒP, a learnable linear predictor that accelerates image and video generation in diffusion models by 7.14ร—, outperforming TaylorSeer and FoCa. It requires minimal data (50 samples) and converges in 20 seconds.

AAAI 2025 Oral
D2C

๐Ÿ”ฅWavelet-Assisted Multi-Frequency Attention Network for Pansharpening

Jie Huang*, Rui Huang*, Jinghao Xu, Siran Pen, Yule Duan, Liangjian Dengโ€ 

[PDF] [Github]

TL;DR: Proposed WFANet for image fusion, combining wavelet transformation with attention, achieving SOTA on multiple datasets.

Under Review
D2C

๐Ÿ”ฅCan We Generate Images with CoT? Letโ€™s Verify and Reinforce Image Generation Step by Step

Ziyu Guo*, Renrui Zhang*โ€ , Chengzhuo Tong*, Zhizheng Zhao*, Rui Huang, Haoquan Zhang, Manyuan Zhang, Jiaming Liu, Shanghang Zhang, Peng Gao, Hongsheng Li, Pheng-Ann Heng

[PDF] [Github๐ŸŒŸ800+]

TL;DR: Proposed CoT-Image with step-wise reasoning and novel reward models (PARM/PARM++), improving autoregressive image generation by 24% via test-time verification and preference alignment.

Tech Report
D2C

๐Ÿ”ฅP1: Mastering Physics Olympiads with Reinforcement Learning

Jiacheng Chen*, Qianjia Cheng*, Fangchen Yu* โ€ฆโ€ฆ.. Rui Huang โ€ฆโ€ฆ. Lei Baiโ€ , Yu Chengโ€ , Ning Dingโ€ , Bowen Zhouโ€ , Peng Yeโ€ , Ganqu Cuiโ€ 

[PDF] [HomePage] [้‡ๅญไฝ]

TL;DR: A series of models that achieved ๐—ด๐—ผ๐—น๐—ฑ ๐—บ๐—ฒ๐—ฑ๐—ฎ๐—น-๐—น๐—ฒ๐˜ƒ๐—ฒ๐—น ๐—ฝ๐—ฒ๐—ฟ๐—ณ๐—ผ๐—ฟ๐—บ๐—ฎ๐—ป๐—ฐ๐—ฒ in the Physics Olympiad (IPhO), beating all the models, including Gemini 2.5 Pro, GPT-5 and Grok 4.

Tech Report
D2C

๐Ÿ”ฅP1-VL:Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

Yun Luo*โœ‰โ€ , Futing Wang*, Qianjia Cheng* โ€ฆโ€ฆ.. Rui Huang โ€ฆโ€ฆ. Lei Baiโœ‰, Yu Chengโœ‰, Ning Dingโœ‰, Bowen Zhouโœ‰, Peng Yeโœ‰, Ganqu Cuiโœ‰โ€ 

[PDF] [HomePage]

TL;DR: An open-source physics VLM (+ PhysicsMinions verifier) that delivers ๐—ด๐—ผ๐—น๐—ฑ-๐—บ๐—ฒ๐—ฑ๐—ฎ๐—น-๐—น๐—ฒ๐˜ƒ๐—ฒ๐—น results on HiPhO (12 gold + 1 silver): #3 as a single model (beating Gemini 2.5 Pro, GPT-5, Grok-4) and #2 with agents (surpassing GPT-5.2, only behind Gemini-3-Pro).

Tech Report
D2C

๐Ÿ”ฅUniVA: Universal Video Agents towards Next-Generation Video Intelligence

Zhengyang Liang*, Daoan Zhang*, Huichi Zhou, Rui Huang, Bobo Li, Shengqiong Wu, Yuechen Zhang, Xiaohan Wang, Jiebo Luo, Lizi Liao, Hao Feiโ€ 

[PDF] [HomePage] [ๆ–ฐๆ™บๅ…ƒ]

TL;DR: UniVA unifies understanding/segmentation/editing/generation into traceable multi-step video workflows via Planโ€“Act agents, multi-level memory, and modular tools, plus UniVA-Bench.

IEEE Transactions on Industrial Informatics
D2C

Complementary Online Learning Network for Probabilistic Load Forecasting Against Extreme Weather

Rui Huang, Pengfei Zhao, Di Caoโ€ , Weihao Hu, Qi Huang, Zhe Chen

[PDF] [Github]

TL;DR: Proposed the Complementary Online Learning Network (COLNet) with a Weather-aware gating mechanism for high precision probabilistic and point forecasting under extreme weather.

๐Ÿ“– Educations

  • 2023.09 - 2027.06, โ€œEverest Projectโ€ Computer Top Talent Experimental Class, University of Electronic Science and Technology of China

๐Ÿ† Honors

  • [2025] SenseTime Scholarship (award rate <0.1%; 30 undergrads China-wide)
  • [2025] National Scholarship (top recipient in the college)
  • [2024] National Scholarship (top recipient in the college)
  • [2024] Gratitude Scholarship for Modern Scientists (top 10 in school)
  • [2024] Excellence Scholarship โ€” School of Computer Science, UESTC
  • [2024] First-Class Scholarship for Outstanding Students

๐Ÿ… Competition Awards

  • [2025] National Gold Award โ€” National College Student Career Planning Competition๐Ÿ… (award rate <0.1%; first from UESTC to receive this award)
  • [2025] National Silver Award โ€” China International College Studentsโ€™ Innovation Competition๐Ÿฅˆ
  • [2025] Provincial Special Prize โ€” โ€œChallenge Cupโ€ National College Studentsโ€™ Extracurricular Academic and Technological Works Competition๐Ÿ…
  • [2024] National Third Prize โ€” Five-Minute Research Presentation (5MRP) Competition๐Ÿฅ‰
  • [2023] Provincial First Prize โ€” Huawei ICT Competition๐Ÿ… (rank 2rd in province)

๐Ÿ’ฌ Talks ๏ผ† Reports

๐ŸŽ“ Services

  • Reviewer for AAAI 2026
  • Co-Founder of the UESTC AI Club