Emoji Undergraduate student in software engineering

My name is Yufan Zhou (Chinese: 周雨凡). I am an undergraduate student at Harbin Institute of Technology, advised by Prof. Weigang Zhang and Shuhui Wang. I have also done a research internship and visited students in Professor Huan Wang in WU, Yan-Pei Cao in VAST, Xihui Liu in MMLab@HKU and Xingang Pan in MMLab@NTU before.

I am broadly interested in various software development technologies and diffusion theory, particularly in T2V (Text-to-Video), T2I (Text-to-Image), personalization generation, and projects involving procedural planning (e.g., Video Prediction).

Initially, my interest was in various diffusion works, aspiring to use generative skills to create anything. Currently, I focus more on image generation, such as personalization and data or dataset augmentation. My goal is to generate images in various styles and reduce the cost of data acquisition. I'm eager to connect with anyone who shares this vision for AI or appreciates the same research approach.

Recently, my interest has expanded to 3D generation and world-building, which I believe is the ultimate goal of diffusion and generative subjects. By enabling arbitrary editing styles of images, everyone can access resources and 3D worlds. Additionally, incorporating physical conditions into diffusion can make the generated results appear more realistic and logical. I am optimistic that this can be achieved soon, given that image generation began only a few years ago.

In addressing these issues, my focus is on what I do and aim to achieve. I believe that with logic, mathematics, and the creation of various architectures, one can accomplish anything.

You can find my CV here: Yufan Zhou’s Curriculum Vitae.

🔥 News

2025.08: 🎉 The paper “OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion” has been accepted at SIGGRAPH ASIA 2025.
2025.07： 🤗 Started a Research Internship at MMLab@NTU, focusing on 3D generation & Reconstruction.
2025.03: 🤗 Started a Research Internship at VAST & MMLab@HKU, focusing on 3D part generation.
2025.01: 🎉 The paper “Masked Temporal Interpolation Diffusion for Procedure Planning in Instructional Videos” has been accepted as a poster at ICLR 2025.
2024.10: 🎉 Awarded the National Scholarship by the Ministry of Education of China.
2024.07: 🎉 Received the National Third Prize in the Computer Design Contest with the Virtual Digital Agent project.
2024.06: 🤗 Started a Research Internship at ENCODE Lab, Westlake University, focusing on Image Generation with Diffusion Models.
2024.05: 🎉 Named a Finalist in the Mathematical Contest in Modeling with MM-LSTM method.
2024.04: 🎉 Awarded the Huawei Smart Base Scholarship.
2023.12: 🎉 Won the National Third Prize in the Physics Experiment Competition with Unity 3D project.

📝 Publications

SIGGRAPH ASIA 2025

OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion

Yunhan Yang*, Yufan Zhou*, Yuan-Chen Guo, Zi-Xin Zou, Yukun Huang, Ying-Tian Liu, Hao Xu, Ding Liang, Yan-Pei Cao, Xihui Liu

Proposes a part-aware 3D generation framework using structure planning and spatial flow modeling.
Achieves top performance with precise control over part granularity and localization.
Welcome to our Webpage

Preprint 2025

FreeBlend: Advancing Concept Blending with Staged Feedback-Driven Interpolation Diffusion

Yufan Zhou*, Haoyu Shen*, Huan Wang

Proposes a novel feedback-driven latent interpolation approach for concept blending in three-stage image generation with unCLIP-based image conditions.
Demonstrates superior performance on both visual results and multiple benchmarks through extensive experiments.
Welcome to our Webpage

ICLR 2025 Poster

Masked Temporal Interpolation Diffusion for Procedure Planning in Instructional Videos

Yufan Zhou, Zhaobo Qi, Lingshuai Lin, Junqi Jing, Tingting Chai, Beichen Zhang, Shuhui Wang, Weigang Zhang

Focuses on goal-directed planning using visual observations and proposes a latent space temporal interpolation module.
Implements masking strategies and task-adaptive proximity loss achieving SOTA performance.

💼 Internships

May 2024 - Nov 2024: Research Intern, ENCODE LAB, WestLake University (Advised by Prof. Huan Wang)
Mar 2025 - Jun 2025: Research Intern, MMLab@HKU & VAST (Advised by Prof. Xihui Liu and Yan-Pei Cao)
Jul 2025 - Oct 2025: Research Intern, MMLab@NTU (Advised by Prof. Xingang Pan)

🎁 Academic Projects

I have conducted several surveys in my field to provide detailed assistance to those interested in my work.

💻 Software Projects

I enjoy creating software for fun and exploring new technologies, such as Vue, Spring Boot, Android, and other tech stacks. I have completed several projects using these technologies, which have enhanced my technical skills, project management abilities, and experience in requirement analysis. Additionally, I hope my library can assist others in learning and applying code for their needs and research.

MM-LSTM
LLM agent + RAG + Virtual Digital Person
Simple Qt Browser Framework
Android Accessibility Service
Wholesale And Retail System (Vue + Springboot)
Medicine WeChat Mini Program
Teacher Student Mutual Selection System

📗 Course Projects

I enjoy acquiring knowledge in computer science and related fields. During my courses, I have built several projects to enhance my learning experience. By writing code and documenting information in markdown, I aim to teach myself and future learners more effectively.

🛠 Tech Stack & Latest Blogs

Current Learning

Deepening knowledge in Machine Learning and AI.
Exploring advanced web patterns and software management techniques.
Improving academic skills in Diffusion and AIGC.

Latest Blog Posts

$Language$