Yi Wang
I was born in Guangdong-Hong Kong-Macau Greater Bay Area in China. I mainly focus on `Reinforcement Learning`, `Computer Vision` and `Robotics'. I serve as a reviewer of ICLR 2026 and so on.
I support girls in AI

Education
  • Sun Yat-sen University
    Sun Yat-sen University
    Ph.D candidate (Supervised by Prof. Yulan Guo)
    Sep. 2025 -
  • Sun Yat-sen University
    Sun Yat-sen University
    B.Eng (Supervised by Prof. Yulan Guo)
    Sep. 2021 - Jul. 2025
Honors & Awards
  • President's Scholarship
    2025 - 2026
  • Academic Paper Track, National Undergraduate Innovation Annul Meeting
    2024
  • National Scholarship, China
    2024
  • Academic Excellence Scholarship, First Prize, SYSU
    2023, 2024
News
2024
I created my first personal website!
Nov 26
Selected Publications (view all )
MangoBench: A Benchmark for Multi-Agent Goal-Conditioned Offline Reinforcement Learning
MangoBench: A Benchmark for Multi-Agent Goal-Conditioned Offline Reinforcement Learning

Yi Wang, Ningze Zhong, Zhiheng Fu, Longguang Wang, Ye Zhang, Yulan Guo

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2026

MangoBench, the first benchmark tailored for Goal-Conditioned Offline MARL, covering 3 environments, 4 agent types, and 47 tasks, designed to assess joint-control locomotion, synchronous and asynchronous bimanual manipulation, and robustness to high-dimensional inputs.

MangoBench: A Benchmark for Multi-Agent Goal-Conditioned Offline Reinforcement Learning

Yi Wang, Ningze Zhong, Zhiheng Fu, Longguang Wang, Ye Zhang, Yulan Guo

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2026

MangoBench, the first benchmark tailored for Goal-Conditioned Offline MARL, covering 3 environments, 4 agent types, and 47 tasks, designed to assess joint-control locomotion, synchronous and asynchronous bimanual manipulation, and robustness to high-dimensional inputs.

Leveraging Suboptimal and Noisy Trajectories for Goal-Conditional Offline RL
Leveraging Suboptimal and Noisy Trajectories for Goal-Conditional Offline RL

Ningze Zhong, Yi Wang, Bo Wu

ICLR 2026 RSI Workshop 2026

This paper demonstrates that imperfect trajectories in offline goal-conditioned reinforcement learning (OGCRL), typically discarded as harmful, can be leveraged as a valuable source of exploration, enhancing state-space coverage and improving policy learning, especially in complex environments.

Leveraging Suboptimal and Noisy Trajectories for Goal-Conditional Offline RL

Ningze Zhong, Yi Wang, Bo Wu

ICLR 2026 RSI Workshop 2026

This paper demonstrates that imperfect trajectories in offline goal-conditioned reinforcement learning (OGCRL), typically discarded as harmful, can be leveraged as a valuable source of exploration, enhancing state-space coverage and improving policy learning, especially in complex environments.

Tangram-Splatting: Optimizing 3D Gaussian Splatting Through Tangram-inspired Shape Priors
Tangram-Splatting: Optimizing 3D Gaussian Splatting Through Tangram-inspired Shape Priors

Yi Wang*, Ningze Zhong*, Minglin Chen, Longguang Wang, Yulan Guo (* equal contribution)

ACM Multimedia 2024 (ACM MM) 2024

This study introduces Tangram-Splatting, a novel 3D scene reconstruction method inspired by the tangram puzzle. This method optimizes 3D Gaussian Splatting by diversifying Gaussian functions, achieving a 62.4% reduction in memory overhead while maintaining competitive PSNR performance.

Tangram-Splatting: Optimizing 3D Gaussian Splatting Through Tangram-inspired Shape Priors

Yi Wang*, Ningze Zhong*, Minglin Chen, Longguang Wang, Yulan Guo (* equal contribution)

ACM Multimedia 2024 (ACM MM) 2024

This study introduces Tangram-Splatting, a novel 3D scene reconstruction method inspired by the tangram puzzle. This method optimizes 3D Gaussian Splatting by diversifying Gaussian functions, achieving a 62.4% reduction in memory overhead while maintaining competitive PSNR performance.

CASIT: Collective Intelligent Agent System for Internet of Things
CASIT: Collective Intelligent Agent System for Internet of Things

Ningze Zhong*, Yi Wang*, etc (* equal contribution)

IEEE Internet of Things Journal 2023

This article introduces CASIT, a pioneering collective intelligent agent system for IoT, leveraging multiple LLM-based agents with Memory and Summary Mechanisms to collaboratively solve complex tasks and optimize information transmission.

CASIT: Collective Intelligent Agent System for Internet of Things

Ningze Zhong*, Yi Wang*, etc (* equal contribution)

IEEE Internet of Things Journal 2023

This article introduces CASIT, a pioneering collective intelligent agent system for IoT, leveraging multiple LLM-based agents with Memory and Summary Mechanisms to collaboratively solve complex tasks and optimize information transmission.

All publications