avatar

Heng Dong

Ph.D. Student
Tsinghua University
drdhxi (at) gmail.com


About Me

I am a Ph.D. Student advised by Prof. Chongjie Zhang and Prof. Yi Wu at the Institute for Interdisciplinary Information Sciences (IIIS), Tsinghua University, led by Prof. Andrew Yao.

The goal of my research is to endow agents with superhuman intelligence, which I believe can be achieved through learning from interactions using modern models. My biography is here.

Research Interests

Education

News

Publications

    Most of my research is about robotics and agents. Some papers are highlighted.

  1. ICML 2025
    TL;DR We propose a novel LLM-based Actor-Critic framework that enhances LLMs' decision-making through long-term action evaluations and efficient policy improvements.

  2. arXiv 2025
    Xinyi Yang, Liang Zeng, Heng Dong, Chao Yu, Xiaoran Wu, Huazhong Yang, Yu Wang, Milind Tambe Tonghan Wang
    TL;DR We train LLMs to generate better agent explanations using RL with feedback from generative models, outperforming standard RLHF methods.

  3. AAMAS 2025
    TL;DR We show that diffusion models can reconstruct global states in decentralized partially observable multiagent systems, with approximation errors leading to deviations that can be bounded for convergence to the true state.

  4. ICLR 2024
    TL;DR We propose to design multi-cellular robots in a coarse-to-fine manner and leverage hyperbolic embeddings for implementation.

  5. ICML 2023
    TL;DR We exploit the structure of the design space in robot design problems with symmetry characteristics and generate robots with high performance more efficiently.

  6. NeurIPS 2022
    TL;DR A muscle synergy inspired RL framework that groups actuators into synchronised modules for efficient control of high-DoF robots, demonstrating superior efficiency and generalizability on complex morphologies like Humanoids++ and UNIMALs.

  7. arXiv 2021
    TL;DR We identify that fragile cooperation in sequential social dilemmas stems from second-order conflicts in incentive mechanisms, and propose a homophily-based learning framework that achieves stable cooperation in public goods and tragedy of the commons scenarios.

  8. ICLR 2021
    TL;DR A decomposed policy gradient method that outperforms state-of-the-art multi-agent RL algorithms by enabling efficient off-policy learning and better credit assignment.

  9. ICML 2020
    TL;DR A role-based MARL framework where agents autonomously learn specialized roles, improving performance on StarCraft II micromanagement.

Services

Conference Reviewer

Teaching Assistant


Powered by Jekyll and Minimal Light theme. Also some ideas borrowed from Jon Barron and Junyu Zhang