Skip to content

Commit

Permalink
wz
Browse files Browse the repository at this point in the history
  • Loading branch information
HeyuanMingong committed Oct 10, 2024
1 parent 3c895fc commit 6a15f1b
Show file tree
Hide file tree
Showing 2 changed files with 3 additions and 3 deletions.
4 changes: 2 additions & 2 deletions index.html
Original file line number Diff line number Diff line change
Expand Up @@ -31,7 +31,7 @@ <h1>Zhi Wang</h1>
Department of Control Science and Intelligent Engineering<br /></p>
<p>22 Hankou Road, Gulou District, Nanjing, China <br />
E-mail: <a href="mailto:[email protected]">[email protected]</a></p>
<p>Reinforcement Learning, Robot Learning, Meta / Offline / Multi-Agent RL </p>
<p>Reinforcement Learning, Robot Learning, In-context / Offline / Multi-Agent RL </p>
<p><a href="https://scholar.google.com/citations?user=cRXlxYcAAAAJ&hl=en"><img src="scholar.png" alt="alt text" width="30px" height="30px" /></a> &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
<a href="https://github.com/HeyuanMingong"><img src="github.png" alt="alt text" width="30px" height="30px" /></a> &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
<a href="https://sme.nju.edu.cn/wz1/list.htm"><img src="nju.png" alt="alt text" width="30px" height="30px" /></a> </p>
Expand All @@ -49,7 +49,7 @@ <h2>About me</h2>
<br />
Specifically, I work on how learning algorithms can scale RL agents to (i) dynamic environments, (ii) offline settings, and (iii) multi-agent systems, allowing them to autonomously adapt to (i) non-stationary task distributions, (ii) non-interactive scenarios, and (iii) cooperative or competitive task assignments in real-world domains.
<br />
This includes a wide range of topics such as (i) meta-RL, lifelong/continual RL, in-context RL, (ii) offline RL, large models for RL, and (iii) multi-agent RL.
This includes a wide range of topics such as (i) in-context RL, meta-RL, lifelong/continual RL, (ii) offline RL, large models for RL, and (iii) multi-agent RL.
</p>

<p>
Expand Down
2 changes: 1 addition & 1 deletion publication.html
Original file line number Diff line number Diff line change
Expand Up @@ -35,7 +35,7 @@ <h3>Preprints</h3>
<h3>Conferences</h3>
<ol>

<li><p>Zhi Wang, Li Zhang, Wenhao Wu, Yuanheng Zhu, Dongbin Zhao, and Chunlin Chen, "Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement," in <i>Advances of Neural Information Processing Systems (NeurIPS)</i>, 2024.
<li><p><b>Zhi Wang</b>, Li Zhang, Wenhao Wu, Yuanheng Zhu, Dongbin Zhao, and Chunlin Chen, "Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement," in <i>Advances of Neural Information Processing Systems (NeurIPS)</i>, 2024. [<a href="https://github.com/NJU-RL/Meta-DT">code</a>]
</p></li>

<li><p>Zican Hu, Zongzhang Zhang, Huaxiong Li, Chunlin Chen, Hongyu Ding, and <b>Zhi Wang*</b>, "<a href="https://arxiv.org/abs/2312.04819">Attention-Guided Contrastive Role Representations for Multi-Agent Reinforcement Learning</a>," in <i>Proceedings of International Conference on Learning Representations (ICLR)</i>, 2024. [<a href="https://github.com/NJU-RL/ACORM">code</a>]
Expand Down

0 comments on commit 6a15f1b

Please sign in to comment.