Wei Yang

Viterbi School of Engineering, USC

Los Angeles, USA

My name is Wei Yang. I am currently a Ph.D. candidate at the Viterbi School of Engineering, University of Southern California (USC), where I am advised by Prof. Jesse Thomason and Prof. Xuezhe Ma. I am a member of the GLAMOR Lab. I received my B.S. degree from Huazhong University of Science and Technology.

🔬 Research Interests

My research interests primarily lie in Agentic LLMs, Efficient Multi-Agent LLM Systems, Multi-Agent RL, and Generative & Multimodal Ranking:

Agentic LLM Systems
- Long-term Memory
- Meta-Thinking Collaboration
- Role Coordination
- Reliable LLM Judgement
Efficient MAS LLMs Post-training
- LLM Reasoning Compression
- Cost-Aware & Budgeted Collaboration
- Efficient MARL Algorithms
Generative & Multimodal Ranking
- User Lifelong Preference Reasoning
- Preference Alignment & Optimization
- Cross-Modal Fusion & Alignment

🗞️ News

Date	Update
May 17, 2026	🎉 One paper is accepted by KDD 2026.
Apr 30, 2026	🎉 One paper is accepted by ICML 2026. Honored to be nominated as a Gold Reviewer.
Apr 02, 2026	🎉 One paper is accepted by SIGIR 2026 Oral.
Mar 13, 2026	🎉 Our team is selected as a Finalist for the 2026 Qualcomm Innovation Fellowship: link
Mar 06, 2026	🎤 Give a Talk at NICE AI on Human-Multi-Agent Collaboration: post
Feb 25, 2026	🎉 One paper is accepted by TMLR 2026.
Feb 20, 2026	🔥 We release TemporalBench Benchmark for LLM Agents on Contextual Time Series: article
Jan 26, 2026	🎉 One paper is accepted by MLSys 2026 Oral.
Jan 26, 2026	🎉 One paper is accepted by ICLR 2026.
Jan 17, 2026	🎉 One paper is accepted by ICASSP 2026 Oral.
Dec 16, 2025	🔥 We release Towards Evolutionary Agentic LLMs with MARL Survey: article
Nov 09, 2025	📰 Interviewed by MIT Technology Review: article - Meta-Policy Multi-Agent Collaboration
Nov 07, 2025	🎉 One paper is accepted by AAAI 2026.
Sep 16, 2025	🔥 We release Reasoning and Agentic LLMs for Time Series Survey: article
Sep 10, 2025	🎉 One paper is accepted by NeurIPS 2025.
Jul 05, 2025	🎉 One paper is accepted by ACM MM 2025.
Jul 03, 2025	🎉 One paper is accepted by RecSys 2025.
Apr 05, 2025	🎉 One paper is accepted by SIGIR 2025 Oral.
Jan 15, 2025	🎉🎉 Two papers are accepted by WWW 2025 Oral.
Jul 01, 2024	🎉🎉 Two papers are accepted by ACM MM 2024.
Apr 05, 2024	🎉 One paper is accepted by IJCAI 2024.

selected publications

KDD 2026

TemporalBench: A Benchmark for Evaluating LLM-Based Agents on Contextual and Event-Informed Time Series Tasks

Muyan Weng, Defu Cao, Wei Yang, and 2 more authors

arXiv preprint arXiv:2602.13272, 2026

@article{weng2026temporalbench,
  title = {TemporalBench: A Benchmark for Evaluating LLM-Based Agents on Contextual and Event-Informed Time Series Tasks},
  author = {Weng, Muyan and Cao, Defu and Yang, Wei and Sharma, Yashaswi and Liu, Yan},
  journal = {arXiv preprint arXiv:2602.13272},
  year = {2026},
}

ICML 2026

"Someone Hid It": Query-Agnostic Black-Box Attacks on LLM-Based Retrieval

Jiate Li, Defu Cao, Li Li, and 8 more authors

arXiv preprint arXiv:2602.00364, 2026

@article{li2026someonehidit,
  title = {"Someone Hid It": Query-Agnostic Black-Box Attacks on LLM-Based Retrieval},
  author = {Li, Jiate and Cao, Defu and Li, Li and Yang, Wei and Qin, Yuehan and Yu, Chenxiao and Yang, Tiannuo and Rossi, Ryan A. and Liu, Yan and Hu, Xiyang and Zhao, Yue},
  journal = {arXiv preprint arXiv:2602.00364},
  year = {2026},
}

SIGIR 2026

TimeMM: Time-as-Operator Spectral Filtering for Dynamic Multimodal Recommendation

Wei Yang, Rui Zhong, Zihan Lin, and 4 more authors

arXiv preprint arXiv:2604.26247, 2026

@article{yang2026timemm,
  title = {TimeMM: Time-as-Operator Spectral Filtering for Dynamic Multimodal Recommendation},
  author = {Yang, Wei and Zhong, Rui and Lin, Zihan and Wang, Xiaodan and Chen, Cheng and Ren, Huan and Hu, Yao},
  journal = {arXiv preprint arXiv:2604.26247},
  year = {2026},
}

arXiv

PRAISE: Prefix-Based Rollout Reuse in Agentic Search Training

Erhan Zhang, Yiqun Chen, Zechun Niu, and 6 more authors

arXiv preprint arXiv:2604.03675, 2026

@article{zhang2026praise,
  title = {PRAISE: Prefix-Based Rollout Reuse in Agentic Search Training},
  author = {Zhang, Erhan and Chen, Yiqun and Niu, Zechun and Yang, Wei and Wei, Xiaochi and Gao, Yan and Wu, Yi and Hu, Yao and Mao, Jiaxin},
  journal = {arXiv preprint arXiv:2604.03675},
  year = {2026},
}

ICLR 2026

Adaptive Collaboration with Humans: Metacognitive Policy Optimization for Multi-Agent LLMs with Continual Learning

Wei Yang, Defu Cao, Jiacheng Pang, and 2 more authors

In The Fourteenth International Conference on Learning Representations, 2026

@inproceedings{yang2026adaptive,
  title = {Adaptive Collaboration with Humans: Metacognitive Policy Optimization for Multi-Agent LLMs with Continual Learning},
  author = {Yang, Wei and Cao, Defu and Pang, Jiacheng and Weng, Muyan and Liu, Yan},
  booktitle = {The Fourteenth International Conference on Learning Representations},
  year = {2026},
}

TMLR 2026

TS-Reasoner: Domain-oriented time series inference agents for reasoning and automated analysis

Wen Ye^*, Wei Yang^*, Defu Cao, and 4 more authors

Transactions on Machine Learning Research, 2026

@article{ye2024domain,
  title = {TS-Reasoner: Domain-oriented time series inference agents for reasoning and automated analysis},
  author = {Ye, Wen and Yang, Wei and Cao, Defu and Zhang, Yizhou and Tang, Lumingyuan and Cai, Jie and Liu, Yan},
  journal = {Transactions on Machine Learning Research},
  year = {2026},
}

arXiv

Auditing Multi-Agent LLM Reasoning Trees Outperforms Majority Vote and LLM-as-Judge

Wei Yang^*, Shixuan Li^*, Heng Ping, and 3 more authors

arXiv preprint arXiv:2602.09341, 2026

@article{yang2026auditing,
  title = {Auditing Multi-Agent LLM Reasoning Trees Outperforms Majority Vote and LLM-as-Judge},
  author = {Yang, Wei and Li, Shixuan and Ping, Heng and Zhang, Peiyu and Bogdan, Paul and Thomason, Jesse},
  journal = {arXiv preprint arXiv:2602.09341},
  year = {2026},
}

arXiv

Self-Compression of Chain-of-Thought via Multi-Agent Reinforcement Learning

Yiqun Chen^*, Jinyuan Feng^*, Wei Yang^*, and 9 more authors

arXiv preprint arXiv:2601.21919, 2026

@article{chen2026self,
  title = {Self-Compression of Chain-of-Thought via Multi-Agent Reinforcement Learning},
  author = {Chen, Yiqun and Feng, Jinyuan and Yang, Wei and Zhong, Meizhi and Shi, Zhengliang and Li, Rui and Wei, Xiaochi and Gao, Yan and Wu, Yi and Hu, Yao and Pu, Zhiqiang and Mao, Jiaxin},
  journal = {arXiv preprint arXiv:2601.21919},
  year = {2026},
}

arXiv

De-conflating Preference and Qualification: Constrained Dual-Perspective Reasoning for Job Recommendation with Large Language Models

Bryce Kan^*, Wei Yang^*, Emily Nguyen, and 4 more authors

arXiv preprint arXiv:2602.03097, 2026

@article{kan2026jobrec,
  title = {De-conflating Preference and Qualification: Constrained Dual-Perspective Reasoning for Job Recommendation with Large Language Models},
  author = {Kan, Bryce and Yang, Wei and Nguyen, Emily and Yi, Ganghui and Yi, Bowen and Yu, Chenxiao and Liu, Yan},
  journal = {arXiv preprint arXiv:2602.03097},
  year = {2026},
}

SSRN

Toward Evolutionary Intelligence: LLM-based Agentic Systems with Multi-Agent Reinforcement Learning

Wei Yang, Muyan Weng, Jiacheng Pang, and 8 more authors

Available at SSRN 5819182, 2025

@article{yang2025toward,
  title = {Toward Evolutionary Intelligence: LLM-based Agentic Systems with Multi-Agent Reinforcement Learning},
  author = {Yang, Wei and Weng, Muyan and Pang, Jiacheng and Cao, Defu and Ping, Heng and Zhang, Peiyu and Li, Shixuan and Zhao, Yue and Yang, Qiang and Wang, Mengdi and others},
  journal = {Available at SSRN 5819182},
  year = {2025},
}

MLSys 2026

VeriMoA: A Mixture-of-Agents Framework for Spec-to-HDL Generation

Heng Ping, Arijit Bhattacharjee, Peiyu Zhang, and 8 more authors

arXiv preprint arXiv:2510.27617, 2025

@article{ping2025verimoa,
  title = {VeriMoA: A Mixture-of-Agents Framework for Spec-to-HDL Generation},
  author = {Ping, Heng and Bhattacharjee, Arijit and Zhang, Peiyu and Li, Shixuan and Yang, Wei and Cheng, Anzhe and Zhang, Xiaole and Thomason, Jesse and Jannesari, Ali and Ahmed, Nesreen and others},
  journal = {arXiv preprint arXiv:2510.27617},
  year = {2025},
}

arXiv

Maestro: Learning to Collaborate via Conditional Listwise Policy Optimization for Multi-Agent LLMs

Wei Yang, Jiacheng Pang, Shixuan Li, and 3 more authors

arXiv preprint arXiv:2511.06134, 2025

@article{yang2025maestro,
  title = {Maestro: Learning to Collaborate via Conditional Listwise Policy Optimization for Multi-Agent LLMs},
  author = {Yang, Wei and Pang, Jiacheng and Li, Shixuan and Bogdan, Paul and Tu, Stephen and Thomason, Jesse},
  journal = {arXiv preprint arXiv:2511.06134},
  year = {2025},
}

WWW 2025

TourRank: Utilizing Large Language Models for Document Ranking with a Tournament-Inspired Strategy

Yiqun Chen, Qi Liu, Yi Zhang, and 6 more authors

In Proceedings of The Web Conference (WWW) 2025, 2025

@inproceedings{chen2025tourrank,
  title = {TourRank: Utilizing Large Language Models for Document Ranking with a Tournament-Inspired Strategy},
  author = {Chen, Yiqun and Liu, Qi and Zhang, Yi and Sun, Weiwei and Ma, Xinyu and Yang, Wei and Shi, Daiting and Mao, Jiaxin and Yin, Dawei},
  booktitle = {Proceedings of The Web Conference (WWW) 2025},
  pages = {1638--1652},
  year = {2025},
}

arXiv

A Survey of Reasoning and Agentic Systems in Time Series with Large Language Models

Ching Chang, Yidan Shi, Defu Cao, and 8 more authors

arXiv preprint arXiv:2509.11575, 2025

@article{chang2025survey,
  title = {A Survey of Reasoning and Agentic Systems in Time Series with Large Language Models},
  author = {Chang, Ching and Shi, Yidan and Cao, Defu and Yang, Wei and Hwang, Jeehyun and Wang, Haixin and Pang, Jiacheng and Wang, Wei and Liu, Yan and Peng, Wen-Chih and others},
  journal = {arXiv preprint arXiv:2509.11575},
  year = {2025},
}

AAAI 2026

Learning to Deliberate: Meta-Policy Collaboration for Agentic LLMs with Multi-Agent Reinforcement Learning

Wei Yang and Jesse Thomason

arXiv preprint arXiv:2509.03817, 2025

@article{yang2025learning,
  title = {Learning to Deliberate: Meta-Policy Collaboration for Agentic LLMs with Multi-Agent Reinforcement Learning},
  author = {Yang, Wei and Thomason, Jesse},
  journal = {arXiv preprint arXiv:2509.03817},
  year = {2025},
}

NeurIPS 2025

Structured Spectral Reasoning for Frequency-Adaptive Multimodal Recommendation

Wei Yang, Rui Zhong, Yiqun Chen, and 2 more authors

arXiv preprint arXiv:2512.01372, 2025

@article{yang2025structured,
  title = {Structured Spectral Reasoning for Frequency-Adaptive Multimodal Recommendation},
  author = {Yang, Wei and Zhong, Rui and Chen, Yiqun and Lu, Chi and Jiang, Peng},
  journal = {arXiv preprint arXiv:2512.01372},
  year = {2025},
}

ACM MM 2025

FITMM: Adaptive Frequency-Aware Multimodal Recommendation via Information-Theoretic Representation Learning

Wei Yang, Rui Zhong, Yiqun Chen, and 4 more authors

In Proceedings of the 33rd ACM International Conference on Multimedia (ACM MM 2025), 2025

@inproceedings{yang2025fitmm,
  title = {FITMM: Adaptive Frequency-Aware Multimodal Recommendation via Information-Theoretic Representation Learning},
  author = {Yang, Wei and Zhong, Rui and Chen, Yiqun and Li, Shixuan and Ping, Heng and Lu, Chi and Jiang, Peng},
  booktitle = {Proceedings of the 33rd ACM International Conference on Multimedia (ACM MM 2025)},
  pages = {6193--6202},
  year = {2025},
}

WWW 2025

Hierarchical Sequence ID Representation of Large Language Models for Large-Scale Recommendation Systems

Rui Zhao, Rui Zhong, Haoran Zheng, and 5 more authors

In Companion Proceedings of The Web Conference (WWW) 2025, 2025

@inproceedings{zhao2025hierarchical,
  title = {Hierarchical Sequence ID Representation of Large Language Models for Large-Scale Recommendation Systems},
  author = {Zhao, Rui and Zhong, Rui and Zheng, Haoran and Yang, Wei and Lu, Chi and Jin, Beihong and Jiang, Peng and Gai, Kun},
  booktitle = {Companion Proceedings of The Web Conference (WWW) 2025},
  pages = {641--650},
  year = {2025},
}

SIGIR 2025

Hierarchical Tree Search-Based User Lifelong Behavior Modeling on Large Language Model

Yu Xia, Rui Zhong, Hao Gu, and 4 more authors

In Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2025), 2025

@inproceedings{xia2025hierarchical,
  title = {Hierarchical Tree Search-Based User Lifelong Behavior Modeling on Large Language Model},
  author = {Xia, Yu and Zhong, Rui and Gu, Hao and Yang, Wei and Lu, Chi and Jiang, Peng and Gai, Kun},
  booktitle = {Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2025)},
  pages = {1758--1767},
  year = {2025},
}

RecSys 2025

R4ec: A Reasoning, Reflection, and Refinement Framework for Recommendation Systems

Hao Gu, Rui Zhong, Yu Xia, and 4 more authors

In Proceedings of the Nineteenth ACM Conference on Recommender Systems (RecSys 2025), 2025

@inproceedings{gu2025r4ec,
  title = {R4ec: A Reasoning, Reflection, and Refinement Framework for Recommendation Systems},
  author = {Gu, Hao and Zhong, Rui and Xia, Yu and Yang, Wei and Lu, Chi and Jiang, Peng and Gai, Kun},
  booktitle = {Proceedings of the Nineteenth ACM Conference on Recommender Systems (RecSys 2025)},
  pages = {411--421},
  year = {2025},
}

ACM MM 2024

Enhancing Transformer-Based Semantic Matching for Few-Shot Learning through Weakly Contrastive Pre-Training

Wei Yang, Tengfei Huo, and Zhiqiang Liu

In Proceedings of the 32nd ACM International Conference on Multimedia (ACM MM 2024), 2024

@inproceedings{yang2024enhancing,
  title = {Enhancing Transformer-Based Semantic Matching for Few-Shot Learning through Weakly Contrastive Pre-Training},
  author = {Yang, Wei and Huo, Tengfei and Liu, Zhiqiang},
  booktitle = {Proceedings of the 32nd ACM International Conference on Multimedia (ACM MM 2024)},
  pages = {10611--10620},
  year = {2024},
}

IJCAI 2024

PTDE: Personalized Training with Distilled Execution for Multi-Agent Reinforcement Learning

Yiqun Chen, Hangyu Mao, Jiaxin Mao, and 5 more authors

arXiv preprint arXiv:2210.08872, 2024

@article{chen2022ptde,
  title = {PTDE: Personalized Training with Distilled Execution for Multi-Agent Reinforcement Learning},
  author = {Chen, Yiqun and Mao, Hangyu and Mao, Jiaxin and Wu, Shiguang and Zhang, Tianle and Zhang, Bin and Yang, Wei and Chang, Hongxing},
  journal = {arXiv preprint arXiv:2210.08872},
  year = {2024},
}

ACM MM 2024

Multimodal-Aware Multi-Intention Learning for Recommendation

Wei Yang and Qingchen Yang

In Proceedings of the 32nd ACM International Conference on Multimedia (ACM MM 2024), 2024

@inproceedings{yang2024multimodal,
  title = {Multimodal-Aware Multi-Intention Learning for Recommendation},
  author = {Yang, Wei and Yang, Qingchen},
  booktitle = {Proceedings of the 32nd ACM International Conference on Multimedia (ACM MM 2024)},
  pages = {5663--5672},
  year = {2024},
}

IJCNN 2022

Commander-Soldiers Reinforcement Learning for Cooperative Multi-Agent Systems

Yiqun Chen, Wei Yang, Tianle Zhang, and 2 more authors

In 2022 International Joint Conference on Neural Networks (IJCNN 2022), 2022

@inproceedings{chen2022commander,
  title = {Commander-Soldiers Reinforcement Learning for Cooperative Multi-Agent Systems},
  author = {Chen, Yiqun and Yang, Wei and Zhang, Tianle and Wu, Shiguang and Chang, Hongxing},
  booktitle = {2022 International Joint Conference on Neural Networks (IJCNN 2022)},
  pages = {1--7},
  year = {2022},
  organization = {IEEE},
}