Wei Yang

wechat_2.jpg

Viterbi School of Engineering, USC

Los Angeles, USA

My name is Wei Yang. Currently, I am pursuing my Ph.D. at the Viterbi School of Engineering, University of Southern California (USC), under the guidance of Prof. Jesse Thomason. I am a member of the GLAMOR Lab. I received my B.S. degree from Huazhong University of Science and Technology.

🔬 Research Interests

My research interests primarily lie in Agentic LLMs, Efficient Multi-Agent LLM Systems, Multi-Agent RL, and Generative & Multimodal Ranking:

  • Agentic LLM Systems
    • Long-term Memory
    • Meta-Thinking Collaboration
    • Role Coordination
    • Reliable LLM Judgement
  • Efficient MAS LLMs Post-training
    • LLM Reasoning Compression
    • Cost-Aware & Budgeted Collaboration
    • Efficient MARL Algorithms
  • Generative & Multimodal Ranking
    • User Lifelong Preference Reasoning
    • Preference Alignment & Optimization
    • Cross-Modal Fusion & Alignment

🗞️ News

Date Update
Mar 13, 2026 🎉 Our team is selected as a Finalist for the 2026 Qualcomm Innovation Fellowship: link
Mar 06, 2026 🎤 Give a Talk at NICE AI on Human-Multi-Agent Collaboration: post
Feb 25, 2026 🎉 One paper is accepted by TMLR 2026.
Feb 20, 2026 🔥 We release TemporalBench Benchmark for LLM Agents on Contextual Time Series: article
Jan 26, 2026 🎉 One paper is accepted by MLSys 2026.
Jan 26, 2026 🎉 One paper is accepted by ICLR 2026.
Jan 17, 2026 🎉 One paper is accepted by ICASSP 2026.
Dec 16, 2025 🔥 We release Towards Evolutionary Agentic LLMs with MARL Survey: article
Nov 09, 2025 📰 Interviewed by MIT Technology Review: article - Meta-Policy Multi-Agent Collaboration
Nov 07, 2025 🎉 One paper is accepted by AAAI 2026.
Sep 16, 2025 🔥 We release Reasoning and Agentic LLMs for Time Series Survey: article
Sep 10, 2025 🎉 One paper is accepted by NeurIPS 2025.
Jul 05, 2025 🎉 One paper is accepted by ACM MM 2025.
Jul 03, 2025 🎉 One paper is accepted by RecSys 2025.
Apr 05, 2025 🎉 One paper is accepted by SIGIR 2025.
Jan 15, 2025 🎉🎉 Two papers are accepted by WWW 2025.
Jul 01, 2024 🎉🎉 Two papers are accepted by ACM MM 2024.
Apr 05, 2024 🎉 One paper is accepted by IJCAI 2024.


selected publications

  1. ICLR 2026
    Adaptive Collaboration with Humans: Metacognitive Policy Optimization for Multi-Agent LLMs with Continual Learning
    Wei Yang, Defu Cao, Jiacheng Pang, and 2 more authors
    In The Fourteenth International Conference on Learning Representations, 2026
  2. TMLR 2026
    TS-Reasoner: Domain-oriented time series inference agents for reasoning and automated analysis
    Wen Ye*, Wei Yang*, Defu Cao, and 4 more authors
    Transactions on Machine Learning Research, 2026
  3. arXiv
    TemporalBench: A Benchmark for Evaluating LLM-Based Agents on Contextual and Event-Informed Time Series Tasks
    Muyan Weng, Defu Cao, Wei Yang, and 2 more authors
    arXiv preprint arXiv:2602.13272, 2026
  4. arXiv
    Auditing Multi-Agent LLM Reasoning Trees Outperforms Majority Vote and LLM-as-Judge
    Wei Yang*, Shixuan Li*, Heng Ping, and 3 more authors
    arXiv preprint arXiv:2602.09341, 2026
  5. arXiv
    "Someone Hid It": Query-Agnostic Black-Box Attacks on LLM-Based Retrieval
    Jiate Li, Defu Cao, Li Li, and 8 more authors
    arXiv preprint arXiv:2602.00364, 2026
  6. arXiv
    Self-Compression of Chain-of-Thought via Multi-Agent Reinforcement Learning
    Yiqun Chen*, Jinyuan Feng*, Wei Yang*, and 9 more authors
    arXiv preprint arXiv:2601.21919, 2026
  7. arXiv
    De-conflating Preference and Qualification: Constrained Dual-Perspective Reasoning for Job Recommendation with Large Language Models
    Bryce Kan*, Wei Yang*, Emily Nguyen, and 4 more authors
    arXiv preprint arXiv:2602.03097, 2026
  8. SSRN
    Toward Evolutionary Intelligence: LLM-based Agentic Systems with Multi-Agent Reinforcement Learning
    Wei Yang, Muyan Weng, Jiacheng Pang, and 8 more authors
    Available at SSRN 5819182, 2025
  9. MLSys 2026
    VeriMoA: A Mixture-of-Agents Framework for Spec-to-HDL Generation
    Heng Ping, Arijit Bhattacharjee, Peiyu Zhang, and 8 more authors
    arXiv preprint arXiv:2510.27617, 2025
  10. arXiv
    Maestro: Learning to Collaborate via Conditional Listwise Policy Optimization for Multi-Agent LLMs
    Wei Yang, Jiacheng Pang, Shixuan Li, and 3 more authors
    arXiv preprint arXiv:2511.06134, 2025
  11. WWW 2025
    TourRank: Utilizing Large Language Models for Document Ranking with a Tournament-Inspired Strategy
    Yiqun Chen, Qi Liu, Yi Zhang, and 6 more authors
    In Proceedings of The Web Conference (WWW) 2025, 2025
  12. arXiv
    A Survey of Reasoning and Agentic Systems in Time Series with Large Language Models
    Ching Chang, Yidan Shi, Defu Cao, and 8 more authors
    arXiv preprint arXiv:2509.11575, 2025
  13. AAAI 2026
    Learning to Deliberate: Meta-Policy Collaboration for Agentic LLMs with Multi-Agent Reinforcement Learning
    Wei Yang and Jesse Thomason
    arXiv preprint arXiv:2509.03817, 2025
  14. NeurIPS 2025
    Structured Spectral Reasoning for Frequency-Adaptive Multimodal Recommendation
    Wei Yang, Rui Zhong, Yiqun Chen, and 2 more authors
    arXiv preprint arXiv:2512.01372, 2025
  15. ACM MM 2025
    FITMM: Adaptive Frequency-Aware Multimodal Recommendation via Information-Theoretic Representation Learning
    Wei Yang, Rui Zhong, Yiqun Chen, and 4 more authors
    In Proceedings of the 33rd ACM International Conference on Multimedia (ACM MM 2025), 2025
  16. WWW 2025
    Hierarchical Sequence ID Representation of Large Language Models for Large-Scale Recommendation Systems
    Rui Zhao, Rui Zhong, Haoran Zheng, and 5 more authors
    In Companion Proceedings of The Web Conference (WWW) 2025, 2025
  17. SIGIR 2025
    Hierarchical Tree Search-Based User Lifelong Behavior Modeling on Large Language Model
    Yu Xia, Rui Zhong, Hao Gu, and 4 more authors
    In Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2025), 2025
  18. RecSys 2025
    R4ec: A Reasoning, Reflection, and Refinement Framework for Recommendation Systems
    Hao Gu, Rui Zhong, Yu Xia, and 4 more authors
    In Proceedings of the Nineteenth ACM Conference on Recommender Systems (RecSys 2025), 2025
  19. ACM MM 2024
    Enhancing Transformer-Based Semantic Matching for Few-Shot Learning through Weakly Contrastive Pre-Training
    Wei Yang, Tengfei Huo, and Zhiqiang Liu
    In Proceedings of the 32nd ACM International Conference on Multimedia (ACM MM 2024), 2024
  20. IJCAI 2024
    PTDE: Personalized Training with Distilled Execution for Multi-Agent Reinforcement Learning
    Yiqun Chen, Hangyu Mao, Jiaxin Mao, and 5 more authors
    arXiv preprint arXiv:2210.08872, 2024
  21. ACM MM 2024
    Multimodal-Aware Multi-Intention Learning for Recommendation
    Wei Yang and Qingchen Yang
    In Proceedings of the 32nd ACM International Conference on Multimedia (ACM MM 2024), 2024
  22. IJCNN 2022
    Commander-Soldiers Reinforcement Learning for Cooperative Multi-Agent Systems
    Yiqun Chen, Wei Yang, Tianle Zhang, and 2 more authors
    In 2022 International Joint Conference on Neural Networks (IJCNN 2022), 2022