Full Paper List
* Co-first author, † Corresponding author LLM & Operations Research & Optimization| OR-Space: A Full-Lifecycle Workspace Benchmark for Industrial Optimization Agents Chenyu Zhou, Xinyun Lu, Jiangyue Zhao, Jianghao Lin†, Dongdong Ge, Yinyu Ye Arxiv Preprint. |
| OSDN: Improving Delta Rule with Provable Online Preconditioning in Linear Attention Chenyu Zhou, Hongpei Li, Yuerou Liu, Jianghao Lin†, Dongdong Ge, Yinyu Ye Arxiv Preprint. |
| From Soliloquy to Agora: Memory-Enhanced LLM Agents with Decentralized Debate for Optimization Modeling Jianghao Lin*, Zi Ling*, Chenyu Zhou, Tianyi Xu, Ruoqing Jiang, Zizhuo Wang, Dongdong Ge Arxiv Preprint. |
| InvEvolve: Evolving White-Box Inventory Policies via Large Language Models with Performance Guarantees Chenyu Huang*, Jianghao Lin*†, Zhengyang Tang*, Bo Jiang, Ruoqing Jiang†, Benyou Wang, Lai We† Arxiv Preprint. |
| StepORLM: A Self-Evolving Framework With Generative Process Supervision For Operations Research Language Models Chenyu Zhou, Tianyi Xu, Jianghao Lin†, Dongdong Ge ICLR 2026. |
| FMIP: Joint Continuous-Integer Flow For Mixed-Integer Linear Programming Hongpei Li, Hui Yuan, Han Zhang, Jianghao Lin†, Dongdong Ge, Mengdi Wang, Yinyu Ye ICLR 2026. |
| Can LLMs Think Like Consumers? Benchmarking Crowd-Level Reaction Reconstruction with ConsumerSimBench Tianyu Wang, Jiajun Li, Jianghao Lin† Arxiv Preprint. |
| SMMBench: A Benchmark for Source-Distributed Multimodal Agent Memory Huacan Chai, Yukai Wang, Yingxuan Yang, Dan Peng, Yuanyi Song, Zhihui Fu, Weiwen Liu†, Jianghao Lin†, Jun Wang†, Weinan Zhang† Arxiv Preprint. |
| Turing Test on Screen: A Benchmark for Mobile GUI Agent Humanization Jiachen Zhu, Lingyu Yang, Rong Shan, Congmin Zheng, Zeyu Zheng, Weiwen Liu, Yong Yu, Weinan Zhang, Jianghao Lin† KDD 2026. |
| PhotoBench: Beyond Visual Matching Towards Personalized Intent-Driven Photo Retrieval Tianyi Xu, Rong Shan, Junjie Wu, Jiadeng Huang, Teng Wang, Jiachen Zhu, Wenteng Chen, Minxin Tu, Quantao Dou, Zhaoxiang Wang, Changwang Zhang†, Weinan Zhang†, Jun Wang†, Jianghao Lin† KDD 2026. |
| InfoDeepSeek: Benchmarking Agentic Information Seeking for Retrieval-Augmented Generation Yunjia Xi, Jianghao Lin†, Menghui Zhu, Yongzhao Xiao, Zhuoying Ou, Jiaqi Liu, Tong Wan, Bo Chen, Weiwen Liu, Yasheng Wang, Ruiming Tang, Weinan Zhang, Yong Yu Arxiv Preprint. |
| CodeApex: A Bilingual Programming Evaluation Benchmark for Large Language Models Lingyue Fu, Huacan Chai, Shuang Luo, Kounianhua Du, Weiming Zhang, Longteng Fan, Jiayi Lei, Renting Rui, Jianghao Lin, Yuchen Fang, Yifan Liu, Jingkuan Wang, Siyuan Qi, Kangning Zhang, Weinan Zhang, Yong Yu Arxiv Preprint. |
| DynaTree: Dynamic Agentic Retrieval Tree for Time-Sensitive News Retrieval Siyuan Qi, Xinyuan Wang, Yingxuan Yang, Haochuan Guo, Jianghao Lin, Weiwen Liu, Yong Yu, Weinan Zhang KDD 2026. |
| Anticipate and Learn: Unleashing Idle-Time Compute in Proactive Agents Haoyi Hu, Qirong Lyu, Xianghan Kong, Weiwen Liu, Jianghao Lin, Zixuan Guo, Yan Xu, Yasheng Wang, Weinan Zhang, Yong Yu Arxiv Preprint. |
| Skills on the Fly: Test-Time Adaptive Skill Synthesis for LLM Agents Jingxing Wang, Chenyu Zhou, Zhihui Fu, Jun Wang†, Weiwen Liu†, Weinan Zhang†, Jianghao Lin† Arxiv Preprint. |
| MMSkills: Towards Multimodal Skills for General Visual Agents Kangning Zhang, Shuai Shao, Qingyao Li, Jianghao Lin, Lingyue Fu, Shijian Wang, Yuan Lu, Wenjiang Jiao, Weiwen Liu, Weinan Zhang, Yong Yu Arxiv Preprint. |
| SkillMAS: Skill Co-Evolution with LLM-based Multi-Agent System Shuai Pan, Yixiang Liu, Jiaye Gao, Te Gao, Weiwen Liu†, Jianghao Lin†, Zhihui Fu, Jun Wang†, Weinan Zhang†, Yong Yu Arxiv Preprint. |
| Externalization in LLM Agents: A Unified Review of Memory, Skills, Protocols and Harness Engineering Chenyu Zhou, Huacan Chai, Wenteng Chen, Zihan Guo, Rong Shan, Yuanyi Song, Tianyi Xu, Yingxuan Yang, Aofan Yu, Weiming Zhang, Congming Zheng, Jiachen Zhu, Zeyu Zheng, Zhuosheng Zhang, Xingyu Lou, Changwang Zhang, Zhihui Fu, Jun Wang†, Weiwen Liu†, Jianghao Lin†, Weinan Zhang† Arxiv Preprint. |
| Holos: A Web-Scale LLM-Based Multi-Agent System for the Agentic Web Xiaohang Nie, Zihan Guo, Zicai Cui, Jiachi Yang, Zeyi Chen, Leheyi De, Yu Zhang, Junwei Liao, Bo Huang, Yingxuan Yang, Zhi Han, Zimian Peng, Linyao Chen, Wenzheng Tom Tang, Zongkai Liu, Tao Zhou, Botao Amber Hu, Shuyang Tang, Jianghao Lin, Weiwen Liu, Muning Wen, Yuanjian Zhou, Weinan Zhang Arxiv Preprint. |
| SkillProbe: Security Auditing for Emerging Agent Skill Marketplaces via Multi-Agent Collaboration Zihan Guo, Zhiyu Chen, Xiaohang Nie, Jianghao Lin, Yuanjian Zhou, Weinan Zhang Arxiv Preprint. |
| OSCAR: Optimization-Steered Agentic Planning for Composed Image Retrieval Teng Wang, Rong Shan, Jianghao Lin†, Junjie Wu, Tianyi Xu, Jianping Zhang, Wenteng Chen, Changwang Zhang, Zhaoxiang Wang, Weinan Zhang, Jun Wang† Arxiv Preprint. |
| A Survey of LLM-based Deep Search Agents: Paradigm, Optimization, Evaluation, and Challenges Yunjia Xi, Jianghao Lin†, Yongzhao Xiao, Zheli Zhou, Rong Shan, Te Gao, Jiachen Zhu, Weiwen Liu, Yong Yu, Weinan Zhang ACL 2026. |
| Evolutionary Perspectives on the Evaluation of LLM-Based AI Agents: A Comprehensive Survey Jiachen Zhu, Menghui Zhu, Renting Rui, Rong Shan, Congmin Zheng, Bo Chen, Yunjia Xi, Jianghao Lin, Weiwen Liu, Ruiming Tang, Yong Yu, Weinan Zhang Frontiers of Computer Science (FCS). |
| Superplatforms Have to Attack AI Agents Jianghao Lin, Jiachen Zhu, Zheli Zhou, Yunjia Xi, Weiwen Liu, Yong Yu, Weinan Zhang Arxiv Preprint. |
| The Real Barrier to LLM Agent Usability is Agentic ROI Weiwen Liu, Jiarui Qin, Xu Huang, Xingshan Zeng, Yunjia Xi, Jianghao Lin, Chuhan Wu, Yasheng Wang, Lifeng Shang, Ruiming Tang, Defu Lian, Yong Yu, Weinan Zhang Arxiv Preprint. |
| A Survey of AI Agent Protocols Yingxuan Yang, Huacan Chai, Yuanyi Song, Siyuan Qi, Muning Wen, Ning Li, Junwei Liao, Haoyi Hu, Jianghao Lin†, Gaowei Chang, Weiwen Liu, Ying Wen, Yong Yu, Weinan Zhang Arxiv Preprint. |
| Agentic Information Retrieval Weinan Zhang, Junwei Liao, Ning Li, Kounianhua Du, Jianghao Lin† Arxiv Preprint. |
| Sell It Before You Make It: Revolutionizing E-Commerce with Personalized AI-Generated Items Jianghao Lin, Peng Du, Jiaqi Liu, Weite Li, Yong Yu, Weinan Zhang, Yang Cao KDD 2026. |
| DiffCold: A Diffusion-based Generative Model for Cold-Start Item Recommendation Kangning Zhang, Jianghao Lin†, Yingjie Qin, Weinan Zhang†, Yong Yu ECML-PKDD 2026. |
| Diffusion Models for Recommender Systems: From Content Distribution To Content Creation Jianghao Lin, Yang Cao, Yong Yu, Weinan Zhang KDD 2025. |
| A Survey on Diffusion Models for Recommender Systems Jianghao Lin, Jiaqi Liu, Jiachen Zhu, Yunjia Xi, Chengkai Liu, Yangtian Zhang, Yong Yu, Weinan Zhang Arxiv Preprint. |
| Contexting as Recommendation: Evolutionary Collaborative Filtering for Context Engineering Jiachen Zhu, Zhuoying Ou, Congmin Zheng, Yuxiang Chen, Zeyu Zheng, Rong Shan, Lingyu Yang, Lionel Z. Wang, Weiwen Liu, Yong Yu, Weinan Zhang, Jianghao Lin† Arxiv Preprint. |
| Hölder Policy Optimisation Yuxiang Chen, Dingli Liang, Yihang Chen, Ziqin Gong, Chenyang Le, Zhaokai Wang, Jiachen Zhu, Lingyu Yang, Jianghao Lin, Weinan Zhang, Jun Wang Arxiv Preprint. |
| MoE-SpAc: Efficient MoE Inference Based on Speculative Activation Utility in Heterogeneous Edge Scenarios Shuhuai Li, Jianghao Lin†, Dongdong Ge, Yinyu Ye Arxiv Preprint. |
| A Survey of Process Reward Models: From Outcome Signals to Process Supervisions for Large Language Models Congming Zheng, Jiachen Zhu, Zhuoying Ou, Yuxiang Chen, Kangning Zhang, Rong Shan, Zeyu Zheng, Mengyue Yang, Jianghao Lin†, Yong Yu, Weinan Zhang† ACL 2026. |
| ToolPRM: Fine-Grained Inference Scaling of Structured Outputs for Function Calling Jianghao Lin, Yuanyuan Shi, Xin Peng, Renjie Ding, Hairui Wang, Yuxuan Peng, Bizhe Bai, Weixi Song, Fengshuo Bai, Huacan Chai, Weinan Zhang, Fei Huang, Ying Wen ACL 2026. |
| PARL-MT: Learning to Call Functions in Multi-Turn Conversation with Progress Awareness Huacan Chai, Zijie Cao, Maolin Ran, Yingxuan Yang, Jianghao Lin, Pengxin Guo, Hairui Wang, Renjie Ding, Ziyu Wan, Muning Wen, Weiwen Liu, Weinan Zhang, Fei Huang, Ying Wen ACL 2026 Findings. |
| CoLD: Counterfactually-Guided Length Debiasing for Process Reward Models Congmin Zheng, Jiachen Zhu, Jianghao Lin, Xinyi Dai, Yong Yu, Weinan Zhang, Mengyue Yang Arxiv Preprint. |
| Retrieval-Augmented Process Reward Model for Generalizable Mathematical Reasoning Jiachen Zhu, Congmin Zheng, Jianghao Lin, Kounianhua Du, Ying Wen, Yong Yu, Jun Wang, Weinan Zhang ACL 2025 Findings. |
| Modular Representation Compression: Adapting LLM Representations for Efficient and Effective Recommendation Yunjia Xi, Menghui Zhu, Jianghao Lin†, Bo Chen, Ruiming Tang, Yong Yu, Weinan Zhang SIGIR 2026. |
| MuonRec: Shifting the Optimizer Paradigm Beyond Adam in Scalable Generative Recommendation Rong Shan, Aofan Yu, Bo Chen, Kuo Cai, Qiang Luo, Ruiming Tang, Han Li, Weiwen Liu, Weinan Zhang, Jianghao Lin† Arxiv Preprint. |
| How Can Recommender Systems Benefit from Large Language Models: A Survey Jianghao Lin, Xinyi Dai, Yunjia Xi, Weiwen Liu, Bo Chen, Hao Zhang, Yong Liu, Chuhan Wu, Xiangyang Li, Chenxu Zhu, Huifeng Guo, Yong Yu, Ruiming Tang, Weinan Zhang ACM Transactions on Information Systems (TOIS). |
| LIBER: Lifelong User Behavior Modeling Based on Large Language Models Rong Shan, Chenxu Zhu, Shigang Quan, Bo Chen, Jianghao Lin†, Xiaoling Cai, Hong Zhu, Xiangyang Li, Yunjia Xi, Weinan Zhang, Ruiming Tang† ECML-PKDD 2026. |
| Lifelong Personalized Low-Rank Adaptation of Large Language Models for Recommendation Jiachen Zhu, Jianghao Lin, Xinyi Dai, Bo Chen, Rong Shan, Jieming Zhu, Ruiming Tang, Yong Yu, Weinan Zhang Arxiv Preprint. |
| Generative Representational Learning of Foundation Models for Recommendation Zheli Zhou, Chenxu Zhu, Jianghao Lin†, Bo Chen, Ruiming Tang, Weinan Zhang†, Yong Yu DASFAA 2026. |
| An Automatic Graph Construction Framework based on Large Language Models for Recommendation Rong Shan, Jianghao Lin†, Chenxu Zhu, Bo Chen, Menghui Zhu, Kangning Zhang, Jieming Zhu, Ruiming Tang, Yong Yu, Weinan Zhang KDD 2025. |
| Efficiency Unleashed: Inference Acceleration for LLM-based Recommender Systems with Speculative Decoding Yunjia Xi, Hangyu Wang, Bo Chen, Jianghao Lin†, Menghui Zhu, Weiwen Liu, Ruiming Tang, Zhewei Wei, Weinan Zhang, Yong Yu SIGIR 2025. |
| Action First: Leveraging Preference-Aware Actions for More Effective Decision-Making in Interactive Recommender Systems Renting Rui, Yunjia Xi, Weiwen Liu, Jianghao Lin, Bo Chen, Ruiming Tang, Weinan Zhang, Yong Yu SIGIR 2025. |
| Full-Stack Optimized Large Language Models for Lifelong Sequential Behavior Comprehension in Recommendation Rong Shan, Jiachen Zhu, Jianghao Lin, Chenxu Zhu, Bo Chen, Ruiming Tang, Yong Yu, Weinan Zhang ACM Transactions on Recommender Systems (TORS). |
| Efficient and Deployable Knowledge Infusion for Open-World Recommendations via Large Language Models Yunjia Xi, Weiwen Liu, Jianghao Lin, Muyan Weng, Xiaoling Cai, Hong Zhu, Jieming Zhu, Bo Chen, Ruiming Tang, Yong Yu, Weinan Zhang ACM Transactions on Recommender Systems (TORS). |
| An Efficient Approximation Framework for LLM-Enhanced Recommendation Huacan Chai, Menghui Zhu, Jianghao Lin, Yunjia Xi, Weinan Zhang, Yong Yu ICIC 2025. |
| DisCo: Towards Harmonious Disentanglement and Collaboration between Tabular and Semantic Space for Recommendation Kounianhua Du, Jizheng Chen, Jianghao Lin, Yunjia Xi, Hangyu Wang, Xinyi Dai, Bo Chen, Ruiming Tang, Weinan Zhang KDD 2024. |
| ReLLa: Retrieval-enhanced Large Language Models for Lifelong Sequential Behavior Comprehension in Recommendation Jianghao Lin, Rong Shan, Chenxu Zhu, Kounianhua Du, Bo Chen, Shigang Quan, Ruiming Tang, Yong Yu, Weinan Zhang WWW 2024. |
| ClickPrompt: CTR Models are Strong Prompt Generators for Adapting Language Models to CTR Prediction Jianghao Lin, Bo Chen, Hangyu Wang, Yunjia Xi, Yanru Qu, Xinyi Dai, Kangning Zhang, Ruiming Tang, Yong Yu, Weinan Zhang WWW 2024. |
| MemoCRS: Memory-enhanced Sequential Conversational Recommender Systems with Large Language Models Yunjia Xi, Weiwen Liu, Jianghao Lin, Bo Chen, Ruiming Tang, Weinan Zhang, Yong Yu CIKM 2024. |
| ELCoRec: Enhance Language Understanding with Co-Propagation of Numerical and Categorical Features for Recommendation Jizheng Chen, Kounianhua Du, Jianghao Lin, Bo Chen, Ruiming Tang, Weinan Zhang CIKM 2024. |
| FLIP: Towards Fine-grained Alignment between ID-based Models and Pretrained Language Models for CTR Prediction Hangyu Wang*, Jianghao Lin*, Xiangyang Li, Bo Chen, Chenxu Zhu, Ruiming Tang, Weinan Zhang, Yong Yu RecSys 2024. |
| Towards Open-World Recommendation with Knowledge Augmentation from Large Language Models Yunjia Xi, Weiwen Liu, Jianghao Lin, Jieming Zhu, Bo Chen, Ruiming Tang, Weinan Zhang, Rui Zhang, Yong Yu RecSys 2024. DLP-RecSys 2023 (Best Paper Award). |
| Play to Your Strengths: Collaborative Intelligence of Conventional Recommender Models and Large Language Models Yunjia Xi, Weiwen Liu, Jianghao Lin, Chuhan Wu, Bo Chen, Ruiming Tang, Weinan Zhang, Yong Yu CCIR 2024. |
| Towards Efficient and Effective Unlearning of Large Language Models for Recommendation Hangyu Wang*, Jianghao Lin*, Bo Chen, Yang Yang, Ruiming Tang, Weinan Zhang, Yong Yu Frontiers of Computer Science (FCS). |
| Large Language Models Make Sample-Efficient Recommender Systems Jianghao Lin, Xinyi Dai, Rong Shan, Bo Chen, Ruiming Tang, Yong Yu, Weinan Zhang Frontiers of Computer Science (FCS). |
| MOTOR: Learning ID-free Item Representation with Token Crossing for Embedding-based Multimodal Recommendation Kangning Zhang, Jiarui Jin, Yingjie Qin, Ruilong Su, Jianghao Lin†, Yong Yu, Weinan Zhang† ECML-PKDD 2026. |
| DLF: Enhancing Explicit-Implicit Interaction via Dynamic Low-Order-Aware Fusion for CTR Prediction Kefan Wang, Hao Wang, Wei Guo, Yong Liu, Jianghao Lin, Defu Lian, Enhong Chen SIGIR 2025. |
| Unleashing the Potential of Multi-Channel Fusion in Retrieval for Personalized Recommendations Junjie Huang, Jiarui Qin, Jianghao Lin, Ziming Feng, Yong Yu, Weinan Zhang WWW 2025. |
| A Comprehensive Survey on Retrieval Methods in Recommender Systems Junjie Huang, Jizheng Chen, Jianghao Lin, Jiarui Qin, Ziming Feng, Weinan Zhang, Yong Yu ACM Transactions on Information Systems (TOIS). |
| Beyond Positive History: Re-ranking with List-level Hybrid Feedback Muyan Weng, Yunjia Xi, Weiwen Liu, Bo Chen, Jianghao Lin, Ruiming Tang, Weinan Zhang, Yong Yu Arxiv Preprint. |
| M-scan: A Multi-Scenario Causal-driven Adaptive Network for Recommendation Jiachen Zhu, Yichao Wang, Jianghao Lin, Jiarui Qin, Ruiming Tang, Weinan Zhang, Yong Yu WWW 2024. |
| Retrieval-Oriented Knowledge for Click-Through Rate Prediction Huanshuo Liu, Bo Chen, Menghui Zhu, Jianghao Lin, Jiarui Qin, Yang Yang, Hao Zhang, Ruiming Tang CIKM 2024. |
| Behavior-Dependent Linear Recurrent Units for Efficient Sequential Recommendation Chengkai Liu, Jianghao Lin, Jianling Wang, Hanzhou Liu, James Caverlee CIKM 2024. |
| Mamba4Rec: Towards Efficient Sequential Recommendation with Selective State Space Models Chengkai Liu, Jianghao Lin, Hanzhou Liu, Jianling Wang, James Caverlee RelKD-KDD 2024 (Best Paper Award). |
| Invariant Graph Contrastive Learning for Mitigating Neighborhood Bias in Graph Neural Network based Recommender Systems Zhenyu Mu, Jianghao Lin, Xiaoyu Zhu, Weinan Zhang, Yong Yu ICANN 2024. |
| MAP: A Model-agnostic Pretraining Framework for Click-through Rate Prediction Jianghao Lin, Yanru Qu, Wei Guo, Xinyi Dai, Ruiming Tang, Yong Yu, Weinan Zhang KDD 2023. |
| A Bird's-eye View of Reranking: from List Level to Page Level Yunjia Xi*, Jianghao Lin*, Weiwen Liu, Xinyi Dai, Weinan Zhang, Rui Zhang, Ruiming Tang, Yong Yu WSDM 2023. |
| Position: Academic Conferences are Potentially Facing Denominator Gaming Caused by Fully Automated Scientific Agents Rong Shan, Te Gao, Hang Zheng, Yunjia Xi, Jiachen Zhu, Zeyu Zheng, Yong Yu, Weinan Zhang, Jianghao Lin† ICML 2026 (Position Track). |
| Stop DDoS Attacking the Research Community with AI-Generated Survey Papers Jianghao Lin, Rong Shan, Jiachen Zhu, Yunjia Xi, Yong Yu, Weinan Zhang NeurIPS 2025 (Position Track). |
| A Retrieval-Enhanced Click Model for Web Search Yao Li, Jianghao Lin, Weiwen Liu, Weinan Zhang APWeb 2025. |
| An F-shape Click Model for Information Retrieval on Multi-block Mobile Pages Lingyue Fu*, Jianghao Lin*, Weiwen Liu, Ruiming Tang, Weinan Zhang, Rui Zhang, Yong Yu WSDM 2023. |
| Adversarially Trained Environment Models Are Effective Policy Evaluators and Improvers - An Application to Information Retrieval Yao Li, Yifan Liu, Xinyi Dai, Jianghao Lin, Hang Lai, Yunfei Liu, Yong Yu DAI 2023. |
| A Graph-Enhanced Click Model for Web Search Jianghao Lin, Weiwen Liu, Xinyi Dai, Weinan Zhang, Shuai Li, Ruiming Tang, Xiuqiang He, Jianye Hao, Jun Wang, Yong Yu SIGIR 2021. |
| An Adversarial Imitation Click Model for Information Retrieval Xinyi Dai, Jianghao Lin, Weinan Zhang, Shuai Li, Weiwen Liu, Ruiming Tang, Xiuqiang He, Jianye Hao, Jun Wang, Yong Yu WWW 2021. |
| LLM4CD: Leveraging Large Language Models for Open-World Knowledge Augmented Cognitive Diagnosis Weiming Zhang, Lingyue Fu, Qingyao Li, Kounianhua Du, Jianghao Lin, Jingwei Yu, Wei Xia, Weinan Zhang, Ruiming Tang, Yong Yu CIKM 2025. |
| AdvKT: An Adversarial Multi-Step Training Framework for Knowledge Tracing Lingyue Fu, Ting Long, Jianghao Lin, Wei Xia, Xinyi Dai, Ruiming Tang, Yasheng Wang, Weinan Zhang, Yong Yu ECML-PKDD 2025. |
| SINKT: A Structure-Aware Inductive Knowledge Tracing Model with Large Language Model Lingyue Fu, Hao Guan, Kounianhua Du, Jianghao Lin, Wei Xia, Weinan Zhang, Ruiming Tang, Yasheng Wang, Yong Yu CIKM 2024. |
| Sample-Efficient Deep Reinforcement Learning of Mobile Manipulation for 6-DOF Trajectory Following Yifan Zhou, Qiyu Feng, Yixuan Zhou, Jianghao Lin, Zhe Liu, Hesheng Wang IEEE Transactions on Automation Science and Engineering (T-ASE). |
| Learning Ball-Balancing Robot through Deep Reinforcement Learning Yifan Zhou, Jianghao Lin, Shuai Wang, Chong Zhang ICCCR 2021. |