Prof. Hongzhi Li

Biography

I am a Distinguished Professor with Tenure and Doctoral Supervisor at the Institute of AI for Engineering, Tongji University. I received my Ph.D. in Computer Science from Columbia University.

Previously, I served as Principal Researcher and Principal Architect at Microsoft Research and Microsoft Search & AI Division (US Headquarters); Principal Applied Science Manager and Head of GenAI Group at Microsoft AI Asia.

My research pursues the long-term goal of building scalable, knowledge-rich AI systems that can perceive, reason, and act across multimodal, real-world environments. My current research centers on autonomous AI agent systems — the next paradigm for intelligent systems. Key directions include multi-agent evolution and collaboration, enabling teams of specialized agents to coordinate, adapt, and improve through interaction; agent memory and knowledge-base construction, designing mechanisms for agents to acquire, organize, retrieve, and forget knowledge dynamically; reliable and efficient agent reasoning that ensures agents plan and act robustly under real-world constraints; and agent-environment interaction, grounding agents in tools, APIs, and physical or digital environments for end-to-end task completion.

This direction builds naturally on my recent work on large-scale AI systems and LLM trustworthiness. I led the deployment of LLMs into Bing's web-scale recommendation system (200B+ pages), developing quality-aware ranking, LLM-based candidate generation, and user-preference analytics (KDD 2025, RecSys 2024). I have also investigated core capabilities that underpin reliable agents: benchmarking in-context forgetting (ICF-Bench, ICLR 2026), improving RAG robustness against spurious features (ACL 2026), evaluating long-form narrative consistency (ACL 2026), and proposing efficient reasoning methods including self-compression (ConPress, ICML 2026) and trajectory fusion (TrajFusion, ACL 2026).

These efforts are rooted in over a decade of research on multimodal content understanding and knowledge extraction. My earlier work at Columbia University and Microsoft Research established foundations for automatically constructing event-centric knowledge bases from text, images, and video — through visual pattern mining with deep networks (PatternNet, ICMR 2018 Best Paper Poster Award), cross-media event extraction and coreference resolution (ACM Multimedia, EMNLP, NAACL), multimodal emotion reasoning (MEmoR, ACM MM 2020), object detection (CVPR 2020), and scalable visual instance mining.

Reviewer and editorial board member for ACM MM, ICME, IJCAI, IEEE TMM, IEEE TCSVT, TPAMI, JVCI, JVIS, and other venues.

Education

Ph.D. in Computer Science

Columbia University, USA

M.S. in Computer Science

Columbia University, USA

B.S. in Computer Science

Zhejiang University, China

Selected Publications

For a complete list, please visit my Google Scholar profile.

2026

Quantifying and Improving the Robustness of Retrieval-Augmented Language Models Against Spurious Features in Grounding Data

Shiping Yang, Jie Wu, Wenbiao Ding, Ning Wu, Shining Liang, Ming Gong, Hongzhi Li, Hengyuan Zhang, Angel X. Chang, Dongmei Zhang

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL), pp. 33479–33499, 2026

Lost in Stories: Consistency Bugs in Long Story Generation by LLMs

Junjie Li, Xinrui Guo, Yuhao Wu, Roy Ka-Wei Lee, Hongzhi Li, Yutao Xie

Findings of the Association for Computational Linguistics: ACL, pp. 8400–8428, 2026

Beyond Rejection Sampling: Trajectory Fusion for Scaling Mathematical Reasoning

Jie Deng, Hanshuang Tong, Jun Li, Shining Liang, Ning Wu, Hongzhi Li, Yutao Xie

Findings of the Association for Computational Linguistics: ACL, pp. 7943–7959, 2026

ConPress: Learning Efficient Reasoning from Multi-Question Contextual Pressure

Jie Deng, Shining Liang, Jun Li, Hongzhi Li, Yutao Xie

International Conference on Machine Learning (ICML), 2026

Do LLMs Forget What They Should? Evaluating In-Context Forgetting in Large Language Models

Yuli Qian, Zechuan Yang, Wenbiao Ding, Hongzhi Li, Yutao Xie

International Conference on Learning Representations (ICLR), 2026

2025

Towards Web-scale Recommendations with LLMs: From Quality-aware Ranking to Candidate Generation

Jay Shah, Iman Barjasteh, Ankur Barapatre, Rana Forsati, Gang Luo, Fei Wu, Yuchen Fang, Xia Deng, Hongzhi Li, et al.

Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2025

2024

Analyzing User Preferences and Quality Improvement on Bing's WebPage Recommendation Experience with Large Language Models

Jay Shah, Gang Luo, Jing Liu, Ankur Barapatre, Fei Wu, Changhe Wang, Hongzhi Li

Proceedings of the 18th ACM Conference on Recommender Systems (RecSys), pp. 751–754, 2024

WebReco: A Comprehensive Overview of an Industrial-Scale Webpage Recommendation System at Bing

Jay Shah, Iman Barjasteh, Ankur Barapatre, Changhe Wang, Gang Luo, Rana Forsati, Jay Chu, Hongzhi Li, et al.

2024

Leveraging LLMs to Enhance a Web-Scale Webpage Recommendation System

Iman Barjasteh, Jay Shah, Ankur Barapatre, Rana Forsati, Gang Luo, Fei Wu, Hongzhi Li, et al.

2024

2020

Rethinking Classification and Localization for Object Detection

Yue Wu, Yinpeng Chen, Lu Yuan, Zicheng Liu, Lijuan Wang, Hongzhi Li, Yun Fu

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020

MEmoR: A Dataset for Multimodal Emotion Reasoning in Videos

Guangyao Shen, Xin Wang, Xuguang Duan, Hongzhi Li, Wenwu Zhu

ACM International Conference on Multimedia (ACM MM), 2020

2019

Multi-Modal Deep Analysis for Multimedia

Wenwu Zhu, Xin Wang, Hongzhi Li

IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2019

2018

PatternNet: Visual Pattern Mining with Deep Neural Network

Hongzhi Li, Joseph G. Ellis, Lei Zhang, Shih-Fu Chang

ACM International Conference on Multimedia Retrieval (ICMR), 2018 Best Poster Award

Automatic Visual Pattern Mining from Categorical Image Dataset

Hongzhi Li, Joseph G. Ellis, Lei Zhang, Shih-Fu Chang

International Journal of Multimedia Information Retrieval (IJMIR), 2018

2017

Improving Event Extraction via Multimodal Integration

Tongtao Zhang, Spencer Whitehead, Hanwang Zhang, Hongzhi Li, Joseph G. Ellis, Lifu Huang, Wei Liu, Heng Ji, Shih-Fu Chang

ACM International Conference on Multimedia (ACM MM), 2017

2016

Event Specific Multimodal Pattern Mining for Knowledge Base Construction

Hongzhi Li, Joseph G. Ellis, Heng Ji, Shih-Fu Chang

ACM International Conference on Multimedia (ACM MM), 2016

Placing Broadcast News Videos in their Social Media Context Using Hashtags

Joseph G. Ellis, Svebor Karaman, Hongzhi Li, Hong Bin Shim, Shih-Fu Chang

ACM International Conference on Multimedia (ACM MM), 2016

Cross-media Event Extraction and Recommendation

Di Lu, Clare R. Voss, Fangbo Tao, Xiang Ren, Rachel Guan, Rostyslav Korolov, Tongtao Zhang, Dongang Wang, Hongzhi Li, et al.

North American Chapter of the Association for Computational Linguistics (NAACL), 2016

Watching What and How Politicians Discuss Various Topics: A Large-Scale Video Analytics UI

Emily Song, Joseph G. Ellis, Hongzhi Li, Shih-Fu Chang

ACM International Conference on Multimedia Retrieval (ICMR), 2016

2015

Cross-document Event Coreference Resolution based on Cross-media Features

Tongtao Zhang, Hongzhi Li, Heng Ji, Shih-Fu Chang

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2015

2014

Scalable Visual Instance Mining with Threads of Features

Wei Zhang, Hongzhi Li, Chong-Wah Ngo, Shih-Fu Chang

ACM International Conference on Multimedia (ACM MM), 2014

2013

News Rover: Exploring Topical Structures and Serendipity in Heterogeneous Multimedia News Sources

Hongzhi Li*, Brendan Jou*, Joseph G. Ellis*, Daniel Morozoff*, Shih-Fu Chang

ACM International Conference on Multimedia (ACM MM), 2013

Structured Exploration of Who, What, When, and Where in Heterogeneous Multimedia News Sources

Brendan Jou*, Hongzhi Li*, Joseph G. Ellis*, Daniel Morozoff*, Shih-Fu Chang

ACM International Conference on Multimedia (ACM MM), 2013

mPano: Cloud-Based Mobile Panorama View from Single Picture

Hongzhi Li, Wenwu Zhu

SPIE Optics & Photonics — Applications of Digital Image Processing XXXVI, 2013

News Rover

Brendan Jou*, Hongzhi Li*, Joseph G. Ellis*, Daniel Morozoff*, Shih-Fu Chang

Greater New York Multimedia & Vision (GNYMV) Workshop, 2013 Best Demo Award

Joint Social and Content Recommendation for User-Generated Videos in Online Social Network

Zhi Wang, Lifeng Sun, Wenwu Zhu, Shiqiang Yang, Hongzhi Li, Dapeng Oliver Wu

IEEE Transactions on Multimedia (TMM), 2013

2012

A Novel Large-Scale Digital Forensics Service Platform for Internet Videos

Hao Yin, Wen Hui, Hongzhi Li, Chuang Lin, Wenwu Zhu

IEEE Transactions on Multimedia (TMM), vol. 14, no. 1, pp. 178–186, 2012

2011

Real-time 3D Applications on Handheld Devices: Challenges and Trend

Wenwu Zhu, Dan Miao, Hongzhi Li

IEEE COMSOC MMTC E-Letter, Vol. 6, No. 6, 2011

2010

Melog

Hongzhi Li, Xian-Sheng Hua, Xijia Liu

ACM International Conference on Multimedia (ACM MM), 2010

Melog: Mobile Experience Sharing through Automatic Multimedia Blogging

Hongzhi Li, Xian-Sheng Hua

ACM Multimedia Workshop — Mobile Cloud Media Computing, 2010

News

Biography

Education

Selected Publications

Awards & Honors

Professional Service

Program Committee

Journal Reviewer

Contact

Email

Affiliation

Google Scholar