Prof. Hongzhi Li

Prof. Hongzhi Li

Distinguished Professor

Tongji University

News

2026 Three papers accepted at ACL 2026:
  • Quantifying and Improving the Robustness of Retrieval-Augmented Language Models Against Spurious Features in Grounding Data
  • Lost in Stories: Consistency Bugs in Long Story Generation by LLMs
  • Beyond Rejection Sampling: Trajectory Fusion for Scaling Mathematical Reasoning
2026 One paper accepted at ICLR 2026: "Do LLMs Forget What They Should? Evaluating In-Context Forgetting in Large Language Models."
May 2026 Joined Tongji University as Distinguished Tenured Professor.
2026 We are hiring! Two Postdoctoral Researcher positions available in large-scale AI systems, generative AI, and multimodal analysis. Get in touch if interested.

Biography

I am a Distinguished Professor with Tenure and Doctoral Supervisor at the Institute of Engineering Intelligence, Tongji University. I received my Ph.D. in Computer Science from Columbia University.

My research interests span the broad area of machine intelligence, including multimodal content analysis, knowledge extraction and representation, pattern recognition, and cloud-based computing. My current research focuses on large-scale AI algorithms and systems, including generative artificial intelligence, large-scale recommendation systems, and large-scale GPU cluster management and optimization.

Previously, I served as Principal Researcher and Principal Architect at Microsoft Research and Microsoft Search & AI Division (US Headquarters); Principal Applied Science Manager and Head of GenAI Group at Microsoft AI Asia.

Education

Ph.D. in Computer Science
Columbia University, USA
M.S. in Computer Science
Columbia University, USA
B.S. in Computer Science
Zhejiang University, China

Research Interests

Deep Learning for Visual Intelligence

Developing advanced deep neural network models for visual understanding, including object detection, pattern mining, and scene analysis in multimedia data.

Multimodal Content Analysis

Integrating and analyzing information across multiple modalities — text, images, video, and audio — for comprehensive content understanding.

Knowledge Extraction & Representation

Building systems that automatically extract, structure, and represent knowledge from heterogeneous multimedia sources.

Pattern Recognition

Designing algorithms for discovering and recognizing visual patterns at scale, enabling efficient mining across large-scale image and video datasets.

Cloud-Based Computing

Leveraging cloud computing platforms to deploy scalable AI solutions for real-time visual intelligence and multimedia processing.

Multimedia News Exploration

Developing systems for structured exploration and serendipitous discovery in heterogeneous multimedia news sources.

Selected Publications

For a complete list, please visit my Google Scholar profile.

2025
Towards Web-scale Recommendations with LLMs: From Quality-aware Ranking to Candidate Generation
Jay Shah, Iman Barjasteh, Ankur Barapatre, Rana Forsati, Gang Luo, Fei Wu, Yuchen Fang, Xia Deng, Hongzhi Li, et al.
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2025
2024
Analyzing User Preferences and Quality Improvement on Bing's WebPage Recommendation Experience with Large Language Models
Jay Shah, Gang Luo, Jing Liu, Ankur Barapatre, Fei Wu, Changhe Wang, Hongzhi Li
Proceedings of the 18th ACM Conference on Recommender Systems (RecSys), pp. 751–754, 2024
WebReco: A Comprehensive Overview of an Industrial-Scale Webpage Recommendation System at Bing
Jay Shah, Iman Barjasteh, Ankur Barapatre, Changhe Wang, Gang Luo, Rana Forsati, Jay Chu, Hongzhi Li, et al.
2024
Leveraging LLMs to Enhance a Web-Scale Webpage Recommendation System
Iman Barjasteh, Jay Shah, Ankur Barapatre, Rana Forsati, Gang Luo, Fei Wu, Hongzhi Li, et al.
2024
2020
Rethinking Classification and Localization for Object Detection
Yue Wu, Yinpeng Chen, Lu Yuan, Zicheng Liu, Lijuan Wang, Hongzhi Li, Yun Fu
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020
MEmoR: A Dataset for Multimodal Emotion Reasoning in Videos
Guangyao Shen, Xin Wang, Xuguang Duan, Hongzhi Li, Wenwu Zhu
ACM International Conference on Multimedia (ACM MM), 2020
2019
Multi-Modal Deep Analysis for Multimedia
Wenwu Zhu, Xin Wang, Hongzhi Li
IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2019
2018
PatternNet: Visual Pattern Mining with Deep Neural Network
Hongzhi Li, Joseph G. Ellis, Lei Zhang, Shih-Fu Chang
ACM International Conference on Multimedia Retrieval (ICMR), 2018 Best Poster Award
Automatic Visual Pattern Mining from Categorical Image Dataset
Hongzhi Li, Joseph G. Ellis, Lei Zhang, Shih-Fu Chang
International Journal of Multimedia Information Retrieval (IJMIR), 2018
2017
Improving Event Extraction via Multimodal Integration
Tongtao Zhang, Spencer Whitehead, Hanwang Zhang, Hongzhi Li, Joseph G. Ellis, Lifu Huang, Wei Liu, Heng Ji, Shih-Fu Chang
ACM International Conference on Multimedia (ACM MM), 2017
2016
Event Specific Multimodal Pattern Mining for Knowledge Base Construction
Hongzhi Li, Joseph G. Ellis, Heng Ji, Shih-Fu Chang
ACM International Conference on Multimedia (ACM MM), 2016
Placing Broadcast News Videos in their Social Media Context Using Hashtags
Joseph G. Ellis, Svebor Karaman, Hongzhi Li, Hong Bin Shim, Shih-Fu Chang
ACM International Conference on Multimedia (ACM MM), 2016
Cross-media Event Extraction and Recommendation
Di Lu, Clare R. Voss, Fangbo Tao, Xiang Ren, Rachel Guan, Rostyslav Korolov, Tongtao Zhang, Dongang Wang, Hongzhi Li, et al.
North American Chapter of the Association for Computational Linguistics (NAACL), 2016
Watching What and How Politicians Discuss Various Topics: A Large-Scale Video Analytics UI
Emily Song, Joseph G. Ellis, Hongzhi Li, Shih-Fu Chang
ACM International Conference on Multimedia Retrieval (ICMR), 2016
2015
Cross-document Event Coreference Resolution based on Cross-media Features
Tongtao Zhang, Hongzhi Li, Heng Ji, Shih-Fu Chang
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2015
2014
Scalable Visual Instance Mining with Threads of Features
Wei Zhang, Hongzhi Li, Chong-Wah Ngo, Shih-Fu Chang
ACM International Conference on Multimedia (ACM MM), 2014
2013
News Rover: Exploring Topical Structures and Serendipity in Heterogeneous Multimedia News Sources
Hongzhi Li*, Brendan Jou*, Joseph G. Ellis*, Daniel Morozoff*, Shih-Fu Chang
ACM International Conference on Multimedia (ACM MM), 2013
Structured Exploration of Who, What, When, and Where in Heterogeneous Multimedia News Sources
Brendan Jou*, Hongzhi Li*, Joseph G. Ellis*, Daniel Morozoff*, Shih-Fu Chang
ACM International Conference on Multimedia (ACM MM), 2013
mPano: Cloud-Based Mobile Panorama View from Single Picture
Hongzhi Li, Wenwu Zhu
SPIE Optics & Photonics — Applications of Digital Image Processing XXXVI, 2013
News Rover
Brendan Jou*, Hongzhi Li*, Joseph G. Ellis*, Daniel Morozoff*, Shih-Fu Chang
Greater New York Multimedia & Vision (GNYMV) Workshop, 2013 Best Demo Award
Joint Social and Content Recommendation for User-Generated Videos in Online Social Network
Zhi Wang, Lifeng Sun, Wenwu Zhu, Shiqiang Yang, Hongzhi Li, Dapeng Oliver Wu
IEEE Transactions on Multimedia (TMM), 2013
2012
A Novel Large-Scale Digital Forensics Service Platform for Internet Videos
Hao Yin, Wen Hui, Hongzhi Li, Chuang Lin, Wenwu Zhu
IEEE Transactions on Multimedia (TMM), vol. 14, no. 1, pp. 178–186, 2012
2011
Real-time 3D Applications on Handheld Devices: Challenges and Trend
Wenwu Zhu, Dan Miao, Hongzhi Li
IEEE COMSOC MMTC E-Letter, Vol. 6, No. 6, 2011
2010
Melog
Hongzhi Li, Xian-Sheng Hua, Xijia Liu
ACM International Conference on Multimedia (ACM MM), 2010
Melog: Mobile Experience Sharing through Automatic Multimedia Blogging
Hongzhi Li, Xian-Sheng Hua
ACM Multimedia Workshop — Mobile Cloud Media Computing, 2010

Awards & Honors

Grand Challenge Winner (1st Place)
ACM Multimedia 2012
Best Demo Award
Greater New York Multimedia & Vision (GNYMV) Workshop, 2013
Best Poster Award
ACM International Conference on Multimedia Retrieval (ICMR), 2018

Professional Service

Program Committee

  • ACM International Conference on Multimedia (ACM MM)
  • IEEE International Conference on Multimedia and Expo (ICME)
  • International Joint Conference on Artificial Intelligence (IJCAI)

Journal Reviewer

  • IEEE Transactions on Multimedia (TMM)
  • IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)
  • IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
  • Journal of Visual Communication and Image Representation (JVCI)
  • Journal of Visualization (JVIS)

Contact

Email

Affiliation

Tongji University

Google Scholar

View Profile