Xuanhe Zhou (周煊赫)
[Google Scholar] [Github]
Tenure-Track Assistant Professor
Dep. of Computer Science, Shanghai Jiao Tong University (SJTU / Jiaotong)
Email: zhouxh@cs.sjtu.edu.cn
I got my Ph.D. degree from CS of Tsinghua, advised by Prof. Guoliang Li.
My research interest lies in intelligent database system and data-centric AI.
We have published over 30 papers in top-tier data management conferences and journals, with over one thousand of citations in Google Scholar and one best paper award (e.g., VLDB CCF-A).
We have been maintaining a most-starred repository of AI✖️DB paper list [link] at Github.
I am the AI SIG member of openGauss community [link].
I am actively seeking for strong and self-motivated 1-2 PhD students (Spring/Fall 2026), master students, and undergrad interns. Our Team owns GPU cards sufficient for experiments.
News
Nov 4, 2024 | BMTools (with 2.9k stars at Github) is accepted by CSUR 🎉
August 20, 2024 | OpenAI adopts BIRD-SQL to show their finetuning service [news] 🎉
August, 2024 | I will join SJTU, Dep. of Computer Science, as Tenure-Track Assistant Professor 🎉
July 24, 2024 | Our D-Bot project is Now Sponsored by Azure AI 🎉
June 9, 2024 I My Two Papers (VLDB, ICDE) are selected into 2024 Highly-Cited List (2019-2023) 🎉
July 24, 2023 | Our FEBench paper is awarded the [website] 🎉
October 20, 2022 | 2022 Microsoft Research Asia Fellow [news] 🎉
Representative Projects
👉 D-Bot: LLM-Based Database Diagnosis System (with modelBest)
An LLM-based administrator that can acquire maintenance experience from textual sources, and provide reasonable, well-founded, in-time optimization advice for cloud instances.
👉 OpenMLDB: A Real-Time Feature Management System (with 4paradigm)
An open-source machine learning system that computes consistent features for training and inference.
👉 DBMind: A Self-Driving Database Management Platform (with openGauss)
Full-process autonomous database operation and maintenance capabilities, e.g., anomaly detection, root cause analysis, slow SQL optimization, index recommendation, fault self-repair, and etc.
Peer-Reviewed Publications
🟡 self prediction 🟢 self optimization 🔴 self configuration 🔵 data structure ⚫ data framework 🟤 others
(*indicates equal contribution)
2024
(SIGMOD) Robustness of Updatable Learning-based Index Advisors. [paper] [code] 🔴
Yihang Zheng, Chen Lin, Xian Lyu, Xuanhe Zhou, Guoliang Li, Tianqing Wang.
(VLDB) D-Bot: Database Diagnosis System using Large Language Models. [paper] [code] 🟢 BenchCouncil Top 100 Open Achievements
Xuanhe Zhou, Guoliang Li, Zhaoyan Sun, Zhiyuan Liu, Weize Chen, Jianming Wu, Jiesi Liu, Ruohang Feng, Guoyang Zeng.
(VLDB) Breaking It Down: An In-depth Study of Index Advisors [EA&B]. [code] [pypi] 🔴 Direct Accept with Shepherding
Wei Zhou*, Chen Lin*, Xuanhe Zhou*, Guoliang Li.
(VLDB Demo) Chat2Data: An Interactive Data Analysis System with RAG, Vector Databases and LLMs. [paper] [code] 🟤
Xinyang Zhao, Xuanhe Zhou, Guoliang Li.
(VLDB Tutorial) LLM for Data Management. [slides] [repo] 🟤
Guoliang Li, Xuanhe Zhou, Xinyang Zhao.
(ICDE) TRAP: Tailored Robustness Assessment for Index Advisors via Adversarial Perturbation. [paper] [code] 🔴
Wei Zhou, Chen Lin, Xuanhe Zhou, Guoliang Li.
(TKDE) Automatic Index Tuning: A Survey. [paper] 🔴
Yang Wu*, Xuanhe Zhou*, Yong Zhang, Guoliang Li
(CSUR) Tool Learning with Foundation Models. [paper] [repo] 🟤 2.9k stars
2023
(SIGMOD) Grep: A Graph Learning Based Database Partitioning System. [paper] [code] 🔴
Xuanhe Zhou, Guoliang Li, Wei Guo, Luyang Liu.
(VLDB Industry) FEBench: A Benchmark for Real-Time Relational Data Feature Extraction. [paper] [code] 🟤 Best Industry Paper Runnerup Award
Xuanhe Zhou*, Cheng Chen*, Kunyi Li, Bingsheng He, Mian Lu, Qiaosheng Liu, Wei Huang, Guoliang Li, Zhao Zheng, Yuqqiang Chen.
(VLDB Demo) A Learned Query Rewrite System. [demo] 🟢
Xuanhe Zhou, Guoliang Li, Jianming Wu, Jiesi Liu, Zhaoyan Sun, Xinning Zhang.
(VLDB) Learned Index: A Comprehensive Experimental Evaluation. [paper] [code] 🔵
Zhaoyan Sun, Xuanhe Zhou, Guoliang Li.
(ICDE) DBAugur: An Adversarial-based Trend Forecasting System for Diversified Workloads. [paper] [code] 🟡
Yuanning Gao, Xiuqi Huang, Xuanhe Zhou, Xiaofeng Gao, Guoliang Li.
(NeurIPS) Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs. [paper] [code] 🖐🏻 Spotlight
Led by Damo Academy (Binyuan Hui, Yongbin Li, et al) and University of Hong Kong (Jinyang Li)
(TKDE) Automatic Database Knob Tuning: A Survey. [paper] [code] 🔴
Xinyang Zhao*, Xuanhe Zhou*, Guoliang Li.
(DSE) DB-GPT: Large Language Model Meets Database. [paper] [code] 🟢 🔴
Xuanhe Zhou*, Zhaoyan Sun*, Guoliang Li.
2022
(SIGMOD) LearnedSQLGen: Constraint-Aware SQL Generation using Reinforcement Learning. [paper] [code] 🟤
Lixi Zhang, Chengliang Chai, Xuanhe Zhou, Guoliang Li.
(VLDB) A Learned Query Rewrite System using Monte Carlo Tree Search. [paper] [code] 🟢
Xuanhe Zhou, Guoliang Li, Chengliang Chai, Jianhua Feng.
(ICDE) AutoIndex: An Incremental Index Management System for Dynamic Workloads. [paper] [code] 🔴
Xuanhe Zhou, Luyang Liu, Wenbo Li, Lianyuan Jin, Tianqing Wang, Shifu Li.
(ICDE) Adaptive Code Learning for Spark Configuration Tuning. [paper] [code] 🔴
Chen Lin, Junqing Zhuang, Jiadong Feng, Hui Li, Xuanhe Zhou, Guoliang Li.
(ICDE Tutorial) Machine Learning for Data Management: A System View. [paper] [slide]
Guoliang Li, Xuanhe Zhou.
2021
(VLDB Industry) openGauss: An Autonomous Database System. [paper] [code] ⚫ Over 2.7k stars
Guoliang Li, Xuanhe Zhou (first student author), Ji Sun, Xiang Yu, Yue Han, Lianyuan Jin, Wenbo Li, Tianqing Wang, Shifu Li.
(VLDB Demo) DBMind: A Self-Driving Platform in openGauss. [paper] [code] ⚫
Xuanhe Zhou, Lianyuan Jin, Ji Sun, Xinyang Zhao, Xiang Yu, Shifu Li, Tianqing Wang, et al.
(SIGMOD Tutorial) AI Meets Database: AI4DB and DB4AI. [paper] [slide]
Guoliang Li, Xuanhe Zhou, Lei Cao, Chengliang Chai.
(VLDB Tutorial) Machine Learning for Databases. [paper]
Guoliang Li, Xuanhe Zhou, Lei Cao.
(Journal of Software) Survey of Data Management Techniques for Supporting Artificial Intelligence. [paper] 🟤
Guoliang Li, Xuanhe Zhou.
2020
(VLDB) Query Performance Prediction for Concurrent Queries using Graph Embedding. [code] [paper] 🟡
Xuanhe Zhou, Ji Sun, Guoliang Li, Jianhua Feng.
(TKDE) Database meets artificial intelligence: A survey. [paper]
Xuanhe Zhou, Chengliang Chai, Guoliang Li, Ji Sun.
(Chinese Journal of Computers) Overview of database technology based on machine learning. [paper]
Guoliang, Li, Xuanhe Zhou, Sun Ji, Yu Xiang, Yuan Haitao, Liu Jiabin, and Han Yue.
2019
(VLDB) QTune: A Query-Aware Database Tuning System with Deep Reinforcement Learning. [paper] 🔴
Guoliang Li, Xuanhe Zhou, Shifu Li, Bo Gao.
(Data Eng.) Xuanyuan: An AI-Native Database. [paper] ⚫
Guoliang Li, Xuanhe Zhou, Sihao Li.
(Journal of Computer) A Survey of Machine-Learning-based Database Techniques. [paper]
Guoliang Li, Xuanhe Zhou, Ji Sun, Xiang Yu, Haitao Yuan, Jiabin Li, Yue Han.
Honors and Awards
2024 - Distinguished Doctoral Dissertation Award of Tsinghua (清华优秀博士学位论文)
2023 - VLDB Best Industry Paper RunnerUp Award (first author)
2023 - Top 100 Open Source Achievements by Benchcouncil
2022 - Outstanding Scholarship of Tsinghua University (清华特奖)
2022 - ByteDance Fellowship (10 Ph.D students)
2022 - MSRA Fellowship (12 Asian Ph.D students)
2021 - Zhongshimo Fellowship (钟士模奖学金)
2021 - Apple Scholars in AI/ML Nomination
2023, 2017 - National Scholarship
Activities
2021.10 - Tutorial of ML for Databases, AIMLSystems Conference. [website]
2021.08 - Invited Talk, The LADSIOS Workshop, VLDB Conference. [website]
2021.06 - SIGMOD Onsite Volunteer. [website]
Services
PC Member - ICDE 2025, DBML'23 (ICDE workshop), AIDB'23 (VLDB workshop);
Journal Reviewer - TKDE, VLDB Journal, ACM CSUR
Datasets
https://github.com/TsinghuaDatabaseGroup/datasets (Public archive)
Teaching
2019-2022 Database Systems (THU/30240262), teaching assistant
Online Tutorial for Basic functions: https://thu-db.github.io/dbs-tutorial/
Online Tutorial for Advanced functions: https://thu-db.github.io/dbtrain-tutorial/
Last updated