Xuanhe Zhou (周煊赫)

[Google Scholar] [Github]

Tenure-Track Assistant Professor

Dep. of Computer Science, Shanghai Jiao Tong University (SJTU / Jiaotong)

Email: zhouxh@cs.sjtu.edu.cn

I got my Ph.D. degree from CS of Tsinghua, advised by Prof. Guoliang Li.

My research interest lies in intelligent database system and data-centric AI.

We have published over 30 papers in top-tier data management conferences and journals, with over one thousand of citations in Google Scholar and one best paper award (e.g., VLDB CCF-A).

We have been maintaining a most-starred repository of AI✖️DB paper list [link] at Github.

I am the AI SIG member of openGauss community [link].

I am actively seeking for strong and self-motivated 1-2 PhD students (Spring/Fall 2026), master students, and undergrad interns. Our Team owns GPU cards sufficient for experiments.

News

August 20, 2024 | OpenAI adopts BIRD-SQL to show their finetuning service [news] 🎉

August, 2024 | I will join SJTU, Dep. of Computer Science, as Tenure-Track Assistant Professor 🎉

July 24, 2024 | Our D-Bot project is Now Sponsored by Azure AI 🎉

June 9, 2024 I My Two Papers (VLDB, ICDE) are selected into 2024 Highly-Cited List (2019-2023) 🎉

July 24, 2023 | Our FEBench paper is awarded the [website] 🎉

October 20, 2022 | 2022 Microsoft Research Asia Fellow [news] 🎉

Representative Projects

👉 D-Bot: LLM-Based Database Diagnosis System (with modelBest)

An LLM-based administrator that can acquire maintenance experience from textual sources, and provide reasonable, well-founded, in-time optimization advice for cloud instances.

👉 OpenMLDB: A Real-Time Feature Management System (with 4paradigm)

An open-source machine learning system that computes consistent features for training and inference.

👉 DBMind: A Self-Driving Database Management Platform (with openGauss)

Full-process autonomous database operation and maintenance capabilities, e.g., anomaly detection, root cause analysis, slow SQL optimization, index recommendation, fault self-repair, and etc.

Peer-Reviewed Publications

🟡 self prediction 🟢 self optimization 🔴 self configuration 🔵 data structure data framework 🟤 others

(*indicates equal contribution)

2024

(SIGMOD) Robustness of Updatable Learning-based Index Advisors. [paper] [code] 🔴

Yihang Zheng, Chen Lin, Xian Lyu, Xuanhe Zhou, Guoliang Li, Tianqing Wang.

(VLDB) D-Bot: Database Diagnosis System using Large Language Models. [paper] [code] 🟢 BenchCouncil Top 100 Open Achievements

Xuanhe Zhou, Guoliang Li, Zhaoyan Sun, Zhiyuan Liu, Weize Chen, Jianming Wu, Jiesi Liu, Ruohang Feng, Guoyang Zeng.

(VLDB) Breaking It Down: An In-depth Study of Index Advisors [EA&B]. [code] [pypi] 🔴 Direct Accept with Shepherding (2/~1400)

Wei Zhou*, Chen Lin*, Xuanhe Zhou*, Guoliang Li.

(VLDB Demo) Chat2Data: An Interactive Data Analysis System with RAG, Vector Databases and LLMs. [paper] [code] 🟤

Xinyang Zhao, Xuanhe Zhou, Guoliang Li.

(VLDB Tutorial) LLM for Data Management. [slides] [repo] 🟤

Guoliang Li, Xuanhe Zhou, Xinyang Zhao.

(ICDE) TRAP: Tailored Robustness Assessment for Index Advisors via Adversarial Perturbation. [paper] [code] 🔴

Wei Zhou, Chen Lin, Xuanhe Zhou, Guoliang Li.

(TKDE) Automatic Index Tuning: A Survey. [paper] 🔴

Yang Wu*, Xuanhe Zhou*, Yong Zhang, Guoliang Li

2023

(SIGMOD) Grep: A Graph Learning Based Database Partitioning System. [paper] [code] 🔴

Xuanhe Zhou, Guoliang Li, Wei Guo, Luyang Liu.

(VLDB Industry) FEBench: A Benchmark for Real-Time Relational Data Feature Extraction. [paper] [code] 🟤 Best Industry Paper Runnerup Award

Xuanhe Zhou*, Cheng Chen*, Kunyi Li, Bingsheng He, Mian Lu, Qiaosheng Liu, Wei Huang, Guoliang Li, Zhao Zheng, Yuqqiang Chen.

(VLDB Demo) A Learned Query Rewrite System. [demo] 🟢

Xuanhe Zhou, Guoliang Li, Jianming Wu, Jiesi Liu, Zhaoyan Sun, Xinning Zhang.

(VLDB) Learned Index: A Comprehensive Experimental Evaluation. [paper] [code] 🔵

Zhaoyan Sun, Xuanhe Zhou, Guoliang Li.

(ICDE) DBAugur: An Adversarial-based Trend Forecasting System for Diversified Workloads. [paper] [code] 🟡

Yuanning Gao, Xiuqi Huang, Xuanhe Zhou, Xiaofeng Gao, Guoliang Li.

(NeurIPS) Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs. [paper] [code] 🖐🏻 Spotlight

Led by Damo Academy (Binyuan Hui, Yongbin Li, et al) and University of Hong Kong (Jinyang Li)

(TKDE) Automatic Database Knob Tuning: A Survey. [paper] [code] 🔴

Xinyang Zhao*, Xuanhe Zhou*, Guoliang Li.

(DSE) DB-GPT: Large Language Model Meets Database. [paper] [code] 🟢 🔴

Xuanhe Zhou*, Zhaoyan Sun*, Guoliang Li.

2022

(SIGMOD) LearnedSQLGen: Constraint-Aware SQL Generation using Reinforcement Learning. [paper] [code] 🟤

Lixi Zhang, Chengliang Chai, Xuanhe Zhou, Guoliang Li.

(VLDB) A Learned Query Rewrite System using Monte Carlo Tree Search. [paper] [code] 🟢

Xuanhe Zhou, Guoliang Li, Chengliang Chai, Jianhua Feng.

(ICDE) AutoIndex: An Incremental Index Management System for Dynamic Workloads. [paper] [code] 🔴

Xuanhe Zhou, Luyang Liu, Wenbo Li, Lianyuan Jin, Tianqing Wang, Shifu Li.

(ICDE) Adaptive Code Learning for Spark Configuration Tuning. [paper] [code] 🔴

Chen Lin, Junqing Zhuang, Jiadong Feng, Hui Li, Xuanhe Zhou, Guoliang Li.

(ICDE Tutorial) Machine Learning for Data Management: A System View. [paper] [slide]

Guoliang Li, Xuanhe Zhou.

2021

(VLDB Industry) openGauss: An Autonomous Database System. [paper] [code] Over 2.7k stars

Guoliang Li, Xuanhe Zhou (first student author), Ji Sun, Xiang Yu, Yue Han, Lianyuan Jin, Wenbo Li, Tianqing Wang, Shifu Li.

(VLDB Demo) DBMind: A Self-Driving Platform in openGauss. [paper] [code]

Xuanhe Zhou, Lianyuan Jin, Ji Sun, Xinyang Zhao, Xiang Yu, Shifu Li, Tianqing Wang, et al.

(SIGMOD Tutorial) AI Meets Database: AI4DB and DB4AI. [paper] [slide]

Guoliang Li, Xuanhe Zhou, Lei Cao, Chengliang Chai.

(VLDB Tutorial) Machine Learning for Databases. [paper]

Guoliang Li, Xuanhe Zhou, Lei Cao.

(Journal of Software) Survey of Data Management Techniques for Supporting Artificial Intelligence. [paper] 🟤

Guoliang Li, Xuanhe Zhou.

2020

(VLDB) Query Performance Prediction for Concurrent Queries using Graph Embedding. [code] [paper] 🟡

Xuanhe Zhou, Ji Sun, Guoliang Li, Jianhua Feng.

(TKDE) Database meets artificial intelligence: A survey. [paper]

Xuanhe Zhou, Chengliang Chai, Guoliang Li, Ji Sun.

(Chinese Journal of Computers) Overview of database technology based on machine learning. [paper]

Guoliang, Li, Xuanhe Zhou, Sun Ji, Yu Xiang, Yuan Haitao, Liu Jiabin, and Han Yue.

2019

(VLDB) QTune: A Query-Aware Database Tuning System with Deep Reinforcement Learning. [paper] 🔴

Guoliang Li, Xuanhe Zhou, Shifu Li, Bo Gao.

(Data Eng.) Xuanyuan: An AI-Native Database. [paper]

Guoliang Li, Xuanhe Zhou, Sihao Li.

(Journal of Computer) A Survey of Machine-Learning-based Database Techniques. [paper]

Guoliang Li, Xuanhe Zhou, Ji Sun, Xiang Yu, Haitao Yuan, Jiabin Li, Yue Han.

Honors and Awards

2024 - Distinguished Doctoral Dissertation Award of Tsinghua (清华优秀博士学位论文)

2023 - VLDB Best Industry Paper RunnerUp Award (first author)

2023 - Top 100 Open Source Achievements by Benchcouncil

2022 - Outstanding Scholarship of Tsinghua University (清华特奖)

2022 - ByteDance Fellowship (10 Ph.D students)

2022 - MSRA Fellowship (12 Asian Ph.D students)

2021 - Zhongshimo Fellowship (钟士模奖学金)

2021 - Apple Scholars in AI/ML Nomination

2023, 2017 - National Scholarship

Activities

2021.10 - Tutorial of ML for Databases, AIMLSystems Conference. [website]

2021.08 - Invited Talk, The LADSIOS Workshop, VLDB Conference. [website]

2021.06 - SIGMOD Onsite Volunteer. [website]

Services

PC Member - ICDE 2025, DBML'23 (ICDE workshop), AIDB'23 (VLDB workshop);

Journal Reviewer - TKDE, VLDB Journal, ACM CSUR

Datasets

https://github.com/TsinghuaDatabaseGroup/datasets (Public archive)

Teaching

2019-2022 Database Systems (THU/30240262), teaching assistant

Last updated