Xuanhe Zhou (周煊赫)

Advisor: Prof. Guoliang Li

Tsinghua Database Group

Email: zhouxuan19@mails.tsinghua.edu.cn

[CV] [Google Scholar] [Github] [Related Works]

Brief Biography

I got my Ph.D. degree from CS of Tsinghua, mentored by Prof. Guoliang Li. My research interest lies in intelligent database optimization and data governance for AI (committed to resolving data challenges on the way to AGI🌟). I have published papers in top-tier database conferences and journals, with over 1000 citations in Google Scholar.

I am actively looking for strong and self-motivated interns interested in building intelligent database systems or data4llm. Please send me or Prof. Li an email with your CV if interested.

News

June 9, 2024 I My Two Papers (VLDB, ICDE) are selected into 2024 Highly-Cited List (2019-2023) 🎉

July 24, 2024 | Our D-Bot project is Now Sponsored by Azure AI 🎉

Under Maintenance Projects

👉 D-Bot: LLM-Based Database Diagnosis System (modelBest & tsinghua)

An LLM-based administrator that can acquire maintenance experience from textual sources, and provide reasonable, well-founded, in-time optimization advice for cloud instances.

👉 OpenMLDB: A Real-Time Feature Management System (4paradigm & tsinghua & nus)

An open-source machine learning system that computes consistent features for training and inference.

👉 DBMind: A Self-Driving Database Management Platform (openGauss & tsinghua)

Full-process autonomous database operation and maintenance capabilities, e.g., anomaly detection, root cause analysis, slow SQL optimization, index recommendation, fault self-repair, and etc.

Peer-Reviewed Publications

🟡 self monitoring 🟢 self optimization 🔴 self configuration 🔵 data structure data framework 🟤 data others 🖐🏻 AI techniques

(*indicates equal contribution)

2024

(SIGMOD) Robustness of Updatable Learning-based Index Advisors. [paper] [code] 🔴

Yihang Zheng, Chen Lin, Xian Lyu, Xuanhe Zhou, Guoliang Li, Tianqing Wang.

(VLDB) D-Bot: Database Diagnosis System using Large Language Models. [paper] [code] 🟢 BenchCouncil Top 100 Open Achievements

Xuanhe Zhou, Guoliang Li, Zhaoyan Sun, Zhiyuan Liu, Weize Chen, Jianming Wu, Jiesi Liu, Ruohang Feng, Guoyang Zeng.

(VLDB) Breaking It Down: An In-depth Study of Index Advisors [EA&B]. [code] [pypi] 🔴 Direct Accept with Shepherding

Wei Zhou*, Chen Lin*, Xuanhe Zhou*, Guoliang Li.

(VLDB Demo) Chat2Data: An Interactive Data Analysis System with RAG, Vector Databases and LLMs. [paper] [code] 🟤

Xinyang Zhao, Xuanhe Zhou, Guoliang Li.

(VLDB Tutorial) LLM for Data Management. [paper] [slides] 🟤

Guoliang Li, Xuanhe Zhou, Xinyang Zhao.

(ICDE) TRAP: Tailored Robustness Assessment for Index Advisors via Adversarial Perturbation. [paper] [code] 🔴

Wei Zhou, Chen Lin, Xuanhe Zhou, Guoliang Li.

(TKDE) Automatic Index Tuning: A Survey. [paper] 🔴

Yang Wu*, Xuanhe Zhou*, Yong Zhang, Guoliang Li

2023

(SIGMOD) Grep: A Graph Learning Based Database Partitioning System. [paper] [code] 🔴

Xuanhe Zhou, Guoliang Li, Wei Guo, Luyang Liu.

(VLDB Industry) FEBench: A Benchmark for Real-Time Relational Data Feature Extraction. [paper] [code] 🖐🏻 Best Industry Paper Runnerup Award

Xuanhe Zhou*, Cheng Chen*, Kunyi Li, Bingsheng He, Mian Lu, Qiaosheng Liu, Wei Huang, Guoliang Li, Zhao Zheng, Yuqqiang Chen.

(VLDB Demo) A Learned Query Rewrite System. [demo] 🟢

Xuanhe Zhou, Guoliang Li, Jianming Wu, Jiesi Liu, Zhaoyan Sun, Xinning Zhang.

(VLDB) Learned Index: A Comprehensive Experimental Evaluation. [paper] [code] 🔵

Zhaoyan Sun, Xuanhe Zhou, Guoliang Li.

(ICDE) DBAugur: An Adversarial-based Trend Forecasting System for Diversified Workloads. [paper] [code] 🟡

Yuanning Gao, Xiuqi Huang, Xuanhe Zhou, Xiaofeng Gao, Guoliang Li.

(NeurIPS) Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs. [paper] [code] 🖐🏻 Spotlight

Led by Damo Academy (Binyuan Hui, Yongbin Li, et al) and University of Hong Kong (Jinyang Li)

(TKDE) Automatic Database Knob Tuning: A Survey. [paper] [code] 🔴

Xinyang Zhao*, Xuanhe Zhou*, Guoliang Li.

(DSE) DB-GPT: Large Language Model Meets Database. [paper] [code] 🟢 🔴

Xuanhe Zhou*, Zhaoyan Sun*, Guoliang Li.

2022

(SIGMOD) LearnedSQLGen: Constraint-Aware SQL Generation using Reinforcement Learning. [paper] [code] 🟤

Lixi Zhang, Chengliang Chai, Xuanhe Zhou, Guoliang Li.

(VLDB) A Learned Query Rewrite System using Monte Carlo Tree Search. [paper] [code] 🟢

Xuanhe Zhou, Guoliang Li, Chengliang Chai, Jianhua Feng.

(ICDE) AutoIndex: An Incremental Index Management System for Dynamic Workloads. [paper] [code] 🔴

Xuanhe Zhou, Luyang Liu, Wenbo Li, Lianyuan Jin, Tianqing Wang, Shifu Li.

(ICDE) Adaptive Code Learning for Spark Configuration Tuning. [paper] [code] 🔴

Chen Lin, Junqing Zhuang, Jiadong Feng, Hui Li, Xuanhe Zhou, Guoliang Li.

(ICDE Tutorial) Machine Learning for Data Management: A System View. [paper] [slide]

Guoliang Li, Xuanhe Zhou.

2021

(VLDB Industry) openGauss: An Autonomous Database System. [paper] [code] Over 2.7k stars

Guoliang Li, Xuanhe Zhou (first student author), Ji Sun, Xiang Yu, Yue Han, Lianyuan Jin, Wenbo Li, Tianqing Wang, Shifu Li.

(VLDB Demo) DBMind: A Self-Driving Platform in openGauss. [paper] [code]

Xuanhe Zhou, Lianyuan Jin, Ji Sun, Xinyang Zhao, Xiang Yu, Shifu Li, Tianqing Wang, et al.

(SIGMOD Tutorial) AI Meets Database: AI4DB and DB4AI. [paper] [slide]

Guoliang Li, Xuanhe Zhou, Lei Cao, Chengliang Chai.

(VLDB Tutorial) Machine Learning for Databases. [paper]

Guoliang Li, Xuanhe Zhou, Lei Cao.

(Journal of Software) Survey of Data Management Techniques for Supporting Artificial Intelligence. [paper] 🖐🏻

Guoliang Li, Xuanhe Zhou.

2020

(VLDB) Query Performance Prediction for Concurrent Queries using Graph Embedding. [code] [paper] 🟡

Xuanhe Zhou, Ji Sun, Guoliang Li, Jianhua Feng.

(TKDE) Database meets artificial intelligence: A survey. [paper]

Xuanhe Zhou, Chengliang Chai, Guoliang Li, Ji Sun.

(Chinese Journal of Computers) Overview of database technology based on machine learning. [paper]

Guoliang, Li, Xuanhe Zhou, Sun Ji, Yu Xiang, Yuan Haitao, Liu Jiabin, and Han Yue.

2019

(VLDB) QTune: A Query-Aware Database Tuning System with Deep Reinforcement Learning. [paper] 🔴

Guoliang Li, Xuanhe Zhou, Shifu Li, Bo Gao.

(Data Eng.) Xuanyuan: An AI-Native Database. [paper]

Guoliang Li, Xuanhe Zhou, Sihao Li.

(Journal of Computer) A Survey of Machine-Learning-based Database Techniques. [paper]

Guoliang Li, Xuanhe Zhou, Ji Sun, Xiang Yu, Haitao Yuan, Jiabin Li, Yue Han.

Honors and Awards

2024 - Distinguished Doctoral Dissertation Award of Tsinghua (清华优秀博士学位论文)

2023 - VLDB Best Industry Paper RunnerUp Award (first author)

2023 - Top 100 Open Source Achievements by Benchcouncil

2022 - Outstanding Scholarship of Tsinghua University (清华特奖)

2022 - ByteDance Fellowship (10 Ph.D students)

2022 - MSRA Fellowship (12 Asian Ph.D students)

2021 - Zhongshimo Fellowship (钟士模奖学金)

2021 - Apple Scholars in AI/ML Nomination

2023, 2017 - National Scholarship

Activities

2021.10 - Tutorial of ML for Databases, AIMLSystems Conference. [website]

2021.08 - Invited Talk, The LADSIOS Workshop, VLDB Conference. [website]

2021.06 - SIGMOD Onsite Volunteer. [website]

Services

PC Member - ICDE 2025, DBML'23 (ICDE workshop), AIDB'23 (VLDB workshop);

Journal Reviewer - TKDE, VLDB Journal, JCST

Open Datasets

https://github.com/TsinghuaDatabaseGroup/datasets (Public archive)

Teaching Assistant

2019-2022 Database Systems (THU/30240262)

Online tutorials on building a basic relational database are available!!

Basic functions: https://thu-db.github.io/dbs-tutorial/

Advanced functions (by wenbo, haowen): https://thu-db.github.io/dbtrain-tutorial/

Last updated