I am now a third-year PhD candidate at School of Computer Science, Fudan University. My Ph.D. thesis is joint-supervised by Prof. Peng Wang at Fudan Univ. and Prof. Themis Palpanas at Université Paris Cité. I enjor working at DSM Group of Fudan Univ. and diNo Research group of UPC at the same time. Before that, I got my bachelor’s degree in computer science from Fudan University in 2021.

I have published papers in top conferences and journals of database and data mining, including SIGMOD, VLDB, VLDB Journal and TKDE. I also serve as the reviewer of peer-reviewed journals including TKDE. I’m hornoed to be invited to publish a survey paper in the prestigious IEEE Data Engineering Bulletin, special issue on High-Dimensional Similarity Search: from Time Series Management Systems to Vector Databases.

My research interest lies in the management and analyses of high-dimensional data, including:

1) High-dimensional vector indexing for approximate nearest neighnor search (ANNS)

2) Data series/Time series similarity search and analyses

3) Other important issues on high-dimensional data management (e.g., storage, compression)

4) High-dimensional indexes for AI applications (e.g., Retrieval-Augmented Generation (RAG))

Welcome to contact me via [zeyuwang21@m.fudan.edu.cn] if you are interested in me and my research works.

Office address:

2205 Songhu Road, No.2 Interdisciplinary Research Building, E4009
Shanghai
China

Welcome to subscribe my Medium to discuss the exciting ideas in data management!

My Chinese tech blog records my comments on academic papers and advanced techniques BEFORE 2024.

My MBTI is INTJ.


News

  • 2024.4 🎉🎉🎉 Our new work CIVET, a state-of-the-art time series subsequence matching index, gets accepted in VLDB 2024!
  • 2024.3 Open my Medium Channel. I’m sharing my comments and thoughts on the frontier of data management here. Careful read, deep thinking and active discussion (positive or negative) show respect for researchers.
  • 2023.9 Thrilled to share my survey about the comparison between time series and vector similarity search in IEEE Data Engineering Bulletin Journal, Special Issue in Sep. 2023! paper
  • 2023.6 Very exited to participate in SIGMOD’23 at Seattle, USA! A cool experience to explore the great city with the new friends in the database community!

Publications

  • Zeyu Wang, Qitong Wang, Xiaoxing Cheng, Peng Wang, Themis Palpanas, and Wei Wang. Query Hardness Measurement and Unbiased Workload Generation for Graph-Based ANN Index Evaluation. PVLDB (under revision)

  • Zeyu Wang, Qitong Wang, Peng Wang, Themis Palpanas, and Wei Wang. DumpyOS: A Data-Adaptive Multi-ary Index for Scalable Data Series Similarity Search. VLDB Journal (under revision)

  • Haoran Xiong, Hang Zhang, Zeyu Wang, Zhenying He, Peng Wang, and X. Sean Wang. CIVET: Exploring Compact Index for Variable-Length Subsequence Matching on Time Series. PVLDB 2024 (Accepted) (code)

  • Zeyu Wang, Haoran Xiong, Zhenying He, Peng Wang, and Wei Wang. Distance Comparison Operators for Approximate Nearest Neighbor Search: Exploration and Benchmark. arXiv preprint arXiv:2403.13491 Mar. 2024. (PDF, code)

  • Zeyu Wang, Zhenying He, Peng Wang, Yang Wang, and Wei Wang. Static and Streaming Discovery of Maximal Linear Representation Between Time Series. IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 36, no. 1, pp. 401-415, Jan. 2024. (website, code)

  • Zeyu Wang, Peng Wang, Themis Palpanas, and Wei Wang. Graph- and Tree-based Indexes for High-dimensional Vector Similarity Search: Analyses, Comparisons, and Future Directions. IEEE Data Engineering Bulletin 47(3), Sep. 2023. (PDF)

  • Zeyu Wang, Qitong Wang, Peng Wang, Themis Palpanas, and Wei Wang. Dumpy: A Compact and Adaptive Index for Large Data Series Collections. Proceedings of the ACM Management of Data (PACMMOD) Journal 1(1), 2023, presented at ACM SIG International conference on Management of Data / Principles of Database Systems (SIGMOD/PODS), Seattle, WA, USA, June 2023. (PDF, slides, video, code)

  • Hanbo Zhang, Peng Wang, Zicheng Fang, Zeyu Wang, and Wei Wang, ELIS++: a shapelet learning approach for accurate and efficient time series classification. World Wide Web (WWWJ) 24, 511–539 March 2021. (website)

Activities

Talks

  • A Revisit and the New Progess of Graph-based High-dimensional Vector Search. In DataWhale, online, May 2023. video (in Mandarin)

Teaching Assistants

  • Big Data Mining, Spring 2022, with Prof. Peng Wang
  • Database System Implementation, Fall 2022, with Prof. Peng Wang and Prof. Wei Wang

Interns

  • Research Engineer at Zilliz (2022.8 - 2023.1)
    • Working at the research group led by Dr. Xiaomeng Yi for learned graph-based ANN indexes and graph indexes in streaming
  • BigData Engineer at Construct Tech (2020.1 - 2021.1)
    • Leading the data team to build the big data platform to empower intelligent analyses, recommendation, and social matches.

Awards

  • Top-50 team on Oceanbase Database Competition in 2022. (also the National College Student Computer System ability Competition in China, from 2023)
  • Outstanding Graduate of Fudan University in 2021
  • Gold Award Team at iGEM (intertional Genetically Engineered Machine Competition) in 2019, Boston, USA. Presented the modeling of how the biological therapy take effect for lactose intolerance in the Giant Jamboree.
  • National Second Prize of Undergraduate Mathematical Contest in Modeling Competition in China, in 2019