Welcome to Jia Yu’s homepage

Jia is a PhD student at the Computer Science department, School of Computing, Informatics, and Decision Systems Engineering (CIDSE), Arizona State University, where he also is a member of Data Systems Lab. Jia’s research focuses on database systems and geospatial data management. In particular, he worked on distributed data management systems, database indexing, data visualization. He is the main contributor of several open-sourced research projects such as GeoSpark, a cluster computing framework for processing big spatial data.

I am glad to review papers in the context of database systems and geospatial data management!


  • 08/15/2019: I will teach a graduate class CSE 511 Data Processing at Scale this Fall semester. This course covers the design, deployment and use of state-of-the-art data processing systems, which provide scalable access to data.
  • 08/10/2019: A research paper about “Accelerating Spatial Data Visualization Dashboards via a Materialized Sampling Approach” has been accepted to IEEE ICDE 2020. My paper was one of the few papers accepted directly without revision. Please see my publication list!
  • 07/17/2019: Gave a talk at Microsoft Research about “Designing Succinct Secondary Indexes by Exploiting Column Correlations” (video)
  • 06/06/2019: We will give a talk about “Geospatial Data Management in Apache Spark” in ApacheCon 2019 North America, Las Vegas. Let’s celebrate the 20th year of Apache Software Foundation!
  • 06/06/2019: A research paper and a demo paper about “Scalable Microscopic Road Network Traffic Simulator in Apache Spark” has been accepted to MDM 2019 and SSTD 2019. This work is mentored by me.
  • 06/03/2019: I will be a Research Intern at Microsoft Research (database group) this summer! My mentor is Umar Farooq Minhas. I will work on a realistic design of updatable learned indices. My previous work includes Hippo a lightweight sparse index (implemented into PostgreSQL kernel, VLDB 2016, demo ICDE 2017) and Hermit a lightweight learned secondary index by leveraging column correlations (SIGMOD 2019, demo VLDB 2019).
  • 05/14/2019: Received ASU Ira A. Fulton Schools of Engineering “Engineering Graduate Fellowship” for the 2018‐2019 academic year.
  • 05/10/2019: A research paper and a demo paper about “Succinct Learned Secondary Indexes by Exploiting Column Correlations” have been accepted to SIGMOD 2019 and VLDB 2019. This is part of my 2018 summer intern work at IBM - Almaden.
  • 04/11/2019: Delivered 2 demo papers and 1 tutorial in IEEE ICDE 2019, with $1875 ICDE 2019 NSF Student Travel Grant. We talked about geospatial data management in Apache Spark and geographical knowledge graph management. Our tutorial website is now online.