GeoSpark is a cluster computing system for processing large-scale spatial data. GeoSpark extends Apache Spark / SparkSQL to efficiently load, process, and analyze large-scale spatial data across machines.
Research paper: Geoinformatica Journal 2019, MDM 2019, SSDBM 2018
Demo and short paper: ICDE 2019, SSTD 2019, ICDE 2016, SIGSPATIAL 2015 (short)
Tutorial: ICDE 2019
Collaborators: Zongsi Zhang, Zishan Fu, Mohamed Sarwat (Arizona State University)
Highlight: GeoSpark has > 200K overall website visits and > 10K monthly downloads. Users and contributors include Facebook, Apple, Uber, MoBike, and numerous startups