distributed databases

GeoSpark and Geospatial Data Management in Apache Spark

CSE 511 Data Processing at Scale

Instructor, Graduate level, Computer Science, Arizona State University

Demonstrating GeoSparkSim: A Scalable Microscopic Road Network Traffic Simulator Based on Apache Spark

Road network traffic data has been widely studied by researchers and practitioners in different areas such as urban planning, traffic prediction and spatial-temporal databases. The existing urban traffic simulators suffer from two critical issues (1) …

Building a Large-Scale Microscopic Road Network Traffic Simulator in Apache Spark

Building Microscopic Road Network Traffic Simulators in Apache Spark

Road network traffic data has been widely studied by researchers and practitioners in different areas such as urban planning, traffic prediction and spatial-temporal databases. For instance, researchers use such data to evaluate the impact of road …

GeoSparkSim

GeoSparkSim is a scalable traffic simulator which extends Apache Spark to generate large-scale road network traffic datasets with microscopic traffic simulation.

GeoSparkViz in Action: A Data System with built-in support for Geospatial Visualization

Visualizing data on maps is deemed a powerful tool for data scientists to make sense of geospatial data. The geospatial map visualization (abbr. MapViz) process first loads the designated geospatial data, processes the data and then applies the map …

Geospatial Data Management in Apache Spark - A Tutorial

Geospatial Data Management in Apache Spark: A Tutorial

This paper introduces GeoSpark an in-memory cluster computing framework for processing large-scale spatial data. GeoSpark consists of three layers: Apache Spark Layer, Spatial RDD Layer and Spatial Query Processing Layer. Apache Spark Layer provides …

Spatial Data Management in Apache Spark: the GeoSpark Perspective and Beyond

The paper presents the details of designing and developing GEOSPARK, which extends the core engine of Apache Spark and SparkSQL to support spatial data types, indexes, and geometrical operations at scale. The paper also gives a detailed analysis of …