distributed databases

Spatial Data Management in Apache Spark - The GeoSpark Perspective and Beyond

ASU Online Master of Computer Science - Data Systems

Designer, Graduate level, Computer Science, Coursera (over 10000 learners)

GeoSparkViz: a scalable geospatial data visualization framework in the Apache Spark ecosystem

Data Visualization allows users to summarize, analyze and reason about data. A map visualization tool frst loads the designated geospatial data, processes the data and then applies the map visualization efect. Guaranteeing detailed and accurate …

Deploy Distributed Database Course Project on Vocareum

Geospatial Visual Analytics belongs to Database Systems

SRC: Geospatial Visual Analytics Belongs to Database Systems: the BABYLON approach

The paper presents Babylon a large-scale Geospatial Visual analytics (GeoViz) system that performs the spatial data preparation and map visualization phases in the same distributed cluster.

Interactive and Scalable Exploration of Geospatial Data

Hippo in Action: Scalable Indexing of a Billion New York City Taxi Trips and Beyond

The paper demonstrates Hippo a lightweight database indexing scheme that significantly reduces the storage and maintenance overhead without compromising much on the query execution performance. Hippo stores disk page ranges instead of tuple pointers …

GeoSparkViz

GeoSparkViz is a large-scale geospatial map visualization framework. GeoSparkViz extends Apache Spark to provide native support for general cartographic design.

Data affinity for computation on Spark and Mesos