5 min read

Data Management

Analyzing LiDAR and SAR Data with Capella Space and TileDB

Originally published: Apr 28, 2022

Table Of Contents:

TileDB at a Glance

Why TileDB for LiDAR

Why TileDB for SAR

The Holy Grail: Data fusion and ML

Data and Notebooks

This blog post summarizes a recent webinar hosted by myself, Jason Brown (Remote Sensing Image Scientist and Community Enablement Engineer at Capella Space) and Norman Barker (VP of Geospatial at TileDB), where we shared our experience analyzing SAR data using TileDB Cloud, speeding the data exploration and maximizing collaboration among researchers and citizen scientists alike. SAR is an exciting technology that allows producing high-quality earth imaging even in the presence of clouds or at night, and Capella Space is an amazing company that produces high-quality SAR imagery products.

Here is the full webinar recording:

Special thanks goes to Vicky Liau and Margriet Groenendijk from the TileDB team that contributed immensely to the technical materials presented in the webinar. Also big thanks to Hobu, Inc. for providing us with the open LAS dataset we used in the demos.

TileDB at a Glance

TileDB, Inc. is building a universal database, which can store all types of data in a canonical format (dense or sparse multi-dimensional array) and analyze them with a growing set of language APIs and open-source tools. It has two offerings:

  1. TileDB Embedded: An open-source C++ library, which is the storage engine implementing a cloud-native multi-dimensional array data format and offers superb performance in terms of IO and compression. It comes with a variety of APIs, including C, C++, C#, Python, R, Java, and Go.

  2. TileDB Cloud: A full-fledged data management platform, which offers authentication, access control, logging, hosted Jupyter notebooks, scalable serverless compute, and a growing ecosystem for sharing data and code publicly and privately.

Due to its universality, TileDB can efficiently manage both LiDAR (sparse point clouds) and SAR (dense images) data, in a single solution and with a unified API.

Why TileDB for LiDAR

Why TileDB for SAR

The Holy Grail: Data fusion and ML

Before TileDB, SAR and LiDAR data seemed so different. Therefore, users would typically store SAR in TIFF format and LiDAR in LAS/LAZ. But there are so many opportunities for extracting valuable insights when fusing SAR with LiDAR data, such as producing higher-quality point clouds colored with the help of SAR imaging, or creating training datasets by superimposing labeled LiDAR data on SAR images for object detection and classification used in future imagery acquisitions. Data scientists would have to manually wrangle the different file formats and data models, build a colossal infrastructure for scalable compute, and integrate with state-of-the-art machine learning tools and other analytics software for their analyses.

Watch the webinar video to get a taste of the power TileDB Cloud grants you in your SAR and LiDAR analysis.

Data and Notebooks

The example notebooks are publicly available on TileDB Cloud and can be accessed here. You can freely download the notebooks, or simply run them directly inside the TileDB Cloud platform with a click of a button (you can sign up and contact us for free credits). Some of the notebooks make use of data from the Capella Space Open Data program, so you’ll need to request access directly from Capella Space (or contact us if you need help with that).

A few final remarks:

  • Sign up at Capella Space SAR Open Data Program to access the Capella Space data

  • Learn more about TileDB’s vision for universal data management described in detail in this webinar

  • Check out this TileDB Embedded webinar if you are interested in the internal mechanics of TileDB that make it store data universally.

  • We are hiring! If you liked what you saw and you feel that you are a good fit, please apply today.

  • Please follow us on Twitter or participate in our forum. We would like to hear from you so that we can get better.

Last but not least, a huge thank you to our awesome team for all the amazing work!


Meet the authors

Stavros Papadopoulos

Founder and CEO, TileDB