TileDB Embedded is the only storage engine architected around dense and sparse multi-dimensional arrays, which can model any data, such as dataframes, genomic variants, images, video and many more. It is open-source under the MIT License.See TileDB Embedded use-cases
TileDB Embedded exhibits superb performance for both dense and sparse array storage and access. Slice from any backend in an out-of-core fashion, and optimize operations via parallel IO, compression, and encryption.
TileDB Embedded provides rapid updates embracing immutability, ideal for cloud object stores. This further allows data versioning and time traveling, which are important for data science and applications that require auditing ability.
TileDB Embedded is built in C++ and exposes multiple different language APIs, such as C, C++, Python, R, Java, and Go. It also includes integrations with distributed compute frameworks such as Spark and Dask, SQL engines (MariaDB and PrestoDB), and geospatial libraries (PDAL and GDAL).
TileDB Embedded is built with cloud object store challenges in mind, such as eventual consistency. It supports AWS S3, Google Cloud Storage and Azure Blob Storage. Any TileDB array works in an identical manner whether it is stored locally or on the cloud.
TileDB Embedded is a deep technology. There is a lot more to explore. Get started with our docs and check back for updates.
We invite you to participate in our Slack Channel. Come up with questions, get answers and become a part of the TileDB Community.