vortex

Vortex

Vortex is an extensible, state-of-the-art format for columnar data. It includes specifications & tools for manipulating possibly-compressed arrays in-memory, on-disk (file format), and over-the-wire (IPC format). Vortex is built around the latest research from the database community.

In-memory

Vortex in-memory arrays support:

On-disk

Vortex ships with an extensible file format supporting:

Over-the-wire

Vortex defines a work-in-progress IPC format for sending possibly compressed arrays over the wire.

Extensibility

Vortex is designed to be incredibly extensible. Almost all reader and writer logic is extensible at compile-time by providing various implementations of Rust traits, and encodings and layouts are extensible at runtime with dynamically loaded libraries or WebAssembly kernels.

Please reach out to us if you’d like to extend Vortex with your own encodings, layouts, or other functionality.

Concepts

It can be useful to view Vortex as an ecosystem of building blocks rather than a singular specific format. Almost everything in Vortex is extensible, enabling it to be used for both general-purpose columnar data processing, and niche embedded use-cases where specific encodings and performance characteristics are required.

This section of the documentation covers the core concepts of Vortex and how they fit together.

---
maxdepth: 3
includehidden:
caption: Concepts
---

Arrays <concepts/arrays>
Layouts <concepts/layouts>
Data Types <concepts/dtypes>
Compute <concepts/compute>

Quickstarts

Vortex is currently available for both Python and Rust. To get started, we recommend the language-specific quickstarts.

---
maxdepth: 1
includehidden:
caption: Quickstarts
---

Python <quickstart/python>
Rust <quickstart/rust>

API Documentation

Vortex is primarily written in Rust, and the Rust API is the most complete API for Vortex. The Vortex Python bindings provide a more usable Python interface to the Vortex Rust library.

---
maxdepth: 2
caption: API Documentation
---

Python API <api/python/index>
Rust API <https://docs.rs/vortex>

User Guides

These user guides provide end-to-end overviews of how to use or extend Vortex in your own projects. We intend to extend this collection of guides to cover more use-cases in the future.

---
maxdepth: 1
caption: User Guides
---

guides/python-integrations
guides/writing-an-encoding

Specifications

Vortex currently defines two serialization formats: a file format and an IPC format. The file format is designed for efficient access to Vortex Layouts on disk, while the IPC format is designed to send possibly compressed Vortex Arrays over the wire.

---
maxdepth: 3
includehidden:
caption: Specifications
---

specs/file-format
specs/ipc-format
specs/dtype-format
---
hidden:
caption: Project Links
---

references
Spiral <https://spiraldb.com>
GitHub <https://github.com/spiraldb/vortex>
PyPI <https://pypi.org/project/vortex-array>
Crates <https://crates.io/crates/vortex>