Sail by lakehq

Drop-in Apache Spark replacement written in Rust, unifying batch processing, stream processing, and compute-intensive AI workloads.

arrowbig-datapysparkrustsparksqldatafusionpythonartificial-intelligencedata-engineeringdistributed-computingmachine-learning
Verdict 65/100 health $4.13/mo cheapest, hetzner 2/5 setup difficulty Last release 2 months ago

Self-host Sail on hetzner CAX11 for $4.13/mo.

Health score
65 /100
6-dim composite
Self-hosts from
$4.13 /mo
hetzner · CAX11
Difficulty
2 /5
Docker + read README
GitHub stars
2.4k
131 forks

About Sail

From the project's README at github.com/lakehq/sail. Lightly cleaned for readability; for the full source see the upstream repo.

[](https://github.com/lakehq/sail/actions) [](https://app.codecov.io/gh/lakehq/sail) [](https://pypi.org/project/pysail/) [](https://www.launchpass.com/lakesail-community/free)

Sail is a drop-in Apache Spark replacement written in Rust, unifying batch processing, stream processing, and compute-intensive AI workloads on a distributed, multimodal compute engine. Compatible with the Spark Connect protocol, supporting the Spark SQL and DataFrame API with no code rewrites required. 100% Rust-native with no JVM overhead, delivering memory safety, instant startup, and predictable performance. ~4× faster (up to 8× in specific workloads) than Spark and 94% cheaper on infrastructure costs. See derived TPC-H benchmarks. Proven on ClickBench, outperforming Spark, popular Spark accelerators, Databricks, and Snowflake. Documentation

The documentation of the latest Sail version can be found here. Installation Quick Start

Sail is available as a Python package on PyPI. You can install it along with PySpark in your Python e

Health score breakdown

6-dimension composite. See methodology for formula and weights.

activity
68
maturity
89
community
79
security
70
sustainability
65
adoption
29

Adoption signals

Real-world usage data, pulled from each registry. The bigger the numbers, the more battle-tested the project.

SignalValueSource
GitHub stars 2.4k github.com/lakehq/sail
GitHub forks 131 github.com/lakehq/sail
CRATES downloads (last month) 5 sail

Release & maintenance

Is this project actively maintained, or about to die? Check the recency of last commit and last release.

Project age2.5 yearssince Dec 2023
Last commit2 months agoMay 4, 2026
Releases shipped21last: 2 months ago

Self-hosting cost across providers

Detected requirements: 4GB RAM, 40GB disk minimum. Cheapest plan per provider that meets the requirement.

ProviderPlanSpecsMonthly
hetzner CAX11 2c · 4GB · 40GB $4.13 USD Deploy →
vultr VC2 1c · 1GB · 25GB $5 USD Deploy →
linode Nanode 1GB 1c · 1GB · 25GB $5.12 USD Deploy →
digitalocean Basic Regular 1GB 1c · 1GB · 25GB $6 USD Deploy →

What people say on Hacker News

Ready to self-host Sail?

Spin up a hetzner CAX11 (4GB RAM, 40GB disk) for $4.13/mo and follow the project's official install docs.

Data last refreshed Jun 21, 2026.

Similar open-source projects

Projects in our directory that replace the same SaaS or share topics with Sail.

Frequently asked questions

Last verified . Health scores and costs are computed from public data.