deltalake0.21.0
Published
Native Delta Lake Python binding based on delta-rs with Pandas integration
pip install deltalake
Package Downloads
Project URLs
Requires Python
>=3.8
Dependencies
- pyarrow
>=16
- pandas
; extra == "pandas"
- azure-storage-blob
==12.20.0; extra == "devel"
- packaging
>=20; extra == "devel"
- pytest
; extra == "devel"
- pytest-mock
; extra == "devel"
- pytest-cov
; extra == "devel"
- pytest-timeout
; extra == "devel"
- sphinx
<=4.5; extra == "devel"
- sphinx-rtd-theme
; extra == "devel"
- toml
; extra == "devel"
- wheel
; extra == "devel"
- pytest-benchmark
; extra == "devel"
- mypy
==1.10.1; extra == "devel"
- ruff
==0.5.2; extra == "devel"
- pyspark
; extra == "pyspark"
- delta-spark
; extra == "pyspark"
- numpy
==1.22.2; extra == "pyspark"
Deltalake-python
Native Delta Lake Python binding based on delta-rs with Pandas integration.
Example
from deltalake import DeltaTable
dt = DeltaTable("../rust/tests/data/delta-0.2.0")
dt.version()
3
dt.files()
['part-00000-cb6b150b-30b8-4662-ad28-ff32ddab96d2-c000.snappy.parquet',
'part-00000-7c2deba3-1994-4fb8-bc07-d46c948aa415-c000.snappy.parquet',
'part-00001-c373a5bd-85f0-4758-815e-7eb62007a15c-c000.snappy.parquet']
See the user guide for more examples.
Installation
pip install deltalake
NOTE: official binary wheels are linked against openssl statically for remote objection store communication. Please file Github issue to request for critical openssl upgrade.
Build custom wheels
Sometimes you may wish to build custom wheels. Maybe you want to try out some unreleased features. Or maybe you want to tweak the optimization of the Rust code.
To compile the package, you will need the Rust compiler and maturin:
curl https://sh.rustup.rs -sSf | sh -s
pip install maturin
Then you can build wheels for your own platform like so:
maturin build --release --out wheels
For a build that is optimized for the system you are on (but sacrificing portability):
RUSTFLAGS="-C target-cpu=native" maturin build --release --out wheels
Cross compilation
The above command only works for your current platform. To create wheels for other
platforms, you'll need to cross compile. Cross compilation requires installing
two additional components: to cross compile Rust code, you will need to install
the target with rustup
; to cross compile the Python bindings, you will need
to install ziglang
.
The following example is for manylinux2014. Other targets will require different
Rust target
and Python compatibility
tags.
rustup target add x86_64-unknown-linux-gnu
pip install ziglang
Then you can build the wheel with:
maturin build --release --zig \
--target x86_64-unknown-linux-gnu \
--compatibility manylinux2014 \
--out wheels
If you expect to only run on more modern system, you can set a newer target-cpu
flag to Rust and use a newer compatibility tag for Linux. For example, here
we set compatibility with CPUs newer than Haswell (2013) and Linux OS with
glibc version of at least 2.24:
RUSTFLAGS="-C target-cpu=haswell" maturin build --release --zig \
--target x86_64-unknown-linux-gnu \
--compatibility manylinux_2_24 \
--out wheels
See note about RUSTFLAGS
from the arrow-rs readme.