fsspec2024.12.0
Published
File-system specification
pip install fsspec
Package Downloads
Authors
Project URLs
Requires Python
>=3.8
Dependencies
- adlfs
; extra == "abfs"
- adlfs
; extra == "adl"
- pyarrow
>=1; extra == "arrow"
- dask
; extra == "dask"
- distributed
; extra == "dask"
- pre-commit
; extra == "dev"
- ruff
; extra == "dev"
- numpydoc
; extra == "doc"
- sphinx
; extra == "doc"
- sphinx-design
; extra == "doc"
- sphinx-rtd-theme
; extra == "doc"
- yarl
; extra == "doc"
- dropbox
; extra == "dropbox"
- dropboxdrivefs
; extra == "dropbox"
- requests
; extra == "dropbox"
- adlfs
; extra == "full"
- aiohttp
!=4.0.0a0,!=4.0.0a1; extra == "full"
- dask
; extra == "full"
- distributed
; extra == "full"
- dropbox
; extra == "full"
- dropboxdrivefs
; extra == "full"
- fusepy
; extra == "full"
- gcsfs
; extra == "full"
- libarchive-c
; extra == "full"
- ocifs
; extra == "full"
- panel
; extra == "full"
- paramiko
; extra == "full"
- pyarrow
>=1; extra == "full"
- pygit2
; extra == "full"
- requests
; extra == "full"
- s3fs
; extra == "full"
- smbprotocol
; extra == "full"
- tqdm
; extra == "full"
- fusepy
; extra == "fuse"
- gcsfs
; extra == "gcs"
- pygit2
; extra == "git"
- requests
; extra == "github"
- gcsfs
; extra == "gs"
- panel
; extra == "gui"
- pyarrow
>=1; extra == "hdfs"
- aiohttp
!=4.0.0a0,!=4.0.0a1; extra == "http"
- libarchive-c
; extra == "libarchive"
- ocifs
; extra == "oci"
- s3fs
; extra == "s3"
- paramiko
; extra == "sftp"
- smbprotocol
; extra == "smb"
- paramiko
; extra == "ssh"
- aiohttp
!=4.0.0a0,!=4.0.0a1; extra == "test"
- numpy
; extra == "test"
- pytest
; extra == "test"
- pytest-asyncio
!=0.22.0; extra == "test"
- pytest-benchmark
; extra == "test"
- pytest-cov
; extra == "test"
- pytest-mock
; extra == "test"
- pytest-recording
; extra == "test"
- pytest-rerunfailures
; extra == "test"
- requests
; extra == "test"
- aiobotocore
<3.0.0,>=2.5.4; extra == "test-downstream"
- dask-expr
; extra == "test-downstream"
- dask
[dataframe,test]; extra == "test-downstream"
- moto
[server]<5,>4; extra == "test-downstream"
- pytest-timeout
; extra == "test-downstream"
- xarray
; extra == "test-downstream"
- adlfs
; extra == "test-full"
- aiohttp
!=4.0.0a0,!=4.0.0a1; extra == "test-full"
- cloudpickle
; extra == "test-full"
- dask
; extra == "test-full"
- distributed
; extra == "test-full"
- dropbox
; extra == "test-full"
- dropboxdrivefs
; extra == "test-full"
- fastparquet
; extra == "test-full"
- fusepy
; extra == "test-full"
- gcsfs
; extra == "test-full"
- jinja2
; extra == "test-full"
- kerchunk
; extra == "test-full"
- libarchive-c
; extra == "test-full"
- lz4
; extra == "test-full"
- notebook
; extra == "test-full"
- numpy
; extra == "test-full"
- ocifs
; extra == "test-full"
- pandas
; extra == "test-full"
- panel
; extra == "test-full"
- paramiko
; extra == "test-full"
- pyarrow
; extra == "test-full"
- pyarrow
>=1; extra == "test-full"
- pyftpdlib
; extra == "test-full"
- pygit2
; extra == "test-full"
- pytest
; extra == "test-full"
- pytest-asyncio
!=0.22.0; extra == "test-full"
- pytest-benchmark
; extra == "test-full"
- pytest-cov
; extra == "test-full"
- pytest-mock
; extra == "test-full"
- pytest-recording
; extra == "test-full"
- pytest-rerunfailures
; extra == "test-full"
- python-snappy
; extra == "test-full"
- requests
; extra == "test-full"
- smbprotocol
; extra == "test-full"
- tqdm
; extra == "test-full"
- urllib3
; extra == "test-full"
- zarr
; extra == "test-full"
- zstandard
; extra == "test-full"
- tqdm
; extra == "tqdm"
filesystem_spec
A specification for pythonic filesystems.
Install
pip install fsspec
would install the base fsspec. Various optionally supported features might require specification of custom
extra require, e.g. pip install fsspec[ssh]
will install dependencies for ssh
backends support.
Use pip install fsspec[full]
for installation of all known extra dependencies.
Up-to-date package also provided through conda-forge distribution:
conda install -c conda-forge fsspec
Purpose
To produce a template or specification for a file-system interface, that specific implementations should follow,
so that applications making use of them can rely on a common behaviour and not have to worry about the specific
internal implementation decisions with any given backend. Many such implementations are included in this package,
or in sister projects such as s3fs
and gcsfs
.
In addition, if this is well-designed, then additional functionality, such as a key-value store or FUSE mounting of the file-system implementation may be available for all implementations "for free".
Documentation
Please refer to RTD
Develop
fsspec uses GitHub Actions for CI. Environment files can be found in the "ci/" directory. Note that the main environment is called "py38", but it is expected that the version of python installed be adjustable at CI runtime. For local use, pick a version suitable for you.
# For a new environment (mamba / conda).
mamba create -n fsspec -c conda-forge python=3.9 -y
conda activate fsspec
# Standard dev install with docs and tests.
pip install -e ".[dev,doc,test]"
# Full tests except for downstream
pip install s3fs
pip uninstall s3fs
pip install -e .[dev,doc,test_full]
pip install s3fs --no-deps
pytest -v
# Downstream tests.
sh install_s3fs.sh
# Windows powershell.
install_s3fs.sh
Testing
Tests can be run in the dev environment, if activated, via pytest fsspec
.
The full fsspec suite requires a system-level docker, docker-compose, and fuse installation. If only making changes to one backend implementation, it is not generally necessary to run all tests locally.
It is expected that contributors ensure that any change to fsspec does not cause issues or regressions for either other fsspec-related packages such as gcsfs and s3fs, nor for downstream users of fsspec. The "downstream" CI run and corresponding environment file run a set of tests from the dask test suite, and very minimal tests against pandas and zarr from the test_downstream.py module in this repo.
Code Formatting
fsspec uses Black to ensure
a consistent code format throughout the project.
Run black fsspec
from the root of the filesystem_spec repository to
auto-format your code. Additionally, many editors have plugins that will apply
black
as you edit files. black
is included in the tox
environments.
Optionally, you may wish to setup pre-commit hooks to
automatically run black
when you make a git commit.
Run pre-commit install --install-hooks
from the root of the
filesystem_spec repository to setup pre-commit hooks. black
will now be run
before you commit, reformatting any changed files. You can format without
committing via pre-commit run
or skip these checks with git commit --no-verify
.