docling2.66.0
docling2.66.0
Published
SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications.
pip install docling
Package Downloads
Project URLs
Requires Python
<4.0,>=3.9
Dependencies
- pydantic
<3.0.0,>=2.0.0 - docling-core
[chunking]<3.0.0,>=2.50.1 - docling-parse
<5.0.0,>=4.7.0 - docling-ibm-models
<4,>=3.9.1 - filetype
<2.0.0,>=1.2.0 - pypdfium2
!=4.30.1,<5.0.0,>=4.30.0 - pydantic-settings
<3.0.0,>=2.3.0 - huggingface_hub
<1,>=0.23 - requests
<3.0.0,>=2.32.2 - ocrmac
<2.0.0,>=1.0.0; sys_platform == "darwin" - rapidocr
<4.0.0,>=3.3 - certifi
>=2024.7.4 - rtree
<2.0.0,>=1.3.0 - typer
<0.20.0,>=0.12.5 - python-docx
<2.0.0,>=1.1.2 - python-pptx
<2.0.0,>=1.0.2 - beautifulsoup4
<5.0.0,>=4.12.3 - pandas
<3.0.0,>=2.1.4 - marko
<3.0.0,>=2.1.2 - openpyxl
<4.0.0,>=3.1.5 - lxml
<7.0.0,>=4.0.0 - pillow
<12.0.0,>=10.0.0 - tqdm
<5.0.0,>=4.65.0 - pluggy
<2.0.0,>=1.0.0 - pylatexenc
<3.0,>=2.10 - scipy
<2.0.0,>=1.6.0 - accelerate
<2,>=1.0.0 - polyfactory
>=2.22.2 - easyocr
<2.0,>=1.7; extra == "easyocr" - tesserocr
<3.0.0,>=2.7.1; extra == "tesserocr" - ocrmac
<2.0.0,>=1.0.0; sys_platform == "darwin" and extra == "ocrmac" - transformers
<5.0.0,>=4.46.0; extra == "vlm" - accelerate
<2.0.0,>=1.2.1; extra == "vlm" - mlx-vlm
<1.0.0,>=0.3.0; (python_version >= "3.10" and python_version < "3.14" and sys_platform == "darwin" and platform_machine == "arm64") and extra == "vlm" - vllm
<1.0.0,>=0.10.0; (python_version >= "3.10" and python_version < "3.14" and sys_platform == "linux" and platform_machine == "x86_64") and extra == "vlm" - qwen-vl-utils
>=0.0.11; extra == "vlm" - rapidocr
<4.0.0,>=3.3; extra == "rapidocr" - onnxruntime
<2.0.0,>=1.7.0; python_version < "3.14" and extra == "rapidocr" - mlx-whisper
>=0.4.3; (python_version >= "3.10" and python_version < "3.14" and sys_platform == "darwin" and platform_machine == "arm64") and extra == "asr" - openai-whisper
>=20250625; python_version < "3.14" and extra == "asr"
Docling
Docling simplifies document processing, parsing diverse formats โ including advanced PDF understanding โ and providing seamless integrations with the gen AI ecosystem.
Features
- ๐๏ธ Parsing of multiple document formats incl. PDF, DOCX, PPTX, XLSX, HTML, WAV, MP3, VTT, images (PNG, TIFF, JPEG, ...), and more
- ๐ Advanced PDF understanding incl. page layout, reading order, table structure, code, formulas, image classification, and more