Capture / Import
Point your phone or webcam at the page and shoot — hands-free with voice control, no flatbed, no app-switching. Already have files? Import your PDFs and images straight in.
The powerful, fast & free book scanner
From your phone camera to a curated, searchable PDF At your desk, in the library, or wherever you are. No flatbed, no bulky rig: just you, your laptop and your phone.
Linux and Windows versions coming soon…
Tip the developerYou don't have to endure endless flatbed scanning sessions either.
Curved pages, skew, perspective, uneven illumination,
and dozens of issues that make you go « eewww… » when you look at your documents.
Aglaïa turns the camera and the laptop you already have into a proper scanner.
All automatic. Fully customizable.
The following examples were all run with default settings. No fiddling needed.
Whether you're scanning books, spiral-bound reports, or loose sheets of paper, there's a specially curated pipeline for you.
Point your phone or webcam at the page and shoot — hands-free with voice control, no flatbed, no app-switching. Already have files? Import your PDFs and images straight in.
Isolate the text content, straighten it, and remove perspective and page-curvature distortion. Make your text pop with advanced binarization techniques.
Fast and coarse or slow and precise; local (Apple Vision, Surya, PaddleOCR-VL) or cloud-based (Mistral OCR) — Aglaïa takes care of your needs.
A clean, searchable PDF for your digital archives or printouts? A well-structured Markdown file to feed your favorite LLM workflows? Aglaïa gives you what you need.
Put your phone on your desk, your book on a chair, et voilà !
With a typical capacity of 2–4 pages processed per second and a sub-2 seconds latency per scan, you may not turn the pages fast enough to overwhelm Aglaïa.
Smart replay — no iterative distortion.
Deskew, keystone correction and dewarp each resample the image. Chain them naïvely and every pass softens and warps the pixels a little more — glyph edges crumble.
Aglaïa keeps track of the operations and at the end composes the perfect blend of geometric and morphological transforms to give you the sharpest result.
Every stage is persisted, inspectable and replayable.





scroll →
Unleash the computer-vision and data-science nerd within you.
Run the end-to-end pipeline from the command line.
The task at hand gets specific? Build your own processing elements in Python.
Have a favorite OCR provider? Need to export to cloud storage? Write your own adapters.
# clone, then sync with the extras you want
git clone https://github.com/yb85/aglaia.git && cd aglaia
uv sync --extra gui --extra macos # macOS GUI: Vision, Speech, MLX dewarp
# add any of:
# --extra jbig2 # smallest searchable PDFs (JBIG2 encoder, needs Rust)
# --extra cloud # Mistral cloud OCR
# --extra cuda # NVIDIA GPU dewarp (Linux)
uv run python aglaia.py ~/scans/my-book # add the Aglaïa tap, then install the app
brew tap yb85/aglaia https://github.com/yb85/aglaia
brew install --cask aglaia Free, source-available*, and yours to extend.