ht-docker-ai

Author	SHA1	Message	Date
Juergen Kunz	3780105c6f	feat(vision): add Qwen3-VL vision model support with Dockerfile and tests; improve invoice OCR conversion and prompts; simplify extraction flow by removing consensus voting	2026-01-18 03:35:05 +00:00
Juergen Kunz	7652a2df52	feat(tests): add Ministral 3 vision tests and improve invoice extraction pipeline to use Ollama chat schema, sanitization, and multi-page support	2026-01-18 02:53:24 +00:00
Juergen Kunz	f0d88fcbe0	feat(paddleocr-vl): add structured HTML output and table parsing for PaddleOCR-VL, update API, tests, and README	2026-01-18 00:11:17 +00:00
Juergen Kunz	5a311dca2d	fix(docker): standardize Dockerfile and entrypoint filenames; add GPU-specific Dockerfiles and update build and test references	2026-01-17 23:13:47 +00:00
Juergen Kunz	30c73b24c1	feat(tests): use Qwen2.5 (Ollama) for invoice extraction tests and add helpers for model management; normalize dates and coerce numeric fields	2026-01-17 21:50:09 +00:00
Juergen Kunz	80e6866442	feat(paddleocr-vl): add PaddleOCR-VL full pipeline Docker image and API server, plus integration tests and docker helpers	2026-01-17 20:22:23 +00:00
Juergen Kunz	0482c35b69	feat(paddleocr-vl): add PaddleOCR-VL GPU Dockerfile, pin vllm, update CPU image deps, and improve entrypoint and tests	2026-01-17 16:57:26 +00:00
Juergen Kunz	15ac1fcf67	update	2026-01-16 16:21:44 +00:00
Juergen Kunz	82358b2d5d	feat(invoices): add hybrid OCR + vision invoice/document parsing with PaddleOCR, consensus voting, and prompt/test refactors	2026-01-16 14:24:37 +00:00
Juergen Kunz	bec379e9ca	feat(paddleocr): add PaddleOCR OCR service (Docker images, server, tests, docs) and CI workflows	2026-01-16 13:23:01 +00:00
Juergen Kunz	379b5c19eb	feat(ocr): add PaddleOCR GPU Docker image and FastAPI OCR server with entrypoint; implement OCR endpoints and consensus extraction testing	2026-01-16 10:22:15 +00:00
Juergen Kunz	3dc1881d8b	update	2026-01-16 03:58:39 +00:00

12 Commits