feat(invoices): add hybrid OCR + vision invoice/document parsing with PaddleOCR, consensus voting, and prompt/test refactors

2026-01-16 14:24:37 +00:00
parent acded2a165
commit 82358b2d5d
4 changed files with 380 additions and 109 deletions
--- a/changelog.md
+++ b/changelog.md
@@ -1,5 +1,14 @@
 # Changelog

+## 2026-01-16 - 1.4.0 - feat(invoices)
+add hybrid OCR + vision invoice/document parsing with PaddleOCR, consensus voting, and prompt/test refactors
+
+- Add hybrid pipeline documentation and examples (PaddleOCR + MiniCPM-V) and architecture diagram in recipes/document.md
+- Integrate PaddleOCR: new OCR extraction functions and OCR-only prompt flow in test/test.node.ts
+- Add consensus voting and parallel-pass optimization to improve reliability (multiple passes, hashing, and majority voting)
+- Refactor prompts and tests: introduce /nothink token, OCR truncation limits, separate visual and OCR-only prompts, and improved prompt building in test/test.invoices.ts
+- Update image conversion defaults (200 DPI, filename change) and add TypeScript helper functions for extraction and consensus handling
+
 ## 2026-01-16 - 1.3.0 - feat(paddleocr)
 add PaddleOCR OCR service (Docker images, server, tests, docs) and CI workflows