feat(paddleocr-vl): add structured HTML output and table parsing for PaddleOCR-VL, update API, tests, and README

2026-01-18 00:11:17 +00:00
parent 0d8a1ebac2
commit f0d88fcbe0
4 changed files with 486 additions and 82 deletions
--- a/changelog.md
+++ b/changelog.md
@@ -1,5 +1,14 @@
 # Changelog

+## 2026-01-18 - 1.8.0 - feat(paddleocr-vl)
+add structured HTML output and table parsing for PaddleOCR-VL, update API, tests, and README
+
+- Add result_to_html(), parse_markdown_table(), and parse_paddleocr_table() to emit semantic HTML and convert OCR/markdown tables to proper <table> elements
+- Enhance result_to_markdown() with positional/type hints (header/footer/title/table/figure) to improve downstream LLM processing
+- Expose 'html' in supported formats and handle output_format='html' in parse endpoints and CLI flow
+- Update tests to request HTML output and extract invoice fields from structured HTML (test/test.invoices.paddleocr-vl.ts)
+- Refresh README with usage, new images/tags, architecture notes, and troubleshooting for the updated pipeline
+
 ## 2026-01-17 - 1.7.1 - fix(docker)
 standardize Dockerfile and entrypoint filenames; add GPU-specific Dockerfiles and update build and test references