fix(core): Improve PDF XML extraction, embedding, and format detection; update loadPdf/exportPdf error handling; add new validator implementations and enhance IPdf metadata.

This commit is contained in:
2025-04-04 12:14:41 +00:00
parent 68fd50fd4c
commit 5d43c1ce4e
15 changed files with 1957 additions and 418 deletions

View File

@ -1,5 +1,15 @@
# Changelog
## 2025-04-04 - 4.1.6 - fix(core)
Improve PDF XML extraction, embedding, and format detection; update loadPdf/exportPdf error handling; add new validator implementations and enhance IPdf metadata.
- Update loadPdf to capture extraction result details including detected format and improve error messaging
- Enhance TextXMLExtractor with a chunked approach using both UTF-8 and Latin-1 decoding for reliable text extraction
- Refactor PDFEmbedder to return a structured PDFEmbedResult with proper filename normalization and robust error handling
- Extend format detection logic by adding quickFormatCheck, isUBLFormat, isXRechnungFormat, isCIIFormat, isZUGFERDV1Format, and FatturaPA checks
- Introduce new validator classes (UBLValidator, XRechnungValidator, FatturaPAValidator) and a generic fallback validator in ValidatorFactory
- Update IPdf interface to include embedded XML metadata (format, filename, description) for better traceability
## 2025-04-03 - 4.1.5 - fix(core)
No uncommitted changes detected in the repository. The project files and functionality remain unchanged.