feat(providers): Add vision and document processing capabilities to providers

2025-02-03 15:26:00 +01:00
parent e82c510094
commit eda8ce36df
9 changed files with 212 additions and 6 deletions
--- a/changelog.md
+++ b/changelog.md
@ -1,5 +1,13 @@
 # Changelog

+## 2025-02-03 - 0.1.0 - feat(providers)
+Add vision and document processing capabilities to providers
+
+- OpenAI and Ollama providers now support vision tasks using GPT-4 Vision and Llava models respectively.
+- Document processing has been implemented for OpenAI and Ollama providers, converting PDFs to images for analysis.
+- Introduced abstract methods for vision and document processing in the MultiModalModel class.
+- Updated the readme file with examples for vision and document processing.
+
 ## 2025-02-03 - 0.0.19 - fix(core)
 Enhanced chat streaming and error handling across providers