tag/v2.0.0/readme.hints.md

# SmartAI Project Hints

## Architecture (v1.0.0 - Vercel AI SDK rewrite)

The package is a **provider registry** built on the Vercel AI SDK (`ai` v6). The core export returns a `LanguageModelV3` from `@ai-sdk/provider`. Specialized capabilities are in subpath exports.

### Core Entry (`ts/`)
- `getModel(options)` → returns `LanguageModelV3` for any supported provider
- Providers: anthropic, openai, google, groq, mistral, xai, perplexity, ollama
- Anthropic prompt caching via `wrapLanguageModel` middleware (enabled by default)
- Custom Ollama provider implementing `LanguageModelV3` directly (for think, num_ctx support)

### Subpath Exports
- `@push.rocks/smartai/vision` — `analyzeImage()` using `generateText` with image content
- `@push.rocks/smartai/audio` — `textToSpeech()` using OpenAI SDK directly
- `@push.rocks/smartai/image` — `generateImage()`, `editImage()` using OpenAI SDK directly
- `@push.rocks/smartai/document` — `analyzeDocuments()` using SmartPdf + `generateText`
- `@push.rocks/smartai/research` — `research()` using `@anthropic-ai/sdk` web_search tool

## Dependencies

- `ai` ^6.0.116 — Vercel AI SDK core
- `@ai-sdk/*` — Provider packages (anthropic, openai, google, groq, mistral, xai, perplexity)
- `@ai-sdk/provider` ^3.0.8 — LanguageModelV3 types
- `@anthropic-ai/sdk` ^0.78.0 — Direct SDK for research (web search tool)
- `openai` ^6.25.0 — Direct SDK for audio TTS and image generation/editing
- `@push.rocks/smartpdf` ^4.1.3 — PDF to PNG conversion for document analysis

## Build

- `pnpm build` → `tsbuild tsfolders --allowimplicitany`
- Compiles: ts/, ts_vision/, ts_audio/, ts_image/, ts_document/, ts_research/

## Important Notes

- LanguageModelV3 uses `unified`/`raw` in FinishReason (not `type`/`rawType`)
- LanguageModelV3 system messages have `content: string` (not array)
- LanguageModelV3 file parts use `mediaType` (not `mimeType`)
- LanguageModelV3FunctionTool uses `inputSchema` (not `parameters`)
- Ollama `think` param goes at request body top level, not inside `options`
- Qwen models get default temperature 0.55 in the custom Ollama provider
- `qenv.getEnvVarOnDemand()` returns a Promise — must be awaited in tests

## Testing

```bash
pnpm test                            # all tests
tstest test/test.smartai.ts --verbose # core tests
tstest test/test.ollama.ts --verbose  # ollama provider tests (mocked, no API needed)
```
feat(provider.anthropic): Add extended thinking modes to AnthropicProvider and apply thinking budgets to API calls 2025-10-30 12:11:18 +00:00			`# SmartAI Project Hints`

BREAKING CHANGE(vercel-ai-sdk): migrate to Vercel AI SDK v6 and introduce provider registry (getModel) returning LanguageModelV3 2026-03-05 19:37:29 +00:00			`## Architecture (v1.0.0 - Vercel AI SDK rewrite)`
fix(docs): update documentation: clarify provider capabilities, add provider capabilities summary, polish examples and formatting, and remove Serena project config 2026-01-20 01:27:52 +00:00
BREAKING CHANGE(vercel-ai-sdk): migrate to Vercel AI SDK v6 and introduce provider registry (getModel) returning LanguageModelV3 2026-03-05 19:37:29 +00:00			The package is a provider registry built on the Vercel AI SDK (`ai` v6). The core export returns a `LanguageModelV3` from `@ai-sdk/provider`. Specialized capabilities are in subpath exports.
feat(mistral): add Mistral provider with native PDF OCR and chat integration 2026-01-18 22:11:52 +00:00
BREAKING CHANGE(vercel-ai-sdk): migrate to Vercel AI SDK v6 and introduce provider registry (getModel) returning LanguageModelV3 2026-03-05 19:37:29 +00:00			### Core Entry (`ts/`)
			- `getModel(options)` → returns `LanguageModelV3` for any supported provider
			`- Providers: anthropic, openai, google, groq, mistral, xai, perplexity, ollama`
			- Anthropic prompt caching via `wrapLanguageModel` middleware (enabled by default)
			- Custom Ollama provider implementing `LanguageModelV3` directly (for think, num_ctx support)
feat(mistral): add Mistral provider with native PDF OCR and chat integration 2026-01-18 22:11:52 +00:00
BREAKING CHANGE(vercel-ai-sdk): migrate to Vercel AI SDK v6 and introduce provider registry (getModel) returning LanguageModelV3 2026-03-05 19:37:29 +00:00			`### Subpath Exports`
			- `@push.rocks/smartai/vision` — `analyzeImage()` using `generateText` with image content
			- `@push.rocks/smartai/audio` — `textToSpeech()` using OpenAI SDK directly
			- `@push.rocks/smartai/image` — `generateImage()`, `editImage()` using OpenAI SDK directly
			- `@push.rocks/smartai/document` — `analyzeDocuments()` using SmartPdf + `generateText`
			- `@push.rocks/smartai/research` — `research()` using `@anthropic-ai/sdk` web_search tool
feat(mistral): add Mistral provider with native PDF OCR and chat integration 2026-01-18 22:11:52 +00:00
BREAKING CHANGE(vercel-ai-sdk): migrate to Vercel AI SDK v6 and introduce provider registry (getModel) returning LanguageModelV3 2026-03-05 19:37:29 +00:00			`## Dependencies`
feat(provider.anthropic): Add extended thinking modes to AnthropicProvider and apply thinking budgets to API calls 2025-10-30 12:11:18 +00:00
BREAKING CHANGE(vercel-ai-sdk): migrate to Vercel AI SDK v6 and introduce provider registry (getModel) returning LanguageModelV3 2026-03-05 19:37:29 +00:00			- `ai` ^6.0.116 — Vercel AI SDK core
			- `@ai-sdk/*` — Provider packages (anthropic, openai, google, groq, mistral, xai, perplexity)
			- `@ai-sdk/provider` ^3.0.8 — LanguageModelV3 types
			- `@anthropic-ai/sdk` ^0.78.0 — Direct SDK for research (web search tool)
			- `openai` ^6.25.0 — Direct SDK for audio TTS and image generation/editing
			- `@push.rocks/smartpdf` ^4.1.3 — PDF to PNG conversion for document analysis
feat(provider.anthropic): Add extended thinking modes to AnthropicProvider and apply thinking budgets to API calls 2025-10-30 12:11:18 +00:00
BREAKING CHANGE(vercel-ai-sdk): migrate to Vercel AI SDK v6 and introduce provider registry (getModel) returning LanguageModelV3 2026-03-05 19:37:29 +00:00			`## Build`
feat(provider.anthropic): Add extended thinking modes to AnthropicProvider and apply thinking budgets to API calls 2025-10-30 12:11:18 +00:00
BREAKING CHANGE(vercel-ai-sdk): migrate to Vercel AI SDK v6 and introduce provider registry (getModel) returning LanguageModelV3 2026-03-05 19:37:29 +00:00			- `pnpm build` → `tsbuild tsfolders --allowimplicitany`
			`- Compiles: ts/, ts_vision/, ts_audio/, ts_image/, ts_document/, ts_research/`
feat(provider.anthropic): Add extended thinking modes to AnthropicProvider and apply thinking budgets to API calls 2025-10-30 12:11:18 +00:00
BREAKING CHANGE(vercel-ai-sdk): migrate to Vercel AI SDK v6 and introduce provider registry (getModel) returning LanguageModelV3 2026-03-05 19:37:29 +00:00			`## Important Notes`
feat(provider.anthropic): Add extended thinking modes to AnthropicProvider and apply thinking budgets to API calls 2025-10-30 12:11:18 +00:00
BREAKING CHANGE(vercel-ai-sdk): migrate to Vercel AI SDK v6 and introduce provider registry (getModel) returning LanguageModelV3 2026-03-05 19:37:29 +00:00			- LanguageModelV3 uses `unified`/`raw` in FinishReason (not `type`/`rawType`)
			- LanguageModelV3 system messages have `content: string` (not array)
			- LanguageModelV3 file parts use `mediaType` (not `mimeType`)
			- LanguageModelV3FunctionTool uses `inputSchema` (not `parameters`)
			- Ollama `think` param goes at request body top level, not inside `options`
			`- Qwen models get default temperature 0.55 in the custom Ollama provider`
			- `qenv.getEnvVarOnDemand()` returns a Promise — must be awaited in tests
feat(providers): Add Anthropic extended thinking and adapt providers to new streaming/file APIs; bump dependencies and update docs, tests and configuration 2026-01-18 16:26:16 +00:00
fix(docs): update documentation: clarify provider capabilities, add provider capabilities summary, polish examples and formatting, and remove Serena project config 2026-01-20 01:27:52 +00:00			`## Testing`
feat(provider.anthropic): Add extended thinking modes to AnthropicProvider and apply thinking budgets to API calls 2025-10-30 12:11:18 +00:00
			```bash
BREAKING CHANGE(vercel-ai-sdk): migrate to Vercel AI SDK v6 and introduce provider registry (getModel) returning LanguageModelV3 2026-03-05 19:37:29 +00:00			`pnpm test # all tests`
			`tstest test/test.smartai.ts --verbose # core tests`
			`tstest test/test.ollama.ts --verbose # ollama provider tests (mocked, no API needed)`
feat(provider.anthropic): Add extended thinking modes to AnthropicProvider and apply thinking budgets to API calls 2025-10-30 12:11:18 +00:00			```