docs(readme): remove stale runtime references
This commit is contained in:
+3
-4
@@ -3,7 +3,7 @@
|
|||||||
## Project Overview
|
## Project Overview
|
||||||
|
|
||||||
ModelGrid is a root-level daemon that manages GPU infrastructure, Docker, and AI model containers
|
ModelGrid is a root-level daemon that manages GPU infrastructure, Docker, and AI model containers
|
||||||
(Ollama, vLLM, TGI) with an OpenAI-compatible API interface.
|
(vLLM, TGI) with an OpenAI-compatible API interface.
|
||||||
|
|
||||||
## Architecture
|
## Architecture
|
||||||
|
|
||||||
@@ -84,13 +84,12 @@ ts/
|
|||||||
|
|
||||||
### Greenlit Model System
|
### Greenlit Model System
|
||||||
|
|
||||||
- Only pre-approved models can be auto-pulled for security
|
- Only catalog-listed models can be auto-deployed on demand
|
||||||
- Greenlist fetched from remote URL (configurable)
|
- Catalog fetched from a remote URL (configurable)
|
||||||
- VRAM requirements checked before loading
|
- VRAM requirements checked before loading
|
||||||
|
|
||||||
### Container Types
|
### Container Types
|
||||||
|
|
||||||
- **Ollama**: Easy to use, native API converted to OpenAI format
|
|
||||||
- **vLLM**: High performance, natively OpenAI-compatible
|
- **vLLM**: High performance, natively OpenAI-compatible
|
||||||
- **TGI**: HuggingFace Text Generation Inference
|
- **TGI**: HuggingFace Text Generation Inference
|
||||||
|
|
||||||
|
|||||||
+5
-6
@@ -26,9 +26,9 @@
|
|||||||
┌─────────────────────────────────────────────────────────────────┐
|
┌─────────────────────────────────────────────────────────────────┐
|
||||||
│ Container Runtime │
|
│ Container Runtime │
|
||||||
│ ┌──────────┐ ┌──────────┐ ┌──────────┐ ┌──────────┐ │
|
│ ┌──────────┐ ┌──────────┐ ┌──────────┐ ┌──────────┐ │
|
||||||
│ │ Ollama │ │ vLLM │ │ TGI │ │ Custom │ │
|
│ │ vLLM │ │ TGI │ │ Custom │ │
|
||||||
│ │Container │ │Container │ │Container │ │Container │ │
|
│ │Container │ │Container │ │Container │ │
|
||||||
│ └──────────┘ └──────────┘ └──────────┘ └──────────┘ │
|
│ └──────────┘ └──────────┘ └──────────┘ │
|
||||||
└─────────────────────────────────────────────────────────────────┘
|
└─────────────────────────────────────────────────────────────────┘
|
||||||
```
|
```
|
||||||
|
|
||||||
@@ -116,8 +116,7 @@ modelgrid/
|
|||||||
│ │ │ └── embeddings.ts # /v1/embeddings
|
│ │ │ └── embeddings.ts # /v1/embeddings
|
||||||
│ │ └── middleware/ # Request processing
|
│ │ └── middleware/ # Request processing
|
||||||
│ │ ├── auth.ts # API key validation
|
│ │ ├── auth.ts # API key validation
|
||||||
│ │ ├── sanity.ts # Request validation
|
│ │ └── sanity.ts # Request validation
|
||||||
│ │ └── proxy.ts # Container proxy
|
|
||||||
│ ├── models/ # Model management
|
│ ├── models/ # Model management
|
||||||
│ │ ├── index.ts
|
│ │ ├── index.ts
|
||||||
│ │ ├── registry.ts # Model registry
|
│ │ ├── registry.ts # Model registry
|
||||||
@@ -177,7 +176,7 @@ modelgrid config init # Initialize configuration
|
|||||||
|
|
||||||
## Greenlit Model System
|
## Greenlit Model System
|
||||||
|
|
||||||
Models are controlled via a remote greenlist to prevent arbitrary downloads:
|
Models are resolved through a remote catalog so deployments come from an explicit allowlist:
|
||||||
|
|
||||||
```json
|
```json
|
||||||
{
|
{
|
||||||
|
|||||||
Reference in New Issue
Block a user