fix: Runtime gates + unit tests; benchmark GPU analysis

Core:
- Run preflight passes probe/framework to audio_runtime_compatibility
- STT model_type gate extended (vibevoice, audio)
- MLX 0.30.x compat: catch Exception in whisper_tokenizer
- Embedding-gate unit tests (3 tests)
- Removed get_encoding duplication (-45 LOC)

Benchmark:
- GPU analysis section in reports
This commit is contained in:
The BROKE Cluster Team
2026-02-07 23:38:34 +01:00
parent 10a7648f8d
commit 7f10187bee
13 changed files with 609 additions and 113 deletions
+1
View File
@@ -28,6 +28,7 @@
- **Resumable Downloads** - Interrupted clone/pull operations continue automatically
- **Safe Vision Batch Processing** - Automatic chunking prevents Metal OOM crashes
- **Workspace Path Support** - Run/show/server/list commands work with local directories
- **Updated [SECURITY.md](SECURITY.md)** - Clarifies mlx-knife vs. upstream library behavior; important for offline/air-gapped use
### Core Functionality
- **List & Manage Models**: Browse your HuggingFace cache with MLX-specific filtering