mirror of
https://github.com/cloudstack-llc/mlx-knife.git
synced 2026-06-30 20:48:03 -04:00
Release 2.0.4-beta.10: Audio PyPI fix (tiktoken workaround complete)
Audio/Whisper works with pip install - no Git workaround needed. See CHANGELOG.md for details. Tested: 647 passed, 11 skipped (Python 3.10-3.12)
This commit is contained in:
@@ -4,9 +4,9 @@
|
||||
<img src="https://github.com/mzau/mlx-knife/raw/main/mlxk-demo.gif" alt="MLX Knife Demo" width="900">
|
||||
</p>
|
||||
|
||||
**Current Version: 2.0.4-beta.9** (Stable: 2.0.3)
|
||||
**Current Version: 2.0.4-beta.10** (Stable: 2.0.3)
|
||||
|
||||
[](https://github.com/mzau/mlx-knife/releases)
|
||||
[](https://github.com/mzau/mlx-knife/releases)
|
||||
[](https://www.apache.org/licenses/LICENSE-2.0)
|
||||
[](https://www.python.org/downloads/)
|
||||
[](https://support.apple.com/en-us/HT211814)
|
||||
@@ -17,6 +17,8 @@
|
||||
|
||||
## Features
|
||||
|
||||
> **⚠️ Beta.9 Audio Bug:** If you installed `mlx-knife[audio]==2.0.4b9` from PyPI, audio transcription fails with "Processor not found". Upgrade to beta.10: `pip install mlx-knife[all]==2.0.4b10`
|
||||
|
||||
### What's New in 2.0.4 (Coming Soon - Currently Beta)
|
||||
- **Audio Transcription (STT)** - Whisper speech-to-text (`--audio` flag, `pip install mlx-knife[audio]`)
|
||||
- **Vision Models with EXIF Metadata** - Image analysis + automatic GPS/date/camera extraction visible to the model
|
||||
@@ -94,15 +96,15 @@ mlxk --version # → mlxk 2.0.3
|
||||
|
||||
**Requirements:** macOS Apple Silicon, Python 3.9-3.12
|
||||
|
||||
### 2. PyPI Beta (2.0.4-beta.9 - Text + Vision + Audio)
|
||||
### 2. PyPI Beta (2.0.4-beta.10 - Text + Vision + Audio)
|
||||
|
||||
```bash
|
||||
pip install mlx-knife[all]==2.0.4b9
|
||||
mlxk --version # → mlxk 2.0.4b9
|
||||
pip install mlx-knife[all]==2.0.4b10
|
||||
mlxk --version # → mlxk 2.0.4b10
|
||||
```
|
||||
|
||||
**Requirements:** macOS Apple Silicon, Python 3.10-3.12
|
||||
**Features:** Audio STT (Whisper), Vision with EXIF metadata, tiktoken workaround bundled
|
||||
**Features:** Audio STT (Whisper), Vision with EXIF metadata, full tiktoken workaround
|
||||
|
||||
### 3. Developer Installation
|
||||
|
||||
@@ -111,7 +113,7 @@ git clone https://github.com/mzau/mlx-knife.git
|
||||
cd mlx-knife
|
||||
pip install -e ".[all,dev,test]"
|
||||
|
||||
mlxk --version # → mlxk 2.0.4b9
|
||||
mlxk --version # → mlxk 2.0.4b10
|
||||
pytest -v
|
||||
```
|
||||
|
||||
@@ -449,18 +451,14 @@ mlxk convert <model> <output> --repair-index
|
||||
|
||||
### Audio Transcription (Speech-to-Text)
|
||||
|
||||
> **🎙️ New in beta.9:** Professional STT via dedicated Whisper models (mlx-audio backend). Backward compatible with Gemma-3n multimodal audio (mlx-vlm).
|
||||
> **🎙️ New in beta.9/10:** Professional STT via dedicated Whisper models (mlx-audio backend). Beta.10 fixes PyPI install (no Git workaround needed). Backward compatible with Gemma-3n multimodal audio (mlx-vlm).
|
||||
|
||||
**Requirements:**
|
||||
- **Python 3.10+** (mlx-audio dependency)
|
||||
- **Installation:** (mlx-audio 0.3.1 PyPI has regression - install from Git):
|
||||
```bash
|
||||
pip install -e "git+https://github.com/Blaizzy/mlx-audio.git@9349644#egg=mlx-audio"
|
||||
pip install tiktoken
|
||||
```
|
||||
- **Installation:** `pip install mlx-knife[audio]` (tiktoken workaround bundled)
|
||||
- **No system dependencies:** MP3/WAV decoding via embedded libsndfile (no ffmpeg or Homebrew required)
|
||||
|
||||
**✅ Recommended Models** (mlx-knife v2.0.4-beta.9):
|
||||
**✅ Recommended Models** (mlx-knife v2.0.4-beta.10):
|
||||
|
||||
| Model | Backend | Size | Duration | Notes |
|
||||
|-------|---------|------|----------|-------|
|
||||
@@ -1241,7 +1239,7 @@ Apache License 2.0 — see `LICENSE` (root) and `mlxk2/NOTICE`.
|
||||
|
||||
<p align="center">
|
||||
<b>Made with ❤️ by The BROKE team <img src="broke-logo.png" alt="BROKE Logo" width="30" align="middle"></b><br>
|
||||
<i>Version 2.0.4-beta.9 | February 2026</i><br>
|
||||
<i>Version 2.0.4-beta.10 | February 2026</i><br>
|
||||
<a href="https://github.com/mzau/broke-nchat">💬 Web UI: nChat - lightweight chat interface</a> •
|
||||
<a href="https://github.com/mzau/broke-cluster">🔮 Multi-node: BROKE Cluster</a>
|
||||
</p>
|
||||
|
||||
Reference in New Issue
Block a user