mirror of
https://github.com/cloudstack-llc/mlx-knife.git
synced 2026-06-30 20:48:03 -04:00
Release MLX Knife 1.1.0-beta3 - Critical Cache Management Fixes
Three major bug fixes for production readiness: - Issue #21: Fix crash on fresh installations (empty cache directory) - Issue #22: Suppress urllib3 LibreSSL warnings on macOS Python 3.9 - Issue #23: Fix double rm execution bug - models now deleted in single command Test improvements: - 140/140 tests passing (up from 137) - Added real integration tests for lock cleanup - Fixed unit tests broken by path corrections All known cache management issues resolved for stable release.
This commit is contained in:
@@ -8,9 +8,10 @@ A lightweight, ollama-like CLI for managing and running MLX models on Apple Sili
|
||||
|
||||
> **Note**: MLX Knife is designed as a command-line interface tool only. While some internal functions are accessible via Python imports, only CLI usage is officially supported.
|
||||
|
||||
**Current Version**: 1.1.0-beta2 (August 2025)
|
||||
- **Issue #19**: Fixed server response truncation - large context models work at full capacity
|
||||
- **Issue #20**: Clean output - End-Tokens no longer visible in non-streaming mode
|
||||
**Current Version**: 1.1.0-beta3 (August 2025)
|
||||
- **Issue #21**: Fixed empty cache directory crash - `mlxk list` now works on fresh installations
|
||||
- **Issue #22**: Suppressed urllib3 LibreSSL warnings on macOS Python 3.9
|
||||
- **Issue #23**: Fixed double execution requirement in `mlxk rm` command with enhanced lock cleanup
|
||||
|
||||
[](https://github.com/mzau/mlx-knife/releases)
|
||||
[](https://opensource.org/licenses/MIT)
|
||||
@@ -18,7 +19,7 @@ A lightweight, ollama-like CLI for managing and running MLX models on Apple Sili
|
||||
[](https://www.python.org/downloads/)
|
||||
[](https://support.apple.com/en-us/HT211814)
|
||||
[](https://github.com/ml-explore/mlx)
|
||||
[](#testing)
|
||||
[](#testing)
|
||||
|
||||
## Features
|
||||
|
||||
@@ -202,10 +203,17 @@ mlxk run <model> --no-chat-template # Raw completion mode
|
||||
|
||||
#### `rm` - Remove Models
|
||||
```bash
|
||||
mlxk rm <model> # Delete a model
|
||||
mlxk rm <model> --force # Skip confirmation
|
||||
mlxk rm <model> # Delete model with cache cleanup confirmation
|
||||
mlxk rm <model>@<hash> # Delete specific version (removes entire model)
|
||||
mlxk rm <model> --force # Skip confirmations, auto-cleanup cache files
|
||||
```
|
||||
|
||||
**Features:**
|
||||
- Removes entire model directory (not just snapshots)
|
||||
- Cleans up orphaned HuggingFace lock files
|
||||
- Handles corrupted models gracefully
|
||||
- Smart prompting (only asks about cache cleanup if needed)
|
||||
|
||||
#### `health` - Check Integrity
|
||||
```bash
|
||||
mlxk health # Check all models
|
||||
@@ -327,6 +335,6 @@ Copyright (c) 2025 The BROKE team 🦫
|
||||
|
||||
<p align="center">
|
||||
<b>Made with ❤️ by The BROKE team <img src="broke-logo.png" alt="BROKE Logo" width="30" style="vertical-align: middle;"></b><br>
|
||||
<i>Version 1.1.0-beta2 | August 2025</i><br>
|
||||
<i>Version 1.1.0-beta3 | August 2025</i><br>
|
||||
<a href="https://github.com/mzau/broke-cluster">🔮 Next: BROKE Cluster for multi-node deployments</a>
|
||||
</p>
|
||||
|
||||
Reference in New Issue
Block a user