mirror of
https://github.com/cloudstack-llc/mlx-knife.git
synced 2026-06-30 20:48:03 -04:00
Release MLX Knife 1.0.4 - Issue #14 Chat Self-Conversation Fix & Web UI Overhaul
Fix Issue #14: Interactive chat self-conversation bug resolved - Added context-sensitive chat stop tokens (\nHuman:, \nAssistant:, \nYou:, \nUser:) - Smart priority system: native model stop tokens first, chat tokens as fallback - Affects both `mlxk run` and `mlxk server` modes with backward compatibility Web UI complete transformation (simple_chat.html): - 🦫 Beaver branding replaces 🔪 knife emoji - Model and chat history persistence across browser sessions - Smart model switching with option to keep or clear chat history Testing infrastructure enhancements: - Automated server testing with RAM-aware model filtering - 15 new regression tests across 7+ MLX models validating Issue #14 fix - Comprehensive TESTING.md guide for server-based testing All 114 tests passing
This commit is contained in:
@@ -8,7 +8,7 @@ A lightweight, ollama-like CLI for managing and running MLX models on Apple Sili
|
||||
|
||||
> **Note**: MLX Knife is designed as a command-line interface tool only. While some internal functions are accessible via Python imports, only CLI usage is officially supported.
|
||||
|
||||
**Current Version**: 1.0.3 (August 2025)
|
||||
**Current Version**: 1.0.4 (August 2025)
|
||||
|
||||
[](https://github.com/mzau/mlx-knife/releases)
|
||||
[](https://opensource.org/licenses/MIT)
|
||||
@@ -325,6 +325,6 @@ Copyright (c) 2025 The BROKE team 🦫
|
||||
|
||||
<p align="center">
|
||||
<b>Made with ❤️ by The BROKE team <img src="broke-logo.png" alt="BROKE Logo" width="30" style="vertical-align: middle;"></b><br>
|
||||
<i>Version 1.0.3 | August 2025</i><br>
|
||||
<i>Version 1.0.4 | August 2025</i><br>
|
||||
<a href="https://github.com/mzau/broke-cluster">🔮 Next: BROKE Cluster for multi-node deployments</a>
|
||||
</p>
|
||||
|
||||
Reference in New Issue
Block a user