mirror of https://github.com/cloudstack-llc/mlx-knife.git synced 2026-07-01 20:44:14 -04:00

Files

T

The BROKE Cluster Team 86f669dc82 Release 2.0.4-beta.1: Vision + Pipes + Memory

- Vision Support (Issue #45): CLI + Server with OpenAI-compatible image API, EXIF metadata
- Unix Pipes (ADR-014): stdin support, isatty detection, SIGPIPE handling
- Memory-Aware Loading (ADR-016): Pre-load checks with >70% RAM warnings
- Python 3.9-3.14: Full compatibility verified (476-485 tests passing)
- Fixed: --log-json regression (Issue #44), Vision multimodal history filtering

See CHANGELOG.md for complete details.

2025-12-16 19:35:30 +01:00

31 KiB

Raw Blame History

MLX-Knife 2.0 JSON API Specification

Specification Version: 0.1.6 Status: Alpha - Subject to change Target: MLX-Knife 2.0.4-beta.1

Based on GitHub Issue #8 - Comprehensive JSON output support for all commands

Motivation

MLX Knife is promoted as a "scriptable" tool, but formatted terminal output makes automation difficult. JSON output enables robust scripting integration and broke-cluster compatibility.

Health Check Concepts (0.1.5)

MLX Knife distinguishes between two levels of model validation:

Integrity Check (`health` field)

Purpose: Verify that downloaded model files are complete and uncorrupted
Scope: File-level validation only
Checks:
- Required files present (config.json, weights, tokenizer files)
- No Git LFS pointers instead of actual files
- JSON files are valid JSON
States: "healthy" | "unhealthy"
Always included: In all modelObject instances

Runtime Compatibility Check (`runtime_compatible` field)

Purpose: Verify that model can be executed with mlx-lm
Scope: Framework and model architecture validation
Checks:
- Framework is MLX (GGUF/PyTorch models fail)
- Model architecture supported by current mlx-lm version
- Respects MODEL_REMAPPING (e.g., mistral → llama)
States: true | false
Always included: In all modelObject instances

Gate Logic & Reason Field

Runtime compatibility check requires integrity check first
If integrity check fails (health: "unhealthy"), runtime check is skipped (runtime_compatible: false)
reason field describes the first problem found:
- Integrity problems take precedence
- Runtime problems only shown if files are healthy
- null when both checks pass (health: "healthy" AND runtime_compatible: true)

Example Scenarios

Healthy MLX Model (Compatible):

/* Illustrative snippet - not a complete response */
{
  "health": "healthy",
  "runtime_compatible": true,
  "reason": null
}

GGUF Model (Files OK, Not Executable):

/* Illustrative snippet - not a complete response */
{
  "health": "healthy",
  "runtime_compatible": false,
  "reason": "Framework GGUF not executable with mlx-lm (requires MLX)"
}

Unsupported Architecture:

/* Illustrative snippet - not a complete response */
{
  "health": "healthy",
  "runtime_compatible": false,
  "reason": "Model architecture 'qwen3_next' requires mlx-lm >= 0.28.0 (current: 0.27.1)"
}

Incomplete Download (Runtime Check Skipped):

/* Illustrative snippet - not a complete response */
{
  "health": "unhealthy",
  "runtime_compatible": false,
  "reason": "config.json missing"
}

CLI Usage

All commands require the --json flag for JSON output:

mlxk2 list --json                      # JSON output
mlxk2 list                             # Human-readable output

Version Reporting

CLI version (human):
- mlxk2 --version
CLI version (JSON):
- mlxk2 --version --json

JSON output example:

{
  "status": "success",
  "command": "version",
  "data": {
    "cli_version": "2.0.4-beta.1",
    "json_api_spec_version": "0.1.6",
    "system": {
      "memory_total_bytes": 137438953472
    }
  },
  "error": null
}

Notes:

Regular command responses (e.g., list, show) do not include a separate protocol tag; the spec version is reported by the version command in data.json_api_spec_version.
system object is null on non-macOS platforms where sysctl hw.memsize is unavailable (0.1.6+).

Commands Overview

All commands support consistent JSON output with standardized error handling and exit codes.

Core Schema Pattern

{
  "status": "success" | "error",
  "command": "list" | "show" | "health" | "pull" | "rm" | "clone" | "version" | "push" | "run" | "server",
  "data": { /* command-specific data */ },
  "error": null | { "type": "string", "message": "string" }
}

Common Model Object

All commands that return model information use the same minimal model object.

name: Full HF name org/model.
hash: 40-char snapshot commit of the selected snapshot, or null.
size_bytes: Total size in bytes of files under the selected path (snapshot preferred, else model root).
last_modified: ISO-8601 UTC timestamp (with Z) of the selected path.
framework: "MLX" | "GGUF" | "PyTorch" | "Unknown".
model_type: "chat" | "embedding" | "base" | "unknown".
capabilities: e.g., ["text-generation", "chat"], ["embeddings"], or ["text-generation", "chat", "vision"].
health: "healthy" | "unhealthy" (always present).
runtime_compatible: true | false (0.1.5+, always present).
reason: string | null (0.1.5+, describes first problem found, null when both checks pass).
cached: true.

Notes:

No human-readable size field; only size_bytes.
No human-readable "modified" field; last_modified is authoritative.
No absolute filesystem paths are exposed.
runtime_compatible and reason fields added in spec version 0.1.5 (Issue #36).
vision capability added in 0.1.5 as a backward-compatible enum extension (ADR-012 Phase 1a).

Supported Commands

Command	Description	JSON-Only in 2.0	Alpha Feature
`list`	List models with metadata and hash codes	✅	-
`show`	Detailed model inspection with files/config	✅	-
`health`	Check model integrity and corruption	✅	-
`pull`	Download models from HuggingFace	✅	-
`rm`	Delete models from cache	✅	-
`clone`	Clone models to workspace directory	✅	`MLXK2_ENABLE_ALPHA_FEATURES=1`
`push`	Upload a local folder to Hugging Face (experimental)	✅	`MLXK2_ENABLE_ALPHA_FEATURES=1`
`run`	Execute model inference	✅	-
`serve`/`server`	OpenAI-compatible API server	✅	-

Note: Commands marked with Alpha Feature require MLXK2_ENABLE_ALPHA_FEATURES=1 environment variable to be available.

Model Discovery & Metadata

Model Type & Capabilities

Model Types:

"chat" - Language models with chat/instruction capability
"embedding" - Embedding models for vector representations
"completion" - Base models for text completion (no chat template)
"unknown" - Cannot determine model type from config

Capabilities Array:

"text-generation" - Can generate text
"chat" - Supports chat template/instruction format
"embeddings" - Can generate embeddings
"completion" - Text completion without chat format
"vision" - Accepts image inputs (detected via model_type in vision families or presence of preprocessor_config.json)

Vision Example (Phase 1a, ADR-012):

{
  "status": "success",
  "command": "list",
  "data": {
    "models": [
      {
        "name": "mlx-community/llava-1.5-7b-hf-4bit-mlx",
        "hash": "a5339a41b2e3abcdefgh1234567890ab12345678",
        "size_bytes": 4613734656,
        "last_modified": "2024-12-03T10:00:00Z",
        "framework": "MLX",
        "model_type": "chat",
        "capabilities": ["text-generation", "chat", "vision"],
        "health": "healthy",
        "runtime_compatible": true,
        "reason": null,
        "cached": true
      }
    ],
    "count": 1
  },
  "error": null
}

`mlxk-json list [pattern] --json`

Basic Usage:

mlxk-json list --json                        # All models with full validation
mlxk-json list "mlx-community" --json        # Filter by pattern
mlxk-json list "Llama" --json                # Fuzzy matching

Behavior:

Returns all cached models with complete metadata
Performs both integrity and runtime compatibility checks (0.1.5+)
Pattern filter is a case-insensitive substring match on name

JSON Schema:

{
  "status": "success",
  "command": "list",
  "data": {
    "models": [
      {
        "name": "mlx-community/Phi-3-mini-4k-instruct-4bit",
        "hash": "a5339a41b2e3abcdefgh1234567890ab12345678",
        "size_bytes": 4613734656,
        "last_modified": "2024-10-15T08:23:41Z",
        "framework": "MLX",
        "model_type": "chat",
        "capabilities": ["text-generation", "chat"],
        "health": "healthy",
        "runtime_compatible": true,
        "reason": null,
        "cached": true
      },
      {
        "name": "mlx-community/mxbai-embed-large-v1",
        "hash": "b5679a5f90abcdef1234567890abcdef12345678",
        "size_bytes": 1200000000,
        "last_modified": "2024-10-20T10:30:15Z",
        "framework": "MLX",
        "model_type": "embedding",
        "capabilities": ["embeddings"],
        "health": "healthy",
        "runtime_compatible": true,
        "reason": null,
        "cached": true
      },
      {
        "name": "TheBloke/TinyLlama-1.1B-Chat-v1.0-GGUF",
        "hash": "e96c7a5f90abcdef1234567890abcdef12345678",
        "size_bytes": 16900000000,
        "last_modified": "2024-09-20T14:15:22Z",
        "framework": "GGUF",
        "model_type": "chat",
        "capabilities": ["text-generation", "chat"],
        "health": "healthy",
        "runtime_compatible": false,
        "reason": "Framework GGUF not executable with mlx-lm (requires MLX)",
        "cached": true
      },
      {
        "name": "mlx-community/Qwen3-Next-80B-A3B-Instruct-4bit",
        "hash": "f1234a5f90abcdef1234567890abcdef12345678",
        "size_bytes": 45000000000,
        "last_modified": "2024-10-01T09:15:30Z",
        "framework": "MLX",
        "model_type": "chat",
        "capabilities": ["text-generation", "chat"],
        "health": "healthy",
        "runtime_compatible": false,
        "reason": "Model architecture 'qwen3_next' requires mlx-lm >= 0.28.0 (current: 0.27.1)",
        "cached": true
      },
      {
        "name": "corrupted/incomplete-download",
        "hash": "c9876a5f90abcdef1234567890abcdef12345678",
        "size_bytes": 2500000000,
        "last_modified": "2024-09-15T12:00:00Z",
        "framework": "MLX",
        "model_type": "unknown",
        "capabilities": [],
        "health": "unhealthy",
        "runtime_compatible": false,
        "reason": "config.json missing",
        "cached": true
      }
    ],
    "count": 12
  },
  "error": null
}

Empty Cache:

{
  "status": "success",
  "command": "list",
  "data": {
    "models": [],
    "count": 0
  },
  "error": null
}

`mlxk-json health [pattern] --json`

Usage:

mlxk-json health --json                      # Check all models
mlxk-json health "Phi-3" --json              # Check specific pattern
mlxk-json health "Qwen3@e96" --json          # Check specific hash

Healthy Models:

{
  "status": "success",
  "command": "health",
  "data": {
    "healthy": [
      {
        "name": "mlx-community/Phi-3-mini-4k-instruct-4bit",
        "status": "healthy",
        "reason": "Model is healthy"
      }
    ],
    "unhealthy": [],
    "summary": {
      "total": 1,
      "healthy_count": 1,
      "unhealthy_count": 0
    }
  },
  "error": null
}

Unhealthy Models (Real Scenario):

{
  "status": "success", 
  "command": "health",
  "data": {
    "healthy": [],
    "unhealthy": [
      {
        "name": "mlx-community/Phi-3-mini-4k-instruct-4bit",
        "status": "unhealthy",
        "reason": "config.json missing"
      },
      {
        "name": "corrupted/model", 
        "status": "unhealthy",
        "reason": "LFS pointers instead of files: model.safetensors"
      }
    ],
    "summary": {
      "total": 2,
      "healthy_count": 0,
      "unhealthy_count": 2
    }
  },
  "error": null
}

Ambiguous Pattern:

{
  "status": "error",
  "command": "health", 
  "data": null,
  "error": {
    "type": "ambiguous_match",
    "message": "Multiple models match 'Llama'",
    "matches": [
      "mlx-community/Llama-3.2-1B-Instruct-4bit",
      "mlx-community/Llama-3.2-3B-Instruct-4bit"
    ]
  }
}

`mlxk-json show <model> --json`

Usage:

mlxk-json show "Phi-3-mini" --json               # Short name expansion
mlxk-json show "mlx-community/Phi-3-mini" --json # Full name
mlxk-json show "Qwen3@e96" --json                # Specific hash
mlxk-json show "Phi-3-mini" --files --json       # Include file listing
mlxk-json show "Phi-3-mini" --config --json      # Include config.json content

Basic Model Information:

{
  "status": "success",
  "command": "show",
  "data": {
    "model": {
      "name": "mlx-community/Phi-3-mini-4k-instruct-4bit",
      "hash": "a5339a41b2e3abcdefgh1234567890ab12345678",
      "size_bytes": 4613734656,
      "framework": "MLX",
      "model_type": "chat",
      "capabilities": ["text-generation", "chat"],
      "last_modified": "2024-10-15T08:23:41Z",
      "health": "healthy",
      "runtime_compatible": true,
      "reason": null,
      "cached": true
    },
    "metadata": {
      "model_type": "phi3",
      "quantization": "4bit",
      "context_length": 4096,
      "vocab_size": 32064,
      "hidden_size": 3072,
      "num_attention_heads": 32,
      "num_hidden_layers": 32
    }
  },
  "error": null
}

With Files Listing (--files):

{
  "status": "success",
  "command": "show",
  "data": {
    "model": {
      "name": "mlx-community/Phi-3-mini-4k-instruct-4bit",
      "hash": "a5339a41b2e3abcdefgh1234567890ab12345678",
      "size_bytes": 4613734656,
      "framework": "MLX",
      "model_type": "chat",
      "capabilities": ["text-generation", "chat"],
      "last_modified": "2024-10-15T08:23:41Z",
      "health": "healthy",
      "runtime_compatible": true,
      "reason": null,
      "cached": true
    },
    "files": [
      {"name": "config.json", "size": "1.2KB", "type": "config"},
      {"name": "model.safetensors", "size": "2.3GB", "type": "weights"},
      {"name": "model-00001-of-00002.safetensors", "size": "1.8GB", "type": "weights"},
      {"name": "model-00002-of-00002.safetensors", "size": "200MB", "type": "weights"},
      {"name": "tokenizer.json", "size": "2.1MB", "type": "tokenizer"},
      {"name": "tokenizer_config.json", "size": "3.4KB", "type": "config"},
      {"name": "special_tokens_map.json", "size": "588B", "type": "config"}
    ],
    "metadata": null
  },
  "error": null
}

With Config Content (--config):

{
  "status": "success",
  "command": "show",
  "data": {
    "model": {
      "name": "mlx-community/Phi-3-mini-4k-instruct-4bit",
      "hash": "a5339a41b2e3abcdefgh1234567890ab12345678",
      "size_bytes": 4613734656,
      "framework": "MLX",
      "model_type": "chat",
      "capabilities": ["text-generation", "chat"],
      "last_modified": "2024-10-15T08:23:41Z",
      "health": "healthy",
      "runtime_compatible": true,
      "reason": null,
      "cached": true
    },
    "config": {
      "architectures": ["Phi3ForCausalLM"],
      "model_type": "phi3",
      "vocab_size": 32064,
      "hidden_size": 3072,
      "intermediate_size": 8192,
      "num_hidden_layers": 32,
      "num_attention_heads": 32,
      "max_position_embeddings": 4096,
      "rope_theta": 10000.0,
      "quantization": {
        "bits": 4,
        "group_size": 64
      }
    },
    "metadata": null
  },
  "error": null
}

Model Not Found:

{
  "status": "error",
  "command": "show",
  "data": null,
  "error": {
    "type": "model_not_found",
    "message": "No model found matching 'nonexistent-model'"
  }
}

Ambiguous Match:

{
  "status": "error",
  "command": "show",
  "data": null,
  "error": {
    "type": "ambiguous_match",
    "message": "Multiple models match 'Llama'",
    "matches": [
      "mlx-community/Llama-3.2-1B-Instruct-4bit",
      "mlx-community/Llama-3.2-3B-Instruct-4bit"
    ]
  }
}

Changes in 0.1.6 (Alpha)

ADR-016 Preparation: System Memory Information

Added system object to version command response
system.memory_total_bytes: Total physical RAM in bytes (from sysctl hw.memsize)
system is null on non-macOS platforms where sysctl is unavailable
Enables ADR-016 Memory-Aware Model Loading (pre-load memory checks)

ADR-012: Vision Support - Model Discovery

Vision models detected via preprocessor_config.json presence
vision capability added to model discovery (backward-compatible enum extension)
Visible in mlxk list --json, mlxk show --json, mlxk health --json
Example: "capabilities": ["text-generation", "chat", "vision"]

Note on mlxk run --image (CLI):

mlxk run --image command exists for vision models (ADR-012 Phase 1b)
Current output: Text mode only (Markdown table with filename mapping)
JSON output: Deferred to ADR-017 Phase 2 (requires formal schema extension)
Server OpenAI Vision API documented in docs/SERVER-HANDBOOK.md

Changes in 0.1.5 (Alpha)

Issue #36: Separate Integrity and Runtime Compatibility Checks

Added runtime_compatible: boolean field to modelObject
Added reason: string | null field to modelObject
Both fields always present in JSON output
runtime_compatible checks:
- Framework must be MLX (GGUF/PyTorch fail)
- Model architecture must be supported by installed mlx-lm version
- Respects MODEL_REMAPPING for aliased architectures
Gate logic: Runtime check requires passing integrity check first
reason field describes first problem found (integrity > runtime priority)

Changes in 0.1.2 (Alpha)

Introduced a common minimal Model Object for consistency across commands.
Replaced human-readable size with machine-friendly size_bytes.
Removed human-readable modified; last_modified (ISO-8601 UTC) is authoritative.

Operations

`mlxk-json pull <model> --json`

Usage:

mlxk-json pull "Phi-3-mini" --json               # Short name expansion
mlxk-json pull "mlx-community/Phi-3-mini" --json # Full name
mlxk-json pull "microsoft/DialoGPT-small" --json # Non-MLX model

Successful Download:

{
  "status": "success",
  "command": "pull",
  "data": {
    "model": "mlx-community/Phi-3-mini-4k-instruct-4bit",
    "download_status": "success",
    "message": "Successfully downloaded model",
    "expanded_name": "mlx-community/Phi-3-mini-4k-instruct-4bit"
  },
  "error": null
}

Already Exists (Bug - doesn't detect corruption):

{
  "status": "success",
  "command": "pull",
  "data": {
    "model": "mlx-community/Phi-3-mini-4k-instruct-4bit",
    "download_status": "already_exists",
    "message": "Model mlx-community/Phi-3-mini-4k-instruct-4bit already exists in cache",
    "expanded_name": null
  },
  "error": null
}

Download Failed:

{
  "status": "error",
  "command": "pull",
  "data": {
    "model": "nonexistent/model",
    "download_status": "failed",
    "message": "",
    "expanded_name": null
  },
  "error": {
    "type": "download_failed",
    "message": "Repository not found for url: https://huggingface.co/api/models/nonexistent/model"
  }
}

Validation Error:

{
  "status": "error",
  "command": "pull", 
  "data": {
    "model": null,
    "download_status": "error",
    "message": "",
    "expanded_name": null
  },
  "error": {
    "type": "ValidationError",
    "message": "Model name too long: 105/96 characters"
  }
}

Ambiguous Match:

{
  "status": "error",
  "command": "pull",
  "data": {
    "model": null,
    "download_status": "unknown",
    "message": "",
    "expanded_name": null
  },
  "error": {
    "type": "ambiguous_match",
    "message": "Multiple models match 'Llama'",
    "matches": [
      "mlx-community/Llama-3.2-1B-Instruct-4bit",
      "mlx-community/Llama-3.2-3B-Instruct-4bit"
    ]
  }
}

`mlxk-json rm <model> [--force] --json`

Usage:

mlxk-json rm "Phi-3-mini" --json                 # Direct deletion (no locks)
mlxk-json rm "Phi-3-mini" --force --json         # Force deletion (ignores locks)
mlxk-json rm "locked-model" --json               # Error: requires --force due to locks

Successful Deletion:

{
  "status": "success", 
  "command": "rm",
  "data": {
    "model": "mlx-community/Phi-3-mini-4k-instruct-4bit",
    "action": "deleted",
    "message": "Successfully deleted mlx-community/Phi-3-mini-4k-instruct-4bit"
  },
  "error": null
}

Model has Active Locks (requires --force):

{
  "status": "error",
  "command": "rm", 
  "data": {
    "model": "mlx-community/Phi-3-mini-4k-instruct-4bit",
    "locks_detected": true,
    "lock_files": [".locks/model-lock-12345.lock"]
  },
  "error": {
    "type": "locks_present",
    "message": "Model has active locks. Use --force to override."
  }
}

Model Not Found:

{
  "status": "error",
  "command": "rm",
  "data": null,
  "error": {
    "type": "model_not_found",
    "message": "No models found matching 'nonexistent-model'"
  }
}

Ambiguous Pattern:

{
  "status": "error",
  "command": "rm",
  "data": {
    "matches": [
      "mlx-community/Llama-3.2-1B-Instruct-4bit",
      "mlx-community/Llama-3.2-3B-Instruct-4bit"
    ]
  },
  "error": {
    "type": "ambiguous_match", 
    "message": "Multiple models match 'Llama'. Please specify which model to delete."
  }
}

Permission Error:

{
  "status": "error",
  "command": "rm",
  "data": {
    "model": "mlx-community/Phi-3-mini-4k-instruct-4bit"
  },
  "error": {
    "type": "PermissionError",
    "message": "Permission denied: Cannot delete read-only files"
  }
}

`mlxk-json clone <model> <target_dir> --json`

Requires: MLXK2_ENABLE_ALPHA_FEATURES=1

Usage:

mlxk-json clone "Phi-3-mini" ./workspace --json              # Clone to workspace directory
mlxk-json clone "mlx-community/Phi-3-mini" ./my-model --json # Full name to custom directory
mlxk-json clone "microsoft/DialoGPT-small" ./workspace --json # Non-MLX model

Successful Clone:

{
  "status": "success",
  "command": "clone",
  "data": {
    "model": "mlx-community/Phi-3-mini-4k-instruct-4bit",
    "clone_status": "success",
    "message": "Cloned to ./workspace",
    "target_dir": "./workspace",
    "expanded_name": "mlx-community/Phi-3-mini-4k-instruct-4bit"
  },
  "error": null
}

Target Directory Not Empty:

{
  "status": "error",
  "command": "clone",
  "data": {
    "model": null,
    "clone_status": "error",
    "target_dir": "./workspace"
  },
  "error": {
    "type": "ValidationError",
    "message": "Target directory './workspace' already exists and is not empty"
  }
}

Clone Failed:

{
  "status": "error",
  "command": "clone",
  "data": {
    "model": "nonexistent/model",
    "clone_status": "failed",
    "target_dir": "./workspace"
  },
  "error": {
    "type": "clone_failed",
    "message": "Repository not found for url: https://huggingface.co/api/models/nonexistent/model"
  }
}

Access Denied:

{
  "status": "error",
  "command": "clone",
  "data": {
    "model": "gated/model",
    "clone_status": "access_denied",
    "target_dir": "./workspace"
  },
  "error": {
    "type": "access_denied",
    "message": "Access denied: gated/private model 'gated/model'. Accept terms and set HF_TOKEN."
  }
}

APFS Filesystem Error:

{
  "status": "error",
  "command": "clone",
  "data": {
    "model": "org/model",
    "clone_status": "filesystem_error",
    "target_dir": "./workspace"
  },
  "error": {
    "type": "FilesystemError",
    "message": "APFS required for clone operations."
  }
}

`mlxk-json push <dir> <org/model> [--create] [--private] [--branch <b>] [--commit "..."] [--verbose] [--check-only] --json`

Requires: MLXK2_ENABLE_ALPHA_FEATURES=1

Behavior:

Requires HF_TOKEN env.
Default branch: main (subject to change).
Fails if repo missing unless --create is provided.
Sends folder as-is to the specified branch using huggingface_hub.upload_folder.
--verbose affects only human output; JSON remains unchanged in structure.
--check-only performs a local, content-oriented workspace validation and does not contact the Hub (no token required). Results are included under data.workspace_health.

Successful Upload (with changes):

{
  "status": "success",
  "command": "push",
  "data": {
    "repo_id": "org/model",
    "branch": "main",
    "commit_sha": "abcdef1234567890abcdef1234567890abcdef12",
    "commit_url": "https://huggingface.co/org/model/commit/abcdef1",
    "repo_url": "https://huggingface.co/org/model",
    "uploaded_files_count": 3,
    "local_files_count": 11,
    "no_changes": false,
    "created_repo": false,
    "change_summary": {"added": 1, "modified": 2, "deleted": 0},
    "message": "Push successful. Clone operations require APFS filesystem.",
    "experimental": true,
    "disclaimer": "Experimental feature (M0: upload only). No validation/filters; review on the Hub."
  },
  "error": null
}

No Changes (no-op commit avoided):

{
  "status": "success",
  "command": "push",
  "data": {
    "repo_id": "org/model",
    "branch": "main",
    "commit_sha": null,
    "commit_url": null,
    "repo_url": "https://huggingface.co/org/model",
    "uploaded_files_count": 0,
    "local_files_count": 11,
    "no_changes": true,
    "created_repo": false,
    "message": "No files changed; skipped empty commit.",
    "experimental": true,
    "disclaimer": "Experimental feature (M0: upload only). No validation/filters; review on the Hub.",
    "hf_logs": ["No files have been modified since last commit. Skipping to prevent empty commit."]
  },
  "error": null
}

Check-only (no network):

{
  "status": "success",
  "command": "push",
  "data": {
    "repo_id": "org/model",
    "branch": "main",
    "commit_sha": null,
    "commit_url": null,
    "repo_url": "https://huggingface.co/org/model",
    "local_files_count": 11,
    "no_changes": null,
    "created_repo": false,
    "message": "Check-only: no upload performed.",
    "workspace_health": {
      "files_count": 11,
      "total_bytes": 289612345,
      "config": {"exists": true, "valid_json": true, "path": "/path/to/config.json"},
      "weights": {"count": 3, "formats": ["safetensors"], "index": {"has_index": true, "missing": []}, "pattern_complete": true},
      "anomalies": [],
      "healthy": true
    },
    "experimental": true,
    "disclaimer": "Experimental feature (M0: upload only). No validation/filters; review on the Hub."
  },
  "error": null
}

Missing Token:

{
  "status": "error",
  "command": "push",
  "data": {
    "repo_id": "org/model",
    "branch": "main",
    "repo_url": "https://huggingface.co/org/model",
    "uploaded_files_count": null,
    "local_files_count": null,
    "no_changes": null,
    "created_repo": false,
    "experimental": true,
    "disclaimer": "Experimental feature (M0: upload only). No validation/filters; review on the Hub."
  },
  "error": {
    "type": "auth_error",
    "message": "HF_TOKEN not set"
  }
}

Error Handling

All errors follow consistent format with detailed error types:

Error Types

Validation Errors:

ValidationError - Invalid input (96 char limit, empty names)
ambiguous_match - Multiple models match pattern
model_not_found - No models match pattern

Network Errors:

download_failed - HuggingFace API errors, network timeouts
NetworkError - Connection issues

System Errors:

PermissionError - File system permission denied
OperationError - Cache corruption, disk full
InternalError - Unexpected system errors

Error Response Schema:

{
  "status": "error",
  "command": "pull",
  "data": { /* partial data if available */ },
  "error": {
    "type": "ValidationError",
    "message": "Repository name exceeds HuggingFace Hub limit: 105/96 characters"
  }
}

Real-World Error Examples

Cache Corruption (Health Check Bug):

{
  "status": "success",
  "command": "health", 
  "data": {
    "healthy": [],
    "unhealthy": [{
      "name": "mlx-community/Phi-3-mini-4k-instruct-4bit",
      "status": "unhealthy",
      "reason": "config.json missing"
    }],
    "summary": {
      "total": 1,
      "healthy_count": 0,
      "unhealthy_count": 1
    }
  },
  "error": null
}

Pull Refuses Corrupted Model (Bug):

{
  "status": "success",
  "command": "pull",
  "data": {
    "download_status": "already_exists",
    "message": "Model already exists in cache"
  },
  "error": null
}

Agent Integration Examples

Model Management Automation:

# List all MLX models with hashes
mlxk-json list --json | jq -r '.data.models[] | select(.framework=="MLX") | "\(.name)@\(.hash)"'

# Get model hashes for pattern matching
mlxk-json list "Qwen" --json | jq -r '.data.models[] | .hash'

# Count models by framework
mlxk-json list --json | jq '.data.models | group_by(.framework) | map({framework: .[0].framework, count: length})'

# Health summary
mlxk-json health --json | jq '.data.summary'

# Find unhealthy models
mlxk-json health --json | jq -r '.data.unhealthy[].name'

# Filter by pattern
mlxk-json list "Llama" --json | jq '.data.count'

# Model sizes with hashes
mlxk-json list --json | jq -r '.data.models[] | "\(.name)@\(.hash): \(.size_bytes)"'

# Get detailed model info
mlxk-json show "Phi-3-mini" --json | jq '.data.model'

# List all files in a model
mlxk-json show "Phi-3-mini" --files --json | jq -r '.data.files[] | "\(.name): \(.size)"'

# Extract model config
mlxk-json show "Phi-3-mini" --config --json | jq '.data.config.quantization'

Automated Health Monitoring:

#!/bin/bash
# Check if any models are unhealthy
unhealthy_count=$(mlxk-json health --json | jq '.data.summary.unhealthy_count')
if [ "$unhealthy_count" -gt 0 ]; then
  echo "Warning: $unhealthy_count unhealthy models found"
  mlxk-json health --json | jq -r '.data.unhealthy[] | "UNHEALTHY: \(.name) - \(.reason)"'
fi

Batch Operations:

# Pull multiple models
for model in "Phi-3-mini" "Llama-3.2-1B"; do
  echo "Pulling $model..."
  mlxk-json pull "$model" --json | jq '.data.download_status'
done

# Clean up old models
mlxk-json list --json | jq -r '.data.models[] | select(.size | test("GB")) | .name' | while read model; do
  echo "Found large model: $model"
done

Design Principles

No implementation details: No cache paths, internal directories, or implementation specifics
No user-specific data: No usernames in paths or environment-dependent information
Consistent schema: All commands follow same status/command/data/error structure
Scriptable output: Rich structured data optimized for jq and automation
Backward compatible: Exit codes remain unchanged for script compatibility

Exit Codes

All commands use consistent exit codes for scripting:

0 - Success
1 - General error (validation, not found, etc.)
2 - Network/download error
3 - Permission/filesystem error

Version History

2.0.0-alpha: JSON-only implementation with mlxk-json --json
2.0.0-alphha.1: Full implementation with both JSON and human-readable output
2.0.0-alphha.2: Push function protocol extension (json-0.1.3)

31 KiB Raw Blame History

MLX-Knife 2.0 JSON API Specification

Motivation

Health Check Concepts (0.1.5)

Integrity Check (health field)

Runtime Compatibility Check (runtime_compatible field)

Gate Logic & Reason Field

Example Scenarios

CLI Usage

Version Reporting

Commands Overview

Core Schema Pattern

Common Model Object

Supported Commands

Model Discovery & Metadata

Model Type & Capabilities

mlxk-json list [pattern] --json

mlxk-json health [pattern] --json

mlxk-json show <model> --json

Changes in 0.1.6 (Alpha)

Changes in 0.1.5 (Alpha)

Changes in 0.1.2 (Alpha)

Operations

mlxk-json pull <model> --json

mlxk-json rm <model> [--force] --json

mlxk-json clone <model> <target_dir> --json

mlxk-json push <dir> <org/model> [--create] [--private] [--branch <b>] [--commit "..."] [--verbose] [--check-only] --json

Error Handling

Error Types

Real-World Error Examples

Agent Integration Examples

Design Principles

Exit Codes

Version History

31 KiB

Raw Blame History

Integrity Check (`health` field)

Runtime Compatibility Check (`runtime_compatible` field)

`mlxk-json list [pattern] --json`

`mlxk-json health [pattern] --json`

`mlxk-json show <model> --json`

`mlxk-json pull <model> --json`

`mlxk-json rm <model> [--force] --json`

`mlxk-json clone <model> <target_dir> --json`

`mlxk-json push <dir> <org/model> [--create] [--private] [--branch <b>] [--commit "..."] [--verbose] [--check-only] --json`