Skip to content
#

mlx-lm

Here are 20 public repositories matching this topic...

A high-performance API server that provides OpenAI-compatible endpoints for MLX models. Developed using Python and powered by the FastAPI framework, it provides an efficient, scalable, and user-friendly solution for running MLX-based vision and language models locally with an OpenAI-compatible interface.

  • Updated Dec 31, 2025
  • Python

Prompt LLM Bench is a platform that discovers compatible Hugging Face models on-the-fly, runs reproducible multi-model evaluations, and recommends the optimal prompt–LLM pair based on accuracy, latency, and resource efficiency.

  • Updated Dec 30, 2025
  • TypeScript

Improve this page

Add a description, image, and links to the mlx-lm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the mlx-lm topic, visit your repo's landing page and select "manage topics."

Learn more