Originally published at deepu.tech.
In my recent post about my fully offline AI-assisted Linux development machine, I dropped a small detail near the bottom. I run my local model with an alias.
llamaServer
Enter fullscreen mode
Exit fullscreen mode
A fast TUI, CLI, daemon, and OpenAI-compatible proxy for running local LLMs via llama.cpp, in one Rust binary
Originally published at deepu.tech.
In my recent post about my fully offline AI-assisted Linux development machine, I dropped a small detail near the bottom. I run my local model with an alias.
llamaServer
Enter fullscreen mode
Exit fullscreen mode

A reproducible benchmark of LlamaStash against raw llama-server, Ollama, and LM Studio on AMD APU, Apple Silicon, and NVIDIA

LLM-Manager is a lightweight, modular Bash suite with a dual JSON/Interactive interface designed to...

A Blog post by ggml-org on Hugging Face

We'll show you how to train Alpaca, a fine-tuned version of LLaMA that can respond to instructions like ChatGPT.

A Blog post by ggml-org on Hugging Face

I Built a 12MB Desktop App for Running Uncensored AI Models Locally (Tauri + Rust + Ollama)...