ds4.pinokio is a new launcher and browser interface that brings the massive DeepSeek V4 Flash AI model to Apple Silicon Macs. It builds on the ds4.c Metal-only inference engine created […]
Tools
About AI tool releases
Latest AI tools
Coming across tokens-per-second benchmarks is easy, but truly understanding what "47 tok/s" feels like while you work is much harder. A new open-source tool called Tokenspeed solves this problem by […]
ExLlamaV3 is an inference library that lets you run large language models on consumer graphics cards. It introduces the EXL3 quantization format, which compresses models to very low bitrates while […]
The new release, needle, is a tiny 26-million parameter open-source AI model purpose-built for function calling, or tool use. It interprets a user's plain text query and outputs a structured […]
Lucebox-hub is a collection of hand-tuned LLM inference servers that push consumer GPUs to their limits. The latest release adds DFlash speculative decoding and PFlash speculative prefill for AMD Ryzen […]
Derpy-Turtle-The-Kokoro-Trainer is a Windows GUI that blends Kokoro’s text-to-speech with RVC voice conversion to build better local voice clones. It lets you search for and refine Kokoro voice tensors, train […]
TextGen is a desktop application that runs large language models locally on your own computer. The latest update transforms the project from a web interface into a no-install portable app […]
Merlin-community is the free, open-core release of a deduplication engine that strips repeated text chunks from AI prompts before they reach the model. The tool now ships with a transparent […]
A small, privacy-first web tool called AI Metadata Viewer now gives anyone a quick way to read all the hidden creation data tucked inside AI-generated images. You simply drag a […]