Foundation-1 Crafts Structured Loops for Producers

Foundation-1 is a text-to-sample model built for structured music production. It generates tempo-synced, key-aware loops that slot directly into production workflows instead of producing generic audio textures.
RoyalCities developed this tool to give producers separate control over instrumentation, timbre, effects, and musical notation. The layered approach creates samples that fit specific roles in a mix rather than random audio clips.
Model Size: 2.43GB & VRAM GPU: 8GB required
Sample generation with musical intent
- Produces 4-bar and 8-bar loops at BPM values from 100 to 150.
- Locks to major and minor keys across western music theory.
- Separates instrument identity from tonal and textural character.
- Responds to FX tags including reverb, delay, distortion, and modulation.
- Accepts notation-style prompts to shape phrasing and rhythm.
Electronic music producers can generate bass lines, synth leads, pads, and melodic elements that integrate immediately into their projects. Sound designers building sample packs or custom libraries may find the timbral control useful for creating cohesive sound collections.
Running locally on consumer hardware
The model requires approximately 7GB of VRAM during generation, with 8GB recommended for reliable operation. On an RTX 3090, sample generation takes roughly 7-8 seconds. This release provides only a 16-bit version, reducing file size without quality loss compared to prior dual-format releases.
RoyalCities emphasizes that prompt quality significantly affects output. The documentation states that
'structured layered prompts outperform vague natural language.'
Users should learn the tagging vocabulary for best results. The model works best with the RC Stable Audio Fork interface, which handles timing alignment between bar length, BPM, and generation duration. It is optimized for sample generation rather than full songs or percussion. The Stability AI Community License permits non-commercial use and limited commercial use for entities earning under $1 million annually.
Download Foundation-1 on Hugging Face.