Skip to content

Music Prompts

The Song Node generates a full music track from a short text prompt. Good music prompts are shorter than you’d think — focused, emotionally specific, and anchored in a clear genre. This page covers what actually moves the needle.

Every strong music prompt layers the same elements:

Genre → Mood → Instrumentation → Tempo → Use case

Example: “A slow, melancholic piano melody over ambient synth textures, suitable for a tragic film scene, 70 BPM, instrumental only.”

That’s one sentence, five layers. The model has everything it needs.

Mood descriptors are the most efficient words in a music prompt. “Melancholic,” “triumphant,” “unsettling,” “foreboding,” “dreamy,” “tense,” “serene” — each one bundles harmonic progression, tempo, and instrumentation choices the model makes for you.

You don’t need music-theory vocabulary to prompt music. “Sense of building dread” gives the model more to work with than a chord chart.

Lead with genre. It anchors the sonic world the model draws from:

  • Cinematic orchestral — strings, brass, percussion. Epic, dramatic.
  • Ambient electronic — synth pads, drones, sparse rhythm. Atmospheric.
  • Indie folk — acoustic guitar, soft vocals, banjo, warm room.
  • Jazz — piano, upright bass, brushed drums, horns. Cafe, noir.
  • Hip-hop — 808 drums, bass, sampled textures.
  • Rock / post-rock — electric guitar, live drums, build-and-release dynamics.
  • Lo-fi — warm analog textures, tape hiss, relaxed tempo.

Blending is fine: “Ambient electronic with jazz piano influence” works. “Lo-fi hip-hop with orchestral strings” works.

Name specific instruments when you want them. “Solo piano” means the piano leads. “808 drums and sub-bass” sets a specific low end. “String quartet” is not the same as “orchestral strings.”

Tempo can be given as BPM or as feel:

  • Slow / 60–80 BPM — ballads, reflective, tragic
  • Mid / 90–110 BPM — conversational, warm, everyday
  • Upbeat / 120–140 BPM — action, energetic, driving
  • Fast / 140+ BPM — chase, tension, adrenaline

Music in a scene rarely stays at one intensity. Direct the arc:

  • “Builds from tense to explosive over thirty seconds”
  • “Starts quiet and minimal, adds layers as it progresses”
  • “Steady throughout, no dynamic shifts”
  • “Resolves to silence in the final five seconds”

Arc descriptions work best for longer tracks. Short cues (under fifteen seconds) don’t need one — the model won’t have room to develop it.

  • Contradictory moods. “Energetic but calm” or “aggressive yet peaceful” forces the model to reconcile opposites. Pick one primary direction.
  • Artist imitation. “Sound exactly like [artist]” produces inconsistent results. Describe the qualities you want instead: “Warm analog synths, reverbed vocals, dreamy feel.”
  • Keyword stacking. “Best, high quality, studio grade, professional mastering” does nothing. The model already defaults to high fidelity.
  • Too much structure for a short cue. A fifteen-second bed doesn’t need a verse-chorus-bridge instruction. Save structural direction for longer pieces.

By default, music prompts generate instrumentally. If you want vocals:

  • Write “with vocals” and describe the vocal quality (“warm female voice,” “raspy male tenor,” “choral”).
  • Or provide lyrics. The model structures vocals to match.

For instrumental scoring, explicitly write “instrumental only” — otherwise some models add vocal elements you didn’t ask for.

Use the visual mood of your storyboard to guide the music prompt. A noir scene calls for different music than a warm family memory, even if the beats are the same length.

Visual moodMusic direction
Dark, moody, noirSparse piano, upright bass, brushed drums, low ambient drone
Bright, upbeatMajor-key acoustic, steady rhythm, warm strings
Tense, thrillerStaccato strings, deep bass pulse, percussive stabs
Dreamy, etherealReverbed synth pads, gentle chimes, slow movement
Action, chaseDriving percussion, orchestral brass, 140 BPM
ReflectiveSolo piano or acoustic guitar, slow tempo, minor key

How long can a generated track be? Length depends on your tier and the model. For most scenes you’ll generate thirty seconds to three minutes at a time. Chain tracks or loop a short cue in the Timeline for longer sequences.

Can I specify BPM exactly? Yes. “120 BPM” will pull the model toward that tempo. It’s an approximation — expect to land within five BPM of the number you give.

Why did my prompt produce vocals when I didn’t ask for them? Some models default to adding vocal elements. Explicitly write “instrumental only” in the prompt to force a pure instrumental.

Can I generate music that sounds like a specific artist? Not reliably, and not for commercial release — imitation prompts are inconsistent and legally murky. Describe the qualities you want (instrumentation, feel, era, production style) instead of naming an artist.

What if the music doesn’t fit my scene? Regenerate with a tighter mood direction, or adjust the genre. Music is the element most often reshot — expect to try two or three versions before one lands.

Can I edit the music after it’s generated? You can trim, loop, and fade on the Timeline. For deeper edits (changing an instrument, shifting structure), regenerate with an adjusted prompt — editing after the fact is not currently supported.