Music industry terminology
Dynamic mix
Also called: TV mix, Talk-over mix, Dialogue mix
A dynamic mix is a version of a music cue where the mid-frequencies (typically 200 Hz to 3 kHz, the human vocal range) are reduced so dialogue or voiceover can sit cleanly on top. Dynamic mixes are usually delivered alongside the full mix as an ALT version, giving the music editor a ready-made option for any scene where someone is talking.
In film, TV, and ad post-production, the music editor often needs music that does not fight with dialogue. A full mix has rich mid-frequency content (vocals, lead synths, guitar chords) that compete with the spoken voice. A dynamic mix solves the problem at the source by carving out vocal-range space in the music itself, before the editor has to do it under deadline.
Why it matters
On every production with dialogue, the music editor needs to balance music against speech. They have three options: duck the music in real time (which sounds artificial), automate the EQ on every dialogue line (which is tedious), or use a music cue that was already mixed to sit under voice (which is easy). Guess which one wins.
Composers who deliver dynamic mixes proactively make the editor's job dramatically easier. Editors remember composers who deliver dynamic mixes. Composers who only deliver the full mix often do not get repeat work because every cue requires extra editing labor, and editors have memories like elephants for who made their week harder.
How it works
A dynamic mix is created by reducing the mid-frequency content in the music while keeping the low end (kick, bass) and high end (cymbals, percussion, air) intact. The reduction is typically 3 to 6 dB across the 200 Hz to 3 kHz range, with the heaviest cut around 1 to 2 kHz where the human voice has its strongest presence.
There are several common approaches: a static EQ cut on the full bus (simplest), removing the lead vocal or lead melody instrument entirely (most aggressive), or sidechain compression triggered by a placeholder dialogue track (most adaptive but harder to deliver as a static file).
Dynamic mixes are different from instrumentals. An instrumental has the vocals removed but keeps every other element at full level. A dynamic mix may keep the vocals (reduced) but drops the entire mid-range. They are complementary, and library composers often deliver both because covering both bases costs nothing extra at bounce time.
Examples
- A 60-second corporate brand video uses a dynamic mix under the founder voiceover. The music sets warmth and emotion without any of the typical "music too loud, can't hear the words" notes from the client at 4pm on a Friday.
- A trailer composer ships a 90-second hero cue with three ALT mixes: full mix (for the climax with no dialogue), instrumental (for moments where vocals would distract), and dynamic mix (for the act-one section where the trailer narrator is speaking). The editor uses all three across the trailer.
- A library composer's standard delivery for every track includes a full mix, an instrumental, and a dynamic mix as the three core ALT versions. Their tracks get licensed at a higher rate per submission than catalogs that only deliver the full mix and a wing and a prayer.
Common mistakes
- ●Making the dynamic mix too quiet. A dynamic mix should still be musically present at normal listening levels. If you reduce 12 dB across the mid-range, the music feels hollow and weak. Three to six dB is typically right.
- ●Calling an instrumental a dynamic mix. They are different things. Instrumental removes vocals but keeps the full instrumental energy. Dynamic mix carves out vocal-range space across all elements. Mislabeling here costs you trust.
- ●Not labeling clearly. "Track Title (Dynamic).wav" or "Track Title (TV Mix).wav" is the standard. "Track Title v3.wav" is unhelpful and slightly insulting.
- ●Skipping the ALT entirely. If you only deliver the full mix and the production needs music under dialogue, the editor will pass on your cue and license something else. Easy yes, easy no.
How DropCue handles this
DropCue groups dynamic mixes under their parent track automatically. Supervisors see one cue in the playlist and can expand to the dynamic version when their scene has dialogue, without juggling separate files.