> ## Documentation Index
> Fetch the complete documentation index at: https://veniceai-mintlify-d2fddb8a.mintlify.site/llms.txt
> Use this file to discover all available pages before exploring further.

# Music & Sound Effects Models

> Venice music and audio models for AI-generated songs, instrumental tracks, and sound effects synthesis, with model IDs, pricing, and prompt guidance.

<div id="model-search-placeholder" data-filter="music">Loading models...</div>

## Model Categories

**Song Generation:** Create full songs with optional lyrics and vocal support

* ACE-Step 1.5, ElevenLabs Music, MiniMax Music 2.0

**Music & Sound Effects:** Generate instrumental music or sound effects from text prompts

* Stable Audio 2.5

**Sound Effects:** Synthesize audio effects and ambient sounds from text prompts

* ElevenLabs Sound Effects, MMAudio V2

<Tip>
  ElevenLabs Music is the only model that supports `force_instrumental` to generate music without vocals.
</Tip>

<Note>
  Audio generation uses an async queue system. See the [Audio Queue API](/api-reference/endpoint/audio/queue) to start generation and [Audio Retrieve API](/api-reference/endpoint/audio/retrieve) to fetch results.
</Note>

## Pricing

Pricing varies by model:

* **Per-generation:** Fixed price per audio clip (MiniMax Music 2.0, Stable Audio 2.5)
* **Duration-tiered:** Price scales with duration tier (ElevenLabs Music, ACE-Step 1.5)
* **Per-second:** Price based on output duration (ElevenLabs Sound Effects, MMAudio V2)

For exact quotes before generation, use the [Audio Quote API](/api-reference/endpoint/audio/quote).

### Duration-Tiered Pricing

Models with duration-tiered pricing accept any `duration_seconds` within the model's `min_duration`–`max_duration` range. The price is determined by which tier the requested duration falls into. Tier ranges are returned in the `/models` response under `pricing.durations`, with `min_seconds` and `max_seconds` for each tier.

For example, ElevenLabs Music accepts 3–600 seconds (up to 10 minutes) at \$0.75 per minute, rounded up to the nearest minute:

| Duration Range | Tier Key | Base Price |
| -------------- | -------- | ---------- |
| 3–60s          | `60`     | \$0.75     |
| 61–120s        | `120`    | \$1.50     |
| 121–180s       | `180`    | \$2.25     |
| 181–240s       | `240`    | \$3.00     |
| 241–300s       | `300`    | \$3.75     |
| 301–360s       | `360`    | \$4.50     |
| 361–420s       | `420`    | \$5.25     |
| 421–480s       | `480`    | \$6.00     |
| 481–540s       | `540`    | \$6.75     |
| 541–600s       | `600`    | \$7.50     |

These are base prices before markup. Use the [Audio Quote API](/api-reference/endpoint/audio/quote) to get the exact price you will be charged.

## Key Parameters

| Parameter            | Description                                                                   |
| -------------------- | ----------------------------------------------------------------------------- |
| `prompt`             | Text description of the audio to generate                                     |
| `lyrics_prompt`      | Song lyrics for vocal models (required when model has `lyrics_required=true`) |
| `duration_seconds`   | Output length in seconds                                                      |
| `force_instrumental` | Generate without vocals (where supported)                                     |
