Generate 3D models (Hunyuan), music (ACE-Step), and sound effects (MireloSFX).
The Hunyuan model generates 3D models from text descriptions. Generated models can be previewed in the dashboard's built-in 3D viewer and downloaded in standard formats.
| Parameter | Type | Required | Description |
|---|---|---|---|
| prompt | string | Yes | Text description of the 3D model. |
| format | enum | No | Output format: "glb" (default), "obj", "fbx". |
Example prompt: "A low-poly medieval castle with a drawbridge, stone walls, and a red flag on the tallest tower."
The ACE-Step model composes original music tracks from text descriptions. You can specify genre, mood, tempo, instrumentation, and structure.
| Parameter | Type | Required | Description |
|---|---|---|---|
| prompt | string | Yes | Description of the music to generate. |
| duration | number | No | Track length in seconds (default 30, max 180). |
| format | enum | No | Output format: "mp3" (default), "wav". |
Example prompt: "An upbeat lo-fi hip hop track with mellow piano chords, a steady drum beat, vinyl crackle, and a jazzy bass line. 90 BPM."
MireloSFX generates short sound effects from text descriptions. Useful for game development, video editing, app design, and creative projects.
| Parameter | Type | Required | Description |
|---|---|---|---|
| prompt | string | Yes | Description of the sound effect. |
| duration | number | No | Duration in seconds (default 3, max 10). |
Example prompts:
Agents can use multiple media skills in a single heartbeat. For example, a game asset agent might generate a 3D model, a matching texture image, and associated sound effects all in one task cycle. Assign all the relevant skills to the agent and describe the full scope in the task description.