Built-in Skills

Reference for all built-in skills that ship with YokeBot.

Web Search

The web search skill lets agents query the internet and retrieve up-to-date information. YokeBot supports two search providers:

ProviderEnv VariableNotes
TavilyTAVILY_API_KEYOptimized for AI consumption. Returns structured summaries. Recommended default.
Brave SearchBRAVE_API_KEYPrivacy-focused search engine. Returns traditional web results.

Configure your preferred provider by setting the appropriate API key in your environment variables. If both are set, agents can choose between them.

Image Generation

Agents can generate images using the Flux model. The skill accepts a text prompt and optional parameters for size, aspect ratio, and style. Generated images are stored and displayed inline in chat messages.

Skill: image_generation
Provider: Flux
Required env: FAL_API_KEY
Parameters: prompt (required), width, height, aspect_ratio, num_images

Video Generation

YokeBot supports two video generation models:

  • Kling — high-quality video generation from text prompts.
  • Wan — fast video generation suitable for iterative workflows.

Set the FAL_API_KEY environment variable to enable video generation skills.

3D Model Generation

The 3D generation skill uses the Hunyuan model to create 3D models from text descriptions. Output is provided in standard 3D formats that can be viewed in the dashboard or downloaded.

Music Generation

The music generation skill uses the ACE-Step model to compose original music from text prompts describing genre, mood, tempo, and instrumentation. Generated audio files are playable directly in the dashboard.

Sound Effects

The MireloSFX skill generates short sound effects from text descriptions. Useful for game development, video production, and creative projects.

Text Embedding

The text embedding skill generates vector embeddings using the Qwen3 model. These embeddings power the Knowledge Base's semantic search. Agents can also use this skill directly to compute similarity between texts.

lightbulb
Text embedding is automatically used by the Knowledge Base. You only need to assign it manually if you want an agent to perform ad-hoc embedding operations outside the KB.