generate images & video, built in
Text → Image, Image Edit, Text → Video,
Image → Video — across Gemini, OpenAI gpt-image-2,
Qwen, Veo, and DashScope HappyHorse. Pick a model, set the resolution,
generate; results drop into a live gallery you can click to reuse as the
next source. The same tools are callable by the agent from chat.