HappyHorse 1.1 R2V
Image-to-Video • AlibabaAlibaba's HappyHorse 1.1 reference-to-video model. Takes 1-9 reference images (characters and scenes) and a prompt that choreographs them into a single video, keeping each subject's identity consistent. Supports 720P and 1080P output with durations from 3 to 15 seconds.
| Model Info | |
|---|---|
| Terms and License | link ↗ |
| More information | link ↗ |
| Pricing | View pricing in the Cloudflare dashboard ↗ |
Usage
const response = await env.AI.run( 'alibaba/hh1.1-r2v', { prompt: 'The person in image 1 walks through the futuristic city in image 2 and meets the person in image 3.', images: [ 'https://pub-04a6d208d361438ea01b797e6973bd19.r2.dev/catalog/bytedance__seedream-5-lite/portrait-photo-0.jpeg', 'https://pub-04a6d208d361438ea01b797e6973bd19.r2.dev/catalog/google__nano-banana-2/futuristic-city.png', 'https://pub-04a6d208d361438ea01b797e6973bd19.r2.dev/catalog/google__nano-banana-2/high-resolution-portrait.jpg', ], duration: 8, ratio: '16:9', resolution: '1080P', },)console.log(response)curl https://api.cloudflare.com/client/v4/accounts/$CLOUDFLARE_ACCOUNT_ID/ai/run \ --header "Authorization: Bearer $CLOUDFLARE_API_TOKEN" \ --header "Content-Type: application/json" \ --data '{ "model": "alibaba/hh1.1-r2v", "input": { "prompt": "The person in image 1 walks through the futuristic city in image 2 and meets the person in image 3.", "images": [ "https://pub-04a6d208d361438ea01b797e6973bd19.r2.dev/catalog/bytedance__seedream-5-lite/portrait-photo-0.jpeg", "https://pub-04a6d208d361438ea01b797e6973bd19.r2.dev/catalog/google__nano-banana-2/futuristic-city.png", "https://pub-04a6d208d361438ea01b797e6973bd19.r2.dev/catalog/google__nano-banana-2/high-resolution-portrait.jpg" ], "duration": 8, "ratio": "16:9", "resolution": "1080P" }}'{ "state": "Completed", "result": { "video": "https://examples.aig.cloudflare.com/alibaba/hh1.1-r2v/multi-image-reference.mp4" }, "gatewayMetadata": { "keySource": "Unified" }}Parameters
stringrequiredminLength: 1maxLength: 2500arrayrequiredminItems: 1maxItems: 9format: uristringenum: 720P, 1080Pstringenum: 16:9, 9:16, 3:4, 4:3, 1:1, 21:9, 9:21, 5:4, 4:5integerminimum: 3maximum: 15integerminimum: 0maximum: 2147483647booleanstringformat: uri