HappyHorse 1.1 R2V

Image-to-Video • Alibaba

Alibaba's HappyHorse 1.1 reference-to-video model. Takes 1-9 reference images (characters and scenes) and a prompt that choreographs them into a single video, keeping each subject's identity consistent. Supports 720P and 1080P output with durations from 3 to 15 seconds.

Model Info
Terms and License	link ↗
More information	link ↗
Pricing	View pricing in the Cloudflare dashboard ↗

const response = await env.AI.run(
  'alibaba/hh1.1-r2v',
  {
    prompt:
      'The person in image 1 walks through the futuristic city in image 2 and meets the person in image 3.',
    images: [
      'https://pub-04a6d208d361438ea01b797e6973bd19.r2.dev/catalog/bytedance__seedream-5-lite/portrait-photo-0.jpeg',
      'https://pub-04a6d208d361438ea01b797e6973bd19.r2.dev/catalog/google__nano-banana-2/futuristic-city.png',
      'https://pub-04a6d208d361438ea01b797e6973bd19.r2.dev/catalog/google__nano-banana-2/high-resolution-portrait.jpg',
    ],
    duration: 8,
    ratio: '16:9',
    resolution: '1080P',
  },
)
console.log(response)

curl https://api.cloudflare.com/client/v4/accounts/$CLOUDFLARE_ACCOUNT_ID/ai/run \
  --header "Authorization: Bearer $CLOUDFLARE_API_TOKEN" \
  --header "Content-Type: application/json" \
  --data '{
  "model": "alibaba/hh1.1-r2v",
  "input": {
    "prompt": "The person in image 1 walks through the futuristic city in image 2 and meets the person in image 3.",
    "images": [
      "https://pub-04a6d208d361438ea01b797e6973bd19.r2.dev/catalog/bytedance__seedream-5-lite/portrait-photo-0.jpeg",
      "https://pub-04a6d208d361438ea01b797e6973bd19.r2.dev/catalog/google__nano-banana-2/futuristic-city.png",
      "https://pub-04a6d208d361438ea01b797e6973bd19.r2.dev/catalog/google__nano-banana-2/high-resolution-portrait.jpg"
    ],
    "duration": 8,
    "ratio": "16:9",
    "resolution": "1080P"
  }
}'

Output
Raw response

{
  "state": "Completed",
  "result": {
    "video": "https://examples.aig.cloudflare.com/alibaba/hh1.1-r2v/multi-image-reference.mp4"
  },
  "gatewayMetadata": {
    "keySource": "Unified"
  }
}

Parameters

Input
Output

prompt

stringrequiredminLength: 1maxLength: 2500

▶images[]

arrayrequiredminItems: 1maxItems: 9format: uri

resolution

stringenum: 720P, 1080P

ratio

stringenum: 16:9, 9:16, 3:4, 4:3, 1:1, 21:9, 9:21, 5:4, 4:5

duration

integerminimum: 3maximum: 15

seed

integerminimum: 0maximum: 2147483647

watermark

boolean

video

stringformat: uri

API Schemas (Raw)

Input

Output