Question 1

What is Grok Imagine?

Accepted Answer

Grok Imagine is xAI's multimodal generation model supporting text-to-image, image-to-image, text-to-video, and image-to-video. Videos automatically include synchronized audio.

Question 2

Does Grok Imagine support video generation?

Accepted Answer

Yes! Grok Imagine supports text-to-video and image-to-video — generating 6-second clips at 480p or 720p with automatically synchronized background audio.

Question 3

What are the generation modes?

Accepted Answer

Three modes: Normal for standard results, Fun for expressive creative takes, and Spicy Mode for more intense interpretations. Note: Spicy mode is not available when using external image inputs for video.

Question 4

What aspect ratios does Grok Imagine support?

Accepted Answer

Image aspect ratios: 1:1 and various portrait/landscape formats. Video: 2:3, 3:2, 1:1, 16:9, and 9:16.

Grok Imagine

Key Features

Text & Image to Video

Synchronized Audio

Creative Generation Modes

Image Generation & Editing

How to Use

Choose Output Type

Select Mode

Generate

Frequently Asked Questions

Start creating with Grok Imagine

Explore More AI Features