ImageGeneration​Image​Generation

Generate an image with LLMs

Generate images with LLMs using a natural language prompt.

yaml
type: "io.kestra.plugin.ai.completion.ImageGeneration"

Generate an image using OpenAI (DALL-E 3)

yaml
id: image_generation
namespace: company.ai

tasks:
  - id: image_generation
    type: io.kestra.plugin.ai.completion.ImageGeneration
    prompt: >
      Four-panel comic page about a data engineer shipping a workflow.
      Clean modern line art with soft colors and ample white space.
      Panel 1: Early morning desk setup with dual monitors, coffee, and a workflow DAG on screen; calm focused mood.
      Panel 2: Debugging a failing task; close-up of terminal and error icon; speech bubble: "hmm…"
      Panel 3: Fix applied; green checks ripple through the pipeline; small celebratory detail (cat paw, fist pump).
      Panel 4: Deployed dashboard showing metrics trending up; sticky note says "ship it".
      Include subtle tech props (cloud icons, database cylinder) but no logos.
      Minimal readable text only in tiny bubbles/notes; no large paragraphs of text.
    provider:
      type: io.kestra.plugin.ai.provider.OpenAI
      apiKey: "{{ kv('OPENAI_API_KEY') }}"
      modelName: dall-e-3
Properties

Image prompt

The input prompt for the image generation model

Language Model Provider

Possible Values
STOPLENGTHTOOL_EXECUTIONCONTENT_FILTEROTHER

Finish reason

Generated image URL

The URL of the generated image

Token usage

API endpoint

The Azure OpenAI endpoint in the format: https://{resource}.openai.azure.com/

Model name

API Key

Client ID

Client secret

API version

Tenant ID

Endpoint URL

Project location

Model name

Project ID

API Key

Model name

API Key

Model name

API base URL

Model endpoint

Model name

API Key

Model name

API base URL

API Key

Model name

Default https://api.deepseek.com/v1

API base URL

AWS Access Key ID

Model name

AWS Secret Access Key

Default COHERE
Possible Values
COHERETITAN

Amazon Bedrock Embedding Model Type

API Key

Model name