Step Types Reference

Overview

Step types extend Dagu's capabilities beyond simple shell commands. Available step types:

Shell (default) - Execute shell commands
Docker - Run commands in Docker containers
SSH - Execute commands on remote hosts
S3 - S3 operations (upload, download, list, delete)
HTTP - Make HTTP requests
Chat - Execute LLM requests (OpenAI, Anthropic, Gemini, etc.)
Archive - Extract, create, and list archive files
Mail - Send emails
JQ - Process JSON data
Redis - Execute Redis commands and operations
HITL - Human-in-the-loop approval gates
GitHub Actions (experimental) - Run marketplace actions locally with nektos/act

TIP

For detailed documentation on each step type, click the links above to visit the feature pages.

Shell (Default)

INFO

For detailed Shell step type documentation, see Shell Guide.

The default step type runs commands in the system shell. Set a DAG-level shell to pick the program and flags once; steps inherit it unless you override them.

yaml

shell: ["/bin/bash", "-e"]  # Default shell for the workflow
steps:
  - command: echo "Hello World"
    
  - command: echo $BASH_VERSION   # Uses DAG shell
  - shell: /usr/bin/zsh           # Step-level override
    command: echo "Uses zsh"

Shell Selection

yaml

steps:
  - name: default-shell
    command: echo "Uses DAG shell or system default"
    
  - name: bash-specific
    shell: ["bash", "-e", "-u"]   # Array form for flags
    command: echo "Uses bash features"
    
  - name: custom-shell
    shell: /usr/bin/zsh
    command: echo "Uses zsh"

Docker

INFO

For detailed Docker step type documentation, see Docker Guide.

Run commands in Docker containers for isolation and reproducibility. The container field supports two modes:

Image mode: Create a new container from a Docker image
Exec mode: Execute commands in an already-running container

Image Mode (Create New Container)

Use the container field to run a step in its own container:

yaml

steps:
  - name: run-in-container
    container:
      image: alpine:latest
    command: echo "Hello from container"

TIP

The container is automatically removed after execution. Set keepContainer: true to preserve it.

Exec Mode (Use Existing Container)

Execute commands in an already-running container:

yaml

steps:
  # String form - exec with container's defaults
  - name: run-migration
    container: my-app-container
    command: php artisan migrate

  # Object form with overrides
  - name: admin-task
    container:
      exec: my-app-container
      user: root
      workingDir: /app
    command: chown -R app:app /data

Exec mode is ideal for running commands in containers started by Docker Compose or other orchestration tools.

Image Pull Options

yaml

steps:
  - name: pull-always
    container:
      image: myapp:latest
      pullPolicy: always      # Always pull from registry
    command: ./app

  - name: pull-if-missing
    container:
      image: myapp:latest
      pullPolicy: missing     # Default - pull only if not local
    command: ./app

  - name: never-pull
    container:
      image: local-image:dev
      pullPolicy: never       # Use local image only
    command: ./test

Registry Authentication

yaml

# Configure authentication for private registries
registryAuths:
  docker.io:
    username: ${DOCKER_USERNAME}
    password: ${DOCKER_PASSWORD}
  ghcr.io:
    username: ${GITHUB_USER}
    password: ${GITHUB_TOKEN}

steps:
  - name: use-private-image
    container:
      image: ghcr.io/myorg/private-app:latest
    command: echo "Running"

Authentication can also be configured via DOCKER_AUTH_CONFIG environment variable.

Volume Mounts

yaml

steps:
  - name: with-volumes
    container:
      image: python:3.13
      volumes:
        - /host/data:/container/data:ro      # Read-only
        - /host/output:/container/output:rw  # Read-write
        - ./config:/app/config               # Relative path
    command: python process.py /container/data

GitHub Actions

INFO

For the full guide, see GitHub Actions.

Run marketplace actions (e.g. actions/checkout@v4) inside Dagu steps.

yaml

secrets:
  - name: GITHUB_TOKEN
    provider: env
    key: GITHUB_TOKEN

steps:
  - name: checkout
    command: actions/checkout@v4
    type: gha               # Aliases: github_action, github-action
    config:
      runner: node:24-bookworm
    params:
      repository: dagu-org/dagu
      ref: main
      token: "${GITHUB_TOKEN}"

WARNING

This executor is experimental. It depends on Docker, downloads images on demand, and currently supports single-action invocations per step.

Environment Variables

yaml

env:
  - API_KEY: secret123

steps:
  - name: with-env
    container:
      image: node:22
      env:
        - NODE_ENV=production
        - API_KEY=${API_KEY}  # Pass from DAG env
        - DB_HOST=postgres
    command: npm start

Network Configuration

yaml

steps:
  - name: custom-network
    container:
      image: alpine
      network: my-network
    command: ping other-service

Platform Selection

yaml

steps:
  - name: specific-platform
    container:
      image: myapp:latest
      platform: linux/amd64  # Force platform
    command: ./app

Working Directory

yaml

steps:
  - name: custom-workdir
    container:
      image: python:3.13
      workingDir: /app
      env:
        - PYTHONPATH=/app
      volumes:
        - ./src:/app
    command: python main.py

Complete Docker Example

yaml

steps:
  - name: run-postgres
    container:
      name: test-db
      image: postgres:17
      pullPolicy: missing
      platform: linux/amd64
      keepContainer: true
      env:
        - POSTGRES_USER=test
        - POSTGRES_PASSWORD=test
        - POSTGRES_DB=testdb
      volumes:
        - postgres-data:/var/lib/postgresql/data
      ports:
        - "127.0.0.1:5432:5432"
      network: bridge
    command: postgres

SSH

INFO

For detailed SSH step type documentation, see SSH Guide.

Execute commands on remote hosts over SSH.

Basic SSH

yaml

steps:
  - name: remote-command
    type: ssh
    config:
      user: deploy
      host: server.example.com
      port: 22
      key: /home/user/.ssh/id_rsa
    command: ls -la /var/www

With Environment

yaml

steps:
  - name: remote-with-env
    type: ssh
    config:
      user: deploy
      host: 192.168.1.100
      key: ~/.ssh/deploy_key
    command: |
      export APP_ENV=production
      cd /opt/app
      echo "Deploying"

Multiple Commands

yaml

steps:
  - name: remote-script
    type: ssh
    config:
      user: admin
      host: backup.server.com
      key: ${SSH_KEY_PATH}
    script: |
      #!/bin/bash
      set -e
      
      echo "Starting backup..."
      tar -czf /backup/app-$(date +%Y%m%d).tar.gz /var/www
      
      echo "Cleaning old backups..."
      find /backup -name "app-*.tar.gz" -mtime +7 -delete
      
      echo "Backup complete"

S3

INFO

For detailed S3 step type documentation, see S3 Guide.

Execute S3 operations including upload, download, list, and delete. Supports AWS S3 and S3-compatible services (MinIO, GCS, DigitalOcean Spaces).

DAG-Level Configuration

yaml

s3:
  region: us-east-1
  accessKeyId: ${AWS_ACCESS_KEY_ID}
  secretAccessKey: ${AWS_SECRET_ACCESS_KEY}
  bucket: my-bucket

steps:
  - name: upload-file
    type: s3
    config:
      key: data/file.txt
      source: /tmp/file.txt
    command: upload

Upload

yaml

steps:
  - name: upload-report
    type: s3
    config:
      bucket: my-bucket
      key: reports/daily.csv
      source: /tmp/report.csv
      contentType: text/csv
      storageClass: STANDARD_IA
    command: upload

Download

yaml

steps:
  - name: download-config
    type: s3
    config:
      bucket: my-bucket
      key: config/settings.json
      destination: /tmp/settings.json
    command: download

List Objects

yaml

steps:
  - name: list-logs
    type: s3
    config:
      bucket: my-bucket
      prefix: logs/2024/
      maxKeys: 100
      recursive: true
    command: list
    output: OBJECTS

Delete

yaml

steps:
  # Single object
  - name: delete-file
    type: s3
    config:
      bucket: my-bucket
      key: temp/old-file.txt
    command: delete

  # Batch delete by prefix
  - name: cleanup
    type: s3
    config:
      bucket: my-bucket
      prefix: logs/2023/
    command: delete

S3-Compatible Services

yaml

# MinIO
s3:
  endpoint: http://localhost:9000
  accessKeyId: minioadmin
  secretAccessKey: minioadmin
  bucket: my-bucket
  forcePathStyle: true

# Google Cloud Storage
s3:
  endpoint: https://storage.googleapis.com
  accessKeyId: ${GCS_HMAC_KEY}
  secretAccessKey: ${GCS_HMAC_SECRET}
  bucket: my-gcs-bucket

HTTP

INFO

For detailed HTTP step type documentation, see HTTP Guide.

Make HTTP requests to APIs and web services.

GET Request

yaml

steps:
  - name: simple-get
    type: http
    config:
      silent: true  # Output body only
    command: GET https://api.example.com/status

POST with Body

yaml

steps:
  - name: post-json
    type: http
    config:
      headers:
        Content-Type: application/json
        Authorization: Bearer ${API_TOKEN}
      body: |
        {
          "name": "test",
          "value": 123
        }
      timeout: 30
    command: POST https://api.example.com/data

Query Parameters

yaml

steps:
  - name: search-api
    type: http
    config:
      query:
        q: "dagu workflow"
        limit: "10"
        offset: "0"
      silent: true
    command: GET https://api.example.com/search

Form Data

yaml

steps:
  - name: form-submit
    type: http
    config:
      headers:
        Content-Type: application/x-www-form-urlencoded
      body: "username=user&password=pass&remember=true"
    command: POST https://example.com/login

Self-Signed Certificates

yaml

steps:
  - name: internal-api
    type: http
    config:
      skipTLSVerify: true  # Skip certificate verification
      headers:
        Authorization: Bearer ${INTERNAL_TOKEN}
    command: GET https://internal-api.local/data

Complete HTTP Example

yaml

steps:
  - name: api-workflow
    type: http
    config:
      headers:
        Accept: application/json
        X-API-Key: ${API_KEY}
      timeout: 60
      silent: false
    command: GET https://api.example.com/data
    output: API_RESPONSE
    
  - name: process-response
    command: echo "${API_RESPONSE}" | jq '.data[]'

Mail

INFO

For detailed Mail step type documentation, see Mail Guide.

Send emails for notifications and alerts.

Basic Email

yaml

smtp:
  host: smtp.gmail.com
  port: "587"
  username: sender@gmail.com
  password: ${SMTP_PASSWORD}

steps:
  - name: send-notification
    type: mail
    config:
      to: recipient@example.com
      from: sender@gmail.com
      subject: "Workflow Completed"
      message: "The data processing workflow has completed successfully."

With Attachments

yaml

steps:
  - name: send-report
    type: mail
    config:
      to: team@company.com
      from: reports@company.com
      subject: "Daily Report - ${TODAY}"
      message: |
        Please find attached the daily report.

        Generated at: ${TIMESTAMP}
      attachments:
        - command: /tmp/daily-report.pdf
        - command: /tmp/summary.csv

Multiple Recipients

yaml

steps:
  - name: alert-team
    type: mail
    config:
      to:
        - command: ops@company.com
        - command: alerts@company.com
        - command: oncall@company.com
      from: dagu@company.com
      subject: "[ALERT] Process Failed"
      message: |
        The critical process has failed.

        Error: ${ERROR_MESSAGE}
        Time: ${TIMESTAMP}

HTML Email

yaml

steps:
  - name: send-html
    type: mail
    config:
      to: marketing@company.com
      from: notifications@company.com
      subject: "Weekly Stats"
      contentType: text/html
      message: |
          <html>
          <body>
            <h2>Weekly Statistics</h2>
            <p>Users: <strong>${USER_COUNT}</strong></p>
            <p>Revenue: <strong>${REVENUE}</strong></p>
          </body>
          </html>

JQ

INFO

For detailed JQ step type documentation, see JQ Guide.

Process and transform JSON data using jq syntax.

Raw Output

Set config.raw: true to mirror jq's -r flag and emit unquoted primitives.

yaml

steps:
  - name: list-emails
    type: jq
    config:
      raw: true
    command: '.data.users[].email'
    script: |
      {
        "data": {
          "users": [
            {"email": "user1@example.com"},
            {"email": "user2@example.com"}
          ]
        }
      }

Output:

text

user1@example.com
user2@example.com

Format JSON

yaml

steps:
  - name: pretty-print
    type: jq
    script: |
      {"name":"test","values":[1,2,3],"nested":{"key":"value"}}

Output:

json

{
  "name": "test",
  "values": [1, 2, 3],
  "nested": {
    "key": "value"
  }
}

Query JSON

yaml

steps:
  - name: extract-value
    type: jq
    command: '.data.users[] | select(.active == true) | .email'
    script: |
      {
        "data": {
          "users": [
            {"id": 1, "email": "user1@example.com", "active": true},
            {"id": 2, "email": "user2@example.com", "active": false},
            {"id": 3, "email": "user3@example.com", "active": true}
          ]
        }
      }

Output:

"user1@example.com"
"user3@example.com"

Transform JSON

yaml

steps:
  - name: transform-data
    type: jq
    command: '{id: .id, name: .name, total: (.items | map(.price) | add)}'
    script: |
      {
        "id": "order-123",
        "name": "Test Order",
        "items": [
          {"name": "Item 1", "price": 10.99},
          {"name": "Item 2", "price": 25.50},
          {"name": "Item 3", "price": 5.00}
        ]
      }

Output:

json

{
  "id": "order-123",
  "name": "Test Order",
  "total": 41.49
}

Complex Processing

yaml

steps:
  - name: analyze-logs
    type: jq
    command: |
      group_by(.level) |
      map({
        level: .[0].level,
        count: length,
        messages: map(.message)
      })
    script: |
      [
        {"level": "ERROR", "message": "Connection failed"},
        {"level": "INFO", "message": "Process started"},
        {"level": "ERROR", "message": "Timeout occurred"},
        {"level": "INFO", "message": "Process completed"}
      ]

Redis

INFO

For detailed Redis step type documentation, see Redis Guide.

Execute commands against Redis servers.

Basic Usage

yaml

steps:
  - name: ping
    type: redis
    config:
      host: localhost
      port: 6379
      command: PING

DAG-Level Configuration

Define connection defaults at the DAG level:

yaml

redis:
  host: localhost
  port: 6379
  password: ${REDIS_PASSWORD}

steps:
  - name: set-value
    type: redis
    config:
      command: SET
      key: mykey
      value: "hello"

  - name: get-value
    type: redis
    config:
      command: GET
      key: mykey
    output: RESULT

String Operations

yaml

steps:
  - name: cache-user
    type: redis
    config:
      command: SET
      key: user:${USER_ID}
      value: '{"name": "John", "email": "john@example.com"}'

  - name: get-user
    type: redis
    config:
      command: GET
      key: user:${USER_ID}
    output: USER_DATA

Hash Operations

yaml

steps:
  - name: set-user-field
    type: redis
    config:
      command: HSET
      key: user:1
      field: email
      value: "john@example.com"

  - name: get-all-fields
    type: redis
    config:
      command: HGETALL
      key: user:1
    output: USER_HASH

Pipeline Operations

yaml

steps:
  - name: batch-ops
    type: redis
    config:
      pipeline:
        - command: SET
          key: key1
          value: "value1"
        - command: SET
          key: key2
          value: "value2"
        - command: MGET
          keys: ["key1", "key2"]

Connection Modes

yaml

# Standalone (default)
redis:
  host: localhost
  port: 6379

# Sentinel
redis:
  mode: sentinel
  sentinelMaster: mymaster
  sentinelAddrs:
    - sentinel1:26379
    - sentinel2:26379

# Cluster
redis:
  mode: cluster
  clusterAddrs:
    - node1:6379
    - node2:6379

Chat

INFO

For detailed Chat step type documentation, see Chat Guide.

Execute requests to Large Language Model providers.

Basic Chat Request

yaml

steps:
  - type: chat
    llm:
      provider: openai
      model: gpt-4o
    messages:
      - role: user
        content: "What is 2+2?"
    output: ANSWER

Supported Providers

Provider	Environment Variable
`openai`	`OPENAI_API_KEY`
`anthropic`	`ANTHROPIC_API_KEY`
`gemini`	`GOOGLE_API_KEY`
`openrouter`	`OPENROUTER_API_KEY`
`local`	(none)

Aliases ollama, vllm, and llama map to local.

Multi-turn Conversation

yaml

type: graph

steps:
  - name: setup
    type: chat
    llm:
      provider: openai
      model: gpt-4o
      system: "You are a helpful assistant."
    messages:
      - role: user
        content: "What is 2+2?"

  - name: followup
    depends: [setup]
    type: chat
    llm:
      provider: openai
      model: gpt-4o
    messages:
      - role: user
        content: "Now multiply that by 3."

Steps inherit conversation history from dependencies.

Variable Substitution

yaml

params:
  - TOPIC: "quantum computing"

steps:
  - type: chat
    llm:
      provider: anthropic
      model: claude-sonnet-4-20250514
    messages:
      - role: user
        content: "Explain ${TOPIC} briefly."

Local Models (Ollama)

yaml

steps:
  - type: chat
    llm:
      provider: local
      model: llama3
    messages:
      - role: user
        content: "Hello!"

DAG-Level Configuration

Define LLM defaults at the DAG level to share configuration across steps:

yaml

llm:
  provider: openai
  model: gpt-4o
  system: "You are a helpful assistant."
  temperature: 0.7

steps:
  - type: chat
    messages:
      - role: user
        content: "First question"

  - type: chat
    llm:
      provider: anthropic
      model: claude-sonnet-4-20250514
    messages:
      - role: user
        content: "Override with different provider"

When a step specifies llm:, it completely replaces DAG-level config (no field merging).

HITL (Human in the Loop)

INFO

For detailed HITL documentation, see HITL Guide.

Pause workflow execution until human approval or rejection. This enables human-in-the-loop (HITL) workflows where manual review or authorization is required before proceeding.

Basic Usage

yaml

steps:
  - command: ./deploy.sh staging
  - type: hitl
  - command: ./deploy.sh production

With Prompt and Inputs

yaml

steps:
  - command: ./deploy.sh staging
  - type: hitl
    config:
      prompt: "Approve production deployment?"
      input: [APPROVED_BY, RELEASE_NOTES]
      required: [APPROVED_BY]
  - command: |
      echo "Approved by: ${APPROVED_BY}"
      ./deploy.sh production

Configuration Options

Option	Type	Description
`prompt`	string	Message displayed to the approver
`input`	string[]	Parameter names to collect from approver
`required`	string[]	Parameters that must be provided (subset of `input`)

Approval and Rejection

HITL steps can be approved or rejected via the Web UI or REST API:

Approval: The step succeeds and execution continues
Rejection: The step enters Rejected status, the DAG status becomes Rejected, and dependent steps are aborted

DAG (Subworkflow)

INFO

The DAG step type allows running other workflows as steps. See Nested Workflows.

Execute other workflows as steps, enabling workflow composition.

Execute External DAG

yaml

steps:
  - name: run-etl
    type: dag
    command: workflows/etl-pipeline.yaml
    params: "DATE=${TODAY} ENV=production"

Execute Local DAG

yaml

name: main-workflow
steps:
  - name: prepare-data
    type: dag
    command: data-prep
    params: "SOURCE=/data/raw"

---

name: data-prep
params:
  - SOURCE: /tmp
steps:
  - name: validate
    command: validate.sh ${SOURCE}
  - name: clean
    command: clean.py ${SOURCE}

Capture DAG Output

yaml

steps:
  - name: analyze
    type: dag
    command: analyzer.yaml
    params: "FILE=${INPUT_FILE}"
    output: ANALYSIS
    
  - name: use-results
    command: |
      echo "Status: ${ANALYSIS.outputs.status}"
      echo "Count: ${ANALYSIS.outputs.record_count}"

Error Handling

yaml

steps:
  - name: may-fail
    type: dag
    command: risky-process.yaml
    continueOn:
      failure: true
    retryPolicy:
      limit: 3
      intervalSec: 300

Dynamic DAG Selection

yaml

steps:
  - name: choose-workflow
    command: |
      if [ "${ENVIRONMENT}" = "prod" ]; then
        echo "production-workflow.yaml"
      else
        echo "staging-workflow.yaml"
      fi
    output: WORKFLOW_FILE
    
  - name: run-selected
    type: dag
    command: ${WORKFLOW_FILE}
    params: "ENV=${ENVIRONMENT}"

Step Types Reference ​

Overview ​

Shell (Default) ​

Shell Selection ​

Docker ​

Image Mode (Create New Container) ​

Exec Mode (Use Existing Container) ​

Image Pull Options ​

Registry Authentication ​

Volume Mounts ​

GitHub Actions ​

Environment Variables ​

Network Configuration ​

Platform Selection ​

Working Directory ​

Complete Docker Example ​

SSH ​

Basic SSH ​

With Environment ​

Multiple Commands ​

S3 ​

DAG-Level Configuration ​

Upload ​

Download ​

List Objects ​

Delete ​

S3-Compatible Services ​

HTTP ​

GET Request ​

POST with Body ​

Query Parameters ​

Form Data ​

Self-Signed Certificates ​

Complete HTTP Example ​

Archive ​

Extract Archive ​

Create Archive ​

List Contents ​

Mail ​

Basic Email ​

With Attachments ​

Multiple Recipients ​

HTML Email ​

JQ ​

Raw Output ​

Format JSON ​

Query JSON ​

Transform JSON ​

Complex Processing ​

Redis ​

Basic Usage ​

DAG-Level Configuration ​

String Operations ​

Hash Operations ​

Pipeline Operations ​

Connection Modes ​

Chat ​

Basic Chat Request ​

Supported Providers ​

Multi-turn Conversation ​

Variable Substitution ​

Local Models (Ollama) ​

DAG-Level Configuration ​

HITL (Human in the Loop) ​

Basic Usage ​

With Prompt and Inputs ​

Configuration Options ​

Approval and Rejection ​

DAG (Subworkflow) ​

Execute External DAG ​

Execute Local DAG ​

Capture DAG Output ​

Error Handling ​

Dynamic DAG Selection ​

See Also ​

Step Types Reference

Overview

Shell (Default)

Shell Selection

Docker

Image Mode (Create New Container)

Exec Mode (Use Existing Container)

Image Pull Options

Registry Authentication

Volume Mounts

GitHub Actions

Environment Variables

Network Configuration

Platform Selection

Working Directory

Complete Docker Example

SSH

Basic SSH

With Environment

Multiple Commands

S3

DAG-Level Configuration

Upload

Download

List Objects

Delete

S3-Compatible Services

HTTP

GET Request

POST with Body

Query Parameters

Form Data

Self-Signed Certificates

Complete HTTP Example

Archive

Extract Archive

Create Archive

List Contents

Mail

Basic Email

With Attachments

Multiple Recipients

HTML Email

JQ

Raw Output

Format JSON

Query JSON

Transform JSON

Complex Processing

Redis

Basic Usage

DAG-Level Configuration

String Operations

Hash Operations

Pipeline Operations

Connection Modes

Chat

Basic Chat Request

Supported Providers

Multi-turn Conversation

Variable Substitution

Local Models (Ollama)

DAG-Level Configuration

HITL (Human in the Loop)

Basic Usage

With Prompt and Inputs

Configuration Options

Approval and Rejection

DAG (Subworkflow)

Execute External DAG

Execute Local DAG

Capture DAG Output

Error Handling

Dynamic DAG Selection

See Also