Workflow output definitions¶

Nextflow provides a powerful new syntax for managing workflow outputs that centralizes output configuration and enables automatic generation of output manifests.

Learning goals¶

In this side quest, you'll learn how to use the workflow output definition syntax, which provides a cleaner alternative to the traditional publishDir directive.

By the end of this side quest, you'll be able to:

Understand the limitations of the traditional publishDir approach
Use the publish: section in workflows to declare outputs
Configure the output {} block to organize published files
Use dynamic paths based on metadata to organize outputs
Generate index files (manifests) documenting your outputs

These skills will help you build workflows with cleaner output management and better documentation of results.

Prerequisites¶

Before taking on this side quest, you should:

Have completed the Hello Nextflow tutorial or equivalent beginner's course.
Be comfortable using basic Nextflow concepts (processes, channels, operators)

0. Get started¶

Open the training codespace¶

If you haven't yet done so, make sure to open the training environment as described in the Environment Setup.

Move into the project directory¶

Let's move into the directory where the files for this tutorial are located.

cd side-quests/workflow_outputs

You can set VSCode to focus on this directory:

code .

Review the materials¶

You'll find a simple workflow file called main.nf, a modules directory containing a module file with two processes, and a greetings.csv file containing sample data.

Directory contents

.
├── greetings.csv
├── main.nf
├── modules
│   └── greetings.nf
└── nextflow.config

This directory contains a simple greeting pipeline similar to what you built in Hello Nextflow. The CSV file contains greetings in different languages that we'll process and organize by language.

Review the assignment¶

Your challenge is to refactor this workflow to use the new workflow output definition syntax instead of publishDir, organizing outputs by language and generating index files that document what was produced.

Readiness checklist¶

Think you're ready to dive in?

I understand the goal of this course and its prerequisites
My codespace is up and running
I've set my working directory appropriately
I understand the assignment

If you can check all the boxes, you're good to go.

1. The traditional approach: publishDir¶

1.1. Review the current workflow¶

Let's start by examining how the current workflow uses publishDir to manage outputs.

Take a look at the modules file:

modules/greetings.nf
#!/usr/bin/env nextflow

/*
 * Create a greeting file from input text
 */
process SAY_HELLO {

    publishDir 'results/greetings', mode: 'copy'

    input:
        tuple val(meta), val(greeting)

    output:
        tuple val(meta), path("${greeting}-output.txt")

    script:
    """
    echo '$greeting' > '${greeting}-output.txt'
    """
}

/*
 * Convert greeting to uppercase
 */
process CONVERT_TO_UPPER {

    publishDir 'results/uppercase', mode: 'copy'

    input:
        tuple val(meta), path(input_file)

    output:
        tuple val(meta), path("UPPER-${input_file}")

    script:
    """
    cat '$input_file' | tr '[a-z]' '[A-Z]' > 'UPPER-${input_file}'
    """
}

Each process has its own publishDir directive that specifies where outputs should be copied. The processes use the common [meta, file] tuple pattern, where meta is a map containing metadata like the greeting text and language.

Now look at the main workflow file:

main.nf
#!/usr/bin/env nextflow

// Enable workflow output definition syntax (required for Nextflow < 25.10)
nextflow.preview.output = true

/*
 * Pipeline parameters
 */
params.input = 'greetings.csv'

// Include modules
include { SAY_HELLO } from './modules/greetings.nf'
include { CONVERT_TO_UPPER } from './modules/greetings.nf'

workflow {

    // Create a channel from the CSV file with metadata
    greeting_ch = channel.fromPath(params.input)
                        .splitCsv(header: true)
                        .map { row -> [[id: row.greeting, language: row.language], row.greeting] }

    // Create greeting files
    SAY_HELLO(greeting_ch)

    // Convert to uppercase
    CONVERT_TO_UPPER(SAY_HELLO.out)
}

The workflow parses the CSV and creates [meta, greeting] tuples where meta is a map containing both the greeting ID and language. This pattern is standard in nf-core pipelines and makes metadata easy to access throughout the workflow.

1.2. Run the workflow¶

Let's run the workflow and see how outputs are organized:

nextflow run main.nf

Output

N E X T F L O W   ~  version 25.04.3

Launching `main.nf` [friendly_ride] DSL2 - revision: abc123

executor >  local (10)
[12/abc123] SAY_HELLO (1)        | 5 of 5 ✔
[34/def456] CONVERT_TO_UPPER (1) | 5 of 5 ✔

Check the results directory:

ls -la results/

You should see two subdirectories: greetings/ and uppercase/, each containing the relevant output files.

1.3. Limitations of publishDir¶

While publishDir works well for simple cases, it has some limitations as workflows grow more complex:

Scattered configuration: Whether you define publishDir in processes or via configuration selectors like withName and withLabel, the publish settings for each process are specified separately. To understand the full output structure of a large workflow, you need to piece together information from many different places.
No automatic record of outputs: When a workflow completes, you're left with a directory of files but no machine-readable summary of what was produced. If you want a CSV or JSON file listing all outputs with their associated metadata (often called a "manifest"), you have to build that yourself.
Repetitive patterns: If multiple processes need similar publish configurations, you end up repeating yourself—either in process definitions or across many configuration selectors.
Coupling between processes and output structure: The output organization is tied to process-level configuration. If you want to reorganize outputs (say, grouping by sample instead of by process), you need to update the configuration for each affected process.

The workflow output definition syntax addresses these limitations by centralizing output configuration in one place.

Takeaway¶

The traditional publishDir approach works but scatters output configuration across process definitions and doesn't provide automatic documentation of outputs.

What's next?¶

In the next section, we'll introduce the workflow output definition syntax that centralizes output management.

2. Introducing workflow outputs¶

2.1. A different approach to publishing¶

With publishDir, each process is responsible for publishing its own outputs. The process definition (or its configuration) specifies where files should go. This means publishing logic is distributed across your workflow—every process that produces user-facing outputs needs its own publish configuration.

The workflow output definition syntax takes a different approach: publishing is handled at the workflow level, not the process level.

Instead of processes deciding where to put their outputs, you:

Write processes that simply emit their outputs to channels (no publishDir)
Declare in your workflow which channels contain outputs worth publishing
Configure in a separate output {} block how those outputs should be organized

This separation means processes focus purely on computation, while output organization is managed in one central place.

2.2. The syntax¶

The workflow output definition syntax uses two constructs:

A publish: section inside your workflow that declares which channels to publish
An output {} block that configures how those outputs are organized

Feature flag

This training environment uses an older version of Nextflow that requires a feature flag for workflow outputs. The starter script already includes this line:

nextflow.preview.output = true

In Nextflow 25.10 and later, this flag is no longer needed and can be removed.

Let's modify our workflow to use this new syntax.

2.3. Remove publishDir from processes¶

Edit modules/greetings.nf to remove the publishDir directives from both processes:

AfterBefore

modules/greetings.nf
#!/usr/bin/env nextflow

/*
 * Create a greeting file from input text
 * Note: No publishDir - outputs managed by workflow output block
 */
process SAY_HELLO {

    input:
        tuple val(meta), val(greeting)

    output:
        tuple val(meta), path("${greeting}-output.txt")

    script:
    """
    echo '$greeting' > '${greeting}-output.txt'
    """
}

/*
 * Convert greeting to uppercase
 * Note: No publishDir - outputs managed by workflow output block
 */
process CONVERT_TO_UPPER {

    input:
        tuple val(meta), path(input_file)

    output:
        tuple val(meta), path("UPPER-${input_file}")

    script:
    """
    cat '$input_file' | tr '[a-z]' '[A-Z]' > 'UPPER-${input_file}'
    """
}

modules/greetings.nf
#!/usr/bin/env nextflow

/*
 * Create a greeting file from input text
 */
process SAY_HELLO {

    publishDir 'results/greetings', mode: 'copy'

    input:
        tuple val(meta), val(greeting)

    output:
        tuple val(meta), path("${greeting}-output.txt")

    script:
    """
    echo '$greeting' > '${greeting}-output.txt'
    """
}

/*
 * Convert greeting to uppercase
 */
process CONVERT_TO_UPPER {

    publishDir 'results/uppercase', mode: 'copy'

    input:
        tuple val(meta), path(input_file)

    output:
        tuple val(meta), path("UPPER-${input_file}")

    script:
    """
    cat '$input_file' | tr '[a-z]' '[A-Z]' > 'UPPER-${input_file}'
    """
}

The only change is removing the publishDir directives. The inputs, outputs, and script remain exactly the same.

2.4. Add the publish section to main.nf¶

Now update main.nf to add the publish: section inside the workflow:

AfterBefore

main.nf
#!/usr/bin/env nextflow

// Enable workflow output definition syntax (required for Nextflow < 25.10)
nextflow.preview.output = true

/*
 * Pipeline parameters
 */
params.input = 'greetings.csv'

// Include modules
include { SAY_HELLO } from './modules/greetings.nf'
include { CONVERT_TO_UPPER } from './modules/greetings.nf'

workflow {

    main:
    // Create a channel from the CSV file with metadata
    greeting_ch = channel.fromPath(params.input)
                        .splitCsv(header: true)
                        .map { row -> [[id: row.greeting, language: row.language], row.greeting] }

    // Create greeting files
    SAY_HELLO(greeting_ch)

    // Convert to uppercase
    CONVERT_TO_UPPER(SAY_HELLO.out)

    publish:
    greetings = SAY_HELLO.out
    uppercase = CONVERT_TO_UPPER.out
}

main.nf
#!/usr/bin/env nextflow

// Enable workflow output definition syntax (required for Nextflow < 25.10)
nextflow.preview.output = true

/*
 * Pipeline parameters
 */
params.input = 'greetings.csv'

// Include modules
include { SAY_HELLO } from './modules/greetings.nf'
include { CONVERT_TO_UPPER } from './modules/greetings.nf'

workflow {

    // Create a channel from the CSV file with metadata
    greeting_ch = channel.fromPath(params.input)
                        .splitCsv(header: true)
                        .map { row -> [[id: row.greeting, language: row.language], row.greeting] }

    // Create greeting files
    SAY_HELLO(greeting_ch)

    // Convert to uppercase
    CONVERT_TO_UPPER(SAY_HELLO.out)
}

When using a publish: section, the workflow content must be placed in a main: block. The publish: section then declares named outputs—here greetings and uppercase—each assigned to an output channel from a process.

2.5. Add the output block¶

Now add the output {} block after the workflow to configure how outputs are organized:

AfterBefore

main.nf
#!/usr/bin/env nextflow

// Enable workflow output definition syntax (required for Nextflow < 25.10)
nextflow.preview.output = true

/*
 * Pipeline parameters
 */
params.input = 'greetings.csv'

// Include modules
include { SAY_HELLO } from './modules/greetings.nf'
include { CONVERT_TO_UPPER } from './modules/greetings.nf'

workflow {

    main:
    // Create a channel from the CSV file with metadata
    greeting_ch = channel.fromPath(params.input)
                        .splitCsv(header: true)
                        .map { row -> [[id: row.greeting, language: row.language], row.greeting] }

    // Create greeting files
    SAY_HELLO(greeting_ch)

    // Convert to uppercase
    CONVERT_TO_UPPER(SAY_HELLO.out)

    publish:
    greetings = SAY_HELLO.out
    uppercase = CONVERT_TO_UPPER.out
}

/*
 * Output block defines how published outputs are organized
 */
output {
    greetings {
        mode 'copy'
        path 'greetings'
    }

    uppercase {
        mode 'copy'
        path 'uppercase'
    }
}

main.nf
#!/usr/bin/env nextflow

// Enable workflow output definition syntax (required for Nextflow < 25.10)
nextflow.preview.output = true

/*
 * Pipeline parameters
 */
params.input = 'greetings.csv'

// Include modules
include { SAY_HELLO } from './modules/greetings.nf'
include { CONVERT_TO_UPPER } from './modules/greetings.nf'

workflow {

    main:
    // Create a channel from the CSV file with metadata
    greeting_ch = channel.fromPath(params.input)
                        .splitCsv(header: true)
                        .map { row -> [[id: row.greeting, language: row.language], row.greeting] }

    // Create greeting files
    SAY_HELLO(greeting_ch)

    // Convert to uppercase
    CONVERT_TO_UPPER(SAY_HELLO.out)

    publish:
    greetings = SAY_HELLO.out
    uppercase = CONVERT_TO_UPPER.out
}

The output {} block:

Configures each named output with its subdirectory path
Sets the publish mode to copy for each output
The base output directory defaults to results (override with -output-dir)

2.6. Run the updated workflow¶

Clean up previous results and run:

rm -rf results work .nextflow*
nextflow run main.nf

The outputs should be organized the same way as before, but now the configuration is centralized in one place.

Takeaway¶

The workflow output definition syntax separates output configuration from process definitions:

The publish: section declares which channels to publish
The output {} block configures paths and options

What's next?¶

In the next section, we'll use dynamic paths to organize outputs by metadata.

3. Dynamic publish paths¶

3.1. Organizing by metadata¶

One powerful feature of workflow outputs is the ability to use closures to dynamically determine output paths based on the data itself.

Since our outputs include language metadata, we can organize files by language.

Update the output {} block to use dynamic paths:

AfterBefore

main.nf
/*
 * Output block defines how published outputs are organized
 */
output {
    greetings {
        mode 'copy'
        path { meta, file -> "greetings/${meta.language}" }
    }

    uppercase {
        mode 'copy'
        path { meta, file -> "uppercase/${meta.language}" }
    }
}

main.nf
/*
 * Output block defines how published outputs are organized
 */
output {
    greetings {
        mode 'copy'
        path 'greetings'
    }

    uppercase {
        mode 'copy'
        path 'uppercase'
    }
}

The closure receives the elements of the output tuple (meta map and file) and returns the subdirectory path. Since meta is a map, we access fields with dot notation like meta.language.

3.2. Run with dynamic paths¶

Clean up and run:

rm -rf results work .nextflow*
nextflow run main.nf

Now check the results:

find results -type f

Output

results/greetings/English/Hello-output.txt
results/greetings/French/Bonjour-output.txt
results/greetings/German/Hallo-output.txt
results/greetings/Italian/Ciao-output.txt
results/greetings/Spanish/Holà-output.txt
results/uppercase/English/UPPER-Hello-output.txt
results/uppercase/French/UPPER-Bonjour-output.txt
results/uppercase/German/UPPER-Hallo-output.txt
results/uppercase/Italian/UPPER-Ciao-output.txt
results/uppercase/Spanish/UPPER-Holà-output.txt

The outputs are now organized by language, making it easy to find results for specific languages.

3.3. Override the output directory¶

You can override the output directory from the command line:

nextflow run main.nf -output-dir my_results

This creates outputs in my_results/ instead of results/.

Takeaway¶

Dynamic paths let you organize outputs based on metadata:

Use closures to compute paths from output tuple elements
The -output-dir flag overrides the base directory

What's next?¶

In the next section, we'll add index files to document our outputs.

4. Index files¶

4.1. Generating output manifests¶

Index files are CSV, JSON, or YAML manifests that document what outputs were produced. They're useful for:

Downstream pipelines that need to consume your outputs
Documentation of what was generated
Quality control and auditing

4.2. Add index file configuration¶

Update the output {} block to generate index files:

AfterBefore

main.nf
/*
 * Output block defines how published outputs are organized
 */
output {
    greetings {
        mode 'copy'
        path { meta, file -> "greetings/${meta.language}" }
        index {
            path 'greetings/index.json'
        }
    }

    uppercase {
        mode 'copy'
        path { meta, file -> "uppercase/${meta.language}" }
        index {
            path 'uppercase/index.json'
        }
    }
}

main.nf
/*
 * Output block defines how published outputs are organized
 */
output {
    greetings {
        mode 'copy'
        path { meta, file -> "greetings/${meta.language}" }
    }

    uppercase {
        mode 'copy'
        path { meta, file -> "uppercase/${meta.language}" }
    }
}

4.3. Run and check the index files¶

Clean up and run:

rm -rf results work .nextflow*
nextflow run main.nf

View the generated index file:

cat results/greetings/index.json

Output

[
    [
        {
            "id": "Hallo",
            "language": "German"
        },
        "/workspaces/training/side-quests/workflow_outputs/results/greetings/German/Hallo-output.txt"
    ],
    [
        {
            "id": "Bonjour",
            "language": "French"
        },
        "/workspaces/training/side-quests/workflow_outputs/results/greetings/French/Bonjour-output.txt"
    ],
    [
        {
            "id": "Holà",
            "language": "Spanish"
        },
        "/workspaces/training/side-quests/workflow_outputs/results/greetings/Spanish/Holà-output.txt"
    ],
    [
        {
            "id": "Ciao",
            "language": "Italian"
        },
        "/workspaces/training/side-quests/workflow_outputs/results/greetings/Italian/Ciao-output.txt"
    ],
    [
        {
            "id": "Hello",
            "language": "English"
        },
        "/workspaces/training/side-quests/workflow_outputs/results/greetings/English/Hello-output.txt"
    ]
]

The index file contains the metadata map with named fields (id, language) plus the absolute path to each file.

JSON works best with meta maps

We use JSON format because it properly expands the meta map into an object with named fields. CSV format would serialize the map as a string like "[id:Hello, language:English]", which is less useful. You can also use YAML format by changing the file extension to .yaml.

Takeaway¶

Index files provide automatic documentation of workflow outputs:

JSON format works best with meta maps (expands to objects with named fields)
Include all metadata from output tuples
Useful for downstream pipelines and auditing

What's next?¶

Let's summarize what we've learned.

Summary¶

Key patterns¶

Workflow output definition syntax:

workflow {
    main:
    // ... process calls ...

    publish:
    output_name = PROCESS.out
}

output {
    output_name {
        mode 'copy'
        path { meta, file -> "subdir/${meta.language}" }
        index {
            path 'index.json'
        }
    }
}

Key benefits:

Centralized output configuration
Dynamic paths based on metadata
Automatic index file generation
Override output directory with -output-dir

When to use workflow outputs vs publishDir¶

Use Case	Approach
Simple pipelines with few outputs	`publishDir` is fine
Complex output organization	Workflow outputs
Need output manifests	Workflow outputs
Multiple processes publishing to same structure	Workflow outputs
Quick prototyping	`publishDir`

Additional resources¶

What's next?¶

Congratulations on completing this side quest! You've learned how to use workflow output definitions to organize your pipeline outputs and generate documentation.

Return to the Side Quests menu to continue your training journey.