Flush RRD only when TXGs contain data #18138

oshogbo · 2026-01-16T10:49:08Z

Description

This change modifies the behavior of spa_sync_time_logger when flushing the RRD database.

Previously, once the sync interval elapsed, a flush would always be generated. On solid-state devices, especially when the pool was otherwise idle, this caused disks to wake up solely to write RRD data. Since RRD is best-effort telemetry, this behavior is unnecessary and wasteful.

With this change, spa_sync_time_logger delays flushing until a TXG that already contains data is being synced. The RRD update is appended to that TXG instead of forcing the creation of a new write-only TXG.

During pool export, flushing is forced regardless of whether the TXG contains user data. At that stage, data durability takes precedence and a write must be issued.

This fixes #18082
This change was inspired from @amotin in comments #18120.

Sponsored by: [Wasabi Technology, Inc.; Klara, Inc.]

How Has This Been Tested?

I have added logs to check when the database is flushed and what is the size of database.

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Performance enhancement (non-breaking change which improves efficiency)
Code cleanup (non-breaking change which makes code smaller or more readable)
Quality assurance (non-breaking change which makes the code more robust against bugs)
Breaking change (fix or feature that would cause existing functionality to change)
Library ABI change (libzfs, libzfs_core, libnvpair, libuutil and libzfsbootenv)
Documentation (a change to man pages or other documentation)

Checklist:

My code follows the OpenZFS code style requirements.
I have updated the documentation accordingly.
I have read the contributing document.
I have added tests to cover my changes.
I have run the ZFS Test Suite with this change applied.
All commit messages are properly formatted and contain Signed-off-by.

amotin

I am not 100% sure dp_dirty_pertxg at this point reliably means there is nothing to be written in this TXG. It may need a deeper look. But yea, this might be the direction.

amotin · 2026-01-16T14:41:30Z

module/zfs/spa.c

+	if (force ||
+	    (txg > spa->spa_last_noted_txg &&
+	    curtime >= spa->spa_last_noted_txg_time + spa_note_txg_time)) {


This condition does not look right. We should not care about force if we already logged the txg.

I figured it’s better to log during export, but we can omit it if you prefer so.

Again, this condition is wrong. I haven't looked whether on export txg can happen to be equal to spa->spa_last_noted_txg, but if it can, force will make it to insert a duplicate value. I suppose it should be:
txg > spa->spa_last_noted_txg && (force || curtime >= spa->spa_last_noted_txg_time + spa_note_txg_time).

module/zfs/spa.c

This change modifies the behavior of spa_sync_time_logger when flushing the RRD database. Previously, once the sync interval elapsed, a flush would always be generated. On solid-state devices, especially when the pool was otherwise idle, this caused disks to wake up solely to write RRD data. Since RRD is best-effort telemetry, this behavior is unnecessary and wasteful. With this change, spa_sync_time_logger delays flushing until a TXG that already contains data is being synced. The RRD update is appended to that TXG instead of forcing the creation of a new write-only TXG. During pool export, flushing is forced regardless of whether the TXG contains user data. At that stage, data durability takes precedence and a write must be issued. Sponsored by: [Wasabi Technology, Inc.; Klara, Inc.] Signed-off-by: Mariusz Zaborski <mariusz.zaborski@klarasystems.com>

oshogbo force-pushed the oshogbo/flush_bad branch from 7382076 to dca9998 Compare January 16, 2026 10:50

oshogbo mentioned this pull request Jan 16, 2026

[2.4] TXG timestamp DB sync if idle causes unnecessary disk access/prevent spin down #18082

Open

amotin requested changes Jan 16, 2026

View reviewed changes

Bronek mentioned this pull request Jan 17, 2026

Fix unnecessary writes of transaction database #18120

Closed

14 tasks

oshogbo force-pushed the oshogbo/flush_bad branch from dca9998 to eda07a7 Compare January 22, 2026 15:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flush RRD only when TXGs contain data #18138

Flush RRD only when TXGs contain data #18138

oshogbo commented Jan 16, 2026 •

edited

Loading

Uh oh!

amotin left a comment

Uh oh!

amotin Jan 16, 2026

Uh oh!

oshogbo Jan 22, 2026

Uh oh!

amotin Jan 27, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Flush RRD only when TXGs contain data #18138

Are you sure you want to change the base?

Flush RRD only when TXGs contain data #18138

Conversation

oshogbo commented Jan 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

How Has This Been Tested?

Types of changes

Checklist:

Uh oh!

amotin left a comment

Choose a reason for hiding this comment

Uh oh!

amotin Jan 16, 2026

Choose a reason for hiding this comment

Uh oh!

oshogbo Jan 22, 2026

Choose a reason for hiding this comment

Uh oh!

amotin Jan 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

oshogbo commented Jan 16, 2026 •

edited

Loading

amotin Jan 27, 2026 •

edited

Loading