MDEV-37662: Binlog Corruption When tmpdir is Full #4373

bnestere · 2025-10-15T21:15:34Z

The binary log could be corrupted when committing a large transaction
(i.e. one whose data exceeds the binlog_cache_size limit and spills
into a tmp file) in binlog_format=row if the server's --tmp-dir is
full. The corruption that happens is only the GTID of the errored
transaction would be written into the binary log, without any
body/finalizing events. This would happen because the content of the
transaction wasn't flushed at the proper time, and the transaction's
binlog cache data was not durable while trying to copy the content
from the binlog cache file into the binary log itself. While switching
the tmp file from a WRITE_CACHE to a READ_CACHE, the server would see
there is still data to flush in the cache, and first try to flush it.
This is not a valid time to flush that data to the temporary file
though, as:

The GTID event has already been written directly to the binary
log. So if this flushing fails, it leaves the binary log in a
corrupted state.
This is done during group commit, and will slow down other
concurrent transactions, which are otherwise ready to commit.

This patch fixes these issues by ensuring all transaction data is
fully flushed to its temporary file (if used) before starting any
critical paths, i.e. in binlog_flush_cache(). Note that if the binlog
cache is solely in-memory, this flush-to-temporary-file is skipped.

This PR is organized as follows:
Commit 1: The regression that reproduces the bug report
Commit 2: The fix
And any future commits will be addressing reviews.

The binary log could be corrupted when committing a large transaction (i.e. one whose data exceeds the binlog_cache_size limit and spills into a tmp file) in binlog_format=row if the server's --tmp-dir is full. The corruption that happens is only the GTID of the errored transaction would be written into the binary log, without any body/finalizing events. This would happen because the content of the transaction wasn't flushed at the proper time, and the transaction's binlog cache data was not durable while trying to copy the content from the binlog cache file into the binary log itself. While switching the tmp file from a WRITE_CACHE to a READ_CACHE, the server would see there is still data to flush in the cache, and first try to flush it. This is not a valid time to flush that data to the temporary file though, as the GTID event has already been written directly to the binary log. So if this flushing fails, it leaves the binary log in a corrupted state. The flush itself is expected to happen in THD::binlog_flush_pending_rows_event(). However, if there is no pending event, the flush is skipped.

The binary log could be corrupted when committing a large transaction (i.e. one whose data exceeds the binlog_cache_size limit and spills into a tmp file) in binlog_format=row if the server's --tmp-dir is full. The corruption that happens is only the GTID of the errored transaction would be written into the binary log, without any body/finalizing events. This would happen because the content of the transaction wasn't flushed at the proper time, and the transaction's binlog cache data was not durable while trying to copy the content from the binlog cache file into the binary log itself. While switching the tmp file from a WRITE_CACHE to a READ_CACHE, the server would see there is still data to flush in the cache, and first try to flush it. This is not a valid time to flush that data to the temporary file though, as: 1. The GTID event has already been written directly to the binary log. So if this flushing fails, it leaves the binary log in a corrupted state. 2. This is done during group commit, and will slow down other concurrent transactions, which are otherwise ready to commit. This patch fixes these issues by ensuring all transaction data is fully flushed to its temporary file (if used) before starting any critical paths, i.e. in binlog_flush_cache(). Note that if the binlog cache is solely in-memory, this flush-to-temporary-file is skipped. Reviewed-by: TODO

andrelkin

The work looks good. Special thanks for a solid analysis!
I have only a kind of cosmetic notes.

andrelkin · 2025-11-13T19:31:55Z

sql/log.cc

+
+  Returns TRUE on success, FALSE on error.
+*/
+static my_bool binlog_cache_reconcile_data_in_storage(IO_CACHE *info)


The new function purpose is clear, but its introduction as static feels an overkill..

andrelkin · 2025-11-13T19:43:29Z

sql/log.cc

+           ready-to-commit (concurrent) transactions could be stalled
+    */
+    if (using_stmt && !thd->binlog_flush_pending_rows_event(TRUE, FALSE) &&
+        binlog_cache_reconcile_data_in_storage(


.. so instead my_b_get_pos_in_file(info) && flush_io_cache(info) would be a fair enough at this point, as a new inline, why not flush_pending_bytes(info); flushin the name, while reads naturally, is also for consistency with the caller and a sibling's names.
Also the suggested name suggests to me (much) shorter comments that your static function's header ones..
Indeed, the whole point of the caller function is to get all stuff to disk. That, as it turns out in this bug, just deals on few levels.

bnestere requested review from andrelkin and knielsen October 15, 2025 21:15

bnestere assigned bnestere and andrelkin Oct 15, 2025

bnestere added MariaDB Corporation Replication Patches involved in replication labels Oct 15, 2025

bnestere force-pushed the 10.11-MDEV-37662 branch from 28844af to 9213ed1 Compare October 20, 2025 23:27

bnestere added 2 commits October 21, 2025 13:54

bnestere force-pushed the 10.11-MDEV-37662 branch from 9213ed1 to 72d0eef Compare October 21, 2025 19:54

andrelkin approved these changes Nov 13, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

MDEV-37662: Binlog Corruption When tmpdir is Full #4373

MDEV-37662: Binlog Corruption When tmpdir is Full #4373

bnestere commented Oct 15, 2025 •

edited

Loading

Uh oh!

andrelkin left a comment

Uh oh!

andrelkin Nov 13, 2025

Uh oh!

andrelkin Nov 13, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

3 participants

Uh oh!

MDEV-37662: Binlog Corruption When tmpdir is Full #4373

Are you sure you want to change the base?

MDEV-37662: Binlog Corruption When tmpdir is Full #4373

Conversation

bnestere commented Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

andrelkin left a comment

Choose a reason for hiding this comment

Uh oh!

andrelkin Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

andrelkin Nov 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

3 participants

bnestere commented Oct 15, 2025 •

edited

Loading

andrelkin Nov 13, 2025 •

edited

Loading