Comment by Sesse__

2 months ago

A write-ahead log isn't a performance tool to batch changes, it's a tool to get durability of random writes. You write your intended changes to the log, fsync it (which means you get a 4k write), then make the actual changes on disk just as if you didn't have a WAL.

If you want to get some sort of sub-block batching, you need a structure that isn't random in the first place, for instance an LSM (where you write all of your changes sequentially to a log and then do compaction later)—and then solve your durability in some other way.

7 comments

Sesse__

throw0101a 2 months ago

> A write-ahead log isn't a performance tool to batch changes, it's a tool to get durability of random writes.

¿Por qué no los dos?

Sesse__ 2 months ago
Because it is in addition to your writes, not instead of them. That's what “ahead” points to.
- _bohm 2 months ago
  
  The actual writes don’t need to be persisted on transaction commit, only the WAL. In most DBs the actual writes won’t be persisted until the written page is evicted from the page cache. In this sense, writing WAL generally does provide better perf than synchronously doing a random page write
- Tostino 2 months ago
  
  Look up how "checkpointing" works in Postgres.
  
  2 replies →

toolslive 2 months ago

you can unify database with write-ahead log using a persistent data structure. It also gives you cheap/free snapshots/checkpoints.