why is the volume of data on the target DB much larger than on the source? #194

Jamic28 · 2024-11-30T14:37:08Z

I am doing synchronous replication from standalone Postgres version 14 to a database cluster version 16. On the source database the data volume is 270 GB, and on the target cluster it is already 500 GB and the data copy is still going (no errors)...

root@db-1:/home/administrator# du -sh /var/lib/postgresql/16/main/base/
500G    /var/lib/postgresql/16/main/base/

"replication_stats_count_by_state": {
    "replicating": 128,
    "data_is_being_copied": 148
  },
  "message_lsn_receipts": [
    {
      "received_lsn": "394/3B0D3A58",
      "last_msg_send_time": "2024-11-30 14:38:54 UTC",
      "last_msg_receipt_time": "2024-11-30 14:38:49 UTC",
      "latest_end_lsn": "394/3B0D3A58",
      "latest_end_time": "2024-11-30 14:38:54 UTC"
    },
    {
      "received_lsn": null,
      "last_msg_send_time": "2024-11-30 14:38:04 UTC",
      "last_msg_receipt_time": "2024-11-30 14:38:04 UTC",
      "latest_end_lsn": null,
      "latest_end_time": "2024-11-30 14:38:04 UTC"
    },
    {
      "received_lsn": null,
      "last_msg_send_time": "2024-11-30 14:37:48 UTC",
      "last_msg_receipt_time": "2024-11-30 14:37:48 UTC",
      "latest_end_lsn": null,
      "latest_end_time": "2024-11-30 14:37:48 UTC"
    }
  ],
  "sync_started_at": "2024-11-30 08:22:38 UTC",
  "sync_failed_at": null,
  "switchover_completed_at": null

why is the volume of data on the target DB much larger than on the source?

The text was updated successfully, but these errors were encountered:

Jamic28 · 2024-12-01T09:07:51Z

The inspection table of mydb on source is 237GB

public | inspection                                         | table | mydb | permanent   | heap          | 237 GB     |

And on target is growing while start synch

 public | inspection                                         | table | mydb | permanent   | heap          | 500 GB     |

could this be related to the fact that a record is being written to the table on the source base during synchronization?

joetynan · 2024-12-26T17:14:41Z

I've run into this as well a couple times. and I'm not entirely sure why - but the target table is VASTLY larger.

shayonj · 2024-12-26T17:25:36Z

Does running vacuum on it help ?

joetynan · 2025-01-02T18:08:40Z

Upon further review - looks like (in my instance at least) - a table that has a large amount of read/writes to it (say, if it's a cache table), will generate a rather large TOAST table on the target side. since vacuum can't run until the table's in a replicating state (I'm guessing there's a lock to prevent that from happening while the data copy is running), it can't clean it up until after it's in a replicating state.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

why is the volume of data on the target DB much larger than on the source? #194

why is the volume of data on the target DB much larger than on the source? #194

Jamic28 commented Nov 30, 2024 •

edited

Loading

Jamic28 commented Dec 1, 2024 •

edited

Loading

joetynan commented Dec 26, 2024

shayonj commented Dec 26, 2024

joetynan commented Jan 2, 2025

why is the volume of data on the target DB much larger than on the source? #194

why is the volume of data on the target DB much larger than on the source? #194

Comments

Jamic28 commented Nov 30, 2024 • edited Loading

Jamic28 commented Dec 1, 2024 • edited Loading

joetynan commented Dec 26, 2024

shayonj commented Dec 26, 2024

joetynan commented Jan 2, 2025

Jamic28 commented Nov 30, 2024 •

edited

Loading

Jamic28 commented Dec 1, 2024 •

edited

Loading