Autovacuum Configuration, Duplicate Key Violations, Pipelining, Tuning max_wal_size

In this episode of Scaling Postgres, we discuss how best to configure autovacuum, the dangers of duplicate key violations, how to set up pipelining, and how to tune max_wal_size.

Content Discussed

YouTube Video

Podcast Audio

Transcript

All right. I hope you, your friends, family, and coworkers continue to do well. Our first piece of content is "Is my autovacuum configured properly?". This is from depesz.com, and he's talking about a scenario where you're wondering if you have autovacuum configured correctly. There are a couple of different ways you can do this, but one way he says to do it is how active are your autovacuum workers. So he basically says to query pg_stat_activity and look for how many processes are running autovacuum currently, and then see what your autovacuum_max_workers is set at. And then he says to log this periodically, he says every five minutes. And basically, if you're constantly using all of the workers, then it probably means you need to do more configuration to your autovacuum.

You can also do things like look at how often tables are being vacuumed using the Postgres system views. But this is a pretty good way to gauge if you might need to do some additional configuration. As he shows here, there are a lot of different configurations you can adjust for the autovacuum. But the key ones that determine how fast vacuum runs are these four he lists here. Now two of these are related to vacuum, and two are related to autovacuum. But the thing about autovacuum is sometimes the settings inherent from the vacuum settings. But basically, these are the defaults here. So there's a cost delay and a cost limit. Now each task that vacuum has to do is assigned a particular cost. And as those tasks are done, the costs get added up. Once it hits this limit, vacuum or autovacuum pauses for the delay mentioned here, you can see the vacuum has no delay, but autovacuum has a ten-millisecond delay.

So how you would get vacuum to work more quickly is you can increase the cost limit before it pauses, or you can reduce the amount of pause that happens when the cost limit is hit. So by adjusting these, you can make vacuum or auto vacuum faster or even slower if you need to do that, if you notice it causes a problem. So this post goes through the way that you could do that. He also mentions that there are also table-level configurations you can do. So you can change the cost_limit and the cost_delay per table. So if you have a particularly large table, you can assign them at the table level and then have a generic setting for all the other tables. But this is a pretty simple post that shows you exactly what you would need to tweak to make autovacuum run faster or slower just by changing a few configuration variables.

The next piece of content- "Hidden dangers of duplicate key violations in PostgreSQL and how to avoid them". This is from aws.amazon.com, and they're talking about when you do an INSERT and you get this error in your log that says "duplicate key value violates unique constraint" on a particular index. That happens to be the primary key here. When this happens, if it happens a lot, it can cause some issues. Now, I've just considered it a nuisance in terms of the log, but whenever this happens, a full transaction is completed, even though you see this error. So you're going to get transaction IDs being used up because each transaction is indeed used when you get this duplicate key violation when you're doing an INSERT. The row actually does get inserted, but it's not considered an active row, it's a dead tuple. So you're going to get a dead tuple generated when this happens.

So that also means that now autovacuum has to do more cleanup with regards to it. And then finally you've inserted a dead tuple, so now you've got more storage that's being used up. So having this type of key violation that causes an insert to fail results in transactions being used up and the rows actually being inserted and causing these knock-on effects. Now, a way to avoid it this post mentions is that you can do ON A CONFLICT DO NOTHING. So what that means is it may do a CONFLICT, but then it's not going to do anything. It's not going to return an error. It will just do no INSERT as they show here. But the benefit of that is that there's no row entered and there's no transaction that's done.

So none of this happens because it's considered a speculative INSERT, I believe they're talking about here. So this is a long blog post that goes into a lot more depth about why this happens and specific examples of the effects of it. But in a nutshell, if you have a lot of these in your log, duplicate key violations, you may want to consider doing an INSERT ON CONFLICT DO NOTHING. That'll avoid those messages in the logs and avoid creating the transaction and creating the row. So that could potentially get you more throughput. But if you're interested in learning more, definitely check out this blog post.

The next piece of content- "PIPELINE MODE FOR BETTER POSTGRESQL PERFORMANCE ON SLOW NETWORKS". This is from cybertec-postgresql.com and they're talking about the new mode that's been added to PostgreSQL 14, where the libpq C API supports Pipelining. That means it doesn't have to wait for a response immediately. You can send multiple transactions to the database without waiting for the first one to return. They go through the different messages that have been added to the protocol that supports this. This post explores the performance advantages of Pipelining. So they have a simple example here where they have an account table and they're going to within a transaction update one account, decrement the other account. Now, without Pipelining, you need to send the BEGIN transaction, wait for the server to return, send the UPDATE, wait for it to return, send the other UPDATE, wait for it to return, and then do the COMMIT.

Whereas if you're using Pipelining, a pipeline, they mentioned that it's already part of a transaction, so communication isn't needed once you're in pipeline mode and you simply send the update and then send your next update. And whenever it returns, it returns the data. So this can significantly reduce latency between the client and the server. They measure it here and what they developed is a C program to actually use this pipelining mode in order to come up with these performance results. Now, they added a 50-millisecond delay between the client and the server for doing these tests. And what you notice is that with Pipelining, you essentially have two times the network latency to send the update and get the results back, whereas it's eight times the network latency for non-pipelining. So there are definitely some use cases where pipelining can really result in a performance improvement.

Now, they said what you can also do to avoid network round trips is actually put your code into a function because essentially this function just gets called once with the call statement and the transaction happens all within the function doing the updates. So that's a way to achieve similar results. But as he mentions, a lot of programming shops don't really like to use a whole lot of functions for that purpose, but that does give you that advantage. Now, this is an interesting post, but personally, I'm waiting for a lot of the programming language libraries to catch up and implement Pipelining so that we can use it easily. Like, for example, I do a lot of programming work in Ruby. I would love for the PG Ruby library to support this type of pipelining, but if you want to learn more, you can definitely check out this blog post.

The next piece of content- "Tuning max_wal_size in PostgreSQL". This is from enterprisedb.com. He's talking about the advantages of tuning max_wal_size and that from a stock configuration of Postgres, they say that the greatest advantage comes from adjusting the configuration of the shared buffer. They saw a 3.5 performance increase over the stock configuration, whereas tuning max_wal_size resulted in a 1.5 times increase in performance. But they actually had one test they did that, and they saw a 10x performance. So that's pretty large and they go through what this is important for in terms of maintaining wal_size. And the main reason is to minimize checkpoints and they show different ways to go through the tuning and monitoring process for it. But I kind of do it a little bit differently because they don't mention a setting here that is highly important for me to use in terms of configuring the max_wal_size. So what I do is I basically determine how much wall generation you have, and based upon that, let's pick a checkpoint timeout.

Do you want it to be ten minutes? Do you want 30 minutes? A longer checkpoint can ease WAL writing, but that means if the server has a crash and needs to restart, it's going to take a long time to catch up all that WAL, whereas a shorter checkpoint timeout restarts are faster, but you're going to have more WAL writing pressure depending on the activity of the database. But once that checkpoint timeout is selected, I choose a checkpoint warning close to that checkpoint timeout. Because what that checkpoint warning does is it tells you when the amount of WAL you're generating has exceeded yourmax_wal_size and it's going to do an earlier checkpoint than the timeout you've selected. So basically, if you see these warnings in the logs, it means you need to increase your max_wal_size until you don't really see it doing an early checkpoint and it's pretty much happening at your checkpoint timeout time. But that's personally how I like to set max_wal_size based upon the activity. But if you want to learn more about it, you can check out this blog post.

The next piece of content- "PG Phriday: Wrapping Transactions Right Round". This is from enterprisedb.com as well, and it was a little bit hard to read some of this post because their code looks to be all on one line here as opposed to multiple lines. Basically, they're talking about transaction IDs and Postgres and it's basically a 32-bit ID that wraps around and therefore you have about 2 billion transactions that you can use before they wrap around. They describe some of the implementation as well as the importance of freezing those tuples such that you avoid the transaction ID wraparound. And they also bring up some issues that can actually cause it. The first one they mention is prepared transactions.

So this is where you're basically trying to do two-phase commits and you have an orphan transaction that kind of hangs out in the system and prevents those transaction IDs from being reused. The next one they are talking about is long-running transactions. So if you have a transaction that's open for a really long time, it can't be recycled and reused, which will cause problems. And the third one they mentioned is hot standby feedback. So again, this is where transaction IDs can't be reused. Again, the other instance that I've seen a lot more so is orphaned replication slots. So they don't really mention that here, but definitely, orphan replication slots can prevent the transaction IDs from being recycled. So that's definitely something else to watch out for. But if you want to learn more, you can definitely check out this blog post.

The next piece of content- "Using pgTAP to automate database testing". This is from endpointdev.com. They're talking about a tool called pgTAP, which apparently the TAP means Test Anything Protocol. Basically, it's an extension you can install in Postgres, and it gives you certain functions that enable you to create tests to validate that your database is in a particular state. Now, their particular use case is that they were setting up, it looks like row-level security and subsecurity policies, and they wanted to ensure that they were accurate. So they created tests within the database using this pgTAP extension to be able to validate that their configuration settings were correct. So it's definitely an interesting tool, but I imagine you could use any testing protocol in any language as long as it's communicating to the database to do a similar thing. Like, for example, I use Ruby. So I could imagine using a Ruby testing library that just talks to the database and queries it exactly like it's doing here, and it gives you the same results. But if you want to learn about a database extension that does it, definitely check out this blog post.

The next piece of content- "Hooks: The secret feature powering the Postgres ecosystem". This is from ctodive.com, and they're talking about the capabilities in Postgres that allow extensions to hook into the core functionality of postgres to alter it. They show some of the different hook areas that exist in terms of security hooks that go into security functions related to passwords and user creation. Function hooks that work during function execution. The Planner hooks executor hooks for processing statements, as well as PL/PG SQL hooks for dealing a little bit more with functions. And in this blog post, they show how some well-known extensions like Timescale, pg_stat_statements, and Supabase use these hooks to make their products do what they do. So if you want to learn more about this, definitely check out this blog post.

The next piece of content- "How to benchmark performance of Citus and Postgres with HammerDB on Azure''. This is from Citusdata.com, and they're talking about the benchmarking performance of Citus using the HammerDB tool and how they're measuring the performance of OLTP workloads, OLAP workloads as well as HTAP workloads. But I would say it's also a pretty good post on how to use HammerDB, which is an open-source tool for doing these types of benchmarks. So if you want to learn more, you can check out this blog post.

Related to this is also the topic that pganalyze.com did for their "Five minutes of Postgres". And in five minutes of Postgres episode nine, they cover "PostGIS vs GPUs, tuning of parallel queries, and automating Citus extension benchmarks with HammerDB". This goes into more depth with a post that we discovered about HammerDB. So I definitely encourage you to check out this piece of content as well.

The next piece of content- "Distributed Database with PostgreSQL Atomic Commit Problems". This is from highgo.ca. Now, they're saying this is a post about problems, but there's not a lot of solutions. So this is basically talking about the problem of creating a distributed database. So they're talking about the scenario where Postgres is looking to use Foreign Data Wrappers that communicate to Foreign multiple PostgreSQL servers that have been partitioned in some way. So maybe you have a set of servers and your, say, accounts are partitioned across those servers so that you could answer questions in parallel or even dedicate a server for all account-related queries with regard to a particular customer. Basically a way to scale out Postgres as opposed to just scaling it up.

But the problem that comes is what if you have a query that needs to span multiple foreign servers? So maybe you want to increment an account and decrement another account that resides on different servers where you want those to commit all at once or not at all. One scenario that they are looking into doing is the two-phase commit feature that already exists where you do a prepare transaction and then you commit that prepared transaction. Well, things work fine when you're preparing a transaction because you can send that prepared transaction to multiple hosts. And then if there's an issue with that or if you don't hear about it, it can be rolled back easily.

But the problem: what if all three of these servers respond, okay, I'm ready for this transaction. But then the second step of actually doing the commit, the first two respond and they're able to commit, but then the third hangs. Now you have this orphaned uncommitted transaction that's out there on instance, DN3, but it's already committed on DN one and two. So how do you reconcile that? And there's no great way at this point of resolving that. So basically this post talks about some of those issues and unfortunately doesn't have a lot of solutions for them right now. So it's definitely something that they're going to be discussing. But if you want to learn more, you can check out this blog post.

The next piece of content- "Run PostgreSQL on Kubernetes with Percona Operator &Pulumi. This is from percona.com and they're talking about running Kubernetes using their operator along with this service provider called Pulumi, where you use your own language to deploy to Cloud infrastructure. So if you want to learn more about doing that, you can definitely check out this blog post.

The next piece of content- "Spatial Filters in pg_featureserv with CQL". This is from blog.crunchydata.com and they're talking about pg_featureserv, which is a web service that allows you to query into PostGIS and Postgres to do spatial analysis. And this is particularly talking about the spatial features that have been added. So if you want to learn more, you can check out this blog post.

The next piece of content. The PostgreSQL person of the week is Jaime Casanova. If you're interested in learning more about Jaime and his contributions to Postgres, definitely check out this blog post.

The last piece of content, we had another episode of The Rubber Duck Dev Show this past Wednesday. This last episode was on "Real World Performance Enhancements" that I did with my application. So if you want to learn about all the different changes that I discovered that can improve performance, including a database, one related to a lateral join, I encourage you to check out our show.

Autovacuum Configuration, Duplicate Key Violations, Pipelining, Tuning max_wal_size | Scaling Postgres 207

Join Over 1,000 Engineers & Get New Episodes Weekly!

Content Discussed

YouTube Video

Podcast Audio

Transcript