Question 1

What relation lock modes are there, and which of them conflict?

Accepted Answer

A table has eight lock modes from ACCESS SHARE to ACCESS EXCLUSIVE, lined
up by strength. A plain `SELECT` takes the weakest, ACCESS SHARE;
`INSERT`/`UPDATE`/`DELETE` take ROW EXCLUSIVE; commands like `CREATE INDEX`
take SHARE; most `ALTER TABLE`, `DROP`, `TRUNCATE`, and `VACUUM FULL` take
the strongest, ACCESS EXCLUSIVE, which conflicts with everything,
including a plain SELECT. It is not the commands that conflict but the
modes: reads are compatible with each other, while ACCESS EXCLUSIVE is
compatible with nothing. An incompatible request gets into the queue.

Question 2

How are row locks implemented? Where are they stored?

Accepted Answer

Row locks do not sit in memory. That would be far too expensive with
millions of locked rows. The lock mark is written into the row version
itself: `xmax` gets the id of the locking transaction, and bits in
`t_infomask` say what kind of lock it is, for delete/update or a "soft" one
(FOR SHARE/FOR KEY SHARE). If several transactions hold the row at once, a
MultiXact is placed in `xmax`. The only thing in memory is a short-lived
lock for the moment of changing the version. So the number of locked rows
is unbounded. They are essentially free in terms of memory.

Question 3

FOR UPDATE, FOR SHARE, SKIP LOCKED, NOWAIT: when do you use each?

Accepted Answer

`SELECT ... FOR UPDATE` locks the selected rows for change: other such
queries wait. `FOR SHARE` is softer. It lets others read with a lock too
but not change. `FOR KEY SHARE`/`FOR NO KEY UPDATE` are finer variants that
conflict less with each other and let part of an UPDATE proceed in
parallel. The behavior modifiers for a busy row: `NOWAIT` fails right away
with an error, while `SKIP LOCKED` skips locked rows and takes the next
free ones. The pairing `FOR UPDATE SKIP LOCKED` is the canonical way to
build a task queue without races.

Question 4

How does a deadlock arise, and what does PostgreSQL do about it?

Accepted Answer

A deadlock is a wait cycle: transaction A holds resource 1 and waits for
resource 2, while transaction B holds resource 2 and waits for resource 1.
They will not break apart on their own. PostgreSQL does not wait forever:
after `deadlock_timeout` (one second by default) the waiting transaction
runs a check of the wait graph, finds the cycle, and rolls one transaction
back with a `deadlock detected` error, breaking the cycle. The classic
cause is two transactions updating the same rows in a different order. It
is cured by a consistent lock order and short transactions, not by raising
the timeout.

Question 5

Why is a transaction in the idle in transaction state dangerous?

Accepted Answer

Idle in transaction is an open `BEGIN` that does nothing yet is not
finished: the application took a connection, ran a query, and forgot to
commit or roll back. While the transaction is alive, it holds its
snapshot, and therefore the horizon: vacuum cannot remove dead versions
newer than that snapshot, and tables and indexes bloat. If the transaction
also managed to lock something, it holds the locks too, and the queue
behind it grows. The defense is
`idle_in_transaction_session_timeout`, which terminates such sessions, plus
discipline in the code: do not leave transactions open.

Question 6

Why can a harmless ALTER TABLE take down a loaded database?

Accepted Answer

Heavy DDL takes ACCESS EXCLUSIVE, incompatible with everything. If there
is a long query on the table, the ALTER gets into the queue behind it and
does not run yet. The trouble is that the queued ALTER already blocks
everyone who arrives after it: even fast SELECTs line up behind it. One
long query plus one ALTER turn into a full table stall. The defense is a
`lock_timeout` before the DDL (better to fail than to freeze the queue),
migrations in small steps, and techniques that lower the lock strength:
`CREATE INDEX CONCURRENTLY`, `ADD COLUMN` without rewriting the table,
adding constraints through `NOT VALID` followed by `VALIDATE`.

Question 7

How do lightweight locks and predicate locks differ from ordinary ones?

Accepted Answer

Heavyweight locks protect objects like tables and rows, are visible in
`pg_locks`, can wait in a queue, and take part in deadlock detection.
Lightweight locks (LWLocks) protect internal structures in shared memory,
buffers, lists, caches; they are short, come in shared/exclusive modes, and
do not look for deadlocks. Lower still are spinlocks held for a few
instructions. Predicate locks (SIRead) are a special beast at the
Serializable level: they block no one and only mark what was read, so SSI
can later find dangerous dependencies. In `pg_locks` they appear as rows
with mode `SIReadLock`, but unlike ordinary locks they never make anyone
wait.

Row locks, relation locks, deadlocks