postgres-ai
diff --git a/‎0014_how_to_decide_if_query_too_slow.md
+68 b/‎0014_how_to_decide_if_query_too_slow.md
+68
diff --git a/‎0015_how_to_monitor_index_operations.md
+106 b/‎0015_how_to_monitor_index_operations.md
+106
diff --git a/‎0016_how_to_get_into_trouble_using_some_postgres_features.md
+115 b/‎0016_how_to_get_into_trouble_using_some_postgres_features.md
+115
@@ -0,0 +1,68 @@
+Originally from: [tweet](https://twitter.com/samokhvalov/status/1711575029006418218), [LinkedIn post](...).
+
+---
+
+# How to decide when query is too slow and needs optimization
+
+> I post a new PostgreSQL "howto" article every day. Join me in this
+> journey – [subscribe](https://twitter.com/samokhvalov/), provide feedback, share!
+
+"Slow" is a relative concept. In some cases, we might be happy with query latency 1 minute (or no?), while in other
+scenarios, even 1 ms might seem to be too slow.
+
+Decision when to apply optimization techniques is important for efficiency – as Donald Knuth famously stated in "The Art
+of Computer Programming":
+
+> The real problem is that programmers have spent far too much time worrying about efficiency in the wrong places and at
+> the wrong times; premature optimization is the root of all evil (or at least most of it) in programming.
+
+Below we assume that we work with OLTP or hybrid workloads and need to decide if a certain query is too slow and
+requires optimization.
+
+## How to conclude that a query is too slow
+
+1. Do you have an OLTP case or an analytical one, or hybrid? For OLTP cases, requirements are more strict and dictated
+   by human perception (see: [What is a slow SQL query?](https://postgres.ai/blog/20210909-what-is-a-slow-sql-query)),
+   while for analytical needs, we can usually wait a minute or two – unless it's also user-facing. If it is, we probably
+   consider 1 minute as too slow. In this case, consider using column-store database systems (and Postgres ecosystem has
+   a new offering here: check out [@hydradatabase](https://twitter.com/hydradatabase)). For OLTP, the majority of
+   user-facing queries should be below 100ms – ideally, below 10ms – so the complete
+   requests to your backends that users make, do not exceed 100-200ms (each request can issue several SQL queries,
+   depending on the case). Of course, non-user-facing queries such as those coming from background jobs, `pg_dump`, and so
+   on, can last longer – assuming that the next principles are met.
+
+2. In the case of OLTP, the second question should be: is this query "read-only" or it changes the data (be it DDL or
+   just writing DML – INSERT/UPDATE/DELETE)? In this case, in OLTP, we shouldn't allow it to run longer than a second or
+   two, unless we are 100% sure that this query won't block other queries for long. For massive writes, consider
+   splitting them in batches so each batch doesn't last longer than 1-2 seconds. For DDL, be careful with lock
+   acquisition and lock chains (read these
+   posts: [Common DB schema change mistakes](https://postgres.ai/blog/20220525-common-db-schema-change-mistakes#case-5-acquire-an-exclusive-lock--wait-in-transaction)
+   and
+   [Useful queries to analyze PostgreSQL lock trees (a.k.a. lock queues)](https://postgres.ai/blog/20211018-postgresql-lock-trees)).
+
+3. If you're dealing with a read-only query, make sure it's also not running for too long – long-running transactions
+   make Postgres hold old dead tuples for long ("xmin horizon" is not advancing), so autovacuum cannot delete dead
+   tuples that became dead after the start of our transaction. Aim to avoid transactions that last longer than one or a
+   few hours (and if you absolutely need such long transactions, prefer running them at low-activity hours, when XID is
+   progressing slowly, and do not run them often).
+
+4. Finally, even if a query is relatively fast – for instance, 10ms – it might still be considered too slow if its
+   frequency is high. For example, 10ms query running 1,000 times per second (you can check it via
+   `pg_stat_statements.calls`), then Postgres needs to spend 10 seconds *every* second to process this group of queries.
+   In this case, if lowering down the frequency is hard, the query should be considered slow, and an optimization
+   attempt needs to be performed, to reduce resource consumption (the goal here is to reduce
+   `pg_stat_statements.total_exec_time` – see
+   the [previous #PostgresMarathon posts about pgss](https://twitter.com/search?q=%23PostgresMarathon%20pg_stat_statements&src=typed_query&f=live)).
+
+## Summary
+
+- All queries that last longer than 100-200 ms should be considered as slow, if they are user-facing. Good queries are
+  those that are below 10 ms.
+- Background processing queries are ok to last longer. If they modify data and might block user-facing queries, then
+  they should not be allowed to last longer than 1-2 s.
+- Be careful with DDLs – make sure they don't cause massive writes (if they do, it should be split into batches as
+  well), and use low `lock_timeout` and retries to avoid blocking chains.
+- Do not allow long-running transactions. Make sure the xmin horizon is progressing and autovacuum can remove dead
+  tuples promptly – do not allow transactions that last too long (>1-2h).
+- Optimize even fast (<100ms) queries if the corresponding `pg_stat_statements.calls` and
+  `pg_stat_statements.total_exec_time` are high.
@@ -0,0 +1,106 @@
+Originally from: [tweet](https://twitter.com/samokhvalov/status/1711982079838507225), [LinkedIn post](...).
+
+---
+
+# How to monitor CREATE INDEX / REINDEX progress in Postgres 12+
+
+> I post a new PostgreSQL "howto" article every day. Join me in this
+> journey – [subscribe](https://twitter.com/samokhvalov/), provide feedback, share!
+
+To monitor the progress of long-running index building or rebuilding, you can use this query:
+
+```sql
+select
+   now(),
+   query_start as started_at,
+   now() - query_start as query_duration,
+   format('[%s] %s', a.pid, a.query) as pid_and_query,
+   index_relid::regclass as index_name,
+   relid::regclass as table_name,
+   (pg_size_pretty(pg_relation_size(relid))) as table_size,
+   nullif(wait_event_type, '') || ': ' || wait_event as wait_type_and_event,
+   phase,
+   format(
+           '%s (%s of %s)',
+           coalesce((round(100 * blocks_done::numeric / nullif(blocks_total, 0), 2))::text || '%', 'N/A'),
+           coalesce(blocks_done::text, '?'),
+           coalesce(blocks_total::text, '?')
+   ) as blocks_progress,
+   format(
+           '%s (%s of %s)',
+           coalesce((round(100 * tuples_done::numeric / nullif(tuples_total, 0), 2))::text || '%', 'N/A'),
+           coalesce(tuples_done::text, '?'),
+           coalesce(tuples_total::text, '?')
+   ) as tuples_progress,
+   current_locker_pid,
+   (select nullif(left(query, 150), '') || '...' from pg_stat_activity a where a.pid = current_locker_pid) as current_locker_query,
+   format(
+           '%s (%s of %s)',
+           coalesce((round(100 * lockers_done::numeric / nullif(lockers_total, 0), 2))::text || '%', 'N/A'),
+           coalesce(lockers_done::text, '?'),
+           coalesce(lockers_total::text, '?')
+   ) as lockers_progress,
+   format(
+           '%s (%s of %s)',
+           coalesce((round(100 * partitions_done::numeric / nullif(partitions_total, 0), 2))::text || '%', 'N/A'),
+           coalesce(partitions_done::text, '?'),
+           coalesce(partitions_total::text, '?')
+   ) as partitions_progress,
+   (
+      select
+         format(
+                 '%s (%s of %s)',
+                 coalesce((round(100 * n_dead_tup::numeric / nullif(reltuples::numeric, 0), 2))::text || '%', 'N/A'),
+                 coalesce(n_dead_tup::text, '?'),
+                 coalesce(reltuples::int8::text, '?')
+         )
+      from pg_stat_all_tables t, pg_class tc
+      where t.relid = p.relid and tc.oid = p.relid
+   ) as table_dead_tuples
+from pg_stat_progress_create_index p
+        left join pg_stat_activity a on a.pid = p.pid
+order by p.index_relid
+; -- in psql, use "\watch 5" instead of semicolon
+```
+
+The same query, in [a better formatted form](https://gitlab.com/-/snippets/2138417).
+
+How this query works:
+
+1. The basis of it
+   is `pg_stat_progress_create_index` [added in Postgres 12](https://postgresql.org/docs/current/progress-reporting.html#CREATE-INDEX-PROGRESS-REPORTING).
+
+2. The documentation also has a list of `CREATE INDEX` phases. As you can see from the table provided in the docs,
+   advanced variant, `CREATE INDEX CONCURRENTLY` / `REINDEX CONCURRENTLY` (a.k.a. CIC and RC), which takes longer but
+   acts in a non-blocking fashion suitable for loaded production systems, has more phases. The current phase is
+   presented in the column "phase" of the output.
+
+3. Index name (a temporary one in case of CIC/RC), table name are presented (using the useful trick to convert OIDs to
+   names – note, e.g., `index_relid::regclass as index_name`). Additionally, the table size which is essential to form
+   expectations of overall duration – the bigger the table is, the longer the index creation is going to take.
+
+4. `pg_stat_activity` (`pgsa`) provides a lot of additional useful information:
+    - PID of Postgres backend
+    - actual SQL query used
+    - the moment when the work has started (`query_start`), allowing us to understand the elapsed
+      time (`query_duration`)
+    - `wait_event_type` & `wait_event` to understand what the process is currently waiting on
+    - it also used (in a separate sub-query) to get the information of the session that blocks our process, when such an
+      event occurs (`current_locker_pid`, `current_locker_query`)
+
+5. Function `format(...)` is very useful to consolidate data in convenient form without having to worry about `NULL`s,
+   that
+   would be a problem if we used a regular concatenation without `coalesce(...)`.
+
+6. However, in certain cases we use `coalesce(...)` just to put special symbols if the value is missing (`IS NULL`) –
+   e.g., "?" or "N/A".
+
+7. Another interesting trick is combination of `coalesce(...)` and `nullif(...)`. The latter allows us to avoid division
+   by
+   zero errors (replacing `0` with `NULL`, making the division result also `NULL`), while the former, again, is used to
+   substitute resulting `NULL` with some non-empty value (in this case, 'N/A').
+
+When executed in `psql`, it can be convenient to use `\watch [seconds]` to run this report in loop and observe progress
+live:
+
+![tracking the progress of index building/rebuilding](files/0015_reindex.gif)
@@ -0,0 +1,115 @@
+Originally from: [tweet](https://twitter.com/samokhvalov/status/1712342522314572064), [LinkedIn post](...).
+
+---
+
+# How to get into trouble using some Postgres features
+
+> I post a new PostgreSQL "howto" article every day. Join me in this
+> journey – [subscribe](https://twitter.com/samokhvalov/), provide feedback, share!
+
+Today we have quite entertaining material, but knowing (and avoiding) these things can save you time and effort.
+
+## NULLs
+
+`NULL`s, while being very common, are the most popular way to get into trouble when using SQL in general, and Postgres
+is
+no exception.
+
+For example, one may forget that concatenation (`||`), arithmetic operations (`*`, `/`, `+`, `-`), traditional
+comparison operators (`=`, `<`, `>`, `<=`, `>=`, `<>`) are all not NULL-safe operations, and later be very surprised
+that the result is lost.
+
+Especially it hurts when you build a startup and some important business logic depends on it, a query not
+handling `NULL`s properly, leading to loss of user base or money or time (or all that):
+
+```
+nik=# \pset null ∅
+Null display is "∅".
+
+nik=# select null + 1;
+?column?
+  ----------
+
+          ∅
+
+(1 row)
+```
+
+`NULL`s can be really dangerous and even experienced engineers continue to bump into issues when working with them. Some
+useful materials to educate yourself:
+
+- [NULLs: the good, the bad, the ugly, and the unknown](https://postgres.fm/episodes/nulls-the-good-the-bad-the-ugly-and-the-unknown)
+  (podcast)
+- [What is the deal with NULLs?](http://thoughts.davisjeff.com/2009/08/02/what-is-the-deal-with-nulls/)
+
+A couple of tips – how to make your code NULL-safe:
+
+- Consider using expressions like `COALESCE(val, 0)` for replace `NULL`s with some value (usually `0` or `''`).
+- For comparison, instead of `=` or `<>`: `IS [NOT] DISTINCT FROM` (check out the `EXPLAIN` plan though).
+- Instead of concatenation, use: `format('%s %s', var1, var2)`.
+- Don't use `WHERE NOT IN (SELECT ...)` – use `NOT EXISTS` instead (
+  see thia [JOOQ blog post](https://jooq.org/doc/latest/manual/reference/dont-do-this/dont-do-this-sql-not-in/)).
+- Just be careful. `NULL`s are treacherous.
+
+## Subtransactions under heavy loads
+
+If you aim to grow to dozens or hundreds of thousands of TPS and want to have various issues, use subtransactions.
+Probably, you use them implicitly – e.g., if you use Django, Rails, or `BEGIN/EXCEPTION` blocks in PL/pgSQL.
+
+Why you might want to get rid of subtransactions completely:
+[PostgreSQL Subtransactions Considered Harmful](https://postgres.ai/blog/20210831-postgresql-subtransactions-considered-harmful)
+
+## int4 PK
+
+Zero-downtime conversion of `int4` (a.k.a. int a.k.a. integer) PK to `int8` when the table has 1B rows requires a lot of
+efforts. While table `(id int4, created_at timestamptz)` is going to take the same disk space as
+`(id int8, created_at timestamptz)` due to [alignment padding](https://stackoverflow.com/a/7431468/459391).
+
+## (Exotic) SELECT INTO is not you think it is
+
+One day I was debugging a PL/pgSQL function, and copy-pasted a query like this, to `psql`:
+
+```
+nik=# select * into var from t1 limit 1;
+SELECT 1
+```
+
+It worked! This is a huge surprise – in SQL context (not
+PL/pgSQL), [SELECT INTO](https://postgresql.org/docs/current/sql-selectinto.html) is a DDL command that creates a table
+and inserts data into it (shouldn't this be deprecated already?)
+
+## Thinking that "transactional DDL" is easy
+
+Yes, Postgres has "transactional DDL" and you can benefit from it a lot – until you cannot. Under load, you cannot rely
+on it – instead, you need to start using zero-downtime methodologies and avoid mistakes (
+read: [common db schema change mistakes](https://postgres.ai/blog/20220525-common-db-schema-change-mistakes), and rely
+on "non-transactional" DDL such as
+`CREATE INDEX CONCURRENTLY`, assuming that some attempts might fail, after which cleanup is needed before retrying.
+
+A big problem with DDL deployment under load is that by default, you can have downtime attempting to deploy a very light
+schema change – unless you implement a logic with low `lock_timeout` and retries (
+see: [zero-downtime postgres schema migrations lock timeout and retries](https://postgres.ai/blog/20210923-zero-downtime-postgres-schema-migrations-lock-timeout-and-retries)).
+
+## DELETE a lot of rows with one command
+
+This is a good way to get into trouble: issue a `DELETE` of millions of rows and wait. If `checkpointer` is not tuned
+(`max_wal_size = 1GB`), if tuples are deleted via an `IndexScan` access (meaning the process of making pages dirty is
+quite "random"), and disk IO is quite throttled, this may put your system down. And even if it survives the stress,
+you'll get:
+
+- risks of locking issues (`DELETE` blocking some writes issued by other users),
+- a large number of dead tuples produced, to be converted to bloat later by `autovacuum`.
+
+What to do:
+
+- split to batches,
+- if massive write is inevitable, consider raising `max_wal_size` temporarily, which doesn't require restart (however:
+  this potentially increases recovery time if server crashes during this procedure).
+
+Read [common db schema change mistakes](https://postgres.ai/blog/20220525-common-db-schema-change-mistakes#case-4-unlimited-massive-change).
+
+## Other "Don't do" articles
+
+- [Depesz: Don’t do these things in PostgreSQL](https://depesz.com/2020/01/28/dont-do-these-things-in-postgresql/)
+- [PostgreSQL Wiki: Don't Do This](https://wiki.postgresql.org/wiki/Don't_Do_This)
+- [JOOQ: Don't do this](https://jooq.org/doc/latest/manual/reference/dont-do-this/)