PostgreSQL Advanced Admin Guide
PostgreSQL Advanced Admin Guide
PostgreSQL Administration
BRUCE MOMJIAN
1 / 109
Outline
1. Installation
2. Configuration
3. Maintenance
4. Monitoring
5. Recovery
2 / 109
Installation
◮ Click-through Installers
◮ MS Windows
◮ Linux
◮ OS X
◮ Ports
◮ RPM
◮ DEB
◮ PKG
◮ other packages
◮ Source
◮ obtaining
◮ build options
◮ installing
3 / 109
Initialization (initdb)
$ initdb
All of PostgreSQL successfully made. Ready to install.
PostgreSQL installation complete.
The files belonging to this database system will be owned by user "postgres".
Main
Libpq
Postmaster
Postgres Postgres
Parse Statement
utility Utility
Traffic Cop
Command
Query e.g. CREATE TABLE, COPY
SELECT, INSERT, UPDATE, DELETE
Rewrite Query
Generate Paths
Optimal Path
Generate Plan
Plan
Execute Plan
6 / 109
Starting Postmaster
◮ manually
◮ pg_ctl start
◮ on boot
7 / 109
Stopping Postmaster
◮ manually
◮ pg_ctl stop
◮ on shutdown
8 / 109
Connections
9 / 109
Authentication
◮ trust
◮ reject
◮ passwords
◮ md5
◮ password (cleartext)
◮ local authentication
◮ socket permissions
◮ ’peer’ socket user name passing
◮ host ident using local identd
10 / 109
Authentication (continued)
◮ remote authentication
◮ host ident using pg_ident.conf
◮ kerberos
◮ gss
◮ sspi
◮ pam
◮ ldap
◮ radius
◮ cert
11 / 109
Access
12 / 109
pg_hba.conf Default
13 / 109
pg_hba.conf Example
14 / 109
Permissions
15 / 109
Data Directory
$ ls -CF
base/ pg_hba.conf pg_replslot/ pg_subtrans/ postgresql.auto.conf
global/ pg_ident.conf pg_serial/ pg_tblspc/ postgresql.conf
pg_clog/ pg_logical/ pg_snapshots/ pg_twophase/ postmaster.opts
pg_commit_ts/ pg_multixact/ pg_stat/ PG_VERSION postmaster.pid
pg_dynshmem/ pg_notify/ pg_stat_tmp/ pg_xlog/
16 / 109
Database Directories
$ ls -CF global/
1136 1214_fsm 1261_vm 2671 2846 2967 6000_vm
1136_fsm 1214_vm 1262 2672 2846_vm 3592 6001
1136_vm 1232 1262_fsm 2676 2847 3592_vm 6002
1137 1233 1262_vm 2677 2964 3593 pg_control
1213 1260 2396 2694 2964_vm 4060 pg_filenode.map
1213_fsm 1260_fsm 2396_fsm 2695 2965 4060_vm pg_internal.init
1213_vm 1260_vm 2396_vm 2697 2966 4061
1214 1261 2397 2698 2966_vm 6000
$ ls -CF base/
1/ 12406/ 12407/ 16384/
$ ls -CF base/16384
112 1249_fsm 2606_vm 2652 2699 3081 3598_vm
113 1249_vm 2607 2653 2701 3085 3599
12242 1255 2607_fsm 2654 2702 3118 3600
12242_fsm 1255_fsm 2607_vm 2655 2703 3118_vm 3600_fsm
12242_vm 1255_vm 2608 2656 2704 3119 3600_vm
12244 1259 2608_fsm 2657 2753 3164 3601
12246 1259_fsm 2608_vm 2658 2753_fsm 3256 3601_fsm
…
17 / 109
Transaction/WAL Directories
$ ls -CF pg_xlog/
000000010000000000000001 archive_status/
$ ls -CF pg_clog/
0000
18 / 109
Configuration Directories
$ ls -CF share/
conversion_create.sql postgres.bki snowball_create.sql
extension postgres.description sql_features.txt
information_schema.sql postgresql.conf.sample system_views.sql
pg_hba.conf.sample postgres.shdescription timezone
pg_ident.conf.sample psqlrc.sample timezonesets
pg_service.conf.sample recovery.conf.sample tsearch_data
19 / 109
Configuration of postgresql.conf
https://www.flickr.com/photos/mwichary/
20 / 109
postgresql.conf
# -----------------------------
# PostgreSQL configuration file
# -----------------------------
#
# This file consists of lines of the form:
#
# name = value
#
# (The "=" is optional.) Whitespace may be used. Comments are introduced with
# "#" anywhere on a line. The complete list of parameter names and allowed
# values can be found in the PostgreSQL documentation.
#
# The commented-out settings shown in this file represent the default values.
# Re-commenting a setting is NOT sufficient to revert it to the default value;
# you need to reload the server.
21 / 109
postgresql.conf (Continued)
# This file is read on server startup and when the server receives a SIGHUP
# signal. If you edit the file on a running system, you have to SIGHUP the
# server for the changes to take effect, or use "pg_ctl reload". Some
# parameters, which are marked below, require a server shutdown and restart to
# take effect.
#
# Any parameter can also be given as a command-line option to the server, e.g.,
# "postgres -c log_connections=on". Some parameters can be changed at run time
# with the "SET" SQL command.
#
# Memory units: kB = kilobytes Time units: ms = milliseconds
# MB = megabytes s = seconds
# GB = gigabytes min = minutes
# TB = terabytes h = hours
# d = days
22 / 109
Configuration File Location
# The default values of these variables are driven from the -D command-line
# option or PGDATA environment variable, represented here as ConfigDir.
23 / 109
Connections and Authentication
24 / 109
Security and Authentication
25 / 109
TCP/IP Control
26 / 109
Memory Usage
27 / 109
Memory Usage (Continued)
Recovery
fsync
fsync
Disk Blocks
28 / 109
Sizing Shared Memory
Page In (bad)
Kernel
29 / 109
Disk and Kernel Resources
# - Disk -
30 / 109
Vacuum and Background Writer
#vacuum_cost_delay = 0 # 0-100 milliseconds
#vacuum_cost_page_hit = 1 # 0-10000 credits
#vacuum_cost_page_miss = 10 # 0-10000 credits
#vacuum_cost_page_dirty = 20 # 0-10000 credits
#vacuum_cost_limit = 200 # 1-10000 credits
# - Background Writer -
# - Asynchronous Behavior -
#effective_io_concurrency = 1 # 1-1000; 0 disables prefetching
#max_worker_processes = 8 # (change requires restart)
#max_parallel_workers_per_gather = 0 # taken from max_worker_processes
#old_snapshot_threshold = -1 # 1min-60d; -1 disables; 0 is immediate
# (change requires restart)
#backend_flush_after = 0 # measured in pages, 0 disables
31 / 109
Write-Ahead Log (WAL)
#wal_level = minimal # minimal, replica, or logical
# (change requires restart)
#fsync = on # flush data to disk for crash safety
# (turning this off can cause
# unrecoverable data corruption)
#synchronous_commit = on # synchronization level;
# off, local, remote_write, remote_apply, o
#wal_sync_method = fsync # the default is the first option
# supported by the operating system:
# open_datasync
# fdatasync (default on Linux)
# fsync
# fsync_writethrough
# open_sync
#full_page_writes = on # recover from partial page writes
#wal_compression = off # enable compression of full-page writes
#wal_log_hints = off # also do full page writes of non-critical
# (change requires restart)
#wal_buffers = -1 # min 32kB, -1 sets based on shared_buffers
# (change requires restart)
#wal_writer_delay = 200ms # 1-10000 milliseconds
#wal_writer_flush_after = 1MB # measured in pages, 0 disables
Recovery
fsync
fsync
Disk Blocks
33 / 109
Checkpoints and Archiving
# - Archiving -
34 / 109
Write-Ahead Logging (Continued)
PostgreSQL Shared Buffer Cache Write−Ahead Log
1111
0000 1111
0000 111
000
Begin 1
0000
1111
0000
1111
1
0000
1111
0000
1111
1
000
111
000
111
1
0000
1111 0000
1111 000
111
1111
0000 1111111
0000000 111
000
0000
1111
0000
1111
0000
1111 0000
1111
0000
1111000
111 000
111
0000
1111
0000000
1 2 111
2 000
111
11111
0000
2
0000
1111 1111000
111 000
111
0000
1111
1111
0000
0000
1111000
111
000
111 1111
0000
0000
1111
End 1 0000
1111
0000000
2 111
1111
2
000
111 0000
1111
2
0000
1111
1111
0000
0000
1111000
111
000
111 1111
0000
0000
1111
Rotate 0000
1111
0000000
2 111
1111
2
000
111 0000
1111
2
0000
1111
35 / 109
Sending Server
36 / 109
Primary Replication Server
37 / 109
Standby Replication Server
38 / 109
Planner Method Tuning
#enable_bitmapscan = on
#enable_hashagg = on
#enable_hashjoin = on
#enable_indexscan = on
#enable_indexonlyscan = on
#enable_material = on
#enable_mergejoin = on
#enable_nestloop = on
#enable_seqscan = on
#enable_sort = on
#enable_tidscan = on
39 / 109
Planner Constants
40 / 109
Planner GEQO
#geqo = on
#geqo_threshold = 12
#geqo_effort = 5 # range 1-10
#geqo_pool_size = 0 # selects default based on effort
#geqo_generations = 0 # selects default based on effort
#geqo_selection_bias = 2.0 # range 1.5-2.0
#geqo_seed = 0.0 # range 0.0-1.0
41 / 109
Miscellaneous Planner Options
42 / 109
Where To Log
43 / 109
Where To Log (rotation)
44 / 109
Where to Log (syslog)
#syslog_facility = ’LOCAL0’
#syslog_ident = ’postgres’
#syslog_sequence_numbers = on
#syslog_split_messages = on
45 / 109
When to Log
47 / 109
What to Log
#debug_print_parse = off
#debug_print_rewritten = off
#debug_print_plan = off
#debug_pretty_print = on
#log_checkpoints = off
#log_connections = off
#log_disconnections = off
#log_duration = off
#log_error_verbosity = default # terse, default, or verbose messages
#log_hostname = off
48 / 109
What To Log: Log_line_prefix
49 / 109
What to Log (Continued)
# - Process Title -
50 / 109
Runtime Statistics
#track_activities = on
#track_counts = on
#track_io_timing = off
#track_functions = none # none, pl, all
#track_activity_query_size = 1024 # (change requires restart)
#stats_temp_directory = ’pg_stat_tmp’
# - Statistics Monitoring -
#log_parser_stats = off
#log_planner_stats = off
#log_executor_stats = off
#log_statement_stats = off
51 / 109
Autovacuum
53 / 109
Locale, Formatting, and Full Text Search
datestyle = ’iso, mdy’
#intervalstyle = ’postgres’
timezone = ’US/Eastern’
#timezone_abbreviations = ’Default’ # Select the set of available time zone
# abbreviations. Currently, there are
# Default
# Australia (historical usage)
# India
# You can create your own file in
# share/timezonesets/.
#extra_float_digits = 0 # min -15, max 3
#client_encoding = sql_ascii # actually, defaults to database
# encoding
#dynamic_library_path = ’$libdir’
#local_preload_libraries = ’’
#session_preload_libraries = ’’
55 / 109
Lock Management
#deadlock_timeout = 1s
#max_locks_per_transaction = 64 # min 10
# (change requires restart)
#max_pred_locks_per_transaction = 64 # min 10
# (change requires restart)
56 / 109
Version/Platform Compatibility
#array_nulls = on
#backslash_quote = safe_encoding # on, off, or safe_encoding
#default_with_oids = off
#escape_string_warning = on
#lo_compat_privileges = off
#operator_precedence_warning = off
#quote_all_identifiers = off
#sql_inheritance = on
#standard_conforming_strings = on
#synchronize_seqscans = on
#transform_null_equals = off
57 / 109
Error Handling
58 / 109
Config File Includes
59 / 109
Interfaces
◮ Installing
◮ Compiled Languages (C, ecpg)
◮ Scripting Language (Perl, Python, PHP)
◮ SPI
◮ Connection Pooling
60 / 109
Include Files
$ ls -CF include/
ecpg_config.h libpq/ pgtypes_date.h sql3types.h
ecpgerrno.h libpq-events.h pgtypes_error.h sqlca.h
ecpg_informix.h libpq-fe.h pgtypes_interval.h sqlda-compat.h
ecpglib.h pg_config_ext.h pgtypes_numeric.h sqlda.h
ecpgtype.h pg_config.h pgtypes_timestamp.h sqlda-native.h
informix/ pg_config_manual.h postgres_ext.h
internal/ pg_config_os.h server/
61 / 109
Library Files
$ ls -CF lib/
ascii_and_mic.so* libpgcommon.a utf8_and_big5.so*
cyrillic_and_mic.so* libpgfeutils.a utf8_and_cyrillic.so*
dict_snowball.so* libpgport.a utf8_and_euc2004.so*
euc2004_sjis2004.so* libpgtypes.a utf8_and_euc_cn.so*
euc_cn_and_mic.so* libpgtypes.so@ utf8_and_euc_jp.so*
euc_jp_and_sjis.so* libpgtypes.so.3@ utf8_and_euc_kr.so*
euc_kr_and_mic.so* libpgtypes.so.3.7* utf8_and_euc_tw.so*
euc_tw_and_big5.so* libpq.a utf8_and_gb18030.so*
latin2_and_win1250.so* libpq.so@ utf8_and_gbk.so*
latin_and_mic.so* libpq.so.5@ utf8_and_iso8859_1.so*
libecpg.a libpq.so.5.9* utf8_and_iso8859.so*
libecpg_compat.a libpqwalreceiver.so* utf8_and_johab.so*
libecpg_compat.so@ pgxs/ utf8_and_sjis2004.so*
libecpg_compat.so.3@ pkgconfig/ utf8_and_sjis.so*
libecpg_compat.so.3.8* plperl.so* utf8_and_uhc.so*
libecpg.so@ plpgsql.so* utf8_and_win.so*
libecpg.so.6@ plpython2.so*
libecpg.so.6.8* utf8_and_ascii.so*
62 / 109
Maintenance
63 / 109
Backup
64 / 109
Continuous Archiving /
Point-In-Time Recovery (PITR)
0
:0
:0
:0
:0
02
09
11
13
WAL AL AL
W W
65 / 109
PITR Backup Procedures
1. archive_mode = on
2. wal_level = archive
3. archive_command = ’cp -i %p /mnt/server/pgsql/%f <
/dev/null’
4. SELECT pg_start_backup(’label’);
5. Perform file system-level backup (can be inconsistent)
6. SELECT pg_stop_backup();
pg_basebackup does this automatically and can be run on version
9.2+ standbys.
66 / 109
PITR Recovery
5
:0
:3
:4
:5
17
17
17
17
WAL
AL AL
W W
67 / 109
PITR Recovery Procecdures
1. Stop postmaster
2. Restore file system-level backup
3. Make adjustments as outlined in the documentation
4. Create recovery.conf
5. restore_command = ’cp /mnt/server/pgsql/%f %p’
6. Start the postmaster
68 / 109
Data Maintenance
69 / 109
Automating Tasks
70 / 109
Checkpoints
71 / 109
Monitoring Active Sessions
72 / 109
ps
$ ps -f -Upostgres
postgres 825 1 0 Tue12AM ?? 0:06.57 /u/pgsql/bin/postmaster -i
postgres 829 825 0 Tue12AM ?? 0:35.03 writer process (postmaster)
postgres 830 825 0 Tue12AM ?? 0:16.07 wal writer process (postmaster)
postgres 831 825 0 Tue12AM ?? 0:11.34 autovacuum launcher process (postmaster)
postgres 832 825 0 Tue12AM ?? 0:07.63 stats collector process (postmaster)
postgres 13003 825 0 3:44PM ?? 0:00.01 postgres test [local] idle (postmaster)
postgres 13002 12997 0 3:44PM ttyq1 0:00.03 /u/pgsql/bin/psql test
73 / 109
top
$ top
load averages: 0.56, 0.39, 0.36 18:25:58
138 processes: 5 running, 130 sleeping, 3 zombie
CPU states: 50.0% user, 0.0% nice, 0.0% system, 0.0% interrupt, 50.0% idle
Memory: Real: 96M/133M Virt: 535M/1267M Free: 76M
PID USERNAME PRI NICE SIZE RES STATE TIME WCPU CPU COMMAND
23785 postgres 57 0 11M 5336K run/0 0:07 30.75% 30.66% postmaster
23784 postgres 2 0 10M 11M sleep 0:00 2.25% 2.25% psql
74 / 109
Query Monitoring
75 / 109
Access Statistics
76 / 109
Database Statistics
77 / 109
Table Activity
78 / 109
Table Block Activity
79 / 109
Analyzing Activity
80 / 109
CPU
$ vmstat 5
procs memory page disks faults cpu
r b w avm fre flt re pi po fr sr s0 s0 in sy cs us sy id
1 0 0 501820 48520 1234 86 2 0 0 3 5 0 263 2881 599 10 4 86
3 0 0 512796 46812 1422 201 12 0 0 0 3 0 259 6483 827 4 7 88
3 0 0 542260 44356 788 137 6 0 0 0 8 0 286 5698 741 2 5 94
4 0 0 539708 41868 576 65 13 0 0 0 4 0 273 5721 819 16 4 80
4 0 0 547200 32964 454 0 0 0 0 0 5 0 253 5736 948 50 4 46
4 0 0 556140 23884 461 0 0 0 0 0 2 0 249 5917 959 52 3 44
1 0 0 535136 46280 1056 141 25 0 0 0 2 0 261 6417 890 24 6 70
81 / 109
I/O
$ iostat 5
tty sd0 sd1 sd2 % cpu
tin tout sps tps msps sps tps msps sps tps msps usr nic sys int idl
7 119 244 11 6.1 0 0 27.3 0 0 18.1 9 1 4 0 86
0 86 20 1 1.4 0 0 0.0 0 0 0.0 2 0 2 0 96
0 82 61 4 3.6 0 0 0.0 0 0 0.0 2 0 2 0 97
0 65 6 0 0.0 0 0 0.0 0 0 0.0 1 0 2 0 97
12 90 31 2 5.4 0 0 0.0 0 0 0.0 4 0 3 0 93
24 173 6 0 4.9 0 0 0.0 0 0 0.0 48 0 3 0 49
0 91 3594 63 4.6 0 0 0.0 0 0 0.0 11 0 4 0 85
82 / 109
Disk Usage
83 / 109
Database File Mapping - oid2name
$ oid2name
All databases:
---------------------------------
18720 = test1
1 = template1
18719 = template0
18721 = test
18735 = postgres
18736 = cssi
84 / 109
Table File Mapping
$ cd /usr/local/pgsql/data/base
$ oid2name
All databases:
---------------------------------
16817 = test2
16578 = x
16756 = test
1 = template1
16569 = template0
16818 = test3
16811 = floattest
$ cd 16756
$ ls 1873*
18730 18731 18732 18735 18736 18737 18738 18739
85 / 109
$ oid2name -d test -o 18737
Tablename of oid 18737 from database "test":
---------------------------------
18737 = ips
87 / 109
Per-Database Tablespaces
88 / 109
Per-Object Tablespaces
89 / 109
Analyzing Locking
$ ps -f -Upostgres
PID TT STAT TIME COMMAND
9874 ?? I 0:00.07 postgres test [local] idle in transaction (postmaster)
9835 ?? S 0:00.05 postgres test [local] UPDATE waiting (postmaster)
10295 ?? S 0:00.05 postgres test [local] DELETE waiting (postmaster)
90 / 109
Miscellaneous Tasks
91 / 109
Administration Tools
◮ pgadmin
◮ phppgadmin
92 / 109
External Monitoring Tools
93 / 109
Recovery
https://www.flickr.com/photos/coastguardnews/
94 / 109
Client Application Crash
95 / 109
Graceful Postgres Server Shutdown
96 / 109
Abrupt Postgres Server Crash
97 / 109
Operating System Crash
98 / 109
Disk Failure
99 / 109
Accidental DELETE
100 / 109
Write-Ahead Log (WAL) Corruption
101 / 109
File Deletion
102 / 109
Accidental DROP TABLE
103 / 109
Accidental DROP INDEX
Recreate index.
104 / 109
Accidental DROP DATABASE
105 / 109
Non-Starting Installation
106 / 109
Index Corruption
Use REINDEX.
107 / 109
Table Corruption
Try reindexing the table. Try identifying the corrupt OID of the
row and transfer the valid rows into another table using
SELECT…INTO…WHERE oid != ###. Use http://sources.
redhat.com/rhdb/tools.html to analyze the internal structure of
the table.
108 / 109
Conclusion
http://momjian.us/presentations
109 / 109