February 2025 — start/stop PostgreSQL 12 and find hostname and port

One can, in general use either of the following two programs:

systemctl <command> <service-name> (does not require sudo), or,
sudo service <service-name> <command>

either posgresql.service or simply postgresql in the systemctl program's command line
only as postgresql in the service command line

Finally, a further degree of freedom is that either postgresql or postgresql@12-main may be used (if one knows the version of PostgreSQL and the name of the cluster — in the above example 12 and main respectively).

As a result all of the following commands to start/stop or get the status of a locally running PostgreSQL cluster (database server) are valid:

$ systemctl [ start/stop/status ] [ postgresql/postgresql.service/postgresql@12-main/postgresql@12-main.service ]
$ sudo service [ postgresql/postgresql@12-main ] [ start/stop/status ]

February 2025 — restore a binary dump from PostgreSQL 15 into a PostgreSQL 12 database

In February 2025 I was able to restore a binary dump taken from a PostgreSQL 15 database into a PostgreSQL 12 database. The script had further customizations and did not restore all tables present in the original dump. The script is available at: ~/repos/prj/cognitera-iacs/iacs-ui.24.cognitor.COGNITERA/database/eae.pg_dump.RESTORE.sh

February 2025 — find the version of PostgreSQL from which a binary dump file was obtained from

$ pg_restore -l eae.pg_dump | head -n 10
;
; Archive created at 2025-01-28 18:30:01 EET
;     dbname: core
;     TOC Entries: 12325
;     Compression: -1
;     Dump Version: 1.14-0
;     Format: CUSTOM
;     Integer: 4 bytes
;     Offset: 8 bytes
;     Dumped from database version: 15.7

February 2025 — find list of databases on the local cluster

provenance

psql -l

February 2025 — change the password for user postgres

I simply connected using the psql program as user postgres:

$ ./eae.pg_dump.RESTORE.sh

postgres=# \password postgres

… full interaction from the command line shown in the below figure:

I have no explanation as to why I was not prompted for a password on the first step when I launched the psql program.

February 2025 — find hostname and port

To find the hostname and port all I had to do, in my local installation at least, was connect as user postgres:

psql postgres

\conninfo

January 2025 — configure PostgreSQL 12 to use a data directory in some other location (disk)

I used the procedure outlined below to move the data directory of my PostgreSQL 12 installation (on my ThinkStation P320 machine running Ubuntu 20.04) to a second, much larger, disk as I needed to import a huge (~100GB) binary dump of some PostgreSQL database that wasn't going to fit in the current location of the data directory. The below procedure of course only relates the data directory migration part, not the importation part (which I fear will be a separate ordeal). In the procedure detailed below, I basically followed the instructions I found here (archived locally here ). The instructions were almost entiery accurate with three notable exceptions:

the instructions assume that the config file is located inside the data directory. This was not true in my case. I believe that PostgreSQL 12 keeps the data directory and the location of the configuration files different in the default installation so perhaps the instructions I found were written with a different version of PostgreSQL in mind.
It was not at all necessary to edit the file at: /lib/systemd/system/postgresql.service
The SELinux part was not at all applicable (I don't even know what that is)

The steps I followed were:

Take note of the config file and the data directory locations. Do this by connecting to PostgreSQL with the postgres user doing
```
pgsql postgres
```
and then execute the following commands:
stop the PostgreSQL server (cluster in PostgreSQL terminology)
```
systemctl stop postgresql
```

prepare the destination directory. In my case I executed the following commands:

mkdir /media/hddb/postgresql-12-data
sudo chown postgres:postgres /media/hddb/postgresql-12-data
sudo chmod 700 /media/hddb/postgresql-12-data/
rsync -av /var/lib/postgresql/12/main/ /media/hddb/postgresql-12-data/

copy over the existing data directory to its new location:

rsync -av /var/lib/postgresql/12/main/ /media/hddb/postgresql-12-data/

Change the data_directory parameter value in the postgresql.conf configuration file. The parameter was previously commented out (and so default value was used); I obviously set it to the new location:
```
$ cat /etc/postgresql/12/main/postgresql.conf | grep -i data_directory | grep -v ^#
data_directory = '/media/hddb/postgresql-12-data'
```
NB: the single quotes were actually entered in the config file — I am not 100% certain they were essential.
NB2: in constrast to the instructions I was following, as already mentioned above, in my case the configuration directory was in an entirely different subtree (/etc/postgresql/12/main/postgresql.conf) than the data directory (/var/lib/postgresql/12/main). So, in contrast to the instructions I was using as a guide, I changed the postgresql.conf file in the existing configuration location, and not in the new one (in the instructions the case was that the configuration file was located inside the data directory so of course, in the case, you'd have wanted to edit the configuration file in the new location).
the step in the instructions under the header systemcd configuration was entirely skipped as explained in the introcution
I then restarted the PostgreSQL cluster (database server) using:
```
systemctl daemon-reload
systemctl start postgresql.service
systemctl status postgresql.service
```
NB: the first command (systemctl daemon-reload) needs to be executed every time the configuration file changes so it's essential (merely stopping and restarting the cluster won't cut it).
the step in the instructions under the header SELinux was also skipped as explained in the introduction
to confirm that everything was ok I entirely removed the old data directory and connected to PostgreSQL to verify that the new locations are now recognized:

May 2024 — how I configured my local PostreSQL 12 to allow users to connect over DBeaver

I tried to connect over DBeaver with an existing user and password and got:

FATAL: no pg_hba.conf entry for host "127.0.0.1", user "mperdikeas", database "test", SSL on

After investigation and reading this SO answer I got it to work after adding the following line on my /etc/postgresql/12/main/pg_hba.conf file:

host all           mperdikeas           127.0.0.1/32    md5

NB: It is important to use the /32 notation. Simply writing 127.0.0.1 fails with the hardly elucidating message: Connection refused (Connection refused)

No changes were necessary in the sibling file /etc/postgresql/12/main/postgresql.conf IIRC

Following the above one, obviously, has to restart PostgreSQL:

service postgresql restart

how I created tablespaces in PostgreSQL

I created a number of directories using:

mkdir ~/postgresql-tblspaces && cd ~/postgresql-tblspaces
mkdir userspay2019 && chown -R postgres:postgres *

psql

psql -U postgres
CREATE TABLESPACE userspay2019 LOCATION '/home/mperdikeas/postgresql-tblspaces/userspay2019';

how to discover data directory and configration file in PostgreSQL

postgres=# show data_directory;
       data_directory        
-----------------------------
 /var/lib/postgresql/12/main
(1 row)

postgres=# show config_file;
               config_file               
-----------------------------------------
 /etc/postgresql/12/main/postgresql.conf
(1 row)

streamlined procedure for setting up PostgreSQL 12 (December 2021)

purge previous version

$ sudo apt remove --purge postgresql
$ sudo apt remove --purge postgresql-12*

install PostgreSQL

$ sudo apt install postgis postgresql-12-postgis-3
$ sudo apt install postgresql-12

change password for PostgreSQL user postgres

At this point you want to take advantage of the initial, default peer authentication mode to change the PostgreSQL password for the PostgreSQL user postgres. This will come into play later when you change the authentication mode for the postgres user (and any other user for that matter) from peer to md5:

$ sudo -i -u postgres psql postgres
psql (12.9 (Ubuntu 12.9-2.pgdg20.04+1))
Type "help" for help.
+
postgres=# \password postgres
Enter new password: 
Enter it again:

Note that (according to my understanding) the above only affects the PostgreSQL user postgres, not the Unix user postgres. The latter in fact exists as can be verified by:

$ cat /etc/passwd | grep -i postgres | wc -l
1

in this SO answer

source

$ sudo passwd --status postgres
postgres L 05/26/2021 0 99999 7 -1

In general, you don't want to do anything with the UNIX user postgres and you DEFINITELY don't want to set a password for that user (as that would unlock the account) by doing a:

DANGER: do not do the below:

$ sudo passwd postgres

in this SO answer

create user and database

In the same vein you might also want to create an actual user (not postgres) while the peer authentication mode is still applicable:

$ sudo -i -u postgres createuser --interactive johndoe
$ sudo -i -u postgres psql
psql (12.9 (Ubuntu 12.9-2.pgdg20.04+1))
Type "help" for help.

postgres=# ALTER USER johndoe WITH PASSWORD 'super.secret';
ALTER ROLE

$ sudo -u postgres psql -c 'create database acmeindustries';
CREATE DATABASE

make copies of the basic configuration files

$ cp /etc/postgresql/12/main/pg_hba.conf /etc/postgresql/12/main/pg_hba.conf.ORIGINAL
$ cp /etc/postgresql/12/main/postgresql.conf  /etc/postgresql/12/main/postgresql.conf.ORIGINAL

change authentication mode from peer to md5

In file pg_hba.conf change the following line:

local   all             postgres                                peer

local   all             postgres                                md5

In the same file, I have also done the following changes but I don't know how essential they are:
Change the following line:

host    all             all             127.0.0.1/32            md5

host    all             all             192.168.2.0/8           md5

create file ~/.pgpass

Now is a good time to create file ~/.pgpass to make connecting to the database easier. Typical contents:

$ cat ~/.pgpass
localhost:5432:postgres:postgres:super.secret
localhost:5432:acmeindustries:johndoe:duper.secret

You might also want to create the ~/.psqlrc file but this is less useful (and might even be confusing at times if you are not aware of its existence):

$ cat ~/.psqlrc 
\set ON_ERROR_STOP on

configure PostgreSQL to listen to remote connections

To do that you have to edit the postgresql.conf file and change the listen_addresses setting. Typical value to instruct your machine (assuming your IP is 192.168.2.2) to listen to the external NIC and not just to the local loopback:

listen_addresses = 'localhost, 192.168.2.2'

other configurations in postgresql.conf

At this point, you might also want to change the port setting to a value other than the default of 5432. This might be necessary if another PostgreSQL cluster (e.g. some previous version, say PostgreSQL 9) is also running on the same machine.

I also have found it necessary at times, e.g. when using migration tools to increase the value of max_locks_per_transaction to 1024.

how to create user and database in PostgreSQL

Tried that on PostgreSQL 12.9:
so answer

how to take a dump of a PostgreSQL database

Text dump:

pg_dump -U mperdikeas -h 192.168.2.9 -p 5432 -Fp dbname > /path/to/dump.txt

Binary dump:

pg_dump -U mperdikeas -h 192.168.2.9 -p 5432 -Fc dbname > /path/to/dump.bin

how to restore a PostgreSQL dump

NB: the database dbname needs to exist before running the command given below:

pg_restore -d dbname -U mperdikeas -h localhost /path/to/file.dump

useful commands at psql

find current database

SELECT current_database();

describe table

\d+ tablename

list tables and databases, change databases etc

See this

installation of PostGIS in PostgreSQL 12 (October 2021)

I followed the instructions from here

sudo apt update
sudo apt install postgis postgresql-12-postgis-3

… subsequently, I connected to the database I wanted to create the extension in with, e.g.:

psql -U mperdikeas -d acme_industries

… and then created the extension there with:

CREATE EXTENSION postgis;

So, apparently, the PostGIS extension is not installed on the PostgreSQL cluster as a whole, but on each particular database. This means that the extension has to be re-created whenever the database is dropped. Finally, I verified that the extension is now available with:

SELECT PostGIS_version();

installation and initial configuration of PostgreSQL 12 on Ubuntu 20.04 (May 2021)

I followed the instructions from here and asked, specifically, for PostgreSQL 12 to be installed on the last step:

sudo sh -c 'echo "deb http://apt.postgresql.org/pub/repos/apt $(lsb_release -cs)-pgdg main" > /etc/apt/sources.list.d/pgdg.list'
wget --quiet -O - https://www.postgresql.org/media/keys/ACCC4CF8.asc | sudo apt-key add -
sudo apt-get update
sudo apt-get -y install postgresql-12

I then verified that PostgreSQL was running by doing a:

/etc/init.d/postgresql status

Since the above does not report the version of PostgreSQL, I confirmed that it is PostgreSQL 12 that is running by doing a:

pgrep -u postgres -fa -- -D

Subsequently, to connect to PostgreSQL, I had to edit the file /etc/postgresql/12/main/pg_hba.conf. This controls the client authentication mechanisms and allows client-side tools (which use Unix Domain sockets) to connect to the database server. To that end, I was guided, more or less by this previous note

Namely, since I had forgotten, or wasn't able to use, the default PostgreSQL password I initially changed the local authentication mode to 'trust'. I.e. I changed the line:

local   all             postgres                                peer

local   all             all                                     trust

local   all             postgres                                trust

NB:

pg_hba.conf

postgresql.conf

/etc/init.d/postgresql restart

postgres

ALTER USER postgres PASSWORD 'supersecret';

;

local

host

md5

local   all             all                                    md5
host    all             all   192.168.2.0/8                    md5

192.168.2.0/8

However, following the above, I was still unable to connect to PostgreSQL from another machine (even though I had set host authentication to md5. Moreover, when trying to connect from other machines I was getting connection refused (or something).

To fix that I had to instruct PostgreSQL to listen on the network card interface as well and not just on the localhost loop (which is the initial configuration for obvious security reasons). To do that I edited file /etc/postgresql/12/main/postgresql.conf and change the line

#listen_addresses = 'localhost'

listen_addresses = 'localhost, 192.168.2.7'

initial files: pg_hba.conf postgresql.conf
modified files: pg_hba.conf postgresql.conf

how I setup the postgres user in PostgreSQL 9.5 and created user 'mperdikeas'

PostgreSQL allows one to authenticate using two mechanisms:

the so called IDENT/PEER authentication which uses UNIX accounts
the TCP authentication which uses PostgreSQL's own managed username / passwords

postgres

sudo passwd --lock postgres

postgres

$ sudo cat /etc/shadow | grep -i postgres
postgres:!*:17117:0:99999:7:::
$ sudo passwd -S postgres
postgres L 11/12/2016 0 99999 7 -1

here

update 2021-12-15 An easier way to check for the locked status of an account is offered here.

I then changed the password of the TCP user postgres by doing:

$ sudo -i -u postgres psql postgres
psql (9.5.19)
Type "help" for help.

postgres=# \password postgres
Enter new password:
Enter it again:

Finally, I created a new TCP user mperdikeas and set his password:

$ sudo -i -u postgres createuser --interactive mperdikeas
Shall the new role be a superuser? (y/n) n
Shall the new role be allowed to create databases? (y/n) y
Shall the new role be allowed to create more new roles? (y/n) n
$ sudo -i -u postgres psql
psql (9.5.19)
Type "help" for help.

postgres=# ALTER USER mperdikeas WITH PASSWORD '<redacted>';
ALTER ROLE
postgres=# \q

You will notice that for all administrative commands we are using the UNIX user postgres

NB: be sure to set the authentication method to 'md5' in the following file:

/etc/postgresql/9.5/main/pg_hba.conf

trust

peer

md5

why you shouldn't change the password of the postgres Linux user using sudo passwd postgres
Install Postgresql 9.5 in Ubuntu 14.04 Trusty Tahr
timestamps with or without timezones
how to write recursive SQL WITH queries

Today I implemented the following SSCCE to dig recursive WITH queries:

Let's first define a simple schema to represent trees so we can motivate a use case of recursive queries.
We can imagine having two tables to represent two kinds of nodes:

"proper" nodes
leaf nodes, which can hang under any "proper" node (including internal ones)

A possible approach would be the following:

    DROP TABLE IF EXISTS leaf;
    DROP TABLE IF EXISTS node;

    CREATE TABLE node (
        i         INTEGER NOT NULL,
        parent    INTEGER     NULL
    );
    ALTER TABLE node ADD PRIMARY KEY(i);

    CREATE TABLE leaf (
        i         SERIAL,
        leafName  VARCHAR NOT NULL,
        underNode INTEGER NOT NULL);
    ALTER TABLE leaf ADD PRIMARY KEY (i);
    ALTER TABLE leaf ADD FOREIGN KEY (underNode) REFERENCES node(i);

One can imagine the above schema to be populated with the below test data:

    INSERT INTO node VALUES
    (1, NULL), (2, NULL), (3, 1), (5, 1), (7, 1), (4, 2), (6, 2);

    INSERT INTO leaf(leafName, underNode) VALUES
    ('leaf under 1', 1), ('leaf under 2', 2), ('leaf under 3', 3), ('leaf under 5', 5), ('leaf under 7', 7);

Given the above, the following query fetches the names of all leaves hanging under the 'subtree' of node with key #1:

    WITH RECURSIVE NODES_IN_SUBTREE (i) AS (
        VALUES (1)
        UNION ALL SELECT a.i FROM NODES_IN_SUBTREE INNER JOIN node a
        ON a.parent = NODES_IN_SUBTREE.i
    )
    SELECT leafName FROM leaf
    WHERE underNode IN (SELECT i FROM NODES_IN_SUBTREE);

… and the following query fetches all leaves that live under the 'subtrees' of all nodes who are children of the root of the tree (therefore, effectively fetches all leaves in the tree):

    WITH RECURSIVE NODES_IN_SUBTREE (i) AS (
        (SELECT i FROM node WHERE parent IS NULL)
        UNION ALL SELECT a.i FROM NODES_IN_SUBTREE INNER JOIN node a
        ON a.parent = NODES_IN_SUBTREE.i
    )
    SELECT leafName FROM leaf
    WHERE underNode IN (SELECT i FROM NODES_IN_SUBTREE);

how to create superuser "postgres" if none exists when the PostgreSQL is built from sources

relevant note

./configure
make
su
make install
adduser postgres
mkdir /usr/local/pgsql/data
chown postgres /usr/local/pgsql/data
su - postgres
/usr/local/pgsql/bin/initdb -D /usr/local/pgsql/data
/usr/local/pgsql/bin/postgres -D /usr/local/pgsql/data >logfile 2>&1 &
/usr/local/pgsql/bin/createdb test
/usr/local/pgsql/bin/psql test

/usr/local/pgsql

~/postgresql-9.4.5

./postgres-9.4.5/bin/postgres -D ./postgres-9.4.5/data/

psql

postgres

createuser --interactive postgres

I then changed ~/postgres-9.4.5/data/pg_hba.conf to contain:

# "local" is for Unix domain socket connections only
local   all             all                                     trust
# IPv4 local connections:
host    all             all             127.0.0.1/32            md5
# IPv6 local connections:
host    all             all             ::1/128                 md5

postgres

$ which psql
~/postgres9/bin/psql
rawdar@radacerd:~#
$ psql -U postgres
psql (9.4.5)
Type "help" for help.

postgres=# alter user postgres password 'secret';

postgres

psql

trust

psql

postgres

role

how to configure retention of unresponsive TCP connections in PostgreSQL

#tcp_keepalives_idle = 0                # TCP_KEEPIDLE, in seconds;
                                        # 0 selects the system default
tcp_keepalives_idle = 200               # TCP_KEEPIDLE, in seconds;
#tcp_keepalives_interval = 0            # TCP_KEEPINTVL, in seconds;
                                        # 0 selects the system default
tcp_keepalives_interval = 30            # TCP_KEEPINTVL, in seconds;
#tcp_keepalives_count = 0               # TCP_KEEPCNT;
                                        # 0 selects the system default
tcp_keepalives_count = 10               # TCP_KEEPCNT;

/postgresDB/data-9.1.14/postgresql.conf

GNU Gatekeeper keepalive page

how to kill non-responsive PostgreSQL queries

    select pg_terminate_backend(procpid)
    from pg_stat_activity
    where usename = 'yourusername'
     and current_query = '<IDLE>'
     and query_start < current_timestamp - interval '5 minutes'
     ;

source

how to query PostgreSQL for currently executing queries

Using the pg_stat_activity table

SELECT * FROM pg_stat_activity

Using the pg_stat_statements table

in this dba.stackexchange post

SELECT * FROM pg_stat_statements ORDER BY total_time DESC

sample configuration files used

postgresql.conf

PostgreSQL 9.3 in Ubuntu 14.04 (NP desktop)

/etc/postgresql/9.3/main/postgresql.conf

NB:

postgresql.conf.sample

/usr/share/postgresql

postgresql.conf

this StackOverflow post

pg_hba.conf

PostgreSQL 9.3 in Ubuntu 14.04 (NP desktop)

/etc/postgresql/9.3/main/pg_hba.conf

awesome PostgreSQL CLI client with auto-completion and syntax highlighting

https://github.com/dbcli/pgcli

sudo apt-get install python-pip
sudo apt-get install python-dev libpq-dev libevent-dev
sudo pip install pgcli

how to convert VARCHAR data to XML so that the XPATH function may be used on them

XMLPARSE

SELECT DISTINCT CAST (XPATH('@status',XMLPARSE(CONTENT "content")) AS VARCHAR), isdeleted
FROM vo_business.hosted_record_version

During PostgreSQL startup: could not open file "/etc/ssl/private/ssl-cert-snakeoil.key": Permission denied

sudo chown postgres /etc/ssl/private/ssl-cert-snakeoil.key
sudo chown postgres /etc/ssl/certs/ssl-cert-snakeoil.pem

Install PostgreSQL From Its Official Repository in Ubuntu 14.04

Debian 6.0 (squeeze), 7.0 (wheezy), and unstable (sid) 64/32 bit (amd64/i386)
Ubuntu 10.04 (lucid), 12.04 (precise), and 14.04 (trusty) 64/32 bit (amd64/i386)
PostgreSQL 8.4, 9.0, 9.1, 9.2, 9.3
Server extensions such as Slony-I, various PL languages, and datatypes
Applications like pgadmin3, pgbouncer, and pgpool-II

Create and edit the PostgreSQL repository by running the command below:

sudo emacs -nw /etc/apt/sources.list.d/pgdg.list

deb http://apt.postgresql.org/pub/repos/apt/ trusty-pgdg main

Download & import the repository key:

wget --quiet -O - https://www.postgresql.org/media/keys/ACCC4CF8.asc | sudo apt-key add -

Update your system:

sudo apt-get update && sudo apt-get upgrade

Now you're able to install PostgreSQL via below command:

sudo apt-get install postgresql-9.3 pgadmin3

NB:

postgresql

postgresql-contrib

postgresql-client

postgresql-x.y

postgresql-9.3

postgresql

source

how to execute PL/pgSQL blocks, function definitions, transactions in DbVisualizer

Execute

Execute Current

Execute Buffer

examining (and terminating, if need be) user sessions

SELECT * FROM pg_stat_activity;

SELECT pg_cancel_backend(  );

SELECT pg_terminate_backend(  );

source

configuration of logging directory in Postgresql (in Ubuntu 12.04)

sudo cat /etc/postgresql/9.1/main/postgresql.conf | grep log_directory

yet more complex XPath example

SELECT
DISTINCT ivoid, CAST(xpath('//*[local-name()=''capability'' and @xsi:type=''ssa:SimpleSpectralAccess'']/*[local-name()=''testQuery'']' , content, ARRAY[ARRAY['xsi', 'http://www.w3.org/2001/XMLSchema-instance']]) AS VARCHAR)
FROM rr.resourcecontent
WHERE CAST(xpath('//*[local-name()=''capability'' and @xsi:type=''ssa:SimpleSpectralAccess'']/*[local-name()=''testQuery'']' , content, ARRAY[ARRAY['xsi', 'http://www.w3.org/2001/XMLSchema-instance']]) AS VARCHAR)!='{}'

complex XPath example demonstrating namespace-agnostic local-name, testing if a value belongs in a list, and need to CAST from XML to VARCHAR

SELECT 
CAST(xpath('//*[local-name()=''capability'' and (@standardID=''ivo://ivoa.net/std/SIA'' or @standardID=''ivo://ivoa.net/std/SSA'')]/@standardID' , content) AS VARCHAR), count(*)
FROM someschema.sometable
GROUP BY CAST(xpath('//*[local-name()=''capability'' and (@standardID=''ivo://ivoa.net/std/SIA'' or @standardID=''ivo://ivoa.net/std/SSA'')]/@standardID' , content) AS VARCHAR)

downloading the results of a SQL query on local filesystem

source

psql -d RegTAP -t -A -F"," -c "SELECT pregraft FROM vo_business.harvest_record WHERE recordivoid='ivo://irsa.ipac/Spitzer/Images/SINGS'" > pregraft

using XPath with namespaces in PostgreSQL

PostgreSQL 9.1 docs

SELECT ivoidlowercase, xpath('@xsi:type', content, ARRAY[ARRAY['xsi', 'http://www.w3.org/2001/XMLSchema-instance']])
FROM rr.resourcecontent 
WHERE lower(CAST (xpath('@xsi:type', content, ARRAY[ARRAY['xsi', 'http://www.w3.org/2001/XMLSchema-instance']]) AS VARCHAR)) LIKE '%vs:dataservice%'

typical access control configurations

trust all local UNIX-domain sockets (that tools like psql are using to create database and user in the various scripts)
configure password-based authentication (md5) for TCP/IP sockets

$ sudo cat /etc/postgresql/9.1/main/pg_hba.conf  | grep -v ^# | uniq

local   all             postgres                                trust
local   all             all                                     trust
host    all             all             127.0.0.1/32            md5
host    all             all             ::1/128                 md5

Terminology on clusters, catalogs, databases and schemas

source

cluster

A computer may have one or multiple clusters.
A database server is a cluster.
A cluster has catalogs. ( Catalog = Database )
Catalogs have schemas. (Schema = namespace of tables, and security boundary)
Schemas have tables.
Tables have rows.
Rows have values, defined by columns.

online SQL playground / sandbox

SQL fiddle

Window functions (latest installment)

The PARTITION clause defines the window, however some functions by default operate on a concept called the 'frame' which may by, default include less than the full window E.g. see what postgreSQL says:
Note that first_value, last_value, and nth_value consider only the rows within the "window frame", which by default contains the rows from the start of the partition through the last peer of the current row. This is likely to give unhelpful results for last_value and sometimes also nth_value. You can redefine the frame by adding a suitable frame specification (RANGE or ROWS) to the OVER clause. See Section 4.2.8 for more information about frame specifications.
In the same vein:
- over (order by x)
- over (order by x rows between unbounded preceding and current row)
- over (order by x rows between unbounded preceding and unbounded following
helpful site and other links:

-- order by x means "order by x rows between unbounded preceding and current row
select x, array_agg(x) over (rows between unbounded preceding and current row) from generate_series(1, 10) AS t(x)
select x, array_agg(x) over (order by x rows between unbounded preceding and current row) from generate_series(1, 10) AS t(x)
select x, array_agg(x) over (order by x) from generate_series(1, 10) AS t(x)

select foo.*, first_value(i) over (partition by a order by i desc) from foo

select foo.*, lag(i) over (partition by a order by i asc) from foo

select foo.*, lag(i, 2) over (partition by a order by i asc) from foo
select foo.*, lag(i, 2, -1) over (partition by a order by i asc) from foo

select x, array_agg(x) over (rows between current row and unbounded following) from generate_series(1, 10) AS t(x)
select x, array_agg(x) over () from generate_series(1, 10) AS t(x)
select x, array_agg(x) over (order by x) from generate_series(1, 10) AS t(x)

CREATE TABLE employee_salary(employee VARCHAR, department VARCHAR, salary INTEGER);
INSERT INTO employee_salary
VALUES
('mike', 'sales',  90000),
('john', 'sales', 130000),
('paul', 'sales',  70000),
('anna', 'dev'  ,  20000),
('peter','dev'  ,  50000)

SELECT employee, salary, department,
round(AVG(salary) OVER (PARTITION BY department),0) AS average_dept_salary,
rank() OVER (PARTITION BY department ORDER BY salary DESC) AS salary_rank,
lead(salary) OVER (PARTITION BY department ORDER BY salary ASC) AS next_higher,
lag (salary) OVER (PARTITION BY department ORDER BY salary ASC) AS prev_lower,
first_value(salary) OVER (PARTITION BY department ORDER BY salary ASC ROWS BETWEEN UNBOUNDED PRECEDING AND UNBOUNDED FOLLOWING) AS lowest_salary_in_department,
last_value (salary) OVER (PARTITION BY department ORDER BY salary ASC ROWS BETWEEN UNBOUNDED PRECEDING AND UNBOUNDED FOLLOWING) AS highest_salary_in_department
FROM employee_salary

how to install pgsphere

the pgSphere 1.1 project site

downloaded pgSphere sources from the pgSphere homepage

    wget http://pgfoundry.org/frs/download.php/2558/pgsphere-1.1.1.tar.gz

I opted for the second way to compile pgSphere (as instructed in the installation instructions page), which is the one that does not require the PostgreSQL sources but, instead, the configuration tool pg_config
since pg_config didn't exist I installed it by executing:
```
sudo apt-cache search postgresql-server-dev
sudo apt-get install postgresql-server-dev-9.1
          
```
Now, pg_config is installed and its location can be got with:
```
which pg_config
```
(we use that location below)
expand the pgSphere tarball we downloaded and cd into the directory that's created:
```
tar xvfz pgsphere-1.1.1.tar.gz
cd pgsphere-1.1.1/
          
```
follow the installation instructions linked above ("second way"), replacing "/path/to/pg_config" with the actual path

make USE_PGXS=1 PG_CONFIG=/usr/bin/pg_config
sudo make USE_PGXS=1 PG_CONFIG=/usr/bin/pg_config install

when I tried to check the installation as instructed:

To check the installation change into the pg_sphere source directory again and run:
shell> make installcheck

... I got the following error trace:
```
Makefile:29: ../../src/Makefile.global: No such file or directory
Makefile:30: /contrib/contrib-global.mk: No such file or directory
make: *** No rule to make target `/contrib/contrib-global.mk'.  Stop.
          
```
... but the installation was successful nonetheless because I was able to execute the last step as advised in "2.3. Creating a database with pgSphere" of the installation instructions linked above with:
```
psql -U postgres -d RegTAP -f ./pg_sphere.sql
          
```

Window functions (cont.)

PARTITION

OVER

ORDER

SELECT emp_name, salary, RANK() OVER (ORDER BY salary DESC) AS sal_pos 
FROM test_curation.employee
ORDER BY sal_pos ASC

ROW_NUMBER()

RANK

LIMIT

SELECT emp_name, salary
FROM test_curation.employee
ORDER BY salary DESC
LIMIT 3

RANK()

SELECT x.* FROM (
SELECT emp_name, salary, rank() OVER (ORDER BY salary DESC) AS sal_pos 
FROM test_curation.employee
ORDER BY sal_pos ASC) x
WHERE x.sal_pos<=3

DENSE_RANK()

RANK()

Window functions rock!

DROP TABLE IF EXISTS test_curation.employee;
CREATE TABLE test_curation.employee (
department VARCHAR,
emp_name VARCHAR,
salary INTEGER);

INSERT INTO test_curation.employee(department, emp_name, salary) VALUES ('SALES', 'MIKE', 3);
INSERT INTO test_curation.employee(department, emp_name, salary) VALUES ('SALES', 'MARJORIE', 5);
INSERT INTO test_curation.employee(department, emp_name, salary) VALUES ('SALES', 'ELIZABETH', 4);
INSERT INTO test_curation.employee(department, emp_name, salary) VALUES ('SALES', 'FLORA', 4);
INSERT INTO test_curation.employee(department, emp_name, salary) VALUES ('DEV', 'THOMAS', 10);
INSERT INTO test_curation.employee(department, emp_name, salary) VALUES ('DEV', 'GEORGE', 2);
INSERT INTO test_curation.employee(department, emp_name, salary) VALUES ('DEV', 'MENELAUS', 1);

-- simple example to demonstrate the use of the window function AVG
SELECT department, emp_name, AVG(salary) OVER (PARTITION BY department), rank() OVER (PARTITION BY department ORDER BY salary DESC)
FROM test_curation.employee;

-- using the window function rank, observe that both Elizabeth and Flora appear as rank() assigns the same number
-- in case of ties
SELECT department, emp_name, salary, avg FROM
(SELECT department, emp_name, salary, AVG(salary) OVER (PARTITION BY department), rank() OVER (PARTITION BY department ORDER BY salary DESC)
FROM test_curation.employee) r
WHERE rank<=2;

-- rank() assigns the same number in case of ties() and also leaves gaps right after a tie:
SELECT department, emp_name, salary, avg FROM
(SELECT department, emp_name, salary, AVG(salary) OVER (PARTITION BY department), rank() OVER (PARTITION BY department ORDER BY salary DESC)
FROM test_curation.employee) r
WHERE rank=3;

-- row_number() always assigns different numbers and in case of a tie one row is chosen based on some arbitrary criterion (e.g. actual order fetched maybe?)
SELECT department, emp_name, salary, avg FROM
(SELECT department, emp_name, salary, AVG(salary) OVER (PARTITION BY department), row_number() OVER (PARTITION BY department ORDER BY salary DESC)
FROM test_curation.employee) r
WHERE row_number<=2;

PostgreSQL-specific way to get maximum or minimum values of certain columns for every combination of other columns

DISTINCT ON

DISTINCT

CREATE TABLE A (A1 INTEGER, A2 INTEGER, A3 INTEGER);

INSERT INTO A(A1, A2, A3) VALUES (1, 1, 1);
INSERT INTO A(A1, A2, A3) VALUES (2, 1, 1);
INSERT INTO A(A1, A2, A3) VALUES (2, 1, 2);
INSERT INTO A(A1, A2, A3) VALUES (3, 1, 2);
INSERT INTO A(A1, A2, A3) VALUES (2, 1, 2);
INSERT INTO A(A1, A2, A3) VALUES (4, 1, 2);
INSERT INTO A(A1, A2, A3) VALUES (4, 1, 5);
INSERT INTO A(A1, A2, A3) VALUES (3, 1, 5);

SELECT DISTINCT ON (a2, a3) a2, a3, a1 FROM A ORDER BY a2, a3, a1

SELECT DISTINCT ON (a2, a3) a2, a3, a1 FROM A ORDER BY a2, a3, a1 DESC

how to drop a database in PostgreSQL

command line only:

dropdb <database name>

more refined, once connected as postgres superuser:

DROP DATABASE IF EXISTS <database name>

the way above from the command line:

psql -U postgres postgres -f <file with the above script>

how to handle namespace prefixes in xpath queries

SELECT ( CAST (xpath('/*/@xsi:type', content, array[array['xsi', 'http://www.w3.org/2001/XMLSchema-instance']]) AS TEXT[]))[1] from rr.resourcecontent

SELECT ( CAST (xpath('/ri:Resource/@xsi:type', content, array[array['xsi', 'http://www.w3.org/2001/XMLSchema-instance'],
                                                              array['ri', 'http://www.ivoa.net/xml/RegistryInterface/v1.0']]) AS TEXT[]))[1] from rr.resourcecontent

another very useful syntax for doing xpath in PostgreSQL

SELECT ( CAST (xpath('/*/identifier', content) AS TEXT[]))[1] from rr.resourcecontent

how to check if a value appears in an array

true

select 'a' = ANY ('{a , b}'::varchar[])

how to use xpath in where clauses in PostgreSQL

xpath

select count(*) from rr.resourcecontent where cast (xpath('/*/capability/@standardID', content) as text[])='{ivo://ivoa.net/std/ConeSearch}'

(to allow use of PostgreSQL trim function - but only if only one item is returned)

select count(*) from rr.resourcecontent where trim( cast (xpath('/*/capability/@standardID', content) as text))='{ivo://ivoa.net/std/ConeSearch}'

(if an array of values may be returned by the XPath expression)

select count(*) from rr.resourcecontent where 'ivo://ivoa.net/std/ConeSearch' = ANY (cast (xpath('/*/capability/@standardID', content) as text[]) )

and

trim()

various types of inserts in PostgreSQL

VSI - very silly inserts (executing queries made by concatenated Strings, one-by-one)
SPI - stupid prepared inserts (executing queries made by Prepared Inserts one-by-one)
BPI - batch prepared inserts (executing queries made by Prepared Inserts in batches)
CPI - copy inserts (using the 'properietary' COPY FROM API offered by PostgreSQL driver)

here

this SO discussion

location of pg_ctl in Postgresql 9.2

/usr/lib/postgresql/9.2/bin/pg_ctl

NULL values in foreign key columns

source

how to log SQL statements in PostgreSQL 9.1

(see also this SO discussion)

You have to change certain values in file postgres.conf and restart the server.

File postgres.conf is located in: /etc/postgresql/9.1/main/postgresql.conf

Diff of the changes I made is shown below:

$ diff /etc/postgresql/9.1/main/postgresql.conf.safe.2012-01-15  /etc/postgresql/9.1/main/postgresql.conf
276c276
< #log_destination = 'stderr'# Valid values are combinations of
---
  > log_destination = 'stderr'# Valid values are combinations of
282c282
  < #logging_collector = off# Enable capturing of stderr and csvlog
---
    > logging_collector = on# Enable capturing of stderr and csvlog
288c288
    < #log_directory = 'pg_log'# directory where log files are written,
---
      > log_directory = 'pg_log'# directory where log files are written,
290c290
      < #log_filename = 'postgresql-%Y-%m-%d_%H%M%S.log'# log file name pattern,
---
        > log_filename = 'postgresql-%Y-%m-%d_%H%M%S.log'# log file name pattern,
398c398
        < #log_statement = 'none'# none, ddl, mod, all
---
          > log_statement = 'all'# none, ddl, mod, all

log_directory

data

$ grep -i data /etc/postgresql/9.1/main/postgresql.conf
# option or PGDATA environment variable, represented here as ConfigDir.
data_directory = '/var/lib/postgresql/9.1/main'# use data in another directory

sudo -i
tail -f /var/lib/postgresql/9.1/main/pg_log/postgresql-2013-01-15_182646.log

useful PostgreSQL functions:

        select current_database();
        select current_schema();
        select current_user;
        select extract('epoch' from now());
        select extract(epoch from now())::integer
        select extract('epoch' from current_timestamp);
        select now();
        select current_timestamp;

use of the command-line pg_dump utility:

 pg_dump <database> -h 172.333.444.555 -p 5444 -U username -F p -E UTF8 -C -O -n %ltschema-name> -v -f dumpfile database-name

precedence in PostgreSQL pg_hba.conf files

Most Specific Rule First

typical configuration of PostgreSQL 9.1 I am using:

local   all             all                                 trust
host   all             all   192.168.2.2/24                 md5

local trust

md5

And in file /etc/postgresql/9.1/main/postgresql.conf, to enable remote access:

listen_addresses = '*'

listen_addresses = '*'

Configure PostgreSQL 9.1 to allow password access to users from a specific subdomain (say 173.31.0.0/16, i.e. the 173.31 subdomain)

sudo-i
cd /etc/postgresql/9.1/main

host all all 172.31.0.0/16 md5

listen_addresses='localhost'

listen_addresses='*'

Configure PostgreSQL 9.1 to not require password for users

This can be used, e.g. to automate the creation of users by means of a script. Basically the following line has to be edited in the pg_hba.conf file:

local   all              all                                trust

pg_hba.conf

postgresql.conf

Configure PostgreSQL 9.1 to accept remote connections

original article

It's a two step process:

enable client authentication

host    all             all             192.168.2.0/24          md5

set the deamon to listen to the network interface

listen_addresses = '*'

sudo /etc/init.d/postgresql restart

Find Postgresql version

Connect as an existing user to an existing database and run the "select version()" query, or from the command line:

psql -Uhr -d ab -c 'select version()'

… alternatively, if you don't know the passwords of any existing users but you have sudo privilleges, do:

sudo -u postgres psql postgres -c 'SELECT version()' | grep PostgreSQL