Configuration

Enabling Pekko Persistence Postgres plugin
Database Schema
Reference Configuration
Choosing journal schema variants
Tags caching
Explicitly shutting down the database connections

Enabling Pekko Persistence Postgres plugin

The plugin relies on Slick-pg to do create the SQL dialect for the database in use, therefore the following must be configured in application.conf

Configure pekko-persistence:

instruct pekko persistence to use the postgres-journal plugin,
instruct pekko persistence to use the postgres-snapshot-store plugin,

Configure slick.db:

setup connection pool

Database Schema

Depending on the journal variant, choose the appropriate schema:

Reference Configuration

pekko-persistence-postgres provides the defaults as part of the reference.conf. This file documents all the values which can be configured.

There are several possible ways to configure loading your database connections. Options will be explained below.

One database connection pool per journal type

There is the possibility to create a separate database connection pool per journal-type (one pool for the write-journal, one pool for the snapshot-journal, and one pool for the read-journal). This is the default and the example configuration shows how this is configured.

In order to create only one connection pool which is shared between all journals this configuration can be used.

Customized loading of the db connection

It is also possible to load a custom database connection. In order to do so a custom implementation of SlickDatabaseProvider needs to be created. The method that need to be implemented supply the Slick Database to the journals.

To enable your custom SlickDatabaseProvider, the fully qualified class name of the SlickDatabaseProvider needs to be configured in the application.conf. In addition, you might want to consider whether you want the database to be closed automatically:

pekko-persistence-postgres {
  database-provider-fqcn = "com.mypackage.CustomSlickDatabaseProvider"
}
postgres-journal {
  use-shared-db = "enabled" // setting this to any non-empty string prevents the journal from closing the database on shutdown
}
postgres-snapshot-store {
  use-shared-db = "enabled" // setting this to any non-empty string prevents the snapshot-journal from closing the database on shutdown
}

DataSource lookup by JNDI name

The plugin uses Slick as the database access library. Slick supports jndi for looking up DataSources.

To enable the JNDI lookup, you must add the following to your application.conf:

postgres-journal {
  slick {
    jndiName = "java:jboss/datasources/PostgresDS"
  }
}

When using the use-shared-db = slick setting, the follow configuration can serve as an example:

pekko-persistence-postgres {
  shared-databases {
    slick {
      jndiName = "java:/jboss/datasources/bla"
    }
  }
}

Choosing journal schema variants

Currently, the plugin supports two variants of the journal table schema: flat journal - a single table, similar to what the JDBC plugin provides. All events are appended to the table. Schema can be found here.

This is the default schema.

journal with nested partitions by persistenceId and sequenceNumber - this version allows you to shard your events by the persistenceId. Additionally, each of the shards is split by sequenceNumber range to cap the indexes. You can find the schema here.

This variant is aimed for services that have a finite and/or small number of unique persistence aggregates, but each of them has a big journal.

journal partitioned by ordering (offset) values - this schema fits scenarios with a huge or unbounded number of unique persistence units. Because ordering (offset) is used as a partition key, we can leverage partition pruning while reading from the journal, thus gaining better performance. You can find the schema here.

Using flat journal

This is the default variant, a schema without any partitions similar to what’s used by Pekko Persistence JDBC.

You do not have to override anything in order to start using it, although if you’d like to set it up explicitly, here’s the necessary config:

postgres-journal.dao = "org.apache.pekko.persistence.postgres.journal.dao.FlatJournalDao"

Using journal partitioned by persistence id and sequence number

In order to start using journal with nested partitions, you have to create a table with nested partitions (here is the schema) and set the Journal DAO FQCN:

postgres-journal.dao = "org.apache.pekko.persistence.postgres.journal.dao.NestedPartitionsJournalDao"

Partition size

The size of the nested partitions (sequence_number’s range) can be changed by setting postgres-journal.tables.journal.partitions.size. By default partition size is set to 10000000 (10M).

Partitions are automatically created by the plugin in advance. NestedPartitionsJournalDao keeps track of created partitions and once sequence_number is out of the range for any known partitions, a new one is created.

Partition table names

Partitions follow the prefix_sanitizedPersistenceId_partitionNumber naming pattern. The prefix can be configured by changing the posgres-journal.tables.journal.partitions.prefix value. By default it’s set to j. sanitizedPersistenceId is PersistenceId with all non-word characters replaced by _. partitionNumber is the ordinal number of the partition for a given partition id.

Example partition names: j_myActor_0, j_myActor_1, j_worker_0 etc.

Keep in mind that the default maximum length for a table name in Postgres is 63 bytes, so you should avoid any non-ascii characters in your persistenceIds and keep the prefix reasonably short.

Once any of the partitioning setting under postgres-journal.tables.journal.partitions branch is settled, you should never change it. Otherwise you might end up with PostgresExceptions caused by table name or range conflicts.

Using journal partitioned by ordering (offset)

In order to start using partitioned journal, you have to apply this schema and set the Journal DAO FQCN:

postgres-journal.dao = "org.apache.pekko.persistence.postgres.journal.dao.PartitionedJournalDao"

Partition size

The size of each partition (ordering’s range) can be changed by setting postgres-journal.tables.journal.partitions.size. By default partition size is set to 10000000 (10M).

Partitions are automatically created by the plugin in advance. PartitionedJournalDao keeps track of created partitions and once ordering is out of the range for any known partitions, a new one is created.

Partition table names

Partitions follow the prefix_partitionNumber naming pattern. The prefix can be configured by changing the posgres-journal.tables.journal.partitions.prefix value. By default it’s set to j. partitionNumber is the ordinal number of the partition for a given partition id.

Example partition names: j_0, j_1, j_2 etc.

Once any of the partitioning setting under postgres-journal.tables.journal.partitions branch is settled, you should never change it. Otherwise you might end up with PostgresExceptions caused by table name or range conflicts.

Tags caching

Tags are mapped into their unique integer ids and store in a column of type int[].

In order to provide fast access we cache those mappings. You can define how long given mapping entry remains in the cache before it gets wiped out by setting postgres-journal.tags.cacheTtl (used by write journal when persisting events) and postgres-read-journal.tags.cacheTtl (used by read journal when querying events by tags) config parameters.

Default value is 1 hour.

Explicitly shutting down the database connections

The plugin automatically shuts down the HikariCP connection pool when the ActorSystem is terminated. This is done using ActorSystem.registerOnTermination.

Configuration

Table of contents

Enabling Pekko Persistence Postgres plugin

Database Schema

Reference Configuration

One database connection pool per journal type

Sharing the database connection pool between the journals

Customized loading of the db connection

DataSource lookup by JNDI name

Choosing journal schema variants

Using flat journal

Using journal partitioned by persistence id and sequence number

Partition size

Partition table names

Using journal partitioned by ordering (offset)

Partition size

Partition table names

Tags caching

Explicitly shutting down the database connections