Partitioning support for Event Store #770

jokokko · 2017-05-23T13:58:31Z

To avoid table bloat with the Marten Event Store, events table (mt_events by default) could be partitioned using PostgreSQL table partitioning.

While the table can already be manually partitioned after creation with table inheritance in PG 9.x, PG 10 will introduce native table partitioning with new prerequisites (and limitations): https://www.postgresql.org/docs/10/static/ddl-partitioning.html Given that "It is not possible to turn a regular table into a partitioned table or vice versa.", it would be extremely useful to be able to define a partitioning scheme upfront as Marten creates the necessary tables.

cocowalla · 2017-07-07T15:25:07Z

It would be great to also support this on arbitrary tables when using Marten as a document database, not just as an Event Store.

jokokko · 2017-07-08T08:59:58Z

Indeed. But if it's implemented for Event Store tables (well, mainly mt_events), should not be too big a step to generalize for all tables. The hardest thing likely is to offer a flexible way to define the actual partitioning scheme. Then again, being able to do it via Marten would be a huge advantage, as, afaik, tables cannot be partitioned afterwards (talking about native partitioning in PG 10).

turbo · 2018-09-21T19:40:27Z

I'm relying on Marten for most persistence in my app, but there are two massive tables that are still raw PGSQL because they need partitioning supports (by tenant, timestamp). It would be nice to handle this in Marten.

Although I'd urge you to wait for PG11, which will have quite a few partition improvements.

oskardudycz · 2018-09-24T05:34:58Z

@turbo currently you can use Conjoined Tenancy that will at least give you possibility to split it by tenant (also stream and events table). I'll have a look on the PG partitioning in PG 10.

isen-ng · 2018-11-15T03:32:44Z

Seems like PG11 has been released
https://www.postgresql.org/about/news/1894/

jeremydmiller · 2019-07-08T00:44:46Z

My thoughts:

Just use what Postgresql 11 can do

Partitioning by tenant id should work with native Postgresql support, but I'd rather have someone test that before claiming so
Allow users to identify streams by long numbers backed by a sequence and then allow native Postgresql to do its thing?
See if Postgresql "range" partitioning can work by sequential UUIDs?
Allow users to partition streams by the creation date (not sure how valuable that would be, seems like you'd be mostly querying by stream Id or metadata)
If we did stream level metadata by duplicated columns, it seems like that would greatly aid in partitioning

If we programmatically "route" events to separate tables

Gotta figure out how event/streams get assigned by logical partition. Do you do it by stream type, which makes you require the stream type in StartStream() and that hasn't been popular. Could you do it by event type? That's easier to handle in the routing in a way.
The "routing" would need to be part of appending events, the async daemon (totally hoses up the async daemon the way it is today, but we can beat that), and any and all event store queries
Route by metadata elements? That's more complexity
Do we require users to explicitly choose the logical event store? That's easier, but might cause usage problems. Worth asking folks who care about this
If we separate out the events, with today's design it's an event table, the stream table, a generated function to append to the table, and a sequence to track event numbers. If we partition, do we have one of all of these per logical event store?

Async Daemon

The async daemon might need to take advantage of the partitioning for improved performance. Rebuilds might go much faster if you use "range" partitioning by event sequence #

oskardudycz · 2019-07-26T10:15:59Z

@jeremydmiller I think that maybe we could start with Native partitioning by stream type? Imho this could be the easiest way to start investigation around that topic. I could try to work on PoC. (eg. by adding new Tenancy "ByStreamType").

If we have the paritioning by stream type then we could try to extend that with the next levels like by date, stream id etc.

I agree that duplicate field would be helpfull and that it would be worth to checking if the MetaData would help us on partitioning.

Imho it would be also worth checking TimescaleDB as it might potentialy simplify that process a lot (@cocowalla did already some initial investigations).

p.s. nice introduction to partitioning https://severalnines.com/blog/how-take-advantage-new-partitioning-features-postgresql-11

jeremydmiller · 2020-11-17T18:43:28Z

@oskardudycz @mysticmind Getting back into this a little bit today. Some thoughts:

Haven't spoken w/ y'all about this yet, but I kinda like the idea of letting folks use additional event collections instead of the single, default event collection we have today. That segments things real fast, but it adds all kinds of complexity down the line. So rather than today's stream type (or in addition to), we allow you to define a completely separate event collection (stream & events table)
If you're using the async daemon in a polling mode, I'd vote to have this segmented by a range of the seq_id because that's how the async daemon hits the table.
I agree about duplicating information from the stream to the event table if that helps the partitioning work

jeremydmiller · 2021-02-01T17:40:58Z

From the other day, @mysticmind, @oskardudycz, and I talked about:

Adding some kind of new "is_archived" flag, then partitioning on that first
Partition against the event sequence if users are using the async daemon
Use indexes against the version field if users want that one.
Index against stream id or key if the user intends to read events by stream

There's some opportunity to thin down the indexing for speed.

jeremydmiller · 2021-05-10T20:18:25Z

This isn't a slam dunk, and it's going to add some extra work to users. It makes perfect sense to partition against:

is_archived by list. This would be tremendously helpful for performance if folks would be rigorous about moving event streams to the archived state
tenant_id by either list or range if it exists. And I'd allow the users to choose how to do that.
Sequence? It'd help the async daemon tremendously, but it would require either guessing upfront how their event table is going to grow, or setting up work to create new partitions as necessary. I don't think it'd be that necessary if we do the event archiving
If using string identifiers for the stream id, maybe allow users to partition by the stream id? That might help tremendously if there's some kind of meaning.
Partition by aggregate/stream type. Seems like a no-brainer.
It might sometimes be valuable to partition by event type

For indexes:

There's already a unique index for version + stream + sometimes tenant_id, depending on configuration

nkosi23 · 2023-06-01T19:18:18Z

What would be the migration story of such a feature? Would there be a way for existing users to leverage such a new feature without too much pain?

oskardudycz · 2023-06-19T08:03:17Z

@nkosi23, we're still in the planning phase, we'd for sure provide the migration guide, but it may require copying data from old tables to new if you want to enable partitioning. See more in:

…l the new 7.25 event store optimizations. Closes GH-770. Closes GH-3321

jeremydmiller added the event store label May 30, 2017

jeremydmiller mentioned this issue May 30, 2017

Event Store Overhaul for 2.0 #781

Closed

jeremydmiller added the enhancement label Oct 7, 2017

jeremydmiller mentioned this issue Jul 7, 2019

ARCHIVED -- Event Store Improvements for v4 #1307

Closed

jeremydmiller added this to the 4.0 milestone Apr 28, 2020

jeremydmiller mentioned this issue Nov 18, 2020

Event Store Improvements for V4 #1608

Closed

jeremydmiller mentioned this issue Feb 10, 2021

Re-evaluate the AggregateType on Stream #1671

Closed

jeremydmiller modified the milestones: 4.0, 4.1.0 May 11, 2021

oskardudycz modified the milestones: 4.1.0, 4.2.0 Nov 12, 2021

oskardudycz removed this from the 4.2.0 milestone Nov 21, 2021

jeremydmiller added this to the 7.1.0 milestone Feb 28, 2024

jeremydmiller mentioned this issue Mar 17, 2024

Reboot Projection API Model #3052

Open

jeremydmiller added a commit that referenced this issue Jul 24, 2024

Event store partitioning for hot/cold storage and documentation on al…

cdc2df3

…l the new 7.25 event store optimizations. Closes GH-770. Closes GH-3321

jeremydmiller closed this as completed in 39e4532 Jul 24, 2024

jeremydmiller added a commit that referenced this issue Jul 24, 2024

Event store partitioning for hot/cold storage and documentation on al…

b48aeb7

…l the new 7.25 event store optimizations. Closes GH-770. Closes GH-3321

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Partitioning support for Event Store #770

Partitioning support for Event Store #770

jokokko commented May 23, 2017

cocowalla commented Jul 7, 2017

jokokko commented Jul 8, 2017

turbo commented Sep 21, 2018

oskardudycz commented Sep 24, 2018

isen-ng commented Nov 15, 2018

jeremydmiller commented Jul 8, 2019

oskardudycz commented Jul 26, 2019

jeremydmiller commented Nov 17, 2020

jeremydmiller commented Feb 1, 2021

jeremydmiller commented May 10, 2021 •

edited

Loading

nkosi23 commented Jun 1, 2023

oskardudycz commented Jun 19, 2023

Partitioning support for Event Store #770

Partitioning support for Event Store #770

Comments

jokokko commented May 23, 2017

cocowalla commented Jul 7, 2017

jokokko commented Jul 8, 2017

turbo commented Sep 21, 2018

oskardudycz commented Sep 24, 2018

isen-ng commented Nov 15, 2018

jeremydmiller commented Jul 8, 2019

Just use what Postgresql 11 can do

If we programmatically "route" events to separate tables

Async Daemon

oskardudycz commented Jul 26, 2019

jeremydmiller commented Nov 17, 2020

jeremydmiller commented Feb 1, 2021

jeremydmiller commented May 10, 2021 • edited Loading

nkosi23 commented Jun 1, 2023

oskardudycz commented Jun 19, 2023

jeremydmiller commented May 10, 2021 •

edited

Loading