support multiple per-row updates in a single mutation #2768

0x777 · 2019-08-23T06:01:25Z

Currently we allow updating multiple rows (through where) but all of them will get the same update. We need to add support for cases where the updates are different for each row, say you want to set name to Hello for the row with id=1 and to World when id=2. Something like this maybe?

{
  update_user_many(
    updates: [
      { where: {id: {_eq: 1}, _set: {name: "hello"}},
      { where: {id: {_eq: 2}, _set: {name: "world"}}
    ]
  ) {
    affected_rows
  }
}

Notes:

What happens when there is overlap across the where conditions? What would affected_rows and returning return?
Constructing the updates argument would need a bit of a boilerplate.

Maybe we can simplify the api to just use the primary key/unique constraints?

{
  update_user_many(
    updates: [
      { id: 1, _set: {name: "hello"}},
      { id: 2, _set: {name: "world"}}
    ]
  ) {
    affected_rows
  }
}

We can probably use update .. from as suggested here: https://stackoverflow.com/a/18799497

The text was updated successfully, but these errors were encountered:

revskill10 · 2019-08-23T06:30:15Z

@0x777 As i see, for the simpler API , we could also use where with unique constraint key ?

hutber · 2019-12-28T21:45:01Z

I would enjoy this feature

levid · 2020-01-06T08:37:23Z

I would also really like this feature. Currently I have to update multiple records individually and this would be so much easier and more efficient.

marcfalk · 2020-01-20T20:00:05Z

Would love this too! For now I'm using upsert although it's not highly recommended in the docs. What are the drawbacks of doing that until a multi-update feature is here?

revskill10 · 2020-01-20T21:10:53Z

Currently, i use multiple mutation in one graphql query to achieve this, as Hasura allows you to use multiple mutation inside one query, and all of them will execute in one transaction.

FarazPatankar · 2020-04-11T17:04:18Z

Is there an update on this at all? Also, I am considering going the upsert way like @marcfalk has and was wondering if there are any drawbacks as well. Would love some thoughts on this. @tirumaraiselvan

michael-land · 2020-04-11T20:51:53Z

Is there an update on this at all? Also, I am considering going the upsert way like @marcfalk has and was wondering if there are any drawbacks as well. Would love some thoughts on this. @tirumaraiselvan

The pk may leaving gaps unless you use uuid

e.g.
id 1
id 9
id 100

FarazPatankar · 2020-04-14T19:55:38Z

Is there an update on this at all? Also, I am considering going the upsert way like @marcfalk has and was wondering if there are any drawbacks as well. Would love some thoughts on this. @tirumaraiselvan

The pk may leaving gaps unless you use uuid

e.g.
id 1
id 9
id 100

Anything other than this? And does this have any bad effects apart from the fact that there are gaps?

aaronbski · 2020-05-20T00:41:01Z

+1

FarazPatankar · 2020-05-20T01:41:10Z

For anyone struggling with this, I ended up using the upsert mutation for this due to the lack of a response and it works perfectly.

michael-land · 2020-07-21T07:13:05Z

Is this going to implement? I don't see why cron/schedule job has higher priority than multiple per-row updates and multiple auth role.

hafiztahajamil · 2020-08-10T07:25:19Z

@praveenweb This is an important feature. You must assign this to someone. Thanks

BPiepmatz · 2020-09-04T14:40:58Z

would love to see this too 👍

hafiztahajamil · 2020-09-18T21:26:35Z

@praveenweb Any updates ?

hafiztahajamil · 2020-10-06T08:44:04Z

Currently, i use multiple mutation in one graphql query to achieve this, as Hasura allows you to use multiple mutations inside one query, and all of them will execute in one transaction.

@revskill10 How to dynamically add multiple mutations to a single one each with different variable values? How to make use of the aliases dynamically

revskill10 · 2020-10-06T09:02:16Z

@hafiztahajamil No, you can't. You have to embed query variables inside the mutations. It's fast.

hafiztahajamil · 2020-10-06T10:44:35Z

@revskill10 Can you please give an example on how to do it ?

…

On Tue, 6 Oct 2020 at 2:02 PM, Truong Hoang Dung ***@***.***> wrote: @hafiztahajamil <https://github.com/hafiztahajamil> No, you can't. You have to embed query variables inside the mutations. It's fast. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#2768 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AC2ASTJS37X7X5WQPIPV3SDSJLMKTANCNFSM4IO367RA> .

revskill10 · 2020-10-18T07:13:23Z

@hafiztahajamil For example, i want to update first_name of multiple users in one mutation, i would do:

mutation {
update_users_1(objects:users1_objects) { affected_rows }
update_users_2(objects:users2_objects) { affected_rows }
update_users_3(objects:users3_objects) { affected_rows }
update_users_4(objects:users4_objects) { affected_rows }
}

In above mutation, i generated update_users_X from code , as well as usersX_objects, like:

usersX_objects = [{
first_name: 'test'
}]

valstu · 2021-03-19T17:53:46Z

Any updates to this, currently most of my updates need to fallback to upserts and that is something I wouldn't like to use.

Lexe003 · 2021-03-24T08:00:28Z

Updating multiple rows with different Pk's by calling the API multiple times sounds really bad. Hope this will be implemented soon. This should be a core feature.

harpyng · 2021-05-07T02:41:12Z

Please provide an update on this feature!

Although the upsert alternative is a possibility the documentation specifically suggests otherwise:
https://hasura.io/docs/latest/graphql/core/databases/postgres/mutations/upsert.html
"Upsert is not a substitute for update¶
The upsert functionality is sometimes confused with the update functionality. However, they work slightly differently. An upsert mutation is used in the case when it’s not clear if the respective row is already present in the database. If it’s known that the row is present in the database, update is the functionality to use.

For an upsert, all columns that are necessary for an insert are required."

I have a use case where I want to update JSON fields across multiple ecommerce products at once and doing it by running a mutation with hundreds of individual updates or serially calling the API is not ideal.

Upsert is a possibility, but as stated above from the docs, it requires all columns necessary for an insert - so doesn't work well if there are not null constraints to consider. (or requires a lot of additional and unnecessary information to be sent with each mutation).

noel-dolan · 2021-06-03T13:28:55Z

Wow! First posted in 23 Aug 2019 and still not a feature! This is exactly what I was/am looking for too, I won't hold my breath! :p upsert really isn't an option as all fields are required which would be other data being wiped. Surely this should be high up the list of features that are added! :/

Edit:

Looks like the following might be a viable, if not quite perfect, solution - just add an alias to each update request.

mutation myUpdates{
update1: update_users(where: {id : 1}, _set: { value: 10 }) { affected_rows }
update2: update_users(where: {id : 2}, _set: { value: 15 }) { affected_rows }
update3: update_users(where: {id : 3}, _set: { value: 20 }) { affected_rows }
}

valstu · 2021-06-04T08:04:18Z

I also feel this should have higher priority, I've had multiple projects where this exact feature would have been useful. I mean it is quite common case on any project to update multiple rows at once. @0x777 any idea if this will progress or is this abandoned?

carlpaten · 2021-07-16T21:05:05Z

Since upsert can't work against partial unique indexes this is causing issues with our ETL. I'd hate to have to go around Hasura and talk directly to Postgres, which up until now we've never had to do.

SebasScript · 2021-08-16T10:12:47Z

Hi am also i need of this, will likely go with multiple mutations within one call for now to get around limitations. But is a feature
that seems to be a very basic building block in the mutations toolset and agree with others on the priority of this

mikolajkniejski · 2022-03-11T11:00:20Z

Please add this feature!

0x777 · 2022-03-28T08:52:02Z

Hey folks, we will be picking this up soon. Can you share your use cases here? It'll be really helpful in designing the API.

dariocravero · 2022-03-28T08:57:52Z

@0x777 here's our use case. Say a list of fields need to be updated by PK, right now we do this:

mutation update($id1: uuid!, $object1: ..., $id2: uuid!, $object2: ...) {
  u1: update_users_by_pk(id: $id1, _set: $object2) { id }
  u2: update_users_by_pk(id: $id2, _set: $object2) { id }
}

Ideally, we'll have something like:

mutation update($objects: [{ id, _set }, {id, _set}...]) {
  update_users_many($objects) { returning { id } }
}

So effectively like inserting many but with a way to tell what you update.

Thanks for taking this on. It'll help DX a lot and make mutations safer! (Right now we construct that mutation on the fly)

lxblvs · 2022-03-29T09:11:23Z

We are also updating a big list of objects. I would prefer something like this:

mutation updateMultiple($itemUpdates: [item_update_input!]!) {
  update_items(updates: $itemUpdates) {
    affected_rows
  }
}

where the parameter is

itemUpdates = [
                   {where: {id: {_eq: 1}}, set: {name: "potato"}},
                   {where: {id: {_eq: 2}}, set: {name: "another potato"}},
                   {where: {name: {_eq: "not a potato"}}, set: {name: "yet another potato"}},
]

adimyth · 2022-05-23T07:45:24Z

Exactly! None of the solutions above take into account the possibility of having variable number of updates required

tjad · 2022-06-16T01:25:48Z

@0x777 I like the idea of having an [update_input!]! syntax which contains where and _set , similar to your first suggestion
or #2768 (comment)

This at least allows simple syntax for multiple updates within single transaction. We can easily create the list of update objects dynamically.

In terms of rows affected, I think it could return a value for each object/clause in order of the provided update objects.

K.I.S.S

This would be a good first iteration at the least IMO - we can worry about overlapping next iteration (caveat in the documentation)

Off the top of my head, if the queries are run in a transaction, the overlapping should not be significant as the order of updates will be maintained.

eviefp · 2022-06-30T09:01:01Z

We've gone through a few internal design iterations on this, and I'm here to report what we've landed on and ask for feedback.

What I'm working on

I am currently working on implementing the version initially suggested by @0x7777, which is essentially a multi-record update by primary key.

Internally, we will make sure the keys don't repeat. If they do, we will use the last value (since it's a list of updates, we'll just pick the one that is closest to the list's end).

This will get translated into a single Postgres UPDATE statement. This is important, because this means we get the best possible performance.

What were the designs we considered

One key technical fact is that Postgres will NOT update a row twice in the same statement. For example, say we wanted to allow this query:

{
  update_user_many(
    updates: [
      { where: {id: {_gt: 1}, _set: {name: "hello"}},
      { where: {id: {_eq: 2}, _set: {description: "world"}}
    ]
  ) {
    affected_rows
  }
}

If we have a record with id = 2, then we can't know, before running the query, whether the first or the second update will trigger. However, only one of them will. This sort of non-determinism is not something we want to carry over to our product because it's unexpected and can easily introduce bugs in our user's code.

In the case of updates by primary key, it's fairly easy to detect when there's an overlap. However, as soon as we add more operators and other columns in the mix, the problem becomes incredibly complex (often times not solvable).

This means we end up with two options:

the one we ended up picking: only allow equality on ID instead of a generalized where clause; this means very good performance and predictable results
allow generic where clauses and run multiple updates in sequence; this means worse performance but allows overlapping/updating the same row multiple times, while keeping the latest value

What about RETURNING?

In the primary key update version, returning is relatively simple to do and shouldn't surprise anyone.

However, the general where clause brings further complications: say we run two updates, and the first updates 2 rows, and the second 3. Should affected_rows be 5? Well, that might not be true because some of those rows could overlap. We could say "at least 2, at most 5". Or we could just return a list of affected_rows (one for each update).

Returning columns for affected rows for each operation can also be a bit tricky and impair performance.

Conclusion

So, in conclusion, we're going for the solution that:

is fast
will be out ASAP for everyone to have a look at and give us feedback
has predictable results

At the same time, we're wondering: how important is having a generic where clause? Are there any specific operators other than eq which you feel are essential for this feature?

jackherizsmith · 2022-07-07T13:46:19Z

Hello! I have just come across this thread, looks like great timing. Given the simplicity of option 1, the complexity of option 2, and significant improvement this update will mean for a lot of developers, I think your conclusion is the right one for the product. In the meantime, developers looking for an all in one solution can simply run a prior query that returns the IDs they are looking to update.

eviefp · 2022-07-18T16:06:41Z

I'm happy to announce this feature has been merged: 84366c9.

You should be able to use this feature in the next release!

During these couple of weeks, we've iterated a few times on the solutions and ended up being able to provide a bit more features than originally anticipated. You can read about it in the commit's CHANGELOG.

Essentially, this feature will create a new mutation field named update_<table>_many. This will take a list of updates, each with a where clause along with its own set/inc/etc., clause. These updates will run in sequence. The return type will be the equivalent of running each update separately. The advantage is that everything is being ran in a single transaction, so if one of them fails, everything gets rolled back.

We're excited to hear back from you and get feedback on this new issue! Let us know how you end up using it.

revskill10 · 2022-07-18T18:47:25Z

@eviefp This is a deal breaker for Hasura. Cheers for the launch.

eviefp · 2022-07-18T20:55:19Z

Hey @revskill10, do you care to elaborate? What exactly is the deal breaker part?

hutber · 2022-07-18T21:02:23Z

I'd just like to say an extreme thank you for the devs working on this feature over the last 3 years :D I love you, my wife loves you, my wives wive loves you!!! everybody loves you!

lxblvs · 2022-07-18T21:08:00Z

This is very cool, thank you for this feature. I will definitely use it and praise the developers who wrote this code.

But I still hope that at some point you change your position on complex where clauses in the multiupdates. If we want to shoot ourselves in the foot and run conflicting mutations in a single call - please let us. Or maybe have a flag to run them sequentially and not as a single transaction?

For now we can do a preflight call to resolve the ids or maybe have an Apollo preprocessor in front of Hasura that does it for us.

revskill10 · 2022-07-19T05:09:59Z

Hi @eviefp It's almost the same as Prisma Transaction feature, but one more difference.

In Prisma transaction, they can mix match both query and mutation though.

eviefp · 2022-07-19T06:21:06Z

@lxblvs

But I still hope that at some point you change your position on complex where clauses in the multiupdates. If we want to shoot ourselves in the foot and run conflicting mutations in a single call - please let us.

We actually did change that! So right now we allow arbitrary where clauses that can freely overlap. We just run them in sequence, inside a transaction.

Or maybe have a flag to run them sequentially and not as a single transaction?

If there's user interest, we could definitely add a flag/option to allow running outside a transaction scope, ignoring errors.

lxblvs · 2022-07-19T09:55:53Z

@eviefp oh great! Are you also having your original update_table_many_by_pk? Running optimised mutations by pk would be quite badass for bulk deterministic mutations (like updating rows in a table where ids are known by the front-end)

eviefp · 2022-07-19T10:07:11Z

@eviefp oh great! Are you also having your original update_table_many_by_pk? Running optimised mutations by pk would be quite badass for bulk deterministic mutations (like updating rows in a table where ids are known by the front-end)

@lxblvs We gave that up in favor of this current iteration. However, if we get enough requests, we can definitely prioritize the original update_table_many_by_pk!

ajohnson1200 · 2022-07-19T16:46:45Z

bulk deterministic mutations (like updating rows in a table where ids are known by the front-end)

Just so that we deeply understand the problem, can you talk through your use case in a little more detail? ie: when you say "bulk", are you talking 10 rows, 100 rows, or 1000 rows? And does the new solution that @eviefp outlined about prohibit you from accomplishing that goal... or is it instead that you can accomplish the goal, it's just not as fast?

sassela · 2022-09-15T09:23:48Z

Thanks for your patience with this, folks. I'm closing this issue as it was implemented at d76aab9 and released in v2.10.0. But please keep your feedback coming via comments on this issue; the Hasura team will be notified of it.

0x777 added c/server Related to server e/intermediate can be wrapped up in a week k/enhancement New feature or improve an existing feature p/high candidate for being included in the upcoming sprint labels Aug 23, 2019

tirumaraiselvan added p/medium non-urgent issues/features that are candidates for being included in one of the upcoming sprints and removed p/high candidate for being included in the upcoming sprint labels Apr 11, 2020

tirumaraiselvan mentioned this issue Jan 6, 2021

Mutate multiple rows using the same alias #6407

Closed

vaishnavigvs added the a/api label Jun 25, 2021

vaishnavigvs assigned 0x777 Jun 25, 2021

roryc89 mentioned this issue Oct 6, 2021

Support dynamic aliases OxfordAbstracts/purescript-graphql-client#57

Closed

gilligan added iteration and removed k/enhancement New feature or improve an existing feature p/medium non-urgent issues/features that are candidates for being included in one of the upcoming sprints e/intermediate can be wrapped up in a week labels Feb 2, 2022

sassela mentioned this issue Sep 15, 2022

bug: update_many feature can prevent startup on table name conflict #8844

Open

2 tasks

sassela closed this as completed Sep 15, 2022

support multiple per-row updates in a single mutation #2768

support multiple per-row updates in a single mutation #2768

Comments

0x777 commented Aug 23, 2019

revskill10 commented Aug 23, 2019

hutber commented Dec 28, 2019

levid commented Jan 6, 2020

marcfalk commented Jan 20, 2020

revskill10 commented Jan 20, 2020

FarazPatankar commented Apr 11, 2020

michael-land commented Apr 11, 2020 • edited Loading

FarazPatankar commented Apr 14, 2020

aaronbski commented May 20, 2020

FarazPatankar commented May 20, 2020

michael-land commented Jul 21, 2020

hafiztahajamil commented Aug 10, 2020

BPiepmatz commented Sep 4, 2020

hafiztahajamil commented Sep 18, 2020

hafiztahajamil commented Oct 6, 2020

revskill10 commented Oct 6, 2020

hafiztahajamil commented Oct 6, 2020 via email • edited Loading

revskill10 commented Oct 18, 2020 • edited Loading

valstu commented Mar 19, 2021

Lexe003 commented Mar 24, 2021

harpyng commented May 7, 2021

noel-dolan commented Jun 3, 2021 • edited Loading

valstu commented Jun 4, 2021

carlpaten commented Jul 16, 2021

SebasScript commented Aug 16, 2021

mikolajkniejski commented Mar 11, 2022

0x777 commented Mar 28, 2022

dariocravero commented Mar 28, 2022

lxblvs commented Mar 29, 2022

adimyth commented May 23, 2022

tjad commented Jun 16, 2022 • edited Loading

eviefp commented Jun 30, 2022

What I'm working on

What were the designs we considered

What about RETURNING?

Conclusion

jackherizsmith commented Jul 7, 2022 • edited Loading

eviefp commented Jul 18, 2022

revskill10 commented Jul 18, 2022

eviefp commented Jul 18, 2022

hutber commented Jul 18, 2022

lxblvs commented Jul 18, 2022

revskill10 commented Jul 19, 2022

eviefp commented Jul 19, 2022 • edited Loading

lxblvs commented Jul 19, 2022

eviefp commented Jul 19, 2022

ajohnson1200 commented Jul 19, 2022

sassela commented Sep 15, 2022

michael-land commented Apr 11, 2020 •

edited

Loading

hafiztahajamil commented Oct 6, 2020 via email •

edited

Loading

revskill10 commented Oct 18, 2020 •

edited

Loading

noel-dolan commented Jun 3, 2021 •

edited

Loading

tjad commented Jun 16, 2022 •

edited

Loading

jackherizsmith commented Jul 7, 2022 •

edited

Loading

eviefp commented Jul 19, 2022 •

edited

Loading