Add seeded execution to the Catalyst runtime #936

paul0403 · 2024-07-16T19:37:20Z

Context:
The test test_dynamic_one_shot_several_mcms in frontend/test/pytest/test_mid_circuit_measurement.py was marked as skipped because it was flaky: #842

After some investigation, it was found out that the test involves a measurement, which gives random results. The qjit run of the test is random, but the reference default.qubit run is seeded, so sometimes the results fall outside the acceptable range of np.allclose.

To resolve this, we add a seeding mechanism for qjit.

Description of the Change:
Implemented a random seeding infrastructure for qjit.

The top-level qjit decorator can now take in a string argument seed="some_string". The default value is empty string, which means an unseeded run.

The string will be propagated to the runtime ExecutionContext, which then initializes a PRNG (the c++ std::mt19937 pseudo random number generator) in the context.
The seed and the PRNG is then sent to individual devices.
When performing measurements, the devices draw according to this PRNG.

A proper seed is found and set, to deterministically resolve the flaky test test_dynamic_one_shot_several_mcms.

Benefits:

test_dynamic_one_shot_several_mcms now reliably passes
User can seed a qjit run

Possible Drawbacks:

Related GitHub Issues: closes #839

[sc-66696]

codecov · 2024-07-16T19:49:51Z

Codecov Report

Attention: Patch coverage is 98.11321% with 2 lines in your changes missing coverage. Please review.

Project coverage is 97.90%. Comparing base (897630c) to head (276761e).

Files	Patch %	Lines
runtime/include/QuantumDevice.hpp	0.00%	1 Missing ⚠️
runtime/lib/capi/ExecutionContext.hpp	75.00%	1 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #936   +/-   ##
=======================================
  Coverage   97.90%   97.90%           
=======================================
  Files          72       73    +1     
  Lines       10341    10371   +30     
  Branches     1180     1185    +5     
=======================================
+ Hits        10124    10154   +30     
- Misses        171      172    +1     
+ Partials       46       45    -1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

… it on github, as on my local machine I cannot reproduce the flaky failure.

paul0403 · 2024-07-17T15:42:21Z

Confirming that this is what we want (using "draw" as a name for the internal "random state" in lightning simulator runtime) when we set a seed:

Each execution of a qjit-compiled function should not return the same results, as the user would expect repeated executions of things like catalyst.measure to be random
However, when a seed is set, the evolution history of repeated executions of a qjit-compiled function should stay the same.

i.e.

# in file test.py

dev = qml.device("lightning.qubit", wires=1, shots=None)
@qjit(seed="qwerty")
@qml.qnode(dev)
def circuit():
    qml.Hadamard(0)
    m = measure(0)
    @cond(m)
    def cfun0():
        qml.Hadamard(0)
    cfun0()
    return qml.probs()

print(circuit(), circuit(), circuit(), circuit())

If we run python test.py, the 4 executions should give "random" results from each other; however, running python test.py again should produce the same 4 results:

$ python3 test.py
[0.5 0.5] [0.5 0.5] [1. 0.] [0.5 0.5]
draw is 0.73658 draw is 0.797645 draw is 0.267068 draw is 0.793337 
$ python3 test.py
[0.5 0.5] [0.5 0.5] [1. 0.] [0.5 0.5]
draw is 0.73658 draw is 0.797645 draw is 0.267068 draw is 0.793337

The above is exact (shots=None). If we use a high number of shots (e,g, shots=10000) the probabilities will have small fluctuations:

$ python3 flaky_mcm.py
[0.5013 0.4987] [1. 0.] [1. 0.] [1. 0.]
draw is 0.838864 draw is 0.289431 draw is 0.405051 draw is 0.0953809 

$ python3 flaky_mcm.py
[0.5021 0.4979] [1. 0.] [1. 0.] [1. 0.]
draw is 0.838864 draw is 0.289431 draw is 0.405051 draw is 0.0953809

$ python3 flaky_mcm.py
[0.5019 0.4981] [1. 0.] [1. 0.] [1. 0.]
draw is 0.838864 draw is 0.289431 draw is 0.405051 draw is 0.0953809

$ python3 flaky_mcm.py
[0.4899 0.5101] [1. 0.] [1. 0.] [1. 0.]
draw is 0.838864 draw is 0.289431 draw is 0.405051 draw is 0.0953809

@dime10 @mudit2812

dime10 · 2024-07-17T16:30:37Z

@paul0403 No, I would say each execution of the qjit function should produce the same results, since it is seeded.

However, if you were to invoke the same QNode multiple times within a single execution of the qjit function, they should produce different results.

paul0403 · 2024-07-17T16:58:05Z

@dime10 Ah, so

dev = qml.device("lightning.qubit", wires=1, shots=None)

@qjit(seed="qwerty")
def workflow():
    @qml.qnode(dev)
    def circuit():
        qml.Hadamard(0)
        m = measure(0)
        @cond(m)
        def cfun0():
            qml.Hadamard(0)
        cfun0()
        return qml.probs()

    return circuit(), circuit(), circuit(), circuit()

print(workflow())
print("#######")
print(workflow())

(Array([0.4975, 0.5025], dtype=float64), Array([1., 0.], dtype=float64), Array([1., 0.], dtype=float64), Array([1., 0.], dtype=float64))
#######
(Array([0.4975, 0.5025], dtype=float64), Array([1., 0.], dtype=float64), Array([1., 0.], dtype=float64), Array([1., 0.], dtype=float64))

i.e. the history of both workflow (which is the QJIT) calls are the same, but within each QJIT call the qnode results are different?

dime10 · 2024-07-17T17:58:17Z

@dime10 Ah, so the history of both workflow (which is the QJIT) calls are the same, but within each QJIT call the qnode results are different?

Exactly :)

mudit2812 · 2024-07-17T18:39:27Z

@dime10 @paul0403 When we talked yesterday the behaviour I had in mind was more consistent with the first implementation. This is really more of a product question though.

paul0403 · 2024-07-17T18:43:32Z

@dime10 @paul0403 When we talked yesterday the behaviour I had in mind was more consistent with the first implementation. This is really more of a product question though.

@josh146

dime10 · 2024-07-17T19:06:11Z

@dime10 @paul0403 When we talked yesterday the behaviour I had in mind was more consistent with the first implementation. This is really more of a product question though.

That would be harder to realize because there is no persistent state in the runtime across qjit invocations. (and also unnecessary imo)

The top-level qjit decorator can now take in a string argument `seed="some_string"`. The default value is empty string, which means an unseeded run. The string will be propagated to the runtime `EvaluationContext`, which then initializes a PRNG (the c++ `std::mt19937` pseudo random number generator) in the context. The seed and the PRNG is then sent to individual devices. When performing measurements, the devices draw according to this PRNG. A proper seed is found and set, to deterministically resolve the flaky test `test_dynamic_one_shot_several_mcms`.

paul0403 · 2024-07-18T20:21:47Z

~~TODO: add frontend tests for seeded qjit; changelog~~ done

paul0403 · 2024-07-18T20:23:36Z

Question: right now only measurements are seeded. Should sampling (aka shots) be seeded as well?

runtime/include/QuantumDevice.hpp

runtime/lib/backend/lightning/lightning_dynamic/LightningSimulator.hpp

runtime/lib/backend/lightning/lightning_kokkos/LightningKokkosSimulator.cpp

… update

runtime/lib/capi/RuntimeCAPI.cpp

frontend/catalyst/utils/gen_mlir.py

runtime/lib/backend/common/Utils.hpp

dime10

Nice work @paul0403 !

I think there a few simplifications we can make, and the other question is whether we want to make the other update in the lightning device to use the same randomness for sampling (have you looked at whether this can easily be done?).

frontend/catalyst/third_party/oqc/src/OQCDevice.hpp

frontend/catalyst/third_party/oqc/src/OQCDevice.cpp

runtime/lib/backend/common/Utils.hpp

runtime/lib/capi/ExecutionContext.hpp

runtime/lib/capi/RuntimeCAPI.cpp

mlir/lib/Quantum/Transforms/ConversionPatterns.cpp

paul0403 · 2024-07-19T18:58:20Z

@erick-xanadu @dime10 Thanks for the suggestions! Yes, the root of the complexity is C not allowing overloading.

If we are fine with changing the signature to __catalyst__rt__initilize(char *) everywhere (and modifying all the necessary tests to reflect this), then this is by far the cleanest approach imo. The complex blocks in the conversion pass also become unnecessary.

I have done this in the most recent commit ~~(without going over the tests right now)~~.

…__rt__init, so the normal uses without supplying any arguments are possible

runtime/lib/capi/RuntimeCAPI.cpp

…e_shot

… a unsigned 32-bit integer.

…e_shot

dime10

Thanks @paul0403 :)

runtime/include/RuntimeCAPI.h

runtime/lib/capi/ExecutionContext.hpp

runtime/lib/capi/RuntimeCAPI.cpp

runtime/tests/Test_LightningMeasures.cpp

…e_shot

…ing the unseeded existing tests to all take in nullptr

runtime/lib/capi/ExecutionContext.hpp

… initilizers

runtime/lib/capi/ExecutionContext.hpp

… We make a small update to track these changes. The Catalyst PR adds seeding to qjit: PennyLaneAI/catalyst#936

### Before submitting Please complete the following checklist when submitting a PR: - [x] All new features must include a unit test. If you've fixed a bug or added code that should be tested, add a test to the [`tests`](../tests) directory! - [x] All new functions and code must be clearly commented and documented. If you do make documentation changes, make sure that the docs build and render correctly by running `make docs`. - [x] Ensure that the test suite passes, by running `make test`. - [x] Add a new entry to the `.github/CHANGELOG.md` file, summarizing the change, and including a link back to the PR. - [x] Ensure that code is properly formatted by running `make format`. When all the above are checked, delete everything above the dashed line and fill in the pull request template. ------------------------------------------------------------------------------------------------------------ **Context:** The lightning kokkos files in the Catalyst repo has changed since #770. We make a small update to track these changes. The Catalyst PR that made the changes added seeding to qjit: PennyLaneAI/catalyst#936 **Description of the Change:** The `lightning_kokkos/catalyst` files now has the MCM seeding support added in catalyst PennyLaneAI/catalyst#936 **Benefits:** unblocks kokkos with catalyst **Possible Drawbacks:** None **Related GitHub Issues:** None --------- Co-authored-by: ringo-but-quantum <github-ringo-but-quantum@xanadu.ai>

…t_several_mcms as skipped. Seeding for qjit was added in #936, but only measurements were seeded, and samples were not. Hence this test is still flaky. Sample seeding needs to be done in Lightning. #999

**Context:** Seeding for qjit was added in #936 , but only measurements were seeded, and samples were not. Hence this test is still flaky. Sample seeding needs to be done in Lightning. #999 **Description of the Change:** Marking the test test_mid_circuit_measurement.py/test_dynamic_one_shot_several_mcms as xfail. --------- Co-authored-by: David Ittah <dime10@users.noreply.github.com>

…make the generated samples deterministic (#927) ### Before submitting Please complete the following checklist when submitting a PR: - [x] All new features must include a unit test. If you've fixed a bug or added code that should be tested, add a test to the [`tests`](../tests) directory! - [x] All new functions and code must be clearly commented and documented. If you do make documentation changes, make sure that the docs build and render correctly by running `make docs`. - [x] Ensure that the test suite passes, by running `make test`. - [x] Add a new entry to the `.github/CHANGELOG.md` file, summarizing the change, and including a link back to the PR. - [x] Ensure that code is properly formatted by running `make format`. When all the above are checked, delete everything above the dashed line and fill in the pull request template. ------------------------------------------------------------------------------------------------------------ **Context:** [A while ago](PennyLaneAI/catalyst#936) a new `seed` option to `qjit` was added. The seed was used to make measurement results deterministic, but samples were still probabilistic. This is because within a `qjit` context, [measurements were controlled from the catalyst repo](https://github.com/PennyLaneAI/catalyst/blob/a580bada575793b780d5366aa77dff6157cd4f93/runtime/lib/backend/common/Utils.hpp#L274) , but samples were controlled by lightning. To resolve stochastically failing tests (i.e. flaky tests) in catalyst, we add seeding for samples in lightning. **Description of the Change:** When `qjit(seed=...)` receives a (unsigned 32 bit int) seed value from the user, the seed gets propagated through mlir and [generates a `std::mt19937` rng instance in the catalyst execution context](https://github.com/PennyLaneAI/catalyst/blob/934726fe750043886415953dbd89a4c4ddeb9a80/runtime/lib/capi/ExecutionContext.hpp#L268). This rng instance eventually becomes a field of the `Catalyst::Runtime::Simulator::LightningSimulator` (and kokkos) class [catalyst/runtime/lib/backend/lightning/lightning_dynamic/LightningSimulator.hpp](https://github.com/PennyLaneAI/catalyst/blob/a580bada575793b780d5366aa77dff6157cd4f93/runtime/lib/backend/lightning/lightning_dynamic/LightningSimulator.hpp#L54). To seed samples, catalyst uses this device rng instance on the state vector's `generate_samples` methods: PennyLaneAI/catalyst#1164. In lightning, the `generate_samples` method now takes in a seeding number. The catalyst devices pass in a seed into the lightning `generate_samples`; this seed is created deterministically from the aforementioned already seeded catalyst context rng instance. This makes the generated samples deterministc. **Benefits:** Fewer (hopefully no) stochatically failing frontend tests in catalyst. **Possible Drawbacks:** **Related GitHub Issues:** [sc-72878] --------- Co-authored-by: ringo-but-quantum <github-ringo-but-quantum@xanadu.ai>

**Context:** There's still quite a few frontend tests stochastically failing because the `seed` option in `qjit` only controls the measurements, but not the samples. We add seeding to the samples. **Description of the Change:** When `qjit(seed=...)` receives a (unsigned 32 bit int) seed value from the user, the seed gets propagated through mlir and [eventually becomes a field of the `Catalyst::Runtime::Simulator::LightningSimulator` class, alongside the seeded `std::mt19937` rng instance ](https://github.com/PennyLaneAI/catalyst/blob/a580bada575793b780d5366aa77dff6157cd4f93/runtime/lib/backend/lightning/lightning_dynamic/LightningSimulator.hpp#L54). This was done in #936. In #936 , [the device's rng instance is used during measurements ](https://github.com/PennyLaneAI/catalyst/blob/a580bada575793b780d5366aa77dff6157cd4f93/runtime/lib/backend/lightning/lightning_dynamic/LightningSimulator.cpp#L451), but not during samples. This is because samples are performed from the `Pennylane::LightningQubit::Measures::Measurements` class through the `generate_samples` methods, which is controlled by the lightning repo. To seed samples, we use the device rng instance to generate a deterministic seed to pass it onto the state vector's `generate_samples` methods. This is the only change in catalyst. In lightning, the `generate_samples` method now can take in a seeding number. The catalyst devices pass in a seed into the lightning `generate_samples`; this seed is created deterministically from the aforementioned already seeded catalyst context rng instance. This makes the generated samples deterministc. The above is published on the lightning repo as the branch "seed_sample_lightning": PennyLaneAI/pennylane-lightning#927 PennyLaneAI/pennylane-lightning@6f3e0d5 **Benefits:** Fewer (hopefully no) stochatically failing frontend tests. **Related GitHub Issues:** #999 [sc-72878]

paul0403 added reviewer:require-wheels Pull Requests will need wheel building job successful before being merged author:build-wheels Run the wheel building workflows on this Pull Request labels Jul 16, 2024

unskipping the flaky test test_dynamic_one_shot_several_mcms to run…

da4dd08

… it on github, as on my local machine I cannot reproduce the flaky failure.

paul0403 force-pushed the flaky_test_dynamic_one_shot branch from 601a897 to da4dd08 Compare July 16, 2024 19:54

paul0403 requested review from dime10 and josh146 July 18, 2024 19:43

paul0403 added 2 commits July 18, 2024 16:10

separating the runtime init capi into a seeded one and an unseeded one

d8439e7

format

b1643cd

add oqc device

3a3bd23

paul0403 commented Jul 19, 2024

View reviewed changes

runtime/include/QuantumDevice.hpp Outdated Show resolved Hide resolved

runtime/lib/backend/lightning/lightning_dynamic/LightningSimulator.hpp Outdated Show resolved Hide resolved

runtime/lib/backend/lightning/lightning_kokkos/LightningKokkosSimulator.cpp Outdated Show resolved Hide resolved

paul0403 added 2 commits July 19, 2024 10:36

Adding a has_seed method in the device classes as a quality of life…

c26e2bf

… update

Merge remote-tracking branch 'origin' into flaky_test_dynamic_one_shot

4077b79

erick-xanadu requested changes Jul 19, 2024

View reviewed changes

runtime/lib/capi/RuntimeCAPI.cpp Outdated Show resolved Hide resolved

frontend/catalyst/utils/gen_mlir.py Outdated Show resolved Hide resolved

runtime/lib/backend/common/Utils.hpp Outdated Show resolved Hide resolved

addressing some comments

d9e22b7

dime10 requested changes Jul 19, 2024

View reviewed changes

Unifying ALL __catalyst__rt__initilize to take in a char*

99b2139

Set the default char *seed = nullptr in the declaration of __catalyst…

ddf0abc

…__rt__init, so the normal uses without supplying any arguments are possible

paul0403 commented Jul 19, 2024

View reviewed changes

runtime/lib/capi/RuntimeCAPI.cpp Outdated Show resolved Hide resolved

paul0403 added 7 commits July 23, 2024 16:50

Merge remote-tracking branch 'origin/main' into flaky_test_dynamic_on…

859d772

…e_shot

add qjit documentation to reflect the unseeded shots in lightning

b40aa4d

As per Lee and Josh's suggestion, we change the seed from a string to…

ae3ea16

… a unsigned 32-bit integer.

remove the GetDevicePRNG getter, and remove the associated tests

75ba76b

format

64443f9

changelog example fix

aa60345

Merge remote-tracking branch 'origin/main' into flaky_test_dynamic_on…

333a593

…e_shot

dime10 approved these changes Jul 25, 2024

View reviewed changes

paul0403 added 2 commits July 25, 2024 15:51

Merge remote-tracking branch 'origin/main' into flaky_test_dynamic_on…

9d57b13

…e_shot

removing the default argument in __catalyst__rt__initialize and chang…

a49cd41

…ing the unseeded existing tests to all take in nullptr

dime10 reviewed Jul 25, 2024

View reviewed changes

runtime/lib/capi/ExecutionContext.hpp Outdated Show resolved Hide resolved

erick-xanadu approved these changes Jul 25, 2024

View reviewed changes

using initilizer list for ExecutionContext.seed instead of having two…

ad1c6bd

… initilizers

paul0403 commented Jul 25, 2024

View reviewed changes

runtime/lib/capi/ExecutionContext.hpp Outdated Show resolved Hide resolved

remove this->seed = seed after initialization list

276761e

paul0403 merged commit a49e0bc into main Jul 25, 2024
44 checks passed

paul0403 deleted the flaky_test_dynamic_one_shot branch July 25, 2024 21:52

paul0403 mentioned this pull request Jul 26, 2024

Update kokkos files from catalyst repo to support MCM seeding PennyLaneAI/pennylane-lightning#819

Merged

josh146 mentioned this pull request Jul 30, 2024

[BUG] JAX JIT errors-out when using a tracer-based PRNG key PennyLaneAI/pennylane#6054

Open

1 task

paul0403 mentioned this pull request Aug 6, 2024

Seed sampling for lightning simulators #999

Closed

dime10 mentioned this pull request Aug 6, 2024

Add operations for setting state and basis state and integrate this with skip_initial_state_prep #955

Merged

paul0403 mentioned this pull request Aug 13, 2024

Skipping test_dynamic_one_shot_several_mcms again #1011

Merged

This was referenced Sep 30, 2024

Seed samples for lightning.qubit/kokkos #1164

Merged

Allow lightning.qubit/kokkos::generate_samples to take in seeds to make the generated samples deterministic PennyLaneAI/pennylane-lightning#927

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add seeded execution to the Catalyst runtime #936

Add seeded execution to the Catalyst runtime #936

paul0403 commented Jul 16, 2024 •

edited by dime10

Loading

codecov bot commented Jul 16, 2024 •

edited

Loading

paul0403 commented Jul 17, 2024 •

edited

Loading

dime10 commented Jul 17, 2024 •

edited

Loading

paul0403 commented Jul 17, 2024

dime10 commented Jul 17, 2024

mudit2812 commented Jul 17, 2024

paul0403 commented Jul 17, 2024

dime10 commented Jul 17, 2024

paul0403 commented Jul 18, 2024 •

edited

Loading

paul0403 commented Jul 18, 2024

dime10 left a comment

paul0403 commented Jul 19, 2024 •

edited

Loading

dime10 left a comment

Add seeded execution to the Catalyst runtime #936

Add seeded execution to the Catalyst runtime #936

Conversation

paul0403 commented Jul 16, 2024 • edited by dime10 Loading

codecov bot commented Jul 16, 2024 • edited Loading

Codecov Report

paul0403 commented Jul 17, 2024 • edited Loading

dime10 commented Jul 17, 2024 • edited Loading

paul0403 commented Jul 17, 2024

dime10 commented Jul 17, 2024

mudit2812 commented Jul 17, 2024

paul0403 commented Jul 17, 2024

dime10 commented Jul 17, 2024

paul0403 commented Jul 18, 2024 • edited Loading

paul0403 commented Jul 18, 2024

dime10 left a comment

Choose a reason for hiding this comment

paul0403 commented Jul 19, 2024 • edited Loading

dime10 left a comment

Choose a reason for hiding this comment

paul0403 commented Jul 16, 2024 •

edited by dime10

Loading

codecov bot commented Jul 16, 2024 •

edited

Loading

paul0403 commented Jul 17, 2024 •

edited

Loading

dime10 commented Jul 17, 2024 •

edited

Loading

paul0403 commented Jul 18, 2024 •

edited

Loading

paul0403 commented Jul 19, 2024 •

edited

Loading