Numba-optimize findpulses and add inputfreq argument to LDDecode #768

oyvindln · 2022-08-07T17:29:22Z

Upstream the optimized version of findpulses used in vhs-decode - the function itself is significantly faster, but it doesn't seem to make a much difference on the overall speed when I tested.

Not using the lower threshold, not sure what it's purpose would be anyhow since if anything actually hits it it will probably mess up the computation in whichever version.
The min/max length args are used in vhs-decode but just set them to not have any effect here, removing the condition didn't seem to have any speed impact anyhow.
Benchmarking the func itself indicates it's 3-4x faster than the old one, probably still a lot of room for optimizing it more though. Ideally one would maybe re-use the array buffers between calls, use a vec-type data structure rather than list (not sure what the list in numba actually is implemented as) etc. There are other bottleneck functions that would benefit more from porting to numba/cython or other native code more though, major standout ones from some quick profiling is compute_line_bursts and downscale_audio.

Enable numba on unpack_data_4_40 (probably commented it out by accident when making the PR for that, makes it 2x as fast but the function time is pretty insignificant compared to the rest of the program anyhow.)

add inputfreq argument to LDDecode - mainly to optionally allow decoding at input frequency in vhs-decode rather than resampling to 40mhz which could have some speed benefits since many people capture at lower freqs with cx etc.

atsampson · 2022-08-07T19:52:54Z

Regarding the inputfreq change, that's how I did it originally back in 2019 (#247). I changed it to resample instead (#303) because the filters in ld-decode were all tuned with a sample rate of 40 MHz, and if you ran it at a different sample rate, then the output quality was significantly worse. It'd be worth checking whether that's still the case...

oyvindln · 2022-08-07T21:36:42Z

Yeah not planning to run at native sample rate by default, at least not any time soon, the change just allows passing a different sample rate to the constructor so the option is there for testing it.

oyvindln added 2 commits August 7, 2022 19:16

numba-optimized findpulses + enable numba on load_packed_data_4_40

c85cda7

Add inputfreq parameter to LDDecode to allow adjusting it

d984a1f

happycube approved these changes Aug 7, 2022

View reviewed changes

happycube merged commit 8e3c161 into happycube:master Aug 7, 2022

oyvindln deleted the upstream_stuff branch August 7, 2022 23:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Numba-optimize findpulses and add inputfreq argument to LDDecode #768

Numba-optimize findpulses and add inputfreq argument to LDDecode #768

oyvindln commented Aug 7, 2022

atsampson commented Aug 7, 2022

oyvindln commented Aug 7, 2022 •

edited

Loading

Numba-optimize findpulses and add inputfreq argument to LDDecode #768

Numba-optimize findpulses and add inputfreq argument to LDDecode #768

Conversation

oyvindln commented Aug 7, 2022

atsampson commented Aug 7, 2022

oyvindln commented Aug 7, 2022 • edited Loading

oyvindln commented Aug 7, 2022 •

edited

Loading