Ringbuf remove option<T> #18747

ghost · 2014-11-07T20:41:27Z

Fix for task in Metabug #18009 (Rebased version of #18170)

This changes much of about how RingBuf functions. lo, nelts are replaced by a more traditional head andtail. The Vec<Option<T>> is replaced by a bare pointer that is managed by the RingBuf itself. This also expects the ring buffer to always be size that is a power of 2.

This change also includes a number of new tests to cover the some areas that could be of concern with manual memory management.

The benchmarks have been reworked since the old ones were benchmarking of the Ring buffers growth rather then the actual test.

The unit test suite have been expanded, and exposed some bugs in fn get() and fn get_mut()

Benchmark

Before:

test ring_buf::tests::bench_grow_1025                      ... bench:      8919 ns/iter (+/- 87)
test ring_buf::tests::bench_iter_1000                      ... bench:       924 ns/iter (+/- 28)
test ring_buf::tests::bench_mut_iter_1000                  ... bench:       918 ns/iter (+/- 6)
test ring_buf::tests::bench_new                            ... bench:        15 ns/iter (+/- 0)
test ring_buf::tests::bench_pop_100                        ... bench:       294 ns/iter (+/- 9)
test ring_buf::tests::bench_pop_front_100                  ... bench:       948 ns/iter (+/- 32)
test ring_buf::tests::bench_push_back_100                  ... bench:       291 ns/iter (+/- 16)
test ring_buf::tests::bench_push_front_100                 ... bench:       311 ns/iter (+/- 27

After:

test ring_buf::tests::bench_grow_1025                      ... bench:      2209 ns/iter (+/- 169)
test ring_buf::tests::bench_iter_1000                      ... bench:       534 ns/iter (+/- 27)
test ring_buf::tests::bench_mut_iter_1000                  ... bench:       515 ns/iter (+/- 28)
test ring_buf::tests::bench_new                            ... bench:        11 ns/iter (+/- 0)
test ring_buf::tests::bench_pop_100                        ... bench:       170 ns/iter (+/- 5)
test ring_buf::tests::bench_pop_front_100                  ... bench:       171 ns/iter (+/- 11)
test ring_buf::tests::bench_push_back_100                  ... bench:       172 ns/iter (+/- 13)
test ring_buf::tests::bench_push_front_100                 ... bench:       158 ns/iter (+/- 12)

rust-highfive · 2014-11-07T20:41:34Z

Warning

These commits modify unsafe code. Please review it carefully!

ghost · 2014-11-07T20:42:11Z

ping @gankro

Gankra · 2014-11-07T20:51:42Z

❤️ ❤️ ❤️

Thanks so much for persisting! Will review/trick-others-into-reviewing ASAP.

Gankra · 2014-11-07T20:53:44Z

src/libcollections/ring_buf.rs

-    pub fn reserve_exact(&mut self, additional: uint) {
-        // FIXME(Gankro): this is just wrong. The ringbuf won't actually use this space
-        self.elts.reserve_exact(additional);
+    #[deprecated = "use reserve, Ringbuf can no longer be an exact size."]


I think this is the right call, but we need to work out if we want to expose this level of implementation detail in the public APIs (that is, this isn't fundamental to a RingBuf). CC @aturon interested in thoughts. Applies to HashMap as well.

thestinger · 2014-11-10T23:54:18Z

Using debug_assert! extensively will result in this being slower in many cases than the current ring buffer. In that case, it doesn't make sense to merge this.

huonw · 2014-11-11T00:02:19Z

Once #17665 is solved, those problems will go away. We should optimise for the language we are aiming to have, not the one we currently have. It is ridiculously easy to manually do #17665 (i.e. sed -i /debug_assert!/d ringbuf.rs) on a case-by-case basis if it is not in-place by 1.0 but much harder to do the reverse if #17665 is implemented.

ghost · 2014-11-11T01:09:48Z

@gankro both of the oom cases should be covered by csherratt@3905a39

Gankra · 2014-11-11T01:59:20Z

Ah, that it is! (I was going off of clobbered comments)

Just need to figure out if we want the debug_asserts, then. I leave you in the strong, comforting, arms of @huonw then.

ghost · 2014-11-11T02:26:56Z

I added some extra debug_asserts. I was careful in where I placed them to avoid doing the same check twice. The only state that is invalid for the head and tail pointer to be in is if they are greater then or equal to the capacity of ring buffer. Since wrap_index is used in every case to modify the head and tail pointer placing an out of bound check in it will check that assertion everywhere.

The other suspect code is the logic in reserve. So I added a few extra asserts.

Performance wise the extra checks are not free. The largest changes are in the iteration rate.

Before

test ring_buf::tests::bench_grow_1025                      ... bench:      3078 ns/iter (+/- 43)
test ring_buf::tests::bench_iter_1000                      ... bench:       629 ns/iter (+/- 4)
test ring_buf::tests::bench_mut_iter_1000                  ... bench:       625 ns/iter (+/- 9)
test ring_buf::tests::bench_new                            ... bench:        17 ns/iter (+/- 0)
test ring_buf::tests::bench_pop_100                        ... bench:       284 ns/iter (+/- 0)
test ring_buf::tests::bench_pop_front_100                  ... bench:       235 ns/iter (+/- 0)
test ring_buf::tests::bench_push_back_100                  ... bench:       282 ns/iter (+/- 0)
test ring_buf::tests::bench_push_front_100                 ... bench:       234 ns/iter (+/- 1)

After

test ring_buf::tests::bench_grow_1025                      ... bench:      2883 ns/iter (+/- 202)
test ring_buf::tests::bench_iter_1000                      ... bench:       999 ns/iter (+/- 15)
test ring_buf::tests::bench_mut_iter_1000                  ... bench:       994 ns/iter (+/- 21)
test ring_buf::tests::bench_new                            ... bench:        17 ns/iter (+/- 0)
test ring_buf::tests::bench_pop_100                        ... bench:       223 ns/iter (+/- 1)
test ring_buf::tests::bench_pop_front_100                  ... bench:       209 ns/iter (+/- 1)
test ring_buf::tests::bench_push_back_100                  ... bench:       220 ns/iter (+/- 5)
test ring_buf::tests::bench_push_front_100                 ... bench:       215 ns/iter (+/- 1)

mahkoh · 2014-11-11T02:31:34Z

while None != deq.pop_front() {}

These benchmarks are still useless because they can be optimized away.

mahkoh · 2014-11-11T02:34:10Z

Ideally the push and pop benchmarks should handle their inverse with unsafe code so that it doesn't take any time.

ghost · 2014-11-11T02:39:48Z

@mahkoh that's not a bad idea. I'll give it a try.

mahkoh · 2014-11-11T02:39:52Z

src/libcollections/ring_buf.rs

-        self.elts[self.lo] = Some(t);
-        self.nelts += 1u;
+
+        self.tail = wrap_index(self.tail - 1, self.cap);


Most of these function calls can be replaced by

self.tail -= 1;

The only place where you might have to wrap indices is in the reallocating function. By removing these operations you get measurably better performance. Furthermore, if you don't wrap indices you can reduce memory usage by up to 2x because head == tail if and only if the container is empty and therefore you don't need an extra space.

This optimisation can be done in a follow-up PR. Also, what happens here when self.tail = 0?

The only thing that matters is head - tail.

The actual number does matter, e.g. the self.buffer_write(tail, t) below is writing t to self.ptr.offset(tail), where self.ptr points to the start of the allocated buffer (so tail better be positive). As a random other example (that's not reallocation), wrapping also matters in get,

fn get(&self, i: uint) -> Option<&T> { if i < self.len() { unsafe { Some(&mut *self.ptr.offset((self.tail + i) as int)) } } else { None } }

is incorrect if tail can be "negative".

You have to wrap when you actually access the memory, that's clear.

So

The only place where you might have to wrap indices is in the reallocating function

was not correct?

huonw · 2014-11-11T10:55:24Z

I'm happy to r+ this after feedback on #18747 (comment) (everything else can be done later).

ghost · 2014-11-14T05:47:40Z

@huonw I think I have satisfied the asserts you wanted.

huonw · 2014-11-14T06:07:50Z

src/libcollections/ring_buf.rs

+
+    #[test]
+    fn test_drop() {
+        static mut drops: uint = 0;


General point: this could be a non-mut static storing an AtomicUint which would avoid the need for unsafety.

I believe modifying an non-mut static is UB even if it is an Atomic*.

No, the whole purpose of introducing const was to allow non-unsafe mutation of statics by building on top of the Sync built-in trait and only allowing & references to be taken.

(The compiler allows it without unsafe, which is a pretty good indication that something is not UB.)

-Adds unit tests for fn get() and fn get_mut() which are currently untested -Adds unit tests to verify growth of the ringbuffer when reserve is called. -Adds unit tests to confirm that dropping of items is correct Move ringbuf to use a raw buffer instead of Option<T>

Use is_some() in clear to simplify the clear loop.

… tests.

ghost · 2014-11-15T19:58:28Z

Can I get a retry? The first build failed because I got unlucky and this got hit by a breaking change #18827

Fix for task in Metabug #18009 (Rebased version of #18170) This changes much of about how RingBuf functions. `lo`, `nelts` are replaced by a more traditional `head` and`tail`. The `Vec<Option<T>>` is replaced by a bare pointer that is managed by the `RingBuf` itself. This also expects the ring buffer to always be size that is a power of 2. This change also includes a number of new tests to cover the some areas that could be of concern with manual memory management. The benchmarks have been reworked since the old ones were benchmarking of the Ring buffers growth rather then the actual test. The unit test suite have been expanded, and exposed some bugs in `fn get()` and `fn get_mut()` ## Benchmark **Before:** ``` test ring_buf::tests::bench_grow_1025 ... bench: 8919 ns/iter (+/- 87) test ring_buf::tests::bench_iter_1000 ... bench: 924 ns/iter (+/- 28) test ring_buf::tests::bench_mut_iter_1000 ... bench: 918 ns/iter (+/- 6) test ring_buf::tests::bench_new ... bench: 15 ns/iter (+/- 0) test ring_buf::tests::bench_pop_100 ... bench: 294 ns/iter (+/- 9) test ring_buf::tests::bench_pop_front_100 ... bench: 948 ns/iter (+/- 32) test ring_buf::tests::bench_push_back_100 ... bench: 291 ns/iter (+/- 16) test ring_buf::tests::bench_push_front_100 ... bench: 311 ns/iter (+/- 27 ``` **After:** ``` test ring_buf::tests::bench_grow_1025 ... bench: 2209 ns/iter (+/- 169) test ring_buf::tests::bench_iter_1000 ... bench: 534 ns/iter (+/- 27) test ring_buf::tests::bench_mut_iter_1000 ... bench: 515 ns/iter (+/- 28) test ring_buf::tests::bench_new ... bench: 11 ns/iter (+/- 0) test ring_buf::tests::bench_pop_100 ... bench: 170 ns/iter (+/- 5) test ring_buf::tests::bench_pop_front_100 ... bench: 171 ns/iter (+/- 11) test ring_buf::tests::bench_push_back_100 ... bench: 172 ns/iter (+/- 13) test ring_buf::tests::bench_push_front_100 ... bench: 158 ns/iter (+/- 12) ```

Gankra · 2014-11-17T00:49:45Z

Yessss eat @bors it meeeeerged!

🎉

ghost · 2014-11-17T00:51:11Z

@gankro 4th times the charm.

Gankra reviewed Nov 7, 2014
View reviewed changes

mahkoh reviewed Nov 11, 2014
View reviewed changes

huonw reviewed Nov 14, 2014
View reviewed changes

csherratt added 6 commits November 14, 2014 03:41

Handle allocate/reallocate errors in ring_buf

ba24e33

Use is_some() in clear to simplify the clear loop.

Added some extra debug_asserts to ring_buf.

4cae9ad

Manually reset the ringbuffer before or after the ringbuffer push/pop…

5e549d8

… tests.

Added population count assertion in reserve. Cleaned up wrap_index.

4019118

Update ring_buf.rs from fallout of #18827.

6277e3b

bors closed this Nov 17, 2014

bors merged commit 6277e3b into rust-lang:master Nov 17, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ringbuf remove option<T> #18747

Ringbuf remove option<T> #18747

ghost commented Nov 7, 2014

rust-highfive commented Nov 7, 2014

ghost commented Nov 7, 2014

Gankra commented Nov 7, 2014

Gankra Nov 7, 2014

thestinger commented Nov 10, 2014

huonw commented Nov 11, 2014

ghost commented Nov 11, 2014

Gankra commented Nov 11, 2014

ghost commented Nov 11, 2014

mahkoh commented Nov 11, 2014

mahkoh commented Nov 11, 2014

ghost commented Nov 11, 2014

mahkoh Nov 11, 2014

huonw Nov 11, 2014

mahkoh Nov 11, 2014

huonw Nov 11, 2014

mahkoh Nov 11, 2014

huonw Nov 11, 2014

mahkoh Nov 11, 2014

huonw commented Nov 11, 2014

ghost commented Nov 14, 2014

huonw Nov 14, 2014

tbu- Nov 14, 2014

huonw Nov 14, 2014

ghost commented Nov 15, 2014

Gankra commented Nov 17, 2014

ghost commented Nov 17, 2014

Ringbuf remove option<T> #18747

Ringbuf remove option<T> #18747

Conversation

ghost commented Nov 7, 2014

Benchmark

rust-highfive commented Nov 7, 2014

ghost commented Nov 7, 2014

Gankra commented Nov 7, 2014

Choose a reason for hiding this comment

thestinger commented Nov 10, 2014

huonw commented Nov 11, 2014

ghost commented Nov 11, 2014

Gankra commented Nov 11, 2014

ghost commented Nov 11, 2014

mahkoh commented Nov 11, 2014

mahkoh commented Nov 11, 2014

ghost commented Nov 11, 2014

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

huonw commented Nov 11, 2014

ghost commented Nov 14, 2014

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ghost commented Nov 15, 2014

Gankra commented Nov 17, 2014

ghost commented Nov 17, 2014