New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

[profiling] Fuzz profiler API #449

Merged

taegyunkim merged 54 commits into main from dsn/fuzz-profile-api

Jun 10, 2024

Contributor

danielsn commented May 23, 2024

What does this PR do?

Adds fuzzable versions of the data structures, and uses them to fuzz the profiler API.

Motivation

R&D week

Additional Notes

Anything else we should know when reviewing?

How to test the change?

Describe here in detail how the change can be validated.

github-actions bot added the profiling label

codecov-commenter commented May 23, 2024 •

edited

Loading

Codecov Report

Attention: Patch coverage is 94.24779% with 26 lines in your changes are missing coverage. Please review.

Project coverage is 69.36%. Comparing base (4709a67) to head (db18e70).

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #449      +/-   ##
==========================================
+ Coverage   68.93%   69.36%   +0.42%     
==========================================
  Files         193      194       +1     
  Lines       25574    26006     +432     
==========================================
+ Hits        17630    18039     +409     
- Misses       7944     7967      +23

Components	Coverage Δ
crashtracker	`19.34% <ø> (ø)`
datadog-alloc	`98.76% <ø> (ø)`
data-pipeline	`51.45% <ø> (ø)`
data-pipeline-ffi	`0.00% <ø> (ø)`
ddcommon	`85.16% <ø> (ø)`
ddcommon-ffi	`74.15% <ø> (ø)`
ddtelemetry	`56.09% <ø> (ø)`
ipc	`81.69% <ø> (ø)`
profiling	`79.18% <94.24%> (+1.20%)`	⬆️
profiling-ffi	`60.00% <ø> (ø)`
serverless	`0.00% <ø> (ø)`
sidecar	`37.27% <ø> (ø)`
sidecar-ffi	`0.00% <ø> (ø)`
spawn-worker	`54.98% <ø> (ø)`
trace-mini-agent	`69.12% <ø> (ø)`
trace-normalization	`97.79% <ø> (ø)`
trace-obfuscation	`95.74% <ø> (ø)`
trace-protobuf	`33.33% <ø> (ø)`
trace-utils	`92.25% <ø> (ø)`

taegyunkim force-pushed the dsn/fuzz-profile-api branch from 2a1d752 to 900d192 Compare

May 28, 2024 17:41

datadog-datadog-prod-us1 bot commented May 28, 2024 •

edited

Loading

Software Composition Analysis

✅ No library vulnerabilities found (compared 09ff22a against b462829).

danielsn commented

View reviewed changes

profiling/src/internal/profile.rs Outdated Show resolved Hide resolved

taegyunkim force-pushed the dsn/fuzz-profile-api branch from 3971f7a to af75caa Compare

May 29, 2024 17:22

taegyunkim marked this pull request as ready for review

May 29, 2024 17:22

taegyunkim requested a review from a team as a code owner

May 29, 2024 17:22

taegyunkim force-pushed the dsn/fuzz-profile-api branch from bf22384 to 385f554 Compare

May 29, 2024 20:15

danielsn commented

View reviewed changes

profiling/src/internal/profile.rs Outdated Show resolved Hide resolved

profiling/src/internal/profile.rs Outdated Show resolved Hide resolved

profiling/src/internal/profile.rs Outdated Show resolved Hide resolved

danielsn commented

View reviewed changes

profiling/src/internal/profile/mod.rs Outdated Show resolved Hide resolved

profiling/src/internal/profile/fuzz_tests.rs Outdated Show resolved Hide resolved

profiling/src/internal/profile/fuzz_tests.rs Outdated Show resolved Hide resolved

profiling/src/internal/profile/fuzz_tests.rs Outdated Show resolved Hide resolved

morrisonlevi force-pushed the dsn/fuzz-profile-api branch from c9dc1f5 to dc8a9df Compare

May 31, 2024 11:39

taegyunkim force-pushed the dsn/fuzz-profile-api branch from 027ca3e to db18e70 Compare

May 31, 2024 14:53

danielsn commented

View reviewed changes

profiling/src/internal/profile/fuzz_tests.rs

+                  }
+              }
+              /// PartialEq and Hash work on just the key to avoid getting duplicate

Contributor Author

danielsn May 31, 2024

Are we using Eq to compare labels in the tests? Will this miss issues where the values don't agree?

Contributor

taegyunkim Jun 10, 2024

std::hash::Hash doc says this property must hold when implementing Eq and Hash, k1 == k2 -> hash(k1) == hash(k2). In our case, values can disagree even if the hashes are the same.

This was introduced mainly to use Labels as keys for HashMap in https://github.com/DataDog/libdatadog/blob/dsn/fuzz-profile-api/profiling/src/internal/profile/fuzz_tests.rs#L369

profiling/src/internal/profile/fuzz_tests.rs Outdated Show resolved Hide resolved

profiling/src/internal/profile/fuzz_tests.rs Show resolved Hide resolved

profiling/src/internal/profile/fuzz_tests.rs Show resolved Hide resolved

profiling/src/internal/profile/fuzz_tests.rs Show resolved Hide resolved

profiling/src/internal/profile/fuzz_tests.rs Show resolved Hide resolved

profiling/src/internal/profile/fuzz_tests.rs Show resolved Hide resolved

profiling/src/internal/profile/fuzz_tests.rs

+                      if let Some(expected_sample) = expected_timestamped_samples.next() {
+                          assert_eq!(owned_locations, expected_sample.locations);
+                          assert_eq!(sample.values, expected_sample.values);
+                          assert_eq!(owned_labels, expected_sample.labels);

Contributor Author

danielsn May 31, 2024

These should potentially be sorted: there is no requirement that labels remain in the same order

Contributor

morrisonlevi May 31, 2024

Locally I'm sorting the labels and got a new failure:

======================== Test Failure ========================

Input: 
(
    [],
    [
        (
            None,
            Sample {
                locations: [],
                values: [],
                labels: [
                    Label {
                        key: "local root span id",
                        str: None,
                        num: 281474976710656,
                        num_unit: None,
                    },
                    Label {
                        key: "",
                        str: Some(
                            "",
                        ),
                        num: 0,
                        num_unit: Some(
                            "",
                        ),
                    },
                ],
            },
        ),
    ],
)

Error: 
panicked at profiling/src/internal/profile/fuzz_tests.rs:402:18:
Value not found for an aggregated sample
note: run with `RUST_BACKTRACE=1` environment variable to display a backtrace.

==============================================================

Line 402 is this expect:

            let key = (&owned_locations, &owned_labels);
            let expected_values = samples_without_timestamps
                .get(&key)
                .expect("Value not found for an aggregated sample");
            assert_eq!(&sample.values, expected_values);

I think this happens because Vec<Label> and such are used as keys in the hash, and then they don't work out because of order?

Contributor

morrisonlevi May 31, 2024 •

edited

Loading

I pushed this up with function fuzz_failure_001 to reproduce.

Contributor

taegyunkim Jun 10, 2024

Thanks Levi for capturing this test case!

profiling/src/internal/profile/fuzz_tests.rs Outdated Show resolved Hide resolved

profiling/src/internal/profile/fuzz_tests.rs Show resolved Hide resolved

taegyunkim force-pushed the dsn/fuzz-profile-api branch 2 times, most recently from 9cfe3b5 to 9ad1b94 Compare

June 4, 2024 23:47

r1viollet approved these changes

View reviewed changes

Contributor

r1viollet left a comment

LGTM
I don't know much about what a bolero_generator is, but the changes look fine 👍

Contributor

r1viollet commented Jun 6, 2024

I am guessing we have some amount of work to fix the tests before we can merge this ?

test internal::profile::fuzz_tests::fuzz_failure_001 ... FAILED

taegyunkim force-pushed the dsn/fuzz-profile-api branch from 139f2bd to ef5762e Compare

June 10, 2024 13:31

Contributor

taegyunkim commented Jun 10, 2024 •

edited

Loading

I am guessing we have some amount of work to fix the tests before we can merge this ?
test internal::profile::fuzz_tests::fuzz_failure_001 ... FAILED

Yes, the test fails because input labels are not ordered in an expected way.

The label with 'local root span id' should follow other labels, looking at this function.

libdatadog/profiling/src/internal/profile/mod.rs

Lines 493 to 497 in 6f37a28

    
           self.get_label_set(sample.labels)? 
        
               .iter() 
        
               .map(|l| self.get_label(*l).copied()) 
        
               .chain(self.get_endpoint_for_labels(sample.labels).transpose()) 
        
               .chain(timestamp.map(|ts| Ok(Label::num(self.timestamp_key, ts.get(), None))))

This failing test case has been introduced with these code lines.

libdatadog/profiling/src/internal/profile/fuzz_tests.rs

Lines 231 to 247 in 6f37a28

    
           let mut labels = std::collections::HashSet::<Label>::gen_with().generate(driver)?; 
        
           // Ensure that the label has "local root span id" key 
        
           // Generate non-zero num value for the label 
        
           let num = i64::gen().generate(driver)?; 
        
           // zero num is considered as invalid, see Profile::validate_sample_labels 
        
           if num == 0 { 
        
               return None; 
        
           } 
        
           labels.insert(Label { 
        
               key: "local root span id".into(), 
        
               str: None, 
        
               num, 
        
               num_unit: None, 
        
           }); 
        
           let labels = labels.into_iter().collect();

I wanted to have bolero generated Samples to have a label with key 'local root span id', but didn't make sure that the orders are as expected.

danielsn and others added 10 commits

June 11, 2024 02:24


          [profiling] Fuzz profiler API

041973b


          make owned types orthogonal to api types

50929a6


          make owned types orthogonal to api types

2ddda7a


          reformat

ef3de36


          reformat

f23896c


          all the nice types

aa6fd91


          initial fuzz test, fails with duplicate labels on sample

82cbc79


          add timestamp to fuzz test

1ccf691


          serialize the pprof at the end

afdb3b6


          remove arbitrary from deps

f41836c

taegyunkim and others added 26 commits

June 11, 2024 02:24


          fuzz with add_endpoint

799a4e8


          fix clippy

383fe4b


          assert, expect

88bde3d


          comments

fa1680e


          remove unneeded

cdb9236


          remove is_well_formed

5b576f7


          update expect

6a11770


          refactor into add_sample

583f483


          clippy

5c8c354


          narrow search space by not using times

a94c126


          reduce range for miri


          refactor: move stuff for better diff, convert String to Box<str>

41a5deb


          refactor: move fuzz tests for profile.rs into their own file

58c1317


          refactor: simplify derives, rename method

ce4b453


          constructors

65e64fa


          use constructors

301961b


          string_table_fetch helper function

ba175c2


          also use fuzz_add_sample


          add fuzz_failure_001 to help debug

c1cba8d


          resolve comments

be84229


          explain -1

95d2987


          add comment

17134b8


          fix failure case


          add comments


          revive is_well_formed

7de94c0


          use different default ranges for miri

288ff5c

taegyunkim force-pushed the dsn/fuzz-profile-api branch from cc81644 to 288ff5c Compare

June 10, 2024 17:25


          sort not needed

7417cf1

taegyunkim merged commit 2b45d24 into main

26 checks passed

taegyunkim deleted the dsn/fuzz-profile-api branch

June 10, 2024 18:35

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels