[FEA] Consider fuzz testing with hypothesis #16129

vyasr · 2024-06-28T19:04:20Z

Is your feature request related to a problem? Please describe.
Currently tests of libcudf, pylibcudf, and cuDF Python are a large set of manually written tests. While we endeavor to achieve high coverage rates of the APIs, we inevitably miss data-dependent edge cases, particularly around things like empty data sets.

Describe the solution you'd like
We should consider using hypothesis or another fuzz testing library to add more systematic verification of different inputs. I recommend doing this at the Python layer since there is better and simpler tooling available, and because pylibcudf testing can be treated as a superset of libcudf testing in this respect to ensure good coverage of the C++.

Describe alternatives you've considered
We could also implement fuzz testing in C++ directly using e.g. Google's fuzztest, but that will be a bit more cumbersome to do.

The text was updated successfully, but these errors were encountered:

vyasr added feature request New feature or request tests Unit testing for project labels Jun 28, 2024

vyasr mentioned this issue Jun 28, 2024

Start migrating I/O writers to pylibcudf (starting with JSON) #15952

Merged

3 tasks

lithomas1 added pylibcudf Issues specific to the pylibcudf package Python Affects Python cuDF API. labels Jun 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Consider fuzz testing with hypothesis #16129

[FEA] Consider fuzz testing with hypothesis #16129

vyasr commented Jun 28, 2024

[FEA] Consider fuzz testing with hypothesis #16129

[FEA] Consider fuzz testing with hypothesis #16129

Comments

vyasr commented Jun 28, 2024