Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEA] Create a multi-threaded nvbenchmark for groupby_max #16134

Closed
GregoryKimball opened this issue Jun 28, 2024 · 0 comments · Fixed by #16154
Closed

[FEA] Create a multi-threaded nvbenchmark for groupby_max #16134

GregoryKimball opened this issue Jun 28, 2024 · 0 comments · Fixed by #16154
Assignees
Labels
feature request New feature or request libcudf Affects libcudf (C++/CUDA) code.
Milestone

Comments

@GregoryKimball
Copy link
Contributor

GregoryKimball commented Jun 28, 2024

In the IO microbenchmarks for libcudf, we have parquet_reader_multithread, which is a tool for studying GPU saturation with higher host thread counts and one CUDA stream per thread (PTDS, per-thread-default-stream).

https://github.com/rapidsai/cudf/blob/branch-24.08/cpp/benchmarks/io/parquet/parquet_reader_multithread.cpp

Let's please add a similar benchmark for groupby_max that lets us run multiple groupby aggregations concurrently to reach higher GPU saturation.

https://github.com/rapidsai/cudf/blob/branch-24.08/cpp/benchmarks/groupby/group_max.cpp

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request New feature or request libcudf Affects libcudf (C++/CUDA) code.
Projects
Status: Done
Status: No status
Development

Successfully merging a pull request may close this issue.

2 participants