[FEA] Create a multi-threaded nvbenchmark for groupby_max #16134

GregoryKimball · 2024-06-28T21:21:31Z

In the IO microbenchmarks for libcudf, we have parquet_reader_multithread, which is a tool for studying GPU saturation with higher host thread counts and one CUDA stream per thread (PTDS, per-thread-default-stream).

https://github.com/rapidsai/cudf/blob/branch-24.08/cpp/benchmarks/io/parquet/parquet_reader_multithread.cpp

Let's please add a similar benchmark for groupby_max that lets us run multiple groupby aggregations concurrently to reach higher GPU saturation.

https://github.com/rapidsai/cudf/blob/branch-24.08/cpp/benchmarks/groupby/group_max.cpp

The text was updated successfully, but these errors were encountered:

GregoryKimball added the feature request New feature or request label Jun 28, 2024

GregoryKimball modified the milestones: CSV continuous improvement, Benchmarking Jun 28, 2024

GregoryKimball assigned srinivasyadav18 Jun 28, 2024

GregoryKimball added the libcudf Affects libcudf (C++/CUDA) code. label Jun 28, 2024

srinivasyadav18 mentioned this issue Jul 1, 2024

Add groupby_max multi-threaded benchmark #16154

Merged

3 tasks

rapids-bot bot closed this as completed in #16154 Jul 10, 2024

rapids-bot bot closed this as completed in f592e9c Jul 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Create a multi-threaded nvbenchmark for groupby_max #16134

[FEA] Create a multi-threaded nvbenchmark for groupby_max #16134

GregoryKimball commented Jun 28, 2024 •

edited

Loading

[FEA] Create a multi-threaded nvbenchmark for groupby_max #16134

[FEA] Create a multi-threaded nvbenchmark for groupby_max #16134

Comments

GregoryKimball commented Jun 28, 2024 • edited Loading

GregoryKimball commented Jun 28, 2024 •

edited

Loading