Fix profiler to report time spent on GPU kernels again instead of on 'wait for parallel tasks'. #8453

mcourteaux · 2024-11-04T11:04:23Z

So instead of suspend_thread() while waiting for a GPU kernel to complete, which decrements the active thread count and sets the current func to "wait for parallel tasks", this PR now only decrements the active number of threads, such that the current function is still the Func that the kernel is actually producing. As such, you can see the time spent on that kernel, but still see that it was done by 0 threads, indicating it was the GPU that took care of it.

Also don't report the "wait for parallel tasks" if it doesn't contain any time, like for overhead.

Additional drive-by typo fix.

steven-johnson · 2024-11-04T16:06:34Z

We should extract the LLVM fix and land it separately

mcourteaux · 2024-11-04T16:17:11Z

I'll make separate PR. It's here: #8454. @steven-johnson

…on 'wait for parallel tasks'.

abadams

Looks good to me. Feel free to merge once the bots are all green.

mcourteaux requested review from abadams and halidebuildbots November 4, 2024 11:05

mcourteaux force-pushed the fix-profiler-gpu branch from 87a2549 to 2fcf06b Compare November 4, 2024 12:43

Fix profiler to report runtime spent on gpu kernels again instead of …

7165cda

…on 'wait for parallel tasks'.

mcourteaux force-pushed the fix-profiler-gpu branch from 2fcf06b to 7165cda Compare November 4, 2024 16:28

abadams approved these changes Nov 4, 2024

View reviewed changes

mcourteaux merged commit 9ba1829 into halide:main Nov 4, 2024

BrewTestBot mentioned this pull request Dec 17, 2024

halide 19.0.0 Homebrew/homebrew-core#201454

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix profiler to report time spent on GPU kernels again instead of on 'wait for parallel tasks'. #8453

Fix profiler to report time spent on GPU kernels again instead of on 'wait for parallel tasks'. #8453

Uh oh!

mcourteaux commented Nov 4, 2024 •

edited

Loading

Uh oh!

steven-johnson commented Nov 4, 2024

Uh oh!

mcourteaux commented Nov 4, 2024 •

edited

Loading

Uh oh!

abadams left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix profiler to report time spent on GPU kernels again instead of on 'wait for parallel tasks'. #8453

Fix profiler to report time spent on GPU kernels again instead of on 'wait for parallel tasks'. #8453

Uh oh!

Conversation

mcourteaux commented Nov 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

steven-johnson commented Nov 4, 2024

Uh oh!

mcourteaux commented Nov 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

abadams left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mcourteaux commented Nov 4, 2024 •

edited

Loading

mcourteaux commented Nov 4, 2024 •

edited

Loading