feat: accelerate inference by begumcig · Pull Request #405 · PrunaAI/pruna

begumcig · 2025-10-14T15:34:57Z

Description

Related Issue

Fixes #(issue number)

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

How Has This Been Tested?

Checklist

My code follows the style guidelines of this project
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

Additional Notes

Note

Revamps device management across engine/evaluation/metrics with accelerate-aware utilities, adds a new image sharpness metric, updates tutorials/docs, and expands tests for multi-device compatibility.

Engine/Runtime:
- Add robust device utilities: device_to_string, split_device, find_bytes_free_per_gpu, enhanced set_to_best_available_device, _resolve_cuda_device, and move_to_device with accelerate device map support and GPTQ consistency.
- get_device now preserves indices and accelerate state; unify batch device moves via move_batch_to_device in InferenceHandler.
Evaluation/Task:
- Task now auto-selects devices, separates inference vs. stateful metric devices (stateful_metric_device, low_memory), and builds metrics accordingly.
- EvaluationAgent ensures model/task device consistency, preserves/restores device map, and runs metrics on correct devices.
Metrics:
- Introduce runs_on/device validation and movement in StatefulMetric/BaseMetric and metric implementations.
- Update time/energy/memory/architecture/CLIP metrics to use new device flow; add sync timing across multiple CUDA devices.
- New SharpnessMetric for blind IQA and export via metrics/__init__.py.
Docs/Tutorials:
- Update metric customization guide with device support (runs_on, move_to_device).
- Notebooks: switch to move_to_device(...) helper.
Tests:
- Add comprehensive device-compatibility and accelerate-distributed tests; adjust existing tests to new APIs.
Tooling:
- Pre-commit: tweak trufflehog invocation and refine local grep hook.

^{Written by Cursor Bugbot for commit 72c8c4a. This will update automatically on new commits. Configure here.}

…lerate

…ated oom errors

…tateful metrics

review-notebook-app · 2025-10-22T09:50:55Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

This comment was marked as outdated.

Sign in to view

begumcig requested a review from johannaSommer October 15, 2025 09:11

johannaSommer approved these changes Oct 20, 2025

View reviewed changes

This comment was marked as outdated.

Sign in to view

begumcig added 10 commits October 22, 2025 08:43

feat: add device compatibility checks to inference and evals for acce…

1f276af

…lerate

feat: add supported devices for metrics

e0da55b

test: add distributed latency test, doc updates, comments

819659a

chore: device casting refactoring for stateful metrics

b6de67c

refactor: support more agnostic utility functions

5e0c753

fix: data utilities bug

99548f8

feat: add utils required for accelerate support to new metrics

72b83d7

refactor: set device map for base models and catch batch movement rel…

b5325d8

…ated oom errors

refactor: add comments and add default device movement function for s…

e6e2eee

…tateful metrics

refactor: match device type hint to device type

06c8c5d

begumcig force-pushed the feat/accelerate-inference branch from 4f8f23d to 06c8c5d Compare October 22, 2025 08:45

docs: update notebooks with the new move_to_device interface

72c8c4a

begumcig closed this Oct 22, 2025

begumcig reopened this Oct 22, 2025

begumcig merged commit 792f980 into main Oct 22, 2025
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: accelerate inference#405

feat: accelerate inference#405
begumcig merged 11 commits intomainfrom
feat/accelerate-inference

begumcig commented Oct 14, 2025 •

edited by cursor bot

Loading

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

review-notebook-app bot commented Oct 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

begumcig commented Oct 14, 2025 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issue

Type of Change

How Has This Been Tested?

Checklist

Additional Notes

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

review-notebook-app bot commented Oct 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

begumcig commented Oct 14, 2025 •

edited by cursor bot

Loading