feat: add pre-smash-hook for model preparation by simlang · Pull Request #309 · PrunaAI/pruna

simlang · 2025-08-20T12:14:58Z

Description

This PR introduces new functionality, such that any algorithm can implement a setup function which is called before any smashing algorithm is applied.

To do pre-smash-setup an algorithm has to override _pre_smash_setup to apply in-place operations on the model based on information provided in the SmashConfig for that specific algorithm.

Related Issue

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

How Has This Been Tested?

If no algorithm overrides _pre_smash_setup there should be no change of functionality compared to the current version.
Therefore to test, I successfully ran the existing tests

Checklist

My code follows the style guidelines of this project
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

Additional Notes

gsprochette

Looks almost good to me, I left a few comments:

some docstring and name related stuff
a discussion about undoing the setup after the smash.

It would be nice to have a unit test to make sure that the pre_smash_setup is executed when the aglo is activated and not executed otherwise, this can be done with a monkey patch of an existing method.

src/pruna/algorithms/pruna_base.py

src/pruna/engine/pre_smash_setup.py

gsprochette · 2025-08-20T12:56:07Z

src/pruna/engine/pre_smash_setup.py

+    for current_group in ALGORITHM_GROUPS:
+        algorithm = smash_config[current_group]
+        if algorithm is not None:
+            check_algorithm_availability(algorithm, current_group, algorithm_dict)


This call is repeated many times: 1. in this function, 2. in the smash loop and 3. in check_model_compatibility. Should we take this opportunity to define a check_active_algorithm_availabilities function and running it at the beginning of smash?

i'm not sure if this PR is the correct place for it? but generally I agree - @johannaSommer thoughts on this?

I agree it's not really the PR for it could we clean this up while we're working on this part of the code? Meaning in a follow up PR (my favorite option) or directly here. Would you be ok with that? Johanna do you have a strong opinion about this?

gsprochette · 2025-08-20T13:03:18Z

src/pruna/algorithms/pruna_base.py

        """
        return []

+    def pre_smash_setup(self, model: Any, smash_config: SmashConfig) -> None:


Is there an argument for a post_smash_? as well? For example if the pre_smash_setup computes something based on the pre-smashed model and stores it in smash_config, a post_smash could be the opportunity to destroy it and restore the smash_config to its original state before the smash function was applied. What do you think?

I think there could be an argument for it.
For e.g. recovery it could make sense to create the dataset in the pre-smash, replace the current data and then in a potential post_smash insert the original one again.
What do you think @johannaSommer?

johannaSommer · 2025-08-20T14:40:34Z

src/pruna/smash.py

            check_model_compatibility(model, smash_config)

+        # perform any necessary setup steps before the smashing process begins
+        pre_smash_setup(model, smash_config)


right now this only allows inplace operation. After discussion with Simon we can leave it like that for now but should add documentation/justification as to why

johannaSommer · 2025-08-20T14:42:05Z

src/pruna/engine/pre_smash_setup.py

+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+from __future__ import annotations


i would slightly prefer not putting this into a separate file and putting it into the "compatibility checks" file as there we have all the pre-smash checks and setup (e.g. device casting). If you feel the naming of the file is a problem feel free to change it to pre_smash_setup or so

I'm fine with either options. The point is to keep both the engine dir and the pre_smash_setup file manageable. Each direction is a problem only if we have too many of those setup functions, in which case we can split them into multiple files in a pres_smash_setup directory instead. As long as we don't have that sort of problem I don't have a strong opinion :)

johannaSommer · 2025-08-20T14:47:12Z

src/pruna/engine/pre_smash_setup.py

+        algorithm = smash_config[current_group]
+        if algorithm is not None:
+            check_algorithm_availability(algorithm, current_group, algorithm_dict)
+            algorithm_dict[current_group][algorithm].pre_smash_setup(model, smash_config)


merge with existing function in compatibility check and possible adjust naming? -> pre_smash_hook?

…cation

gsprochette · 2025-08-28T14:01:33Z

src/pruna/algorithms/pruna_base.py

+
+    def _pre_smash_hook(self, model: Any, smash_config: SmashConfigPrefixWrapper) -> None:
+        """
+        Function to be overridden by an algorithm to perform a pre-smash setup.


pre-smash-hook instead of setup in the doc?

gsprochette · 2025-08-28T14:05:40Z

tests/config/test_pre_smash_hook.py

+    model, smash_config = model_fixture
+
+    pre_smash_hook_called = False
+    def mock_pre_smash_hook(self: LLMInt8Quantizer, model: Any, smash_config: SmashConfigPrefixWrapper) -> None:


gsprochette

Thanks for adressing every comment, this looks super good :) I may have found a typo in the _pre_smash_hook docstring but other than that it's ready to merge 🤩

The post_smash_hook thing, we can probably wait until we have a case where we need it, or simply add it in a follow-up PR.

johannaSommer

Thanks Simon! Agree with @gsprochette that we should keep the post smash hook in mind but since no algorithm needs it at the moment let's keep it for later :)

feat: implement pre-smash-setup for model preparation

25e8378

simlang requested review from gsprochette and johannaSommer August 20, 2025 12:15

gsprochette requested changes Aug 20, 2025

View reviewed changes

johannaSommer requested changes Aug 20, 2025

View reviewed changes

simlang added 3 commits August 25, 2025 14:40

chore: adjust docstrings, filenaming, function naming and function lo…

0363e4e

…cation

test: add pre-smash-hook test

7a05fba

test: fix device of new test

b0bcec4

simlang requested review from gsprochette and johannaSommer August 25, 2025 15:44

gsprochette reviewed Aug 28, 2025

View reviewed changes

gsprochette self-requested a review August 28, 2025 14:07

gsprochette approved these changes Aug 28, 2025

View reviewed changes

johannaSommer approved these changes Aug 29, 2025

View reviewed changes

style: adapted docs to the new more general name hook instead of setup

af0eec2

simlang changed the title ~~Add pre-smash-setup for model preparation~~ feat: add pre-smash-setup for model preparation Aug 29, 2025

simlang changed the title ~~feat: add pre-smash-setup for model preparation~~ feat: add pre-smash-hook for model preparation Aug 29, 2025

simlang merged commit 309be37 into main Aug 29, 2025
7 checks passed

simlang deleted the feat/introduce-pre-smash-setup branch August 29, 2025 08:31

Conversation

simlang commented Aug 20, 2025

Description

Related Issue

Type of Change

How Has This Been Tested?

Checklist

Additional Notes

Uh oh!

gsprochette left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gsprochette Aug 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gsprochette left a comment

Choose a reason for hiding this comment

Uh oh!

johannaSommer left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

gsprochette Aug 26, 2025 •

edited

Loading