E2E test for the experimental compress algorithm based on https://arxiv.org/abs/2411.19146 by danielkorzekwa · Pull Request #464 · NVIDIA/Model-Optimizer

danielkorzekwa · 2025-10-27T11:33:17Z

What does this PR do?

Type of change: ?
new feature

Overview: ?
E2E test for the experimental compress algorithm based on https://arxiv.org/abs/2411.19146

Usage

See tests/gpu/torch/_compress/test_compress.py

# Add a code snippet demonstrating how to use this

See tests/gpu/torch/_compress/test_compress.py

Testing

See tests/gpu/torch/_compress/test_compress.py

Before your PR is "Ready for review"

Make sure you read and follow Contributor guidelines and your commits are signed.
Is this change backward compatible?: Yes
Did you write any new necessary tests?: Yes
Did you add or update any necessary documentation?: No
Did you update Changelog?: No

Additional Information

using MIP-based NAS search algorithm. Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

copy-pr-bot · 2025-10-27T11:33:21Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

codecov · 2025-10-27T11:45:23Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 73.40%. Comparing base (230cee1) to head (d942e0a).
⚠️ Report is 2 commits behind head on feature/compress.

Additional details and impacted files

@@                Coverage Diff                @@
##           feature/compress     #464   +/-   ##
=================================================
  Coverage             73.40%   73.40%           
=================================================
  Files                   180      180           
  Lines                 18077    18077           
=================================================
  Hits                  13270    13270           
  Misses                 4807     4807

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

tests/gpu/torch/_compress/resources/configs/bypass/bypass_distillation_defaults.yaml

tests/experimental/torch/_compress/test_compress.py

tests/gpu/torch/_compress/resources/configs/bypass/llama-3_1-8b_bypass.yaml

kevalmorabia97 · 2025-10-27T11:46:59Z

tests/experimental/torch/_compress/resources/tokenizer/special_tokens_map.json

Is the resources/tokenizer used as a toy tokenizer for testing instead of using original llama tokenizer?

We can instead re-use test toy models and tokenizers used in other tests. See comment below in gpu test file

created an internal issue to address this in the next MR: issues/12

tests/experimental/torch/_compress/resources/tokenizer/truncate_tokenizer.py

tests/experimental/torch/_compress/test_compress.py

modelopt/torch/_compress/compress.py

kevalmorabia97 · 2025-10-27T12:07:16Z

tests/experimental/torch/_compress/resources/configs/Llama-3_1-8B.yaml

Unrelated to this PR, but do we also plan to simplify the yaml files as part of roadmap? Currently there are too many things to be configured and in too many yaml files, which we can move to one common base yaml hidden from users and only require user to provide 4-5 most important inputs to keep things simpler

this is captured in the Nvidia internal roadmap

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

…ation. Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Signed-off-by: Keval Morabia <28916987+kevalmorabia97@users.noreply.github.com>

…tmp_path. Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

modelopt/torch/_compress/runtime.py

tests/experimental/torch/_compress/test_compress.py

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

kevalmorabia97

Looks good to merge. Thanks for addressing my comments

danielkorzekwa added 6 commits October 27, 2025 11:50

The main compression function for a model

c758ad5

using MIP-based NAS search algorithm. Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Code formatting

8af9903

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Model search space configuration used by test_compress.py test.

5ba6c27

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Tokenizer used by test_compress.py test.

0bc5d84

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Tokenizer utility used by test_compress.py test

87d4fa5

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

e2e tests for compress.py

ced1e99

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

danielkorzekwa requested a review from a team as a code owner October 27, 2025 11:33

danielkorzekwa requested review from ChenhanYu and Edwardf0t1 and removed request for a team October 27, 2025 11:33

kevalmorabia97 requested review from AAnoosheh and kevalmorabia97 and removed request for Edwardf0t1 October 27, 2025 11:36

danielkorzekwa mentioned this pull request Oct 27, 2025

Add convert_llama3_config_to_decilm_config + unit test #465

Merged

kevalmorabia97 requested changes Oct 27, 2025

View reviewed changes

Remove unused bypass distillation config files.

800414c

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

danielkorzekwa requested a review from a team as a code owner October 27, 2025 14:35

danielkorzekwa and others added 6 commits October 27, 2025 15:38

Moving integration tests to tests/experimental to not trigger CICD

16abcc9

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

update docs

a5ba1c7

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Replace mprint with print and replace osp.join with path1 / path2 not…

1bda391

…ation. Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Refactor file checking assertions to use .is_file() and .exists()

bb38401

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Merge branch 'feature/compress' into dkorzekwa/e2e_compression_test

d4ffc91

Fix: Add missing LICENSE headers

6f28e4a

Signed-off-by: Keval Morabia <28916987+kevalmorabia97@users.noreply.github.com>

kevalmorabia97 approved these changes Oct 28, 2025

View reviewed changes

danielkorzekwa added 2 commits October 28, 2025 11:37

Use spawn_multiprocess_job for test_compress test (to be able to use …

016fb63

…tmp_path. Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Add comments.

0ccf1c4

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

kevalmorabia97 reviewed Oct 28, 2025

View reviewed changes

modelopt/torch/_compress/runtime.py Show resolved Hide resolved

kevalmorabia97 reviewed Oct 28, 2025

View reviewed changes

tests/experimental/torch/_compress/test_compress.py Outdated Show resolved Hide resolved

kevalmorabia97 reviewed Oct 28, 2025

View reviewed changes

tests/experimental/torch/_compress/test_compress.py Outdated Show resolved Hide resolved

danielkorzekwa added 4 commits October 28, 2025 16:35

Add _save_dummy_dataset to the test_compress.py

58439ca

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Refactoring: Move torch distributed env variables to dist_utils.py

2e5f776

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Refactoring: move torch distributed variables to dist_utils

6274db5

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

Move os.environ["WANDB_DISABLED"] = "true" to dist_utils.py

d942e0a

Signed-off-by: Daniel Korzekwa <dkorzekwa@nvidia.com>

kevalmorabia97 approved these changes Oct 28, 2025

View reviewed changes

kevalmorabia97 merged commit 9eeee25 into feature/compress Oct 28, 2025
26 checks passed

kevalmorabia97 deleted the dkorzekwa/e2e_compression_test branch October 28, 2025 19:46

Conversation

danielkorzekwa commented Oct 27, 2025

What does this PR do?

Usage

Testing

Before your PR is "Ready for review"

Additional Information

Uh oh!

copy-pr-bot bot commented Oct 27, 2025

Uh oh!

codecov bot commented Oct 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kevalmorabia97 Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

danielkorzekwa Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kevalmorabia97 Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

danielkorzekwa Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kevalmorabia97 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov bot commented Oct 27, 2025 •

edited

Loading