docs: update Flux tutorial as a more general image generation tutorial by davidberenstein1957 · Pull Request #127 · PrunaAI/pruna

davidberenstein1957 · 2025-05-16T06:40:12Z

Deleted the flux_small.ipynb tutorial as it is no longer relevant.
Introduced image_generation.ipynb tutorial to demonstrate optimizing and evaluating image generation models using the pruna package, including details on optimization algorithms and evaluation metrics.
Updated the tutorial index to reflect these changes.

Description

Related Issue

Fixes #(issue number)

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

How Has This Been Tested?

Checklist

My code follows the style guidelines of this project
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

Additional Notes

davidberenstein1957 · 2025-05-22T09:59:24Z

@nifleisch, that looks good! I would leave the printed statements in the tutorials so people can actually read the output and understand better what is happening.

nifleisch · 2025-05-26T14:19:49Z

@nifleisch, that looks good! I would leave the printed statements in the tutorials so people can actually read the output and understand better what is happening.

I see your point. This way, users could follow the tutorial without running it on their own machines. I just spoke with Johanna, and we’re now clearing the outputs of all tutorial notebooks by default. So even if outputs remain in the tutorial, they won’t be included in the documentation.

davidberenstein1957 · 2025-05-26T14:30:48Z

That's good to know. Let's remove the output clearing, then. I think the output helps understanding, so it is fine to keep them, but feel free to verify with @johannaSommer since we initially decided to remove them.

nifleisch · 2025-05-27T11:43:45Z

That's good to know. Let's remove the output clearing, then. I think the output helps understanding, so it is fine to keep them, but feel free to verify with @johannaSommer since we initially decided to remove them.

Good point. I just checked with @johannaSommer, and she’s on board with this as well. Will add them back to the tutorial.

sharpenb

THanks for the tutorial. Here are my comments on the ntoebook :)

I like the idea of a tutorial card at the top but it could be made cleaner (e.g. in a table?). I would also not mention the distinciton between metrics. The term "base/statefull" is more a implementation relevant rather than a user relevant. I would also remove the libraries from there.
Do we need to show how ot install pruna? We do not do this in the other tutorialx and have a doc page for it.
We could merge the set device cell with theload model cell
Could we have "1., 2. 3." in subtitles like in other tutorials?
We could mention how long is the smashing for this smash config so that peopl know what to expect.
Define all metrics in a single list.
first_results - > base_model_results (and smashed_model_results.
Could we print all metrics in a nice way in a single cell? i.e. merge cell 9. 10. 11.?
Is this tutorial only with SD2.1? Make sense since it is quick to run but also unclear since a bit outdated.

sharpenb

Thanks for the udpate! Here are still some left comments:

Let's remove the library row from the table
Do we need to show how ot install pruna? We do not do this in the other tutorialx and have a doc page for it. I would suggest o redicrect to the installation page if people need.
We could merge the "set device" cell with the "load model" cell
Let's have numbers for sections like in other tutorials. Could we have "1., 2. 3." in subtitles like in other tutorials? :)
I feel that we could find a nicer way to print the metrics e.g. the python print a small table with metrics before/after and relative difference in 3 columns.
I feel that a small intro at the start of the tutorial to explain what we try to do and that it can work with other model than SD2.1 would be nice to give context
We should also update the title of the notebook itself for "Compress and evaluate"

davidberenstein1957 · 2025-06-06T07:44:02Z

Hi @sharpenb, I think I once typed a response already but did not post it. Underneath are some of my reasons for the choices.

Thanks for the udpate! Here are still some left comments:

Let's remove the library row from the table

sure

Do we need to show how ot install pruna? We do not do this in the other tutorialx and have a doc page for it. I would suggest o redicrect to the installation page if people need.

Yes, but if people want to run this as a standalone notebook, let's say in Jupyter or Google Colab or whatever other IDE, we need to install Pruna.

We could merge the "set device" cell with the "load model" cell

I kept this separate as it is a bit more explicit and the device is used in multiple places throughout the notebook

Let's have numbers for sections like in other tutorials. Could we have "1., 2. 3." in subtitles like in other tutorials? :)

I left this out because I don't feel it adds value, and it makes it more challenging to maintain as it is a manual process.

I feel that we could find a nicer way to print the metrics, e.g. the Python print a small table with metrics before/after and relative difference in 3 columns.

I agree it could be nicer, but do you feel this is worth spending 30 minutes on (re-running the notebook evals and such)?

I feel that a small intro at the start of the tutorial to explain what we try to do and that it can work with other models than SD2.1 would be nice to give context

If you feel it is important to use a more modern model, I am happy to set that up, but I think it makes it
less approachable to run end-to-end. Perhaps we can use stabilityai/stable-diffusion-3.5-medium or Efficient-Large-Model/SANA1.5_4.8B_1024px_diffusers?

We should also update the title of the notebook itself for "Compress and evaluate"

I think it is better to keep it more general, although the specific techniques we apply focus on compression.

davidberenstein1957 · 2025-06-10T16:41:43Z

@nifleisch @sharpenb @SaboniAmine could you review this PR, I think it would be nice to merge before the release.

…rial - Deleted the `flux_small.ipynb` tutorial as it is no longer relevant. - Introduced `image_generation.ipynb` tutorial to demonstrate optimizing and evaluating image generation models using the `pruna` package, including details on optimization algorithms and evaluation metrics. - Updated the tutorial index to reflect these changes.

…setup instructions - Added installation instructions for the `pruna` package. - Included a section on setting the device for optimization, ensuring users can utilize the best available hardware.

…ation details - Changed language ID in the notebook metadata for better compatibility. - Added sections for loading the model and tokenizer, including a specific example using the CompVis/stable-diffusion-v1-4 model. - Included code snippets for model optimization and evaluation, enhancing the tutorial's comprehensiveness. - Updated tutorial index to reflect changes in the image generation process.

…ts and output enhancements - Updated execution counts for several code cells to ensure proper sequencing. - Modified output messages for improved clarity and accuracy during model evaluation. - Adjusted configuration parameters for the `hqq_diffusers` quantizer to optimize performance. - Added new image outputs to enhance the visual representation of results.

- Reformatted the introduction section into a table for better readability. - Enhanced explanations regarding model optimization and evaluation metrics. - Updated code snippets for consistency and improved execution flow. - Removed unnecessary output cells to streamline the tutorial experience.

…ents - Updated model reference to [segmind/Segmind-Vega] for improved performance. - Revised evaluation metrics to include `throughput` and `total time`. - Adjusted execution counts for code cells to maintain proper flow. - Enhanced output display for clearer metric comparisons during evaluation.

- Changed model saving and loading references from "stable-diffusion-2-1-smashed" to "segmind-vega-smashed" for consistency with recent updates. - Updated corresponding comments to reflect the new model name.

- Clarified the output quality discussion by removing redundant phrasing regarding the CLIP score, enhancing the overall readability of the section.

- Removed `elapsed_time` from the evaluation metrics to focus on key performance indicators. - Cleared unnecessary output cells to enhance the tutorial's clarity and flow.

- Changed the tutorial title from "Optimize and Evaluate Image Generation Models" to "Compress and Evaluate Image Generation Models" for clarity. - Added numbered sections to improve navigation and organization within the tutorial. - Removed unnecessary metadata from code cells to streamline the content.

sharpenb

Thanks for the update! A last update before merging would be to indicate the units with througput (samples/s), latency (s/samples) and Base Model, Compressed Model, Relative Difference in the row/columns names to be complete :)

- Replaced `TotalTimeMetric` with `LatencyMetric` to better reflect performance evaluation. - Updated markdown table headers and content for clarity, changing "Original" and "Optimized" to "Base Model" and "Compressed Model" respectively. - Added units to the markdown table for improved understanding of metric values.

davidberenstein1957 · 2025-06-17T12:46:28Z

hi @sharpenb I resolved your comments.

sharpenb

Looks good! Similar to the LLM docs, it would be great to put latency (see here).

nifleisch

I love the new tutorial, it feels very complete. Through it I learned about the amazing segmind/Segmind-Vega model that we probably should use more often for tutorials as it looks stunning!
Two minor things:

At the beginning of the tutorial in section 1 there is a typo "howwever"
In the last part you use both the throughput and the latency metric, which felt a bit redundant to me because throuput ~ 1 / latency. Instead I would have preferred a fidelity measure. (feel free to ignore this suggestion)

davidberenstein1957 linked an issue May 16, 2025 that may be closed by this pull request

[DOC] Update end to end tutorial of using flux #120

Closed

davidberenstein1957 requested a review from nifleisch May 16, 2025 08:41

davidberenstein1957 changed the title ~~docs: remove outdated Flux tutorial and add new image generation tutorial~~ docs: update Flux tutorial as a more general image generation tutorial May 16, 2025

davidberenstein1957 removed the request for review from nifleisch May 20, 2025 13:25

davidberenstein1957 assigned nifleisch May 20, 2025

nifleisch force-pushed the docs/120-doc-update-end-to-end-tutorial-of-using-flux branch from 5b9d1de to 79e5544 Compare May 21, 2025 10:14

davidberenstein1957 marked this pull request as ready for review May 22, 2025 09:58

nifleisch requested review from nifleisch and sharpenb and removed request for nifleisch May 26, 2025 14:22

nifleisch force-pushed the docs/120-doc-update-end-to-end-tutorial-of-using-flux branch from fe9f419 to ed412f9 Compare May 28, 2025 11:29

sharpenb reviewed Jun 2, 2025

View reviewed changes

Comment thread docs/tutorials/index.rst Outdated

sharpenb requested changes Jun 2, 2025

View reviewed changes

sharpenb mentioned this pull request Jun 2, 2025

docs: update LLM tutorial to optimize and evaluate large language models #126

Merged

10 tasks

davidberenstein1957 force-pushed the docs/120-doc-update-end-to-end-tutorial-of-using-flux branch from 7137812 to c6f5b01 Compare June 6, 2025 06:17

davidberenstein1957 requested review from nifleisch and sharpenb June 6, 2025 06:18

sharpenb requested changes Jun 6, 2025

View reviewed changes

davidberenstein1957 requested review from SaboniAmine, nifleisch and sharpenb and removed request for nifleisch June 6, 2025 15:57

davidberenstein1957 added 2 commits June 12, 2025 15:37

docs: enhance image generation tutorial with installation and device …

4e1a27d

…setup instructions - Added installation instructions for the `pruna` package. - Included a section on setting the device for optimization, ensuring users can utilize the best available hardware.

davidberenstein1957 and others added 12 commits June 12, 2025 15:37

docs: update section EvaluationAgent

da9da6b

docs: extend image generation tutorial

c6e8e70

fix: fix height of algorithm diagram

b79b0c2

fix: adjust tutorial to Evaluation Agent update and keep cell outputs

2a7e362

docs: update model references in image generation tutorial

0cc0e8d

- Changed model saving and loading references from "stable-diffusion-2-1-smashed" to "segmind-vega-smashed" for consistency with recent updates. - Updated corresponding comments to reflect the new model name.

docs: refine explanation in image generation tutorial

4ef3a72

- Clarified the output quality discussion by removing redundant phrasing regarding the CLIP score, enhancing the overall readability of the section.

docs: streamline evaluation metrics in image generation tutorial

7a14da6

- Removed `elapsed_time` from the evaluation metrics to focus on key performance indicators. - Cleared unnecessary output cells to enhance the tutorial's clarity and flow.

davidberenstein1957 force-pushed the docs/120-doc-update-end-to-end-tutorial-of-using-flux branch from 2fff38d to 94302a3 Compare June 12, 2025 13:38

sharpenb requested changes Jun 15, 2025

View reviewed changes

davidberenstein1957 requested a review from sharpenb June 17, 2025 10:20

sharpenb approved these changes Jun 20, 2025

View reviewed changes

nifleisch approved these changes Jun 24, 2025

View reviewed changes

fix: correct typographical error in image generation tutorial

f818096

davidberenstein1957 merged commit 0d6e97d into main Jun 24, 2025
6 checks passed

Conversation

davidberenstein1957 commented May 16, 2025

Description

Related Issue

Type of Change

How Has This Been Tested?

Checklist

Additional Notes

Uh oh!

davidberenstein1957 commented May 22, 2025

Uh oh!

nifleisch commented May 26, 2025

Uh oh!

davidberenstein1957 commented May 26, 2025

Uh oh!

nifleisch commented May 27, 2025

Uh oh!

Uh oh!

sharpenb left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sharpenb left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

davidberenstein1957 commented Jun 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

davidberenstein1957 commented Jun 10, 2025

Uh oh!

sharpenb left a comment

Choose a reason for hiding this comment

Uh oh!

davidberenstein1957 commented Jun 17, 2025

Uh oh!

sharpenb left a comment

Choose a reason for hiding this comment

Uh oh!

nifleisch left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sharpenb left a comment •

edited

Loading

sharpenb left a comment •

edited

Loading

davidberenstein1957 commented Jun 6, 2025 •

edited

Loading