feat: vbench datamodule by begumcig · Pull Request #397 · PrunaAI/pruna

begumcig · 2025-10-06T15:20:14Z

Description

This PR introduces the prompt suite of VBench as a pruna datamodule.

The updates in this PR:

Loading the VBench prompts from the JSON and turn them into datasets
Options to load single / multi / all dimensions of the prompt suite
Update the prompt collate to have auxiliary information that could be related to benchmarking
Small bugfix in our utilities that changed the type of the dataset

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

How Has This Been Tested?

Added a test in our datamodule tests

Checklist

My code follows the style guidelines of this project
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

Additional Notes

You can use the new datamodule as follows:

# Load specific dimension 

dm = PrunaDataModule.from_string("VBench" , category = 'background_consistency')

# Load the entire prompt suite

dm = PrunaDataModule.from_string("VBench")

Note

Adds VBench prompt suite as a dataset, introduces prompt_with_auxiliaries_collate, enables category filtering via PrunaDataModule.from_string, and integrates tests.

Datasets:
- Add text_to_video.setup_vbench_dataset to load VBench prompts from VBench_full_info.json, rename columns to category and text, optional category filtering, and return test-focused splits.
- Register "VBench" in base_datasets using prompt_with_auxiliaries_collate.
Collate:
- Introduce prompt_with_auxiliaries_collate returning List[str] prompts and auxiliary metadata dicts; expose via pruna_collate_fns.
DataModule:
- Extend from_string with category arg; auto-forward to setup functions that accept it.
Tests:
- Add VBench to test_dm_from_string matrix.

^{Written by Cursor Bugbot for commit 3d21412. This will update automatically on new commits. Configure here.}

simlang

Amazing! 🚀 next to typo and a docstring thing, just the collate_fn thing. completely up to you if you agree or not. easy approve

src/pruna/data/collate.py

simlang · 2025-10-17T15:30:16Z

src/pruna/data/pruna_datamodule.py

add to docstring

src/pruna/data/datasets/text_to_video.py

davidberenstein1957

Some small comments, feel free to merge after.

davidberenstein1957 · 2025-10-20T07:04:11Z

src/pruna/data/pruna_datamodule.py

@@ -135,6 +135,7 @@ def from_string(
        collate_fn_args: dict = dict(),
        dataloader_args: dict = dict(),
        seed: int = 42,
+        category: str | list[str] | None = None,


should we mention a list of categories also somehwere in the doctstring

I added a very simple definition of category here because there's a chance we will use this category attribute also for other datasets in the feature so we are not limited to VBench, but will be adding the categories of VBench in the documentation update how does that sound?

davidberenstein1957 · 2025-10-20T07:06:20Z

pyproject.toml

@@ -25,10 +25,16 @@ invalid-assignment = "ignore"  # mypy is more permissive with Any assignments
 call-non-callable = "ignore"   # mypy allows more dynamic method calls
 index-out-of-bounds = "ignore" # mypy is more permissive with tuple indexing
 unresolved-attribute = "ignore" # mypy is more permissive with module attributes
-possibly-unbound-attribute = "ignore" # mypy doesn't warn about this as much
+possibly-unbound-variable = "ignore" # mypy doesn't warn about this as much
 redundant-cast = "ignore"      # mypy doesn't warn about redundant casts


you had also applied these changes in another PR. should we add some comment in either PR about whhy and what happened?

Yes ofcourse! Basically, we switched to ty from mypy for static type checking and it's more restrictive than mypy. Maybe, sometime in the future we will enforce all of these rules, but for now, we are restricting them to be similar to what we had with mypy! We have this as a comment in line 22 in this file but it doesn't show since it was not added by me 🥺

cursor · 2025-10-23T08:51:10Z

Bug: Incorrect Iteration in List Comprehension

The list comprehension for prompt_list in prompt_with_auxiliaries_collate uses an incorrect iteration pattern. Instead of directly extracting the "text" field from each row, it iterates over all key-value pairs within each row, which is an overly complex and indirect way to get one text value per data item.

This comment was marked as outdated.

Sign in to view

begumcig force-pushed the feat/datamodule-for-video-benchmarking branch from 40584f7 to c3715ac Compare October 14, 2025 12:58

This comment was marked as outdated.

Sign in to view

begumcig force-pushed the feat/datamodule-for-video-benchmarking branch from c3715ac to 9e9ed1a Compare October 14, 2025 13:15

This comment was marked as outdated.

Sign in to view

begumcig force-pushed the feat/datamodule-for-video-benchmarking branch 3 times, most recently from 23a383c to a94c2ba Compare October 14, 2025 16:10

begumcig requested review from davidberenstein1957 and simlang October 15, 2025 10:00

simlang approved these changes Oct 17, 2025

View reviewed changes

davidberenstein1957 approved these changes Oct 20, 2025

View reviewed changes

begumcig added 2 commits October 22, 2025 14:58

feat: add vbench dataset

c2865d7

fix: collate columns and docstring

d8de28a

begumcig force-pushed the feat/datamodule-for-video-benchmarking branch from 00b2ea7 to 4d66c4e Compare October 23, 2025 08:49

begumcig force-pushed the feat/datamodule-for-video-benchmarking branch from 4d66c4e to ec46aa6 Compare October 23, 2025 09:04

refactor: add pass through collate function and various docstring fixes

3d21412

begumcig force-pushed the feat/datamodule-for-video-benchmarking branch from ec46aa6 to 3d21412 Compare October 23, 2025 09:15

begumcig merged commit e49b837 into main Oct 23, 2025
7 checks passed

Conversation

begumcig commented Oct 6, 2025 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of Change

How Has This Been Tested?

Checklist

Additional Notes

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

simlang left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

simlang Oct 17, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

davidberenstein1957 left a comment

Choose a reason for hiding this comment

Uh oh!

davidberenstein1957 Oct 20, 2025

Choose a reason for hiding this comment

Uh oh!

begumcig Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

davidberenstein1957 Oct 20, 2025

Choose a reason for hiding this comment

Uh oh!

begumcig Oct 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cursor bot commented Oct 23, 2025

Bug: Incorrect Iteration in List Comprehension

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

begumcig commented Oct 6, 2025 •

edited by cursor bot

Loading

begumcig Oct 22, 2025 •

edited

Loading