Skip to content

Conversation

@aliencaocao
Copy link
Contributor

@aliencaocao aliencaocao commented Sep 16, 2024

What does this PR do?

Fixes #5444

The current impl disregards whatever user might have set in the yaml config during evaluations that happen during training (as specified by user e.g. every 100 step eval once)
Instead, it is only applied correctly at the final eval after training is done.
This should not be the case as that would make any intermediate eval results inaccurate.

Before submitting

@aliencaocao
Copy link
Contributor Author

Hi @hiyouga mind reviewing this PR?

@hiyouga hiyouga added the pending This problem is yet to be addressed label Oct 7, 2024
@hiyouga hiyouga self-requested a review February 12, 2025 18:04
Copy link
Owner

@hiyouga hiyouga left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@hiyouga hiyouga merged commit 11eac71 into hiyouga:main Feb 12, 2025
12 checks passed
@hiyouga hiyouga added solved This problem has been already solved and removed pending This problem is yet to be addressed labels Feb 12, 2025
@aliencaocao aliencaocao deleted the fix-gen-kwarg branch February 12, 2025 18:40
1587causalai pushed a commit to 1587causalai/llama_factory that referenced this pull request Feb 18, 2025
* Correctly pass gen_kwarg to eval during model runs

* fix

* fix

---------

Co-authored-by: hiyouga <[email protected]>
stephen-nju pushed a commit to stephen-nju/Llmtrain that referenced this pull request Mar 24, 2025
* Correctly pass gen_kwarg to eval during model runs

* fix

* fix

---------

Co-authored-by: hiyouga <[email protected]>
Former-commit-id: 11eac71
yoonseok312 pushed a commit to pensieve-ai/LLaMA-Factory-vlm that referenced this pull request Apr 29, 2025
* Correctly pass gen_kwarg to eval during model runs

* fix

* fix

---------

Co-authored-by: hiyouga <[email protected]>
Former-commit-id: 11eac71
liu-qingyuan pushed a commit to liu-qingyuan/LLaMA-Factory-Megafake that referenced this pull request Jun 6, 2025
* Correctly pass gen_kwarg to eval during model runs

* fix

* fix

---------

Co-authored-by: hiyouga <[email protected]>
Former-commit-id: 11eac71
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

solved This problem has been already solved

Projects

None yet

2 participants