Skip to content

保存后的Reward Model用来inference #4379

@Zhenwen-NLP

Description

@Zhenwen-NLP

Reminder

  • I have read the README and searched the existing issues.

System Info

导出之后的reward model用这个方式加载:

model = AutoModelForCausalLMWithValueHead.from_pretrained('...')

弹出一个warning: no v_head weight is found. This IS expected if you are not resuming PPO training.

请问这是正常可以忽略的吗?我想用保存后的reward model做inference输出value

Reproduction

model = AutoModelForCausalLMWithValueHead.from_pretrained('...')

Expected behavior

No response

Others

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    solvedThis problem has been already solved

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions