Implemented the tool_formatter and tool_extractor for glm4 and Qwen2 tool_format #4173

mMrBun · 2024-06-09T10:38:32Z

What does this PR do?

Implementation of function call for GLM4,Following the previous implementation method, the PROMPT of the function call is added to the TOOL_FORMAT processing function apply using the PROMPT_FORMAT approach.

tool_input

tools = [
        {
            "type": "function",
            "function": {
                "name": "get_current_weather",
                "description": "Get the current weather",
                "parameters": {
                    "type": "object",
                    "properties": {
                        "location": {
                            "type": "string",
                            "description": "The city and state, e.g. San Francisco, CA",
                        },
                        "format": {
                            "type": "string",
                            "enum": ["celsius", "fahrenheit"],
                            "description": "The temperature unit to use. Infer this from the users location.",
                        },
                    },
                    "required": ["location", "format"],
                },
            }
        },
        {
            "type": "function",
            "function": {
                "name": "calculate_gpa",
                "description": "Calculate the Grade Point Average (GPA) based on grades and credit hours",
                "parameters": {
                    "type": "object",
                    "properties": {
                        "grades": {"type": "array", "items": {"type": "string"}, "description": "The grades"},
                        "hours": {"type": "array", "items": {"type": "integer"}, "description": "The credit hours"},
                    },
                    "required": ["grades", "hours"],
                },
            },
        }
    ]

glm4 tokenizer.apply_chat_template

'[gMASK]<sop><|system|>\n你是一个名为 GLM-4 的人工智能助手。你是基于智谱AI训练的语言模型 GLM-4 模型开发的，你的任务是针对用户的问题和要求提供适当的答复和支持。\n\n## get_current_weather\n\n{\n    "name": "get_current_weather",\n    "description": "Get the current weather",\n    "parameters": {\n        "type": "object",\n        "properties": {\n            "location": {\n                "type": "string",\n                "description": "The city and state, e.g. San Francisco, CA"\n            },\n            "format": {\n                "type": "string",\n                "enum": [\n                    "celsius",\n                    "fahrenheit"\n                ],\n                "description": "The temperature unit to use. Infer this from the users location."\n            }\n        },\n        "required": [\n            "location",\n            "format"\n        ]\n    }\n}\n在调用上述函数时，请使用 Json 格式表示调用的参数。\n\n## calculate_gpa\n\n{\n    "name": "calculate_gpa",\n    "description": "Calculate the Grade Point Average (GPA) based on grades and credit hours",\n    "parameters": {\n        "type": "object",\n        "properties": {\n            "grades": {\n                "type": "array",\n                "items": {\n                    "type": "string"\n                },\n                "description": "The grades"\n            },\n            "hours": {\n                "type": "array",\n                "items": {\n                    "type": "integer"\n                },\n                "description": "The credit hours"\n            }\n        },\n        "required": [\n            "grades",\n            "hours"\n        ]\n    }\n}\n在调用上述函数时，请使用 Json 格式表示调用的参数。'

TOOL_FORMAT

'[gMASK]<sop><|system|>\n你是一个名为 GLM-4 的人工智能助手。你是基于智谱AI训练的语言模型 GLM-4 模型开发的，你的任务是针对用户的问题和要求提供适当的答复和支持，\n\n## get_current_weather\n\n{\n    "name": "get_current_weather",\n    "description": "Get the current weather",\n    "parameters": {\n        "type": "object",\n        "properties": {\n            "location": {\n                "type": "string",\n                "description": "The city and state, e.g. San Francisco, CA"\n            },\n            "format": {\n                "type": "string",\n                "enum": [\n                    "celsius",\n                    "fahrenheit"\n                ],\n                "description": "The temperature unit to use. Infer this from the users location."\n            }\n        },\n        "required": [\n            "location",\n            "format"\n        ]\n    }\n}\n在调用上述函数时，请使用 Json 格式表示调用的参数。\n\n## calculate_gpa\n\n{\n    "name": "calculate_gpa",\n    "description": "Calculate the Grade Point Average (GPA) based on grades and credit hours",\n    "parameters": {\n        "type": "object",\n        "properties": {\n            "grades": {\n                "type": "array",\n                "items": {\n                    "type": "string"\n                },\n                "description": "The grades"\n            },\n            "hours": {\n                "type": "array",\n                "items": {\n                    "type": "integer"\n                },\n                "description": "The credit hours"\n            }\n        },\n        "required": [\n            "grades",\n            "hours"\n        ]\n    }\n}\n在调用上述函数时，请使用 Json 格式表示调用的参数。'

For QWEN2, in the same question
"What's the weather like in San Francisco, Tokyo, and Paris? use Celsius"
the response is
'Action: get_current_weather\nAction Input: {"location": "San Francisco, CA", "format": "celsius"}\nAction: get_current_weather\nAction Input: {"location": "Tokyo, JP", "format": "celsius"}\nAction: get_current_weather\nAction Input: {"location": "Paris, FR", "format": "celsius"}'
Here, the method of identifying tools has been changed from a tuple to a list. For GLM4, I may have conducted fewer tests and haven't encountered cases where multiple tools return results. For now, I will temporarily enclose the tuple in a list in the glm4_tool_extractor function. If I make any new discoveries, I will update the glm4_tool_extractor function accordingly.

Before submitting

Did you read the contributor guideline?

…alls.

hiyouga

LGTM

Implemented the tool_formatter and tool_extractor for glm4 and Qwen2 tool_format Former-commit-id: c0ca425

Implemented the tool_formatter and tool_extractor for glm4 and Qwen2 tool_format

mMrBun and others added 4 commits June 9, 2024 18:16

Implemented the tool_formatter and tool_extractor for glm4 tool_format

cb1cbcb

Merge branch 'hiyouga:main' into main

0f2609c

Removed unnecessary comments.

6ed0b0c

Optimize the handling of QWEN2 in scenarios involving multiple tool c…

950e360

…alls.

mMrBun changed the title ~~Implemented the tool_formatter and tool_extractor for glm4 tool_format~~ Implemented the tool_formatter and tool_extractor for glm4 and Qwen2 tool_format Jun 9, 2024

hiyouga added the pending This problem is yet to be addressed label Jun 10, 2024

hiyouga mentioned this pull request Jun 17, 2024

update glm4 template #4342

Closed

2 tasks

hiyouga approved these changes Jun 18, 2024

View reviewed changes

hiyouga requested review from hiyouga and removed request for hiyouga June 18, 2024 19:17

hiyouga merged commit c0ca425 into hiyouga:main Jun 18, 2024

hiyouga added solved This problem has been already solved and removed pending This problem is yet to be addressed labels Jun 18, 2024

stephen-nju pushed a commit to stephen-nju/Llmtrain that referenced this pull request Mar 24, 2025

Merge pull request hiyouga#4173 from mMrBun/main

6db0261

Implemented the tool_formatter and tool_extractor for glm4 and Qwen2 tool_format Former-commit-id: c0ca425

yoonseok312 pushed a commit to pensieve-ai/LLaMA-Factory-vlm that referenced this pull request Apr 29, 2025

Merge pull request hiyouga#4173 from mMrBun/main

2085047

Implemented the tool_formatter and tool_extractor for glm4 and Qwen2 tool_format Former-commit-id: c0ca425

liu-qingyuan pushed a commit to liu-qingyuan/LLaMA-Factory-Megafake that referenced this pull request Jun 6, 2025

Merge pull request hiyouga#4173 from mMrBun/main

ffab29c

Implemented the tool_formatter and tool_extractor for glm4 and Qwen2 tool_format Former-commit-id: c0ca425

zhongwei1968 pushed a commit to zhongwei1968/LLaMA-Factory that referenced this pull request Aug 1, 2025

Merge pull request hiyouga#4173 from mMrBun/main

8470a95

Implemented the tool_formatter and tool_extractor for glm4 and Qwen2 tool_format

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implemented the tool_formatter and tool_extractor for glm4 and Qwen2 tool_format #4173

Implemented the tool_formatter and tool_extractor for glm4 and Qwen2 tool_format #4173

Uh oh!

mMrBun commented Jun 9, 2024 •

edited

Loading

Uh oh!

hiyouga left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Implemented the tool_formatter and tool_extractor for glm4 and Qwen2 tool_format #4173

Implemented the tool_formatter and tool_extractor for glm4 and Qwen2 tool_format #4173

Uh oh!

Conversation

mMrBun commented Jun 9, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Uh oh!

hiyouga left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

mMrBun commented Jun 9, 2024 •

edited

Loading