Skip to content

Add support for Safetensors format and update model specifications#2

Open
ilopezluna wants to merge 1 commit intomainfrom
support-safetensors
Open

Add support for Safetensors format and update model specifications#2
ilopezluna wants to merge 1 commit intomainfrom
support-safetensors

Conversation

@ilopezluna
Copy link
Contributor

@ilopezluna ilopezluna commented Oct 2, 2025

This pull request updates the documentation to add support for the safetensors model format alongside gguf, clarifies how sharded models are represented, and provides example manifests for both formats. The changes introduce new media types, explain sharding conventions, and show how to specify metadata for safetensors models.

Key changes:

Support for Safetensors Format:

  • The config.md documentation now lists both gguf and safetensors as supported model file formats, and describes how to include metadata specific to the safetensors format, such as architecture and parameter count.

Documentation and Examples for Sharded Models:

  • Adds a section in spec.md explaining how sharded models are represented, including conventions for multiple layers and file naming.

New Media Types:

  • Introduces new media types in spec.md for application/vnd.docker.ai.safetensors (for safetensors files) and application/vnd.docker.ai.vllm.config.tar (for config/metadata tarballs).

Expanded Example Manifests:

  • Provides new example manifests in both config.md and spec.md for sharded safetensors models, demonstrating how to specify multiple shards and associated config/license files.

General Documentation Improvements:

  • Updates headings and clarifies sections to distinguish between GGUF and Safetensors examples, improving readability and usability for users working with different model formats.

Summary by Sourcery

Update model documentation to support the safetensors format alongside GGUF, introduce new media types, clarify sharding conventions, and provide example manifests for both formats

New Features:

  • Document safetensors as a supported model format in config with optional metadata fields for architecture and parameter count
  • Introduce new media types for safetensors files and VLLM config tarballs

Enhancements:

  • Clarify how sharded models are represented and ordered in manifests with naming conventions
  • Provide example manifests for sharded safetensors and GGUF models

@ilopezluna ilopezluna requested a review from ekcasey October 2, 2025 11:18
@sourcery-ai
Copy link

sourcery-ai bot commented Oct 2, 2025

Reviewer's Guide

This PR enhances documentation by adding safetensors model format support, detailing sharded model conventions, introducing related media types, and providing updated example manifests in config.md and spec.md.

Entity relationship diagram for updated model config structure (Safetensors support)

erDiagram
    MODEL_CONFIG {
        string format
        string format_version
        object gguf
        object safetensors
        string size
    }
    SAFETENSORS {
        string architecture
        string parameter_count
    }
    MODEL_CONFIG ||--|{ SAFETENSORS : includes
Loading

Class diagram for updated model config and file types

classDiagram
    class ModelConfig {
        +string format
        +string format_version
        +object gguf
        +object safetensors
        +string size
    }
    class SafetensorsMetadata {
        +string architecture
        +string parameter_count
    }
    class ModelFile {
        +string diffID
        +string type
    }
    ModelConfig o-- SafetensorsMetadata : contains
    ModelConfig o-- ModelFile : has
    ModelFile <|-- SafetensorsFile
    ModelFile <|-- ConfigTarFile
    ModelFile <|-- LicenseFile
    class SafetensorsFile {
        +type = "application/vnd.docker.ai.safetensors"
    }
    class ConfigTarFile {
        +type = "application/vnd.docker.ai.vllm.config.tar"
    }
    class LicenseFile {
        +type = "application/vnd.docker.ai.license"
    }
Loading

File-Level Changes

Change Details Files
Support for safetensors format in config documentation
  • Include safetensors in the supported formats list
  • Define safetensors-specific metadata fields (architecture, parameter_count)
  • Add sharded safetensors example manifest
config.md
Defined sharded model conventions and examples
  • Explain shard representation and sequence ordering
  • Specify shard file naming conventions
  • Provide both GGUF and safetensors sharded manifest examples
spec.md
Introduced new media types in spec documentation
  • Add application/vnd.docker.ai.safetensors media type
  • Add application/vnd.docker.ai.vllm.config.tar media type
spec.md
General documentation and readability improvements
  • Update headings to separate GGUF and safetensors sections
  • Clarify examples and section labels for better usability
config.md
spec.md

Tips and commands

Interacting with Sourcery

  • Trigger a new review: Comment @sourcery-ai review on the pull request.
  • Continue discussions: Reply directly to Sourcery's review comments.
  • Generate a GitHub issue from a review comment: Ask Sourcery to create an
    issue from a review comment by replying to it. You can also reply to a
    review comment with @sourcery-ai issue to create an issue from it.
  • Generate a pull request title: Write @sourcery-ai anywhere in the pull
    request title to generate a title at any time. You can also comment
    @sourcery-ai title on the pull request to (re-)generate the title at any time.
  • Generate a pull request summary: Write @sourcery-ai summary anywhere in
    the pull request body to generate a PR summary at any time exactly where you
    want it. You can also comment @sourcery-ai summary on the pull request to
    (re-)generate the summary at any time.
  • Generate reviewer's guide: Comment @sourcery-ai guide on the pull
    request to (re-)generate the reviewer's guide at any time.
  • Resolve all Sourcery comments: Comment @sourcery-ai resolve on the
    pull request to resolve all Sourcery comments. Useful if you've already
    addressed all the comments and don't want to see them anymore.
  • Dismiss all Sourcery reviews: Comment @sourcery-ai dismiss on the pull
    request to dismiss all existing Sourcery reviews. Especially useful if you
    want to start fresh with a new review - don't forget to comment
    @sourcery-ai review to trigger a new review!

Customizing Your Experience

Access your dashboard to:

  • Enable or disable review features such as the Sourcery-generated pull request
    summary, the reviewer's guide, and others.
  • Change the review language.
  • Add, remove or edit custom review instructions.
  • Adjust other review settings.

Getting Help

@gemini-code-assist
Copy link
Contributor

Summary of Changes

Hello @ilopezluna, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly expands the model specification documentation by integrating support for the "safetensors" format alongside the existing "gguf" format. It introduces new media types, clarifies the representation of sharded models, and provides detailed examples, thereby enhancing the clarity and utility of the documentation for users working with diverse model packaging formats.

Highlights

  • Safetensors Format Support: The documentation now officially supports the "safetensors" model format, detailing how to include specific metadata like architecture and parameter count in the model configuration.
  • Sharded Model Representation: New sections explain the conventions for representing sharded models, including how multiple files of the same media type constitute a single sharded model and typical file naming patterns.
  • New Media Types: Two new media types, "application/vnd.docker.ai.safetensors" and "application/vnd.docker.ai.vllm.config.tar", have been introduced to categorize safetensors files and vLLM configuration tarballs, respectively.
  • Expanded Example Manifests: Comprehensive example manifests have been added for sharded "safetensors" models, illustrating how to correctly specify multiple shards and associated configuration/license files.
Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature Command Description
Code Review /gemini review Performs a code review for the current pull request in its current state.
Pull Request Summary /gemini summary Provides a summary of the current pull request in its current state.
Comment @gemini-code-assist Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help /gemini help Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Footnotes

  1. Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution.

Copy link

@sourcery-ai sourcery-ai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hey there - I've reviewed your changes and they look great!


Sourcery is free for open source - if you like our reviews please consider sharing them ✨
Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.

@ilopezluna ilopezluna requested a review from a team October 2, 2025 11:19
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request does a great job of adding support for the safetensors model format. The documentation updates in config.md and spec.md are comprehensive, including new media types, explanations for sharded models, and consistent examples across both files. My feedback includes a couple of suggestions for config.md to improve the clarity and consistency of the specification for fields like parameter_count and size, which should make the specification easier for implementers to follow.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant