model: add glm-asr support #17901

piDack · 2025-12-10T03:28:18Z

Make sure to read the contributing guidelines before submitting a PR

This PR adds support for the GLM-ASR architecture, specifically validating with the zai-org/GLM-ASR-Nano-2512 model.

Key Changes:

Model Support: Implemented necessary logic to support GLM-ASR models.
Conversion Script: Updated convert_hf_to_gguf.py to handle dynamic configuration keys (glm-asr use "lm_config" instead of text_config). It now correctly identifies the config section by checking:
llm_config_key = "lm_config" if "lm_config" in self.hparams else "text_config"

Result

tools/mtmd/clip.cpp

convert_hf_to_gguf.py

CISC · 2025-12-10T15:02:10Z

convert_hf_to_gguf.py

+        if isinstance(self.hparams.get("eos_token_id"), list):
+            from transformers import AutoTokenizer
+            tokenizer = AutoTokenizer.from_pretrained(self.dir_model, trust_remote_code=True)
+            special_vocab = gguf.SpecialVocab(self.dir_model, load_merges=True)
+            special_vocab._set_special_token("eos", tokenizer.get_added_vocab()["<|endoftext|>"])
+            special_vocab._set_special_token("eot", tokenizer.get_added_vocab()["<|user|>"])
+            special_vocab._set_special_token("unk", tokenizer.get_added_vocab()["<|endoftext|>"])
+            special_vocab._set_special_token("bos", tokenizer.get_added_vocab()["<|endoftext|>"])
+            special_vocab.add_to_gguf(self.gguf_writer)
+            special_vocab.chat_template = "glmedge"


This is not ok, check for root architecture instead (see Qwen3MoeModel), also you should not need to set any of those special tokens.

Also, setting the template name like that doesn't work any more I think, and it's a dirty hack to begin with, if the model creators can't be bothered, neither should we.

convert_hf_to_gguf.py

…build_stack for padding and review

ngxson · 2025-12-12T10:34:57Z

convert_hf_to_gguf.py

+    def set_vocab(self):
+        super().set_vocab()
+        from transformers import AutoTokenizer
+        tokenizer = AutoTokenizer.from_pretrained(self.dir_model, trust_remote_code=True)
+        special_vocab = gguf.SpecialVocab(self.dir_model, load_merges=True)
+        special_vocab._set_special_token("eos", tokenizer.get_added_vocab()["<|endoftext|>"])
+        special_vocab._set_special_token("eot", tokenizer.get_added_vocab()["<|user|>"])
+        special_vocab._set_special_token("unk", tokenizer.get_added_vocab()["<|endoftext|>"])
+        special_vocab._set_special_token("bos", tokenizer.get_added_vocab()["<|endoftext|>"])
+        special_vocab.add_to_gguf(self.gguf_writer)


It seems like the original comment was not addressed: #17901 (comment)

[model] add glm-asr support

f432a5c

piDack requested review from CISC and ngxson as code owners December 10, 2025 03:28

github-actions bot added examples python python script changes labels Dec 10, 2025

loci-dev mentioned this pull request Dec 10, 2025

UPSTREAM PR #17901: [model] add glm-asr support auroralabs-loci/llama.cpp#508

Open

piDack added 2 commits December 10, 2025 03:54

fix format for ci

c382d64

fix convert format for ci

e8a1ec5

piDack changed the title ~~[model] add glm-asr support~~ model: add glm-asr support Dec 10, 2025

ngxson requested changes Dec 10, 2025

View reviewed changes

tools/mtmd/clip.cpp Outdated Show resolved Hide resolved

tools/mtmd/clip.cpp Outdated Show resolved Hide resolved

tools/mtmd/clip.cpp Outdated Show resolved Hide resolved

CISC reviewed Dec 10, 2025

View reviewed changes

ngxson reviewed Dec 10, 2025

View reviewed changes

convert_hf_to_gguf.py Outdated Show resolved Hide resolved

update glm_asr convert script & use build_ffn for glm_asr clip & use …

103e894

…build_stack for padding and review

piDack requested review from CISC and ngxson December 12, 2025 06:59

improve text_config_key

02f6e66

ngxson reviewed Dec 12, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

model: add glm-asr support #17901

model: add glm-asr support #17901

piDack commented Dec 10, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

CISC Dec 10, 2025 •

edited

Loading

Uh oh!

Uh oh!

ngxson Dec 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

model: add glm-asr support #17901

Are you sure you want to change the base?

model: add glm-asr support #17901

Conversation

piDack commented Dec 10, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

CISC Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ngxson Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

CISC Dec 10, 2025 •

edited

Loading