22 1 2

ztz

aabbccddwasd

aabbccddwasd

AI & ML interests

LLM

Recent Activity

new activity about 6 hours ago

nvidia/Qwen3.5-397B-A17B-NVFP4:Support SM120

new activity 3 days ago

Sehyo/Qwen3.5-397B-A17B-NVFP4:not working on vllm

new activity 3 days ago

Sehyo/Qwen3.5-397B-A17B-NVFP4:missing think tag

View all activity

Organizations

None yet

New activity in nvidia/Qwen3.5-397B-A17B-NVFP4 about 6 hours ago

Support SM120

❤️ 👍 10

#2 opened about 20 hours ago by

darkstar3537

New activity in Sehyo/Qwen3.5-397B-A17B-NVFP4 3 days ago

not working on vllm

#1 opened 4 days ago by

aabbccddwasd

missing think tag

#2 opened 3 days ago by

fouvy

New activity in vincentzed-hf/Qwen3.5-397B-A17B-NVFP4 3 days ago

Anyone try this on 4x RTX 6000 Pro yet?

#1 opened 6 days ago by

zenmagnets

New activity in INC4AI/Qwen3.5-397B-A17B-int4-mixed-AutoRound 4 days ago

vllm/sglang issue?

#1 opened 4 days ago by

Halbin

New activity in Qwen/Qwen3.5-397B-A17B 4 days ago

vllm 部署oom

#22 opened 6 days ago by

Chris2me

New activity in RedHatAI/Qwen3.5-397B-A17B-FP8-dynamic 4 days ago

W4A16 quant

👍 2

#1 opened 5 days ago by

timroethig

New activity in Qwen/Qwen3.5-397B-A17B 5 days ago

fake knowledge 假知识

#21 opened 6 days ago by

aabbccddwasd

New activity in zai-org/GLM-4.7 about 2 months ago

we still need some Air

🚀 👍 65

#1 opened 2 months ago by

jacek2024

New activity in eousphoros/DeepSeek-V3.2-NVFP4 3 months ago

Is it possible to make smaller NVFP4 quant at 340-360GB to fit in 4x96gb?

👍 1

#1 opened 3 months ago by

Fernanda24

New activity in QuantTrio/DeepSeek-V3.2-AWQ 3 months ago

Aww Man!

#1 opened 3 months ago by

mtcl

New activity in OpenGVLab/InternVL3_5-241B-A28B 6 months ago

FP8 and FP4 please？

➕ 1

#4 opened 6 months ago by

aabbccddwasd

New activity in JimmyFoxx/Qwen3-30B-A3B-Instruct-2507-SAT-FP8-Dynamic 7 months ago

what's the difference between https://huggingface.co/JimmyFoxx/Qwen3-30B-A3B-Instruct-2507-FP8-Dynamic/tree/main

#1 opened 7 months ago by

aabbccddwasd

New activity in Qwen/Qwen3-30B-A3B-Instruct-2507 7 months ago

An Improvement, But Q3 30b Still Has Very Little General Knowledge

❤️ 👍 3

#2 opened 7 months ago by

phil111

New activity in Qwen/Qwen3-235B-A22B-Instruct-2507 7 months ago

🚀[Fine-tuning] 8x80GiB GPUs LoRA finetuning Qwen3-235B-A22B-Instruct-2507

🤗 4

#25 opened 7 months ago by

study-hjt

New activity in deepseek-ai/DeepSeek-R1-0528 9 months ago

刚部署满血deepseek r1 0528版本，推理性能提升这么多嘛？不是架构没变嘛？

#75 opened 9 months ago by

jakyer

New activity in nvidia/DeepSeek-R1-NVFP4 9 months ago

quantize deepseek-r1-0528 please

👍 2

#14 opened 9 months ago by

aabbccddwasd

New activity in QuantTrio/DeepSeek-R1-0528-GPTQ-Int4-Int8Mix-Compact 9 months ago

benchmark please?

👍 1

#1 opened 9 months ago by

aabbccddwasd

New activity in Qwen/Qwen3-235B-A22B 10 months ago

In complex reasoning tasks Qwen3 is far behind QwQ

#32 opened 10 months ago by

AdamF92

New activity in Qwen/Qwen2.5-VL-72B-Instruct about 1 year ago

Qwen/Qwen2.5-VL-72B-Instruct-AWQ and Qwen/Qwen2.5-VL-40<B-Instruct-AWQ please

➕ ❤️ 18

#1 opened about 1 year ago by

devops724

ztz

AI & ML interests

Recent Activity

Organizations

aabbccddwasd's activity

Support SM120

not working on vllm

missing think tag

Anyone try this on 4x RTX 6000 Pro yet?

vllm/sglang issue?

vllm 部署oom

W4A16 quant

fake knowledge 假知识

we still need some Air

Is it possible to make smaller NVFP4 quant at 340-360GB to fit in 4x96gb?

Aww Man!

FP8 and FP4 please？

what's the difference between https://huggingface.co/JimmyFoxx/Qwen3-30B-A3B-Instruct-2507-FP8-Dynamic/tree/main

An Improvement, But Q3 30b Still Has Very Little General Knowledge

🚀[Fine-tuning] 8x80GiB GPUs LoRA finetuning Qwen3-235B-A22B-Instruct-2507

刚部署满血deepseek r1 0528版本，推理性能提升这么多嘛？不是架构没变嘛？

quantize deepseek-r1-0528 please

benchmark please?

In complex reasoning tasks Qwen3 is far behind QwQ

Qwen/Qwen2.5-VL-72B-Instruct-AWQ and Qwen/Qwen2.5-VL-40<B-Instruct-AWQ please