A series of advanced vision-language models built on our SpecVisionForCausalLM architecture, designed for seamless multimodal AI. ✨