视听大语言模型:一阶段问题引导对齐与动态专家融合方法
刘晨熙, 苗晴, 任张毓, 丁一非
Visual-Audio Large Language Model: Single-Stage Question-Guided Alignment and Dynamic Experts Fusion Method
LIUChen-xi, MIAOQing, RENZhang-yu, DINGYi-fei
制造业自动化 . 2026, (4): 28 -40 .  DOI: 10.3969/j.issn.1009-0134.2026.04.004