XBridge 能够将 LLM 低资源语言甚至未见语言的理解和生成能力提升到接近组合的 NMT 模型的水平,在下游任务上显著缩小高资源、低资源语言间性能差距,同时保持或提升高资源语言能力,全程无需训练 LLM。
打破多模态视觉+语言拼接套路! 腾讯开源Penguin-VL,直接用纯文本LLM训视觉编码器。 这项研究跳出了先有传统视觉 backbone,再接语言模型的常规路径,直接从text-only LLM初始化vision encoder。 并在2B/8B紧凑参数规模下的文档理解、长视频时序定位等复杂任务中表现出 ...
研究团队张犬俊,房春荣,谢杨,张雅欣,虞圣呈,陈振宇:南京大学孙伟松:南洋理工大学杨耘:斯威本科技大学#大语言模型,#软件工程,#人工智能,#智能软件工程引用本文: Zhang Q J, Fang C R, Xie Y, et al. A ...
SK Telecom has unveiled a universal document interpretation technology for vision-language model (VLM) and large language model (LLM) training, based on its proprietary large language model, A.Dot X ...
Moonshot AI today released Kimi-K2.6, the latest addition to its popular Kimi series of open-source large language models.
Transformer-based models have rapidly spread from text to speech, vision, and other modalities. This has created challenges for the development of Neural Processing Units (NPUs). NPUs must now ...
Artificial intelligence is becoming an increasingly significant asset for companies worldwide, especially as they integrate generative AI features like chatbots into their services. However, deploying ...
一些您可能无法访问的结果已被隐去。
显示无法访问的结果