Abstract: With the rapid development of remote sensing technology, effectively managing and retrieving vast quantities of remote sensing images has become an urgent challenge. For complex remote ...
Abstract: Leveraging powerful semantic understanding and generation capabilities, Vision-Language Pre-trained (VLP) large models have demonstrated remarkable potential in cross-modal retrieval.
一些您可能无法访问的结果已被隐去。
显示无法访问的结果