LHRS-Bot: Empowering Remote Sensing with VGI-Enhanced Large Multimodal Language Model
Published in ECCV, 2024
Recommended citation: Muhtar, D., Li, Z., Gu, F., Zhang, X., & Xiao, P. (2024). LHRS-Bot: Empowering Remote Sensing with VGI-Enhanced Large Multimodal Language Model. arXiv preprint arXiv:2402.02544. https://arxiv.org/abs/2402.02544
Develop a large-scale pre-training and instruction dataset, and construct a multimodal large language model, LHRS-Bot, with an innovative alignment strategy. LHRS-Bot showcases superior performance on holistic understanding of remote sensing images and complex visual reasoning.