Abstract: 3D Visual Question Answering (3D-VQA), which focuses on answering user questions based on a given 3D scene, has attracted increasing attention from researchers. As far as we know, most ...