affiliation not provided to SSRN
3D Scene Understanding, Industrial Asset Detection, Foundation Models, Point Cloud, Visual Grounding