Loc3R-VLM: Language-based Localization and 3D Reasoning with Vision-Language Models Paper • 2603.18002 • Published about 1 month ago • 13
nvidia/segformer-b5-finetuned-cityscapes-1024-1024 Image Segmentation • Updated Aug 9, 2022 • 37.5k • • 43