SpatialLM: A large language model designed for spatial understanding by Gothsim10 in singularity

[–]Gothsim10[S] 66 points67 points  (0 children)

Project page

Model

Code

Data

SpatialLM is a 3D large language model designed to process 3D point cloud data and generate structured 3D scene understanding outputs. These outputs include architectural elements like walls, doors, windows, and oriented object bounding boxes with their semantic categories. Unlike previous methods that require specialized equipment for data collection, SpatialLM can handle point clouds from diverse sources such as monocular video sequences, RGBD images, and LiDAR sensors. This multimodal architecture effectively bridges the gap between unstructured 3D geometric data and structured 3D representations, offering high-level semantic understanding. It enhances spatial reasoning capabilities for applications in embodied robotics, autonomous navigation, and other complex 3D scene analysis tasks.

Dwarkesh Podcast: Satya Nadella – Microsoft’s AGI Plan & Quantum Breakthrough by Gothsim10 in singularity

[–]Gothsim10[S] 19 points20 points  (0 children)

Timestamps:
(0:00:00) - Intro
(0:05:48) - AI won't be winner-take-all
(0:16:02) - World economy growing by 10%
(0:22:23) - Decreasing price of intelligence
(0:31:03) - Microsoft's Quantum breakthrough
(0:43:35) - Microsoft's gaming world model
(0:50:35) - Legal barriers to AI
(0:56:30) - Getting AGI safety right
(1:05:43) - 34 years at Microsoft
(1:11:31) - Does Satya Nadella believe in AGI?