RoboMamba: Multimodal State Space Model for Robot Reasoning
RoboMamba integrates vision encoder with a state space model to enhance robotic reasoning and action capabilities efficiently. Through experiments, RoboMamba demonstrates outstanding reasoning and action prediction results. Read more