Leaked details regarding xAI's Grok 5 suggest a massive platform shift, boasting six trillion parameters and native video ...
Abstract: Visual perception, as a core component of Intelligent Transportation Systems (ITS), plays a key role in enhancing safety and efficiency in urban mobility. While single-task visual perception ...
Abstract: Vision-and-Language Navigation in Continuous Environments (VLN-CE) requires agents to navigate 3D environments based on visual observations and natural language instructions. Existing ...
MSVMamba is a visual state space model that introduces a hierarchy in hierarchy design to the VMamba model. This repository contains the code for training and evaluating MSVMamba models on the ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results