Abstract: To empower mobile robots with usable maps as well as highest state estimation accuracy and robustness, we present OKVIS2-X: a state-of-the-art multisensor simultaneous localization and ...
Abstract: Sign language recognition(SLR) is a multidisciplinary research topic in pattern recognition and computer vision. Due to large amount of data from the continuous frames of sign language ...
This is the code of the paper Keyframe-Focused Visual Imitation Learning. You can use this repo to reproduce the results of BC-SO (behavioral cloning with single observation), BC-OH (behavioral ...
This is the official implementaion of paper 'Adaptive Keyframe Sampling for Long Video Understanding', which is accepted in CVPR 2025. Multimodal large language models (MLLMs) have enabled open-world ...