Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL 发表评论 / Research / 作者: Fang Peng
SlotLifter: Slot-guided Feature Lifting for Learning Object-centric Radiance Fields 发表评论 / Research / 作者: Fang Peng
F-HOI: Toward Fine-grained Semantic-Aligned 3D Human-Object Interactions 发表评论 / Research / 作者: Fang Peng
VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding 发表评论 / Research / 作者: Fang Peng
SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding 发表评论 / Research / 作者: Fang Peng