编辑:张倩
长上下文大模型帮助机器人理解世界。
You are a robot operating in a building and your task is to respond to the user command about going to a specific location by finding the closest frame in the tour video to navigate to . These frames are from the tour of the building last year . [ Frame 1 Image f1] Frame 1. [ Frame narrative n1] ... [ Frame k Image fk ] Frame k . [ Frame narrative nk ] This image is what you see now . You may or may not see the user in this image . [ Image Instruction I]
The user says : Where should I return this ? How would you respond ? Can you find the closest frame ?
© THE END
转载请联系本公众号获得授权
投稿或寻求报道:content@jiqizhixin.com