VLA & Multimodal Learning

Embodied Intelligence

Advance the frontier of embodied intelligence by redefining how data, models, and real-world interaction co-evolve.

Harnessing the power of artificial intelligence to revolutionize industries and enhance human experiences.

Beijing / Shanghai / Shenzhen / Hong Kong

FUll-time

End-to-end data–model closed loops that quantify effective data gain on embodied learning systems, and next-generation human-centric data acquisition pipelines spanning hardware, simulation, and dexterous teleoperation.
You will design and push VLA models to state-of-the-art performance, evolving multimodal fusion architectures that unlock the value of force and tactile signals in policy generation

First-principles thinker with strong experimental instincts, hands-on experience in real-world data collection, and deep familiarity with vision, force, and tactile sensing systems.
Research background from top labs or conferences (NeurIPS, ICLR, CVPR, CoRL, RSS) is highly valued.

Apply Now

Newsletter

Stay ahead with Originflow updates and trend briefs.

Newsletter

Stay ahead with Originflow updates and trend briefs.