I am a Research Scientist at Meta Superintelligence Lab, MSL (previously at FAIR).
I was a Robotics PhD student at CMU with Kris Kitani. I received bachelor degree from Shanghai Jiao Tong University advised by Cewu Lu. I visited UC Berkeley in 2019, advised by Trevor Darrell. I also visited Yang Gao's group at Tsinghua University in 2020. I have interned in Nvidia Research, Adobe Research, Meta Reality Labs, Tencent and Bytedance AI Lab etc. My research was supported by Meta PhD Fellowship.
Github |
Google Scholar |
Linkedin |
DBLP |
Resume (usually out-dated)
Email: jinkuncao [AT] gmail.com / jinkunc [AT] andrew.cmu.edu
Research Interest
I focus on studying the real2sim and sim2real problems to model and imitate human body and hand motions, especially in interaction scenarios.
- Estimation and generation of human hand pose and motion from noisy visual input.
- Combining physics-based simulations and vision for more plausible human hand interaction imitation.
- Applications in AR/VR, animation and robotic tasks to replicate 3D and physical interactions.
News
- [2025/10]: GENMO was accepted at ICCV as a Highlight and I would be at ICCV, Honolulu to present.
- [2025/7]: I passed my PhD thesis defense on "Estimating and Generating Human Motions from Interactions". Thesis is here.
Selected Publications
* indicates equal contribution | I am a main contributor of the highlighted projects.
GENMO: A GENarlist Model for Human MOtion
ICCV 2025 [Highlight][project]
Grasping Diverse Objects with Simulated Humanoids
NeurIPS 2024 [project]
Mixed Gaussian Flow for Diverse Trajectory Prediction
NeurIPS 2024 [arxiv]
Harmony4D: A Video Dataset for In-The-Wild Close Human Interactions
NeurIPS 2024 (Dataset and Benchmark Track) [project]
Multi-Object Tracking by Hierarchical Visual Representations
ICRA 2024 [arxiv]
Perpetual Humanoid Control for Real-time Simulated Avatars
ICCV 2023 [project page] [arxiv] [code]
Observation-Centric SORT: Rethinking SORT for Robust Multi-Object Tracking
CVPR 2023 [arxiv] [code] [mmtracking]
Track Targets by Dense Spatio-Temporal Position Encoding
BMVC 2022 [Oral] [arxiv]
DanceTrack: Multi-Object Tracking in Uniform Appearance and Diverse Motion
CVPR 2022 [arxiv] [code] [project page] [codalab]
Manuscripts
* indicates equal contribution | I am a main contributor of the highlighted projects.
Humanoidlympics: Sports Environments for Physically Simulated Humanoids
Technical Report, 2024 [project]
Multi-Modal Hand-Object Interaction Generation
Technical Report, 2024 [arxiv]
Services
- Conference Reviewer:
- Computer Vision: ICCV (21, 23), ECCV (20, 22, 24), CVPR (22, 23, 24), ISMAR (23, 24)
- Robotics: ICRA (22, 23, 24), IROS (22)
- Machine Learning: NeurIPS (22, 23), AAAI (22, 23, 24), ICML (23, 24), ICLR (24)
- Journal Reviewer: RA-L, IEEE Trans. Multimedia, TNNLS, TCSVT, Pattern Recoginition, TMLR, IJCV
- Workshop Organizer: