I am a Research Fellow at Nanyang Technological University. I completed my PhD at the National Engineering Research Center of Robot Visual Perception and Control Technology of Hunan University, supervised by Prof. Wei Sun. From Aug. 2023 to Sep. 2024, I was a Visiting PhD Student at the Machine Intelligence Group of the University of Western Australia, supervised by Prof. Ajmal Mian. I also work closely with Prof. Nicu Sebe and Prof. Hossein Rahmani.

My research focuses on 3D machine vision and its applications for robotic manipulation. Specifically, I have worked on object pose estimation and tracking. Subsequent research focuses include label-efficient learning for generalized robotic multimodal perception, manipulation, and embodied AI. I was motivated to conduct this research due to my passion for realizing generalizable perception and manipulation of robots in 3D physical space.

🔥 News

2025.05: 🎉🎉 We proposed a computation framework called Neural Brain for embodied agents through the lens of neuroscience, providing an innovative research perspective for embodied AI. Feel free to get in touch if you have any ideas.
2025.05: 🎉🎉 One paper get accepted by IEEE RAL.
2025.04: 🎉🎉 One paper get accepted by IEEE RAL.
2025.03: 🎉🎉 One paper get accepted by IEEE TPAMI!
2025.03: 🎉🎉 One paper get accepted by IEEE TCyber.
2025.02: 🎉🎉 One paper get accepted by IEEE ICRA’25.
2024.12: 🎉🎉 PhD defended.
2024.09: 🎉🎉 One paper get accepted by IEEE IoT-J.
2024.05: 🎉🎉 A comprehensive survey of deep learning-based object pose estimation was posted on arXiv. Feel free to contact us if you have any suggestions.
2024.02: 🎉🎉 One paper get accepted by IEEE TII.
2024.01: 🎉🎉 One paper get accepted by IEEE TNNLS.
2024.01: 🎉🎉 One paper get accepted by IEEE TMC.
2023.11: 🎉🎉 One paper get accepted by IEEE TIM.
2023.02: 🎉🎉 One paper get accepted by IEEE TII.
2022.06: 🎉🎉 One paper get accepted by IEEE TCSVT.

📝 Selected Publications

IEEE TPAMI 2025

Diff9D: Diffusion-Based Domain-Generalized Category-Level 9DoF Object Pose Estimation (Code)
Jian Liu, Wei Sun, Hui Yang, Pengchao Deng, Chongpei Liu, Nicu Sebe, Hossein Rahmani, Ajmal Mian

We propose an effective diffusion model to redefine 9DoF object pose estimation from a generative perspective. Diff9D is a simple yet effective prior-free domain-generalized (sim2real) category-level 9DoF object pose generator. By employing the denoising diffusion implicit model, we demonstrate that the reverse diffusion process can be executed in as few as 3 steps, achieving near real-time performance.

arXiv 2025

Novel Object 6D Pose Estimation with a Single Reference View (Code)
Jian Liu, Wei Sun, Kai Zeng, Jin Zheng, Hui Yang, Lin Wang, Hossein Rahmani, Ajmal Mian

We propose a single reference view-based CAD model-free novel object 6D pose estimation method. SinRef-6D is simple yet effective and can simultaneously eliminate the need for object CAD models, dense reference views, and model retraining, offering enhanced efficiency and scalability while demonstrating strong generalization to potential real-world robotic applications.

IEEE ICRA'25

MonoDiff9D: Monocular Category-Level 9D Object Pose Estimation via Diffusion Model (Code)
Jian Liu, Wei Sun, Hui Yang, Jin Zheng, Zichen Geng, Hossein Rahmani, Ajmal Mian

MonoDiff9D is an extension of Diff9D, aiming to achieve monocular category-level 9D object pose estimation via diffusion model conditioning on large vision model-based zero-shot depth recovery, without the need for shape priors or CAD models at any stage.

arXiv 2024

Deep Learning-Based Object Pose Estimation: A Comprehensive Survey
Project Page
Jian Liu, Wei Sun, Hui Yang, Zhiwen Zeng, Chongpei Liu, Jin Zheng, Xingyu Liu, Hossein Rahmani, Nicu Sebe, Ajmal Mian

We present a comprehensive survey of deep learning-based object pose estimation methods. This survey covers all three problem formulations in the domain, including instance-level, category-level, and unseen object pose estimation. We hope to provide readers with a complete picture of the research progress of deep learning-based object pose estimation.

IEEE TNNLS 2024

MH6D: Multi-Hypothesis Consistency Learning for Category-Level 6D Object Pose Estimation (Code)
Jian Liu, Wei Sun, Chongpei Liu, Hui Yang, Xing Zhang, Ajmal Mian

We propose a multi-hypothesis consistency learning framework for category-level 6D object pose estimation, which utilizes a parallel consistency learning structure, alleviating the uncertainty problem of single-shot feature extraction and promoting self-adaptation of domain to reduce the synthetic-to-real domain gap.

IEEE TII 2023

Robotic Continuous Grasping System by Shape Transformer-Guided Multi-Object Category-Level 6D Pose Estimation (Code)
Jian Liu, Wei Sun, Chongpei Liu, Xing Zhang, Qiang Fu

A transformer-guided shape reconstruction network is proposed to reconstruct the NOCS shape of intra-class known objects, which can fully use the prior feature, current observation feature, and their feature difference by internal self-attention, as well as strengthen their correlation by mutual cross-attention. By doing so, the shape variation can be explicitly highlighted.

IEEE TCSVT 2022

HFF6D: Hierarchical Feature Fusion Network for Robust 6D Object Pose Tracking
Jian Liu, Wei Sun, Chongpei Liu, Xing Zhang, Shimeng Fan, Wei Wu

We propose a lightweight and robust hierarchical feature fusion network for 6D object pose tracking. It establishes sufficient spatial-temporal information interaction between adjacent frames and explicitly highlights the feature differences between adjacent frames, thus improving the robustness of relative pose estimation in challenging scenes.

💻 Real-World Robotic Projects

Robotic Continuous Grasping System (Demo can be seen through link1 or link2)

We build an end-to-end robotic continuous grasping system, which achieves high-accuracy 6D pose estimation for multiple intra-class unknown objects and high-efficiency robotic grasping in 3D space. For continuous grasping, we propose a low-computation and effective grasping strategy based on the pre-defined vector orientation, and develop a GUI for monitoring and control.

Robotic Continuous Picking System (Demo can be seen through link)

We develop an object pose-guided robotic picking system comprising both hardware and software components. The hardware for the robotic picking system is composed of an Intel RealSense L515 RGB-D camera, a Yaskawa robot MOTOMAN-MH12, an electric parallel gripper DH-PGI140-80, and a host computer. For software, we develop a GUI comprising three parts: calibration module, server-client TCP communication module, and robotic picking module.

🎖 Honors and Awards

2024.11 Ph.D. National Scholarship.
2018.11 The National First Prize in “Higher Education Society Cup” National Undergraduate Mathematical Contest in Modeling (Top 1.5 %).
2019.08 The National Second Prize in “RoboMaster2019” National Undergraduate Robotics Competition (Hosted by DJI-Innovations).
2018.06 Hong Kong “Zhong Huiming” Social Scholarship.

📖 Review Services

I serve as a reviewer for more than 20 journals/conferences, mainly including:

IEEE Transactions on Pattern Analysis and Machine Intelligence
IEEE Transactions on Image Processing
IEEE Transactions on Neural Networks and Learning Systems
IEEE Transactions on Industrial Informatics
IEEE Transactions on Circuits and Systems for Video Technology
IEEE/ASME Transactions on Mechatronics
IEEE Transactions on Automation Science and Engineering
IEEE Transactions on Systems, Man, and Cybernetics: Systems
IEEE Transactions on Circuits and Systems I: Regular Papers
IEEE Transactions on Instrumentation and Measurement
IEEE Robotics and Automation Letters
Pattern Recognition
Neural Networks
IEEE/CVF International Conference on Computer Vision (ICCV)
IEEE International Conference on Robotics and Automation (ICRA)
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

Jian Liu (刘剑)

🔥 News

📝 Selected Publications

💻 Real-World Robotic Projects

🎖 Honors and Awards

📖 Review Services