Linqing Zhao 赵林清

Linqing Zhao（赵林清）

I'm currently a Tenure-track Assistant Professor at the School of Intelligence Engineering and Automation, Beijing University of Posts and Telecommunications (BUPT). Previously, I was a Postdoctoral fellow of Department of Automation, Tsinghua University, affiliated with Intelligent Vision Group (IVG), supervised by Prof. Jiwen Lu.

I received the B.Eng. and Ph.D. degrees in information and communication engineering from the School of Electrical and Information Engineering, Tianjin University, China, in 2017 and 2024, respectively, supervised by Prof. Zhanjie Song. During my PhD studies, I was honored to be a visiting student at Intelligent Vision Group (IVG), supervised by Prof. Jie Zhou and Prof. Jiwen Lu.

My research interests lie in computer vision, especially robot vision, autonomous driving perception, and deep learning.

Email / Google Scholar / Github / 中文主页

News

2025-08: Supported by the National Natural Science Foundation of China (Youth Program).

2025-06: Supported by the China Postdoctoral Science Foundation (General Program).

2025-06: Two papers on RGB SLAM and Camera Pose Estimation, and one co-author paper on Indoor 3D Object Detection are accepted to IROS 2025.

2025-05: Achieved an 'Excellent' rating in the postdoctoral midterm assessment, ranking top 3 of 24.

2025-04: One co-author paper on 3D Object Detection is accepted to TCSVT.

2025-04: One paper on Multi-modal Semantic Segmentation is accepted to TMM.

2025-03: One co-author paper on Zero-shot Object Navigation is accepted to CVPR 2025.

2025-01: One co-author paper on Online 3D Instance Segmentation is accepted to ICLR 2025 (oral paper).

2024-12: Selected into Shuimu Tsinghua Scholar Program .

2024-05: One paper on Lane Detection is accepted to TIP.

2024-02: Two papers on 3D Occupancy and Online 3D Scene Perception are accepted to CVPR 2024.

2024-01: One paper on Depth Completion is accepted to TIP.

2023-09: One paper on Unsupervised Depth Completion is accepted to TCSVT.

Recent Selected Publications

(*Equal Contribution, #Corresponding Author)

	Pseudo Depth Meets Gaussian: A Feed-forward RGB SLAM Baseline Linqing Zhao, Xiuwei Xu, Yirui Wang, Hao Wang, Wenzhao Zheng, Yansong Tang, Haibin Yan#, Jiwen Lu IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2025. We propose an online RGB SLAM method that utilizes only monocular RGB input, eliminating the need for depth sensors or expensive iterative pose optimization.
	iGaussian: Real-Time Camera Pose Estimation via Feed-Forward 3D Gaussian Splatting Inversion Hao Wang, Linqing Zhao*, Xiuwei Xu, Jiwen Lu, Haibin Yan# IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2025.* We propose iGaussian, a two-stage feed-forward framework that achieves real-time camera pose estimation through direct 3D Gaussian inversion.
	Similarity-Aware Fusion Network for Robust Multi-Modal Semantic Segmentation Linqing Zhao, Jiwen Lu#, Jie Zhou IEEE Transactions on Multimedia (TMM), 2025. IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021. We propose a similarity-aware fusion network (SAFNet) to adaptively fuse 2D images and 3D point clouds for Multi-Modal semantic segmentation.
	StructLane: Leveraging Structural Relations for Lane Detection Linqing Zhao, Wenzhao Zheng, Yunpeng Zhang, Jie Zhou, Jiwen Lu# IEEE Transactions on Image Processing (TIP), 2024. We propose the StructLane method to enhance lane detection accuracy and robustness by harnessing the structural relationships among lanes.
	LowRankOcc: Tensor Decomposition and Low-Rank Recovery for Vision-based 3D Semantic Occupancy Prediction Linqing Zhao, Xiuwei Xu, Ziwei Wang, Yunpeng Zhang, Borui Zhang, Wenzhao Zheng, Dalong Du, Jie Zhou, Jiwen Lu# IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024. We propose LowRankOcc to address spatial redundancy in 3D semantic occupancy prediction, leveraging the inherent low-rank property of occupancy data.
	Structure-aware Cross-Modal Transformer for Depth Completion Linqing Zhao, Yi Wei, Jiaxin Li, Jie Zhou, Jiwen Lu# IEEE Transactions on Image Processing (TIP), 2024. We disentangle the hierarchical 3D scene-level structure from the RGB-D input and construct a pathway to make sharp depth boundaries and object shape outlines accessible to 2D features.
	SPTR: Structure-Preserving Transformer for Unsupervised Indoor Depth Completion Linqing Zhao, Wenzhao Zheng, Yueqi Duan, Jie Zhou, Jiwen Lu# IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2023. We propose to reformulate depth completion as the process of 3D structure generation, where the generated structure should recover the complete scene and also consist with the known partial structure.
	SurroundOcc: Multi-Camera 3D Occupancy Prediction for Autonomous Driving Yi Wei, Linqing Zhao*, Wenzhao Zheng, Zheng Zhu, Jie Zhou , Jiwen Lu# IEEE International Conference on Computer Vision (ICCV), 2023* We design a pipeline to generate dense occupancy ground truths without expensive occupancy annotations, which enables the training of more dense 3D occupancy prediction models.
	Dense Hybrid Proposal Modulation for Lane Detection Yuejian Wu, Linqing Zhao, Jiwen Lu, Haibin Yan# IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2023. We densely modulate all proposals to generate topologically and spatially high-quality lane predictions with discriminative representations.
	SurroundDepth: Entangling Surrounding Views for Self-Supervised Multi-Camera Depth Estimation Yi Wei, Linqing Zhao*, Wenzhao Zheng, Zheng Zhu, Yongming Rao, Guan Huang, Jiwen Lu#, Jie Zhou Conference on Robot Learning (CoRL), 2022* We propose a SurroundDepth method to incorporate the information from multiple surrounding views to predict depth maps across cameras.
	Learning Hybrid Semantic Affinity for Point Cloud Segmentation Zhanjie Song, Linqing Zhao, Jie Zhou# IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2021. We present a hybrid semantic affinity learning method (HSA) to capture and leverage the dependencies of categories for 3D semantic segmentation, which aims to learn the label dependencies between 3D points from a hybrid perspective.