Unsplashed background img 1


Biography

Hu Cao is now a postdoctoral research associate in the Chair of Robotics, AI, and Real-Time Systems, at the Technical University of Munich. I obtained my Ph.D. degree from TUM, supervised by Prof. Alois Knoll. Hu Cao's research interests focus on vision and language models for scene understanding, including autonomous driving, robotic grasping, and dense prediction (classification, detection, and segmentation).

I am looking for highly self-motivated collaborators who are interested in autonomous driving, robotic grasping, and dense prediction. Please send me an up-to-date resume via email.
News

  • • 11 / 2024:   I am happy to join the editorial board of the journal Artificial Intelligence and Autonomous Systems (AIAS), Link
  • • 09 / 2024:   I am glad to share my recent talk about "Robotic Perception Based on Attention Mechanisms," presented at the Visual Intelligence International Scholars Academic Frontier Seminar. Video is here, Link
  • • 09 / 2024:   One paper on "Strong but simple: A Baseline for Domain Generalized Dense Perception by CLIP-based Transfer Learning" is accepted by ACCV 2024
  • • 08 / 2024:   One paper on "BiSeg-SAM: Weakly-Supervised Post-Processing Framework for Boosting Binary Segmentation in Segment Anything Models" is accepted by IEEE BIBM 2024
  • • 08 / 2024:   Our Swin-Unet ranks top 3 most cited ECCV papers in five years in Google Metrics, Link
  • • 08 / 2024:   One paper on 4-DOF point cloud registration is accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
  • • 07 / 2024:   Two papers on RGB-Event fusion object detection and dataset distillation are accepted by ECCV 2024
  • • 06 / 2024:   One paper on lightweight fisheye object detection is accepted by IROS 2024
  • • 06 / 2024:   We are organizing a special issue on "Advanced Perception and Planning Technology in Robotics" for Frontiers in Robotics and AI (SCI), and you are welcome to submit your original research work, Please click here for more information
  • • 05 / 2024:   One paper on dimension-pooling transformer for semantic segmentation is accepted by IEEE Transactions on Intelligent Transportation Systems (TITS)
  • • 05 / 2024:   One paper on vision language models in autonomous driving is accepted by IEEE Transactions on Intelligent Vehicles (TIV), PDF is here, Link
  • • 05 / 2024:   One paper on point cloud registration is accepted by IEEE Transactions on Intelligent Vehicles (TIV), PDF is here, Link
  • • 04 / 2024:   I am glad to share my recent talk about "Application of embodied intelligence to generalized object grasping," presented at the Joint Forum of the All German Chinese Mechatronics Engineering Society and the Computer Society 2024. Video is here, Link
  • • 02 / 2024:   One paper on collaborative semantic occupancy prediction is accepted by CVPR 2024
  • • 10 / 2023:   I am happy that I have been awarded as an IEEE TMI Distinguished Reviewer for IEEE Transactions on Medical Imaging (TMI)
  • • 10 / 2023:   One paper on efficient vision transformers is accepted by IEEE Transactions on Artificial Intelligence (TAI)
  • • 03 / 2023:   One paper on point set registration is accepted by ISPRS Journal of Photogrammetry and Remote Sensing (ISPRS), PDF is here, Link
  • • 03 / 2023:   One paper on robust face alignment and landmarks is accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
  • • 02 / 2023:   One paper on robust radar calibration is accepted by IEEE Transactions on Intelligent Transportation Systems (TITS)
  • • 12 / 2022:   We are organizing a special issue on Neurointelligence for Frontiers in Neuroscience (SCI), and you are welcome to submit your original research work, Please click here for more information
  • • 10 / 2022:   One paper on Efficient-Grasping is accepted by IEEE/ASME Transactions on Mechatronics (T-Mech)
  • • 09 / 2022:   Remote visiting the Department of Computer Science, University of Hong Kong, working with Prof. Hengshuang Zhao
  • • 09 / 2022:   We released a git to collect papers about event-based robotics (autonomous driving & robotic grasping), Link
  • • 08 / 2022:   Two papers are accepted by IEEE International Conference on Multisensor Fusion and Integration (MFI) 2022
  • • 08 / 2022:   SwinUnet is accepted by European Conference on Computer Vision-Medical Computer Vision Workshop (ECCV-MCVW), Video is here, Link, 2022
  • • 07 / 2022:   Our paper (Improving Autonomous Driving with Event-Based Neuromorphic Vision) is showcased on IEEE Xplore Innovation Spotlight, which highlights the most innovative and creative research directions, Link
  • • 05 / 2022:   One paper on event-based robotic grasping is accepted by IEEE Transactions on Instrumentation and Measurement (TIM)
  • • 03 / 2022:   I joined Computer Engineering and Networks Laboratory, ETH Zurich, as an academic guest, supervised by Prof. Lothar Thiele
  • • 09 / 2021:   One paper on vehicle detection is accepted by IEEE Sensors Journal
  • • 05 / 2021:   A validation for u-shaped swin transformer is published as a tech report on arxiv, codes released at SwinUnet
  • • 02 / 2021:   One paper on robotic grasping detection is accepted by ICRA 2021
  • • 03 / 2020:   Our work on event-based vision for autonomous driving is accepted by IEEE SPM
  • • 01 / 2020:   I joined the chair of robotics, AI and real-time systems, TUM, as a Ph.D. student, supervised by Prof. Alois Knoll
Scene understanding [Full List]

( indicates equal contribution, * indicates corresponding author, # indicates project lead)

Embracing Events and Frames with Hierarchical Feature Refinement Network for Object Detection
Hu Cao, Zehua Zhang, Yan Xia, Xinyi Li, Jiahao Xia, Guang Chen, Alois Knoll
European Conference on Computer Vision (ECCV) (Tsinghua-A), 2024
Dataset Distillation by Automatic Training Trajectories
Dai Liu, Jindong Gu, Hu Cao, Trinitis Carsten, Schulz Martin
European Conference on Computer Vision (ECCV) (Tsinghua-A), 2024
Transformation Decoupling Strategy based on Screw Theory for Deterministic Point Cloud Registration with Gravity Prior
Xinyi Li, Zijian Ma, Yinlong Liu, Walter Zimmer, Hu Cao, Feihu Zhang, Alois Knoll
IEEE Transactions on Pattern Analysis and Machine Intelligence (JCR Q1, SCI-I, CCF-A), 2024
[4DOF-Registration]
Lightweight Fisheye Object Detection Network with Transformer-based Feature Enhancement for Autonomous Driving
Hu Cao, Yanpeng Li, Yinlong Liu, Xinyi Li, Guang Chen, Alois Knoll
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2024
SDPT: Semantic-aware Dimension-Pooling Transformer for Image Segmentation
Hu Cao, Guang Chen, Hengshuang Zhao, Dongsheng Jiang, Xiaopeng Zhang, Qi Tian, Alois Knoll
IEEE Transactions on Intelligent Transportation Systems (JCR Q1, SCI-I), 2024
Vision Language Models in Autonomous Driving: A Survey and Outlook
Xingcheng Zhou, Mingyu Liu, Ekim Yurtsever, Bare Luka Zagar, Walter Zimmer, Hu Cao, Alois Knoll
IEEE Transactions on Intelligent Vehicles (JCR Q1, SCI-I), 2024
Efficient and Deterministic Search Strategy Based on Residual Projections for Point Cloud Registration with Correspondences
Xinyi Li, Hu Cao, Yinlong Liu, Xueli Liu, Feihu Zhang, Alois Knoll
IEEE Transactions on Intelligent Vehicles (JCR Q1, SCI-I), 2024
Collaborative Semantic Occupancy Prediction with Hybrid Feature Fusion in Connected Automated Vehicles
Rui Song, Chenwei Liang, Hu Cao, Zhiran Yan, Walter Zimmer, Markus Gross, Andreas Festag, Alois Knoll
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (CCF-A), 2024
Robust Face Alignment via Inherent Relation Learning and Uncertainty Estimation
Jiahao Xia, Min Xu, Haimin Zhang, Jianguo Zhang, Wenjian Huang, Hu Cao, Shiping Wen
IEEE Transactions on Pattern Analysis and Machine Intelligence (JCR Q1, SCI-I, CCF-A), 2023
[DSLPT]
GhostViT: Expediting Vision Transformers via Cheap Operations
Hu Cao, Zhongnan Qu, Guang Chen, Xinyi Li, Lothar Thiele, Alois Knoll
IEEE Transactions on Artificial Intelligence, 2023
[paper]
Globally Optimal Robust Radar Calibration in Intelligent Transportation Systems
Xinyi Li, Yinlong Liu, Venkat, Hu Cao, Feihu Zhang and Alois Knoll
IEEE Transactions on Intelligent Transportation Systems (JCR Q1, SCI-I), 2023
Swin-Unet: Unet-like Pure Transformer for Medical Image Segmentation
Hu Cao, Yueyue Wang, Joy Chen, Dongsheng Jiang, Xiaopeng Zhang, Qi Tian, Manning Wang
European Conference on Computer Vision Workshops (ECCVW) [Top 3 Most Cited ECCV Papers in Five Years] , 2022
[paper]
Orientation-aware People Detection and Counting Method based on Overhead Fisheye Camera
Hu Cao, Boyang Peng, Linxuan Jia, Bin Li, Alois Knoll, Guang Chen
IEEE International Conference on Multisensor Fusion and Integration (MFI), 2022
Fusion-based Feature Attention Gate Component for Vehicle Detection based on Event Camera
Hu Cao, Guang Chen, Jiahao Xia, Genghang Zhuang, Alois Knoll
IEEE Sensors Journal (JCR Q1), 2021
Event-based neuromorphic vision for autonomous driving: a paradigm shift for bio-inspired visual sensing and perception
Guang Chen, Hu Cao, Jorg Conradt, Huajin Tang, Florian Rohrbein, Alois Knoll
IEEE Signal Processing Magazine (JCR Q1, SCI-I), [IEEE Xplore Innovation Spotlight] , 2020

Robotic grasping [Full List]

( indicates equal contribution, * indicates corresponding author, # indicates project lead)

Efficient Grasp Detection Network with Gaussian-based Grasp Representation for Robotic Manipulation
Hu Cao, Guang Chen, Zhijun Li, Qian Feng, Jianjie Lin, Alois Knoll
IEEE/ASME Transactions on Mechatronics (JCR Q1, SCI-I), 2022
NeuroGrasp: Multi-modal Neural Network with Euler Region Regression for Neuromorphic Vision-based Grasp Pose Estimation
Hu Cao, Guang Chen, Zhijun Li, Yingbai Hu, Alois Knoll
IEEE Transactions on Instrumentation and Measurement, (JCR Q1), 2022
Residual Squeeze-and-Excitation Network with Multi-scale Spatial Pyramid Module for Fast Robotic Grasping Detection
Hu Cao, Guang Chen, Zhijun Li, Jianjie Lin, Alois Knoll
IEEE International Conference on Robotics and Automation (ICRA) (Tsinghua-A), 2021

Preprints [Full List]

( indicates equal contribution, * indicates corresponding author, # indicates project lead)

VLTSeg: Simple Transfer of CLIP-Based Vision-Language Representations for Domain Generalized Semantic Segmentation
Hümmer Christoph, Schwonberg Manuel, Zhou Liangwei, Hu Cao*, Alois Knoll, Gottschalk Hanno

Academic Activities

Editor Services
  •   • Topic Editors for Frontiers in Robotics and AI (SCI), 2024
  •   • Topic Coordinator for Frontiers in Neuroscience (SCI), 2022
Conference Reviewers/Program Committee member (PC)
  •   • IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2024/2025
  •   • IEEE International Conference on Robotics and Automation (ICRA), 2021/2022/2024
  •   • IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021/2024
  •   • ACM Multimedia (ACM MM), 2023/2024
  •   • Annual Meeting of the Association for Computational Linguistics (ACL), 2024
  •   • Medical Image Computing and Computer Assisted Intervention(MICCAI), 2024
  •   • AAAI Conference on Artificial Intelligence (AAAI), 2023/2024/2025
  •   • European Conference on Computer Vision (ECCV), 2022/2024
Journal Reviewers
  •   • IEEE Transactions on Image Processing (TIP)
  •   • IEEE Transactions on Artificial Intelligence (TAI)
  •   • IEEE Transactions on Neural Networks and Learning Systems (TNNLS)
  •   • IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)
  •   • IEEE Transactions on Cybernetics (TCYB)
  •   • IEEE Transactions on Automation Science and Engineering (T-ASE)
  •   • IEEE/ASME Transactions on Mechatronics (T-Mech)
  •   • IEEE Robotics and Automation Letters (RAL)
  •   • IEEE Transactions on Intelligent Vehicles (TIV)
  •   • IEEE Transactions on Industrial Electronics (TIE)
  •   • IEEE Transactions on Consumer Electronics (TCE)
  •   • IEEE Transactions on Industrial Informatics (TII)
  •   • IEEE Transactions on Computational Imaging (TCI)
  •   • IEEE Transactions on Geoscience and Remote Sensing (TGRS)
  •   • IEEE Transactions on Instrumentation & Measurement (TIM)
  •   • IEEE Instrumentation & Measurement Magazine (TIMM)
  •   • IEEE Journal of Biomedical and Health Informatics (JBHI)
  •   • IEEE Transactions on Medical Imaging (TMI)
  •   • Medical Image Analysis (MIA)
  •   • Pattern Recognition (PR)
  •   • Scientific Reports
  •   • Frontiers in Neurorobotics
  •   • The Visual Computer
  •   • Signal, Image and Video Processing
  •   • Automotive Innovation
Internship and Cooperation

                   
Awards

  •   • Distinguished Reviewer of IEEE Transactions on Medical Imaging (TMI), 2022-2023
  •   • Outstanding Graduate of Hunan Province, 2019
  •   • Outstanding Graduate of Hunan University, 2019
  •   • Outstanding Graduate Student of Hunan University, 2017-2018
  •   • Outstanding Graduate Student Leader of Hunan University, 2017-2018
  •   • Most media Attention Award, The 3rd Lushan Cup Innovation Challenge Competition, 2018
  •   • Second Prize, 2018 ”Haosen Pharmaceutical Cup” 5th China Graduate Smart City Technology and Creative Design Competition , 2018
  •   • Second Prize, The 4th National ”TRIZ” Cup College Students Innovation Competition, 2016
  •   • First Prize, The 7th National College Students Mechanical Innovation Design Competition, 2016
💻 Demos