Publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2026
- AAAIAdaDepth: Exploiting Inherent Scene Information for Self-Supervised Depth Estimation in Dynamic ScenesIn AAAI Conference on Artificial Intelligence (AAAI), 2026
2025
- NNInter-Modality Feature Representation Learning-based Fusion Network for 3D Industrial Defect DetectionNeural Networks (NN), 2025
- TIPSRS: Siamese Reconstruction-Segmentation Network based on Dynamic-Parameter ConvolutionIEEE Transactions on Image Processing (TIP), 2025
- TCSVTCMF-IoU: Multi-Stage Cross-Modal Fusion 3D Object Detection with IoU Joint PredictionIEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2025
- ECAIPMR: Physical Model-Driven Multi-Stage Restoration of Turbulent Dynamic VideosIn European Conference on Artificial Intelligence (ECAI), 2025
- ACM MMMobile U-ViT: Revisiting large kernel and U-shaped ViT for efficient medical image segmentationIn Proceedings of the 33rd ACM International Conference on Multimedia (ACM MM), 2025
- ICCVFix-CLIP: Dual-Branch Hierarchical Contrastive Learning via Synthetic Captions for Better Understanding of Long TextIn International Conference on Computer Vision (ICCV), 2025
- TCSVT2M3DF: Advancing 3D industrial defect detection with multi perspective multimodal fusion networkIEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2025
- TCSVTConsistency-guided adaptive alternating training for semi-supervised salient object detectionIEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2025
- ICASSPMixed Gaussian Splatting for High-Quality Rendering and ReconstructionIn ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2025
- MIAMambaMIM: Pre-training Mamba with state space token interpolation and its application to medical image segmentationMedical Image Analysis (MIA), 2025
- AEI3D-MMFN: Multi-level multimodal fusion network for 3D industrial image anomaly detectionAdvanced Engineering Informatics, 2025
- CVIUSTDepth: Leveraging semantic-textural information in transformers for self-supervised monocular depth estimationComputer Vision and Image Understanding (CVIU), 2025
- AAAIGim: A million-scale benchmark for generative image manipulation detection and localizationIn Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2025
2024
- TIPImage Super-Resolution via Efficient Transformer Embedding Frequency Decomposition With RestartIEEE Transactions on Image Processing (TIP), 2024
- IJCVFast Image Smoothing via Quasi Weighted Least SquaresInternational Journal of Computer Vision (IJCV), 2024
- TMMVb-kgn: Variational bayesian kernel generation networks for motion image deblurringIEEE Transactions on Multimedia (TMM), 2024
- NeurIPSOpus: occupancy prediction using a sparse setIn Advances in Neural Information Processing Systems (NeurIPS), 2024
2023
- ACM MMRecurrent multi-scale transformer for high-resolution salient object detectionIn Proceedings of the 31st ACM International Conference on Multimedia (ACM MM), 2023
- AAAISoftclip: Softer cross-modal alignment makes clip strongerIn Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2023
- RALTowards Better Data Exploitation in Self-Supervised Monocular Depth EstimationIEEE Robotics and Automation Letters (RAL), 2023
- ECAIFastc: A fast attentional framework for semantic traversability classification using point cloudIn European Conference on Artificial Intelligence (ECAI), 2023
2022
- TPAMIA generalized framework for edge-preserving and structure-preserving image smoothingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
2021
- TIPLooking for the detail and context devils: High-resolution salient object detectionIEEE Transactions on Image Processing (TIP), 2021
2020
- AAAIA Generalized Framework for Edge-Preserving and Structure-Preserving Image SmoothingIn Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2020
- TOG
- TIPRAPNet: Residual atrous pyramid network for importance-aware street scene parsingIEEE Transactions on Image Processing (TIP), 2020
- TIP
- TITSSemantic scene labeling via deep nested level setIEEE Transactions on Intelligent Transportation Systems (TITS), 2020
- PRNon-rigid object tracking via deep multi-scale spatial-temporal discriminative saliency mapsPattern Recognition (PR), 2020
2019
- TIPSalient object detection with lossless feature reflection and weighted structural lossIEEE Transactions on Image Processing (TIP), 2019
- PRHyperfusion-Net: Hyper-densely reflective feature fusion for salient object detectionPattern Recognition (PR), 2019
- PRDeep gated attention networks for large-scale street-level scene segmentationPattern Recognition (PR), 2019
2018
- TCSVTEmbedding bilateral filter in least squares for efficient edge-preserving image smoothingIEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2018
- ICCVCascaded context pyramid for full-resolution 3D semantic scene completionIn Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2018
- IJCAISalient object detection by lossless feature reflectionIn Proceedings of the 27th International Joint Conference on Artificial Intelligence (IJCAI), 2018
2017
- TIP
- ICCVSemi-global weighted least squares in image filteringIn Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2017
- TCSVTVariable bandwidth weighting for texture copy artifact suppression in guided depth upsamplingIEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2017
2016
- ICIPRobust weighted least squares for guided depth upsamplingIn IEEE International Conference on Image Processing (ICIP), 2016
2015
- SPLAn MRF-based depth upsampling: Upsample the depth map with its own propertyIEEE Signal Processing Letters (SPL), 2015
- ICIPUpsampling the depth map with its own propertiesIn IEEE International Conference on Image Processing (ICIP), 2015