Publications
2025
- Evaluating Cognitive-Behavioral Fixation via Multimodal User Viewing Patterns on Social Media - Yujie Wang, Yunwei Zhao, Jing Yang, Han Han, Shiguang Shan, Jie Zhang. Evaluating Cognitive-Behavioral Fixation via Multimodal User Viewing Patterns on Social Media. The Conference on Empirical Methods in Natural Language Processing (EMNLP) [pdf] 
- SHALE: A Scalable Benchmark for Fine-grained Hallucination Evaluation in LVLMs - Bei Yan, Zhiyuan Chen, Yuecong Min, Jie Zhang, Jiahao Wang, Xiaozhen Wang, Shiguang Shan. SHALE: A Scalable Benchmark for Fine-grained Hallucination Evaluation in LVLMs. ACM International Conference on Multimedia (ACM MM). [pdf] 
- FullLoRA: Efficiently Boosting the Robustness of Pretrained Vision Transformers - Zheng Yuan, Jie Zhang, Shiguang Shan, Xilin Chen. FullLoRA: Efficiently Boosting the Robustness of Pretrained Vision Transformers. IEEE Transactions on Image Processing (TIP). [pdf] 
- Confidence aware learning for reliable face anti-spoofing - Xingming Long, Jie Zhang, Shiguang Shan. Confidence aware learning for reliable face anti-spoofing. IEEE Transactions on Information Forensics and Security (TIFS). [pdf] 
- Face Forgery Video Detection via Temporal Forgery Cue Unraveling - Zonghui Guo, Yingjie Liu, Jie Zhang, Haiyong Zheng, Shiguang Shan. Face Forgery Video Detection via Temporal Forgery Cue Unraveling. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2025. [pdf] 
- Real face foundation representation learning for generalized deepfake detection - Liang Shi, Jie Zhang, Zhilong Ji, Jinfeng Bai, Shiguang Shan. Real face foundation representation learning for generalized deepfake detection. Pattern Recognition (PR) 2025. [pdf] 
- Collaboratively Self-supervised Video Representation Learning for Action Recognition - Jie Zhang, Zhifan Wan, Lanqing Hu, Stephen Lin, Shuzhe Wu, Shiguang Shan. Collaboratively Self-supervised Video Representation Learning for Action Recognition. IEEE Transactions on Information Forensics and Security (TIFS) 2025. [pdf] 
- Dysca: A Dynamic and Scalable Benchmark for Evaluating Perception Ability of LVLMs - Jie Zhang, Zhongqi Wang, Mengqi Lei, Zheng Yuan, Bei Yan, Shiguang Shan, Xilin Chen. Dysca: A Dynamic and Scalable Benchmark for Evaluating Perception Ability of LVLMs. International Conference on Learning Representations (ICLR) 2025. [pdf] 
2024
- Rethinking the Evaluation of Out-of-Distribution Detection: A Sorites Paradox - Xingming Long, Jie Zhang, Shiguang Shan, Xilin Chen. Rethinking the Evaluation of Out-of-Distribution Detection: A Sorites Paradox. Advances in Neural Information Processing Systems (NeurIPS) 2024. [pdf] 
- Generalized Face Liveness Detection via De-fake Face Generator - Xingming Long, Jie Zhang, Shiguang Shan. Generalized Face Liveness Detection via De-fake Face Generator. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 2024. [pdf] 
- Towards Robust Semantic Segmentation against Patch-Based Attack via Attention Refinement - Zheng Yuan, Jie Zhang, Yude Wang, Shiguang Shan, Xilin Chen. Towards Robust Semantic Segmentation against Patch-Based Attack via Attention Refinement. International Journal of Computer Vision (IJCV) 2024. [pdf] 
- T2IShield: Defending against backdoors on text-to-image diffusion models - Zhongqi Wang, Jie Zhang, Shiguang Shan, Xilin Chen. T2IShield: Defending against backdoors on text-to-image diffusion models. European Conference on Computer Vision (ECCV) 2024. [pdf] 
- Hierarchical compositional representations for few-shot action recognition - Changzhen Li, Jie Zhang, Shuzhe Wu, Xin Jin, Shiguang Shan. Hierarchical compositional representations for few-shot action recognition. Computer Vision and Image Understanding (CVIU) 2024. [pdf] 
- Adaptive perturbation for adversarial attack - Zheng Yuan, Jie Zhang, Zhaoyan Jiang, Liangliang Li, Shiguang Shan. Adaptive perturbation for adversarial attack. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 2024. [pdf] 
- Enhancing Face Recognition with Detachable Self-Supervised Bypass Networks - Mingjie He, Jie Zhang, Shiguang Shan, Xilin Chen. Enhancing Face Recognition with Detachable Self-Supervised Bypass Networks. IEEE Transactions on Image Processing (TIP) 2024. [pdf] 
- Video Harmonization with Triplet Spatio-Temporal Variation Patterns - Zonghui Guo, Xinyu Han, Jie Zhang, Shiguang Shan, Haiyong Zheng. Video Harmonization with Triplet Spatio-Temporal Variation Patterns. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024. [pdf] 
- Pre-trained model guided fine-tuning for zero-shot adversarial robustness - Sibo Wang, Jie Zhang, Zheng Yuan, Shiguang Shan. Pre-trained model guided fine-tuning for zero-shot adversarial robustness. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (CVPR) 2024. [pdf] 
2023
- Data-efficient masked video modeling for self-supervised action recognition - Qiankun Li, Xiaolong Huang, Zhifan Wan, Lanqing Hu, Shuzhe Wu, Jie Zhang, Shiguang Shan, Zengfu Wang. Data-efficient masked video modeling for self-supervised action recognition. Proceedings of the 31st ACM International Conference on Multimedia (ACM MM) 2023. [pdf] 
- Dual sampling based causal intervention for face anti-spoofing with identity debiasing - Xingming Long, Jie Zhang, Shuzhe Wu, Xin Jin, Shiguang Shan. Dual sampling based causal intervention for face anti-spoofing with identity debiasing. IEEE Transactions on Information Forensics and Security (TIFS) 2023. [pdf] 
- An automated optical inspection (AOI) platform for three-dimensional (3D) defects detection on glass micro-optical components (GMOC) - Yinchao Du, Jiangpeng Chen, Han Zhou, Xiaoling Yang, Zhongqi Wang, Jie Zhang, Yuechun Shi, Xiangfei Chen, Xuezhe Zheng. An automated optical inspection (AOI) platform for three-dimensional (3D) defects detection on glass micro-optical components (GMOC). Optics Communications 2023. [pdf] 
- Adaptive Adversarial Patch Attack on Face Recognition Models - Bei Yan, Jie Zhang, Zheng Yuan, Shiguang Shan. Adaptive Adversarial Patch Attack on Face Recognition Models. IEEE International Joint Conference on Biometrics (IJCB) 2023. [pdf] 
- BLPSeg: Balance the label preference in scribble-supervised semantic segmentation - Yude Wang, Jie Zhang, Meina Kan, Shiguang Shan, Xilin Chen. BLPSeg: Balance the label preference in scribble-supervised semantic segmentation. IEEE Transactions on Image Processing (TIP) 2023. [pdf] 
- Cclap: Controllable chinese landscape painting generation via latent diffusion model - Zhongqi Wang, Jie Zhang, Zhilong Ji, Jinfeng Bai, Shiguang Shan. Cclap: Controllable chinese landscape painting generation via latent diffusion model. IEEE International Conference on Multimedia and Expo (ICME) 2023. [pdf] 
- Self-supervised Learning for Fine-grained Ethnicity Classification under Limited Labeled Data - Kunyan Li, Jie Zhang, Shiguang Shan. Self-supervised Learning for Fine-grained Ethnicity Classification under Limited Labeled Data. IEEE 17th International Conference on Automatic Face and Gesture Recognition (FG) 2023. [pdf] 
2022
- Learning pseudo labels for semi-and-weakly supervised semantic segmentation - Yude Wang, Jie Zhang, Meina Kan, Shiguang Shan. Learning pseudo labels for semi-and-weakly supervised semantic segmentation. Pattern Recognition (PR) 2022. [pdf] 
- Adaptive image transformations for transfer-based adversarial attack - Zheng Yuan, Jie Zhang, Shiguang Shan. Adaptive image transformations for transfer-based adversarial attack. European Conference on Computer Vision (ECCV) 2022. [pdf] 
- Polynomial stacked-attention network for nationality classification - Kunyan Li, Jie Zhang, Shiguang Shan. Polynomial stacked-attention network for nationality classification. Frontiers of Computer Science 2022. [pdf] 
- Video-Based Two-Stage Network for Optical Glass Sub-Millimeter Defect Detection - Han Zhou, Xiaoling Yang, Zhongqi Wang, Jie Zhang, Yinchao Du, Jiangpeng Chen, Xuezhe Zheng. Video-Based Two-Stage Network for Optical Glass Sub-Millimeter Defect Detection. AI 2022. [pdf] 
- Enhancing face recognition with self-supervised 3d reconstruction - Mingjie He, Jie Zhang, Shiguang Shan, Xilin Chen. Enhancing face recognition with self-supervised 3d reconstruction. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022. [pdf] 
2021
- Locality-aware Channel-wise Dropout for Occluded Face Recognition - Mingjie He, Jie Zhang, Shiguang Shan, Xiao Liu, Zhongqin Wu, Xilin Chen. Locality-aware Channel-wise Dropout for Occluded Face Recognition. IEEE Transactions on Image Processing (TIP) 2021. [pdf] 
- Dual-Branch Meta-learning Network with Distribution Alignment for Face Anti-spoofing - Yunpei Jia, Jie Zhang, Shiguang Shan. Dual-Branch Meta-learning Network with Distribution Alignment for Face Anti-spoofing. IEEE Transactions on Information Forensics and Security (TFIS) 2021. [pdf] 
- Learning Shape-Appearance Based Attributes Representation for Facial Attribute Recognition with Limited Labeled Data - Kunyan Li, Jie Zhang, Shiguang Shan. Learning Shape-Appearance Based Attributes Representation for Facial Attribute Recognition with Limited Labeled Data. IEEE International Conference on Automatic Face and Gesture Recognition (FG) 2021. [pdf] 
- Unknown Aware Feature Learning for Face Forgery Detection - Liang Shi, Jie Zhang, Chenyue Liang, Shiguang Shan. Unknown Aware Feature Learning for Face Forgery Detection. IEEE International Conference on Automatic Face and Gesture Recognition (FG) 2021. [pdf] 
- Meta Gradient Adversarial Attack - Zheng Yuan, Jie Zhang, Yunpei Jia, Chuanqi Tan, Tao Xue, Shiguang Shan. Meta Gradient Adversarial Attack. IEEE International Conference on Computer Vision (ICCV) 2021. [pdf][code] 
- Unified unsupervised and semi-supervised domain adaptation network for cross-scenario face anti-spoofing - Yunpei Jia, Jie Zhang, Shiguang Shan, Xilin Chen. Unified unsupervised and semi-supervised domain adaptation network for cross-scenario face anti-spoofing. Pattern Recognition (PR) 2021. [pdf] 
2020
- Leveraging Auxiliary Tasks for Height andWeight Estimation by Multi Task Learning - Dan Han, Jie Zhang, Shiguang Shan. Leveraging Auxiliary Tasks for Height and Weight Estimation by Multi Task Learning. International Joint Conference on Biometrics (IJCB), 2020. [pdf] 
- Attributes Aware Face Generation with Generative Adversarial Networks - Zheng Yuan, Jie Zhang, Shiguang Shan, Xilin Chen. Attributes Aware Face Generation with Generative Adversarial Networks. International Conference on Pattern Recognition (ICPR), 2020. [pdf] 
- Deformable Face Net for Pose Invariant Face Recognition - Mingjie He, Jie Zhang, Shiguang Shan, Meina Kan, Xilin Chen. Deformable Face Net for Pose Invariant Face Recognition. Pattern Recognition (PR), 2020. [pdf] 
- Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation - Yude Wang, Jie Zhang, Meina Kan, Shiguang Shan, Xilin Chen. Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020. (Oral) [pdf] [code] 
- Single-Side Domain Generalization for Face Anti-Spoofing - Yunpei Jia, Jie Zhang, Shiguang Shan, Xilin Chen. Single-Side Domain Generalization for Face Anti-Spoofing. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020. [pdf] [code] 
- Noise Robust Hard Example Mining for Human Detection with Efficient Depth-Thermal Fusion - Zijian Zhao, Jie Zhang, Shiguang Shan. Noise Robust Hard Example Mining for Human Detection with Efficient Depth-Thermal Fusion. IEEE International Conference on Automatic Face and Gesture Recognition Workshops (FGW), 2020. (2nd Winner of Human Detection) [pdf] 
- PAS-Net: Pose-based and Appearance-based Spatiotemporal Networks Fusion for Action Recognition - Changzhen Li, Jie Zhang, Shiguang Shan, Xilin Chen. PAS-Net: Pose-based and Appearance-based Spatiotemporal Networks Fusion for Action Recognition. IEEE International Conference on Automatic Face and Gesture Recognition (FG), 2020. [pdf] 
2019
- Locality-constrained Framework for Face Alignment - Jie Zhang, Xiaowei Zhao, Meina Kan, Shiguang Shan, Xiujuan Chai, Xilin Chen. Locality-constrained Framework for Face Alignment. Frontiers of Computer Science (FCS), 2019. [pdf] 
- Deformable Face Net: Learning Pose Invariant Feature with Pose Aware Feature Alignment for Face Recognition - Mingjie He, Jie Zhang, Shiguang Shan, Meina Kan, Xilin Chen. Deformable Face Net: Learning Pose Invariant Feature with Pose Aware Feature Alignment for Face Recognition. IEEE International Conference on Automatic Face and Gesture Recognition (FG), 2019. (Oral) [pdf] [code] 
- DFT-Net: Disentanglement of Face Deformation and Texture Synthesis for Expression Editing - Jinghui Wang, Jie Zhang, Zijia Lu, Shiguang Shan. DFT-Net: Disentanglement of Face Deformation and Texture Synthesis for Expression Editing. IEEE International Conference on Image Processing (ICIP), 2019. [pdf] 
2018
- A Three-Category Face Detector with Contextual Information on Finding Tiny Faces - Feng Jiang, Jie Zhang, Liping Yan, Yuanqing Xia, Shiguang Shan. A Three-Category Face Detector with Contextual Information on Finding Tiny Faces. IEEE International Conference on Image Processing (ICIP), 2018. [pdf] 
- Efficient Weighted Kernel Sharing Convolutional Neural Networks - Helong Zhou, Yie-Tarng Chen, Jie Zhang, Wen-Hsien Fang. Efficient Weighted Kernel Sharing Convolutional Neural Networks. IEEE International Conference on Visual Communications and Image Processing (VCIP), 2018. [pdf] 
2017
- Robust Fec-cnn: A High Accuracy Facial Landmark Detection System - Zhenliang He, Jie Zhang, Meina Kan, Shiguang Shan, Xilin Chen. Robust Fec-cnn: A High Accuracy Facial Landmark Detection System. IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2017. (2nd Winner of Face Alignment) [pdf] 
- KinNet: Fine-to-Coarse Deep Metric Learning for Kinship Verification - Yong Li, Jiabei Zeng, Jie Zhang, Anbo Dai, Meina Kan, Shiguang Shan, Xilin Chen. Kinnet: Fine-to-coarse deep metric learning for kinship verification. ACM Conference on Multimedia Workshops (ACM MMW) 2017. (1st Winner of Kinship Verification) [pdf] 
- A Fully End-to-End Cascaded CNN for Facial Landmark Detection - Zhenliang He, Meina Kan, Jie Zhang, Xilin Chen, Shiguang Shan. A Fully End-to-End Cascaded CNN for Facial Landmark Detection. IEEE International Conference on Face and Gesture Recognition (FG), 2017. [pdf] 
2016
- Occlusion-free Face Alignment: Deep Regression Networks Coupled with De-corrupt AutoEncoders - Jie Zhang, Meina Kan, Shiguang Shan, Xilin Chen. Occlusion-free Face Alignment: Deep Regression Networks Coupled with De-corrupt AutoEncoders. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016. [pdf] 
2015
- Leveraging Datasets with Varying Annotations for Face Alignment via Deep Regression Network - Jie Zhang, Meina Kan, Shiguang Shan, Xilin Chen. Leveraging Datasets with Varying Annotations for Face Alignment via Deep Regression Network. IEEE International Conference on Computer Vision (ICCV), 2015. [pdf] 
- AgeNet:Deeply Learned Regressor and Classifier for Robust Apparent Age Estimation - Xin Liu, Shaoxin Li, Meina Kan, Jie Zhang, Shuzhe Wu, Wenxian Liu, hu Han, Shiguang Shan, Xilin Chen. AgeNet:Deeply Learned Regressor and Classifier for Robust Apparent Age Estimation. IEEE International Conference on Computer Vision Workshops (ICCVW), 2015. (2nd Winner of Apparent Age Estimation) [pdf] 
2014
- Coarse-to-Fine Auto-encoder Networks (CFAN) for Real-time Face Alignment - Jie Zhang, Shiguang Shan, Meina Kan, Xilin Chen. Coarse-to-Fine Auto-encoder Networks (CFAN) for Real-time Face Alignment. European Conference on Computer Vision (ECCV), 2014 [pdf] [code] 
- Topic-aware Deep Auto-encoders (TDA) for Face Alignment - Jie Zhang, Meina Kan, Shiguang Shan, Xiaowe Zhao, Xilin Chen. Topic-aware Deep Auto-encoders (TDA) for Face Alignment. Asian Conference on Computer Vision (ACCV), 2014. [pdf] 
