Publications
2025
Face Forgery Video Detection via Temporal Forgery Cue Unraveling
Zonghui Guo, Yingjie Liu, Jie Zhang, Haiyong Zheng, Shiguang Shan. Face Forgery Video Detection via Temporal Forgery Cue Unraveling. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2025.
Real face foundation representation learning for generalized deepfake detection
Liang Shi, Jie Zhang, Zhilong Ji, Jinfeng Bai, Shiguang Shan. Real face foundation representation learning for generalized deepfake detection. Pattern Recognition (PR) 2025. [pdf]
Collaboratively Self-supervised Video Representation Learning for Action Recognition
Jie Zhang, Zhifan Wan, Lanqing Hu, Stephen Lin, Shuzhe Wu, Shiguang Shan. Collaboratively Self-supervised Video Representation Learning for Action Recognition. IEEE Transactions on Information Forensics and Security (TIFS) 2025. [pdf]
Dysca: A Dynamic and Scalable Benchmark for Evaluating Perception Ability of LVLMs
Jie Zhang, Zhongqi Wang, Mengqi Lei, Zheng Yuan, Bei Yan, Shiguang Shan, Xilin Chen. Dysca: A Dynamic and Scalable Benchmark for Evaluating Perception Ability of LVLMs. International Conference on Learning Representations (ICLR) 2025. [pdf]
2024
Rethinking the Evaluation of Out-of-Distribution Detection: A Sorites Paradox
Xingming Long, Jie Zhang, Shiguang Shan, Xilin Chen. Rethinking the Evaluation of Out-of-Distribution Detection: A Sorites Paradox. Advances in Neural Information Processing Systems (NeurIPS) 2024. [pdf]
Generalized Face Liveness Detection via De-fake Face Generator
Xingming Long, Jie Zhang, Shiguang Shan. Generalized Face Liveness Detection via De-fake Face Generator. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 2024. [pdf]
Towards Robust Semantic Segmentation against Patch-Based Attack via Attention Refinement
Zheng Yuan, Jie Zhang, Yude Wang, Shiguang Shan, Xilin Chen. Towards Robust Semantic Segmentation against Patch-Based Attack via Attention Refinement. International Journal of Computer Vision (IJCV) 2024. [pdf]
T2IShield: Defending against backdoors on text-to-image diffusion models
Zhongqi Wang, Jie Zhang, Shiguang Shan, Xilin Chen. T2IShield: Defending against backdoors on text-to-image diffusion models. European Conference on Computer Vision (ECCV) 2024. [pdf]
Hierarchical compositional representations for few-shot action recognition
Changzhen Li, Jie Zhang, Shuzhe Wu, Xin Jin, Shiguang Shan. Hierarchical compositional representations for few-shot action recognition. Computer Vision and Image Understanding (CVIU) 2024. [pdf]
Adaptive perturbation for adversarial attack
Zheng Yuan, Jie Zhang, Zhaoyan Jiang, Liangliang Li, Shiguang Shan. Adaptive perturbation for adversarial attack. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 2024. [pdf]
Enhancing Face Recognition with Detachable Self-Supervised Bypass Networks
Mingjie He, Jie Zhang, Shiguang Shan, Xilin Chen. Enhancing Face Recognition with Detachable Self-Supervised Bypass Networks. IEEE Transactions on Image Processing (TIP) 2024. [pdf]
Video Harmonization with Triplet Spatio-Temporal Variation Patterns
Zonghui Guo, Xinyu Han, Jie Zhang, Shiguang Shan, Haiyong Zheng. Video Harmonization with Triplet Spatio-Temporal Variation Patterns. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024. [pdf]
Pre-trained model guided fine-tuning for zero-shot adversarial robustness
Sibo Wang, Jie Zhang, Zheng Yuan, Shiguang Shan. Pre-trained model guided fine-tuning for zero-shot adversarial robustness. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (CVPR) 2024. [pdf]
2023
Data-efficient masked video modeling for self-supervised action recognition
Qiankun Li, Xiaolong Huang, Zhifan Wan, Lanqing Hu, Shuzhe Wu, Jie Zhang, Shiguang Shan, Zengfu Wang. Data-efficient masked video modeling for self-supervised action recognition. Proceedings of the 31st ACM International Conference on Multimedia (ACM MM) 2023. [pdf]
Dual sampling based causal intervention for face anti-spoofing with identity debiasing
Xingming Long, Jie Zhang, Shuzhe Wu, Xin Jin, Shiguang Shan. Dual sampling based causal intervention for face anti-spoofing with identity debiasing. IEEE Transactions on Information Forensics and Security (TIFS) 2023. [pdf]
An automated optical inspection (AOI) platform for three-dimensional (3D) defects detection on glass micro-optical components (GMOC)
Yinchao Du, Jiangpeng Chen, Han Zhou, Xiaoling Yang, Zhongqi Wang, Jie Zhang, Yuechun Shi, Xiangfei Chen, Xuezhe Zheng. An automated optical inspection (AOI) platform for three-dimensional (3D) defects detection on glass micro-optical components (GMOC). Optics Communications 2023. [pdf]
Adaptive Adversarial Patch Attack on Face Recognition Models
Bei Yan, Jie Zhang, Zheng Yuan, Shiguang Shan. Adaptive Adversarial Patch Attack on Face Recognition Models. IEEE International Joint Conference on Biometrics (IJCB) 2023. [pdf]
BLPSeg: Balance the label preference in scribble-supervised semantic segmentation
Yude Wang, Jie Zhang, Meina Kan, Shiguang Shan, Xilin Chen. BLPSeg: Balance the label preference in scribble-supervised semantic segmentation. IEEE Transactions on Image Processing (TIP) 2023. [pdf]
Cclap: Controllable chinese landscape painting generation via latent diffusion model
Zhongqi Wang, Jie Zhang, Zhilong Ji, Jinfeng Bai, Shiguang Shan. Cclap: Controllable chinese landscape painting generation via latent diffusion model. IEEE International Conference on Multimedia and Expo (ICME) 2023. [pdf]
Self-supervised Learning for Fine-grained Ethnicity Classification under Limited Labeled Data
Kunyan Li, Jie Zhang, Shiguang Shan. Self-supervised Learning for Fine-grained Ethnicity Classification under Limited Labeled Data. IEEE 17th International Conference on Automatic Face and Gesture Recognition (FG) 2023. [pdf]
2022
Learning pseudo labels for semi-and-weakly supervised semantic segmentation
Yude Wang, Jie Zhang, Meina Kan, Shiguang Shan. Learning pseudo labels for semi-and-weakly supervised semantic segmentation. Pattern Recognition (PR) 2022. [pdf]
Adaptive image transformations for transfer-based adversarial attack
Zheng Yuan, Jie Zhang, Shiguang Shan. Adaptive image transformations for transfer-based adversarial attack. European Conference on Computer Vision (ECCV) 2022. [pdf]
Polynomial stacked-attention network for nationality classification
Kunyan Li, Jie Zhang, Shiguang Shan. Polynomial stacked-attention network for nationality classification. Frontiers of Computer Science 2022. [pdf]
Video-Based Two-Stage Network for Optical Glass Sub-Millimeter Defect Detection
Han Zhou, Xiaoling Yang, Zhongqi Wang, Jie Zhang, Yinchao Du, Jiangpeng Chen, Xuezhe Zheng. Video-Based Two-Stage Network for Optical Glass Sub-Millimeter Defect Detection. AI 2022. [pdf]
Enhancing face recognition with self-supervised 3d reconstruction
Mingjie He, Jie Zhang, Shiguang Shan, Xilin Chen. Enhancing face recognition with self-supervised 3d reconstruction. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022. [pdf]
2021
Locality-aware Channel-wise Dropout for Occluded Face Recognition
Mingjie He, Jie Zhang, Shiguang Shan, Xiao Liu, Zhongqin Wu, Xilin Chen. Locality-aware Channel-wise Dropout for Occluded Face Recognition. IEEE Transactions on Image Processing (TIP) 2021. [pdf]
Dual-Branch Meta-learning Network with Distribution Alignment for Face Anti-spoofing
Yunpei Jia, Jie Zhang, Shiguang Shan. Dual-Branch Meta-learning Network with Distribution Alignment for Face Anti-spoofing. IEEE Transactions on Information Forensics and Security (TFIS) 2021. [pdf]
Learning Shape-Appearance Based Attributes Representation for Facial Attribute Recognition with Limited Labeled Data
Kunyan Li, Jie Zhang, Shiguang Shan. Learning Shape-Appearance Based Attributes Representation for Facial Attribute Recognition with Limited Labeled Data. IEEE International Conference on Automatic Face and Gesture Recognition (FG) 2021. [pdf]
Unknown Aware Feature Learning for Face Forgery Detection
Liang Shi, Jie Zhang, Chenyue Liang, Shiguang Shan. Unknown Aware Feature Learning for Face Forgery Detection. IEEE International Conference on Automatic Face and Gesture Recognition (FG) 2021. [pdf]
Meta Gradient Adversarial Attack
Zheng Yuan, Jie Zhang, Yunpei Jia, Chuanqi Tan, Tao Xue, Shiguang Shan. Meta Gradient Adversarial Attack. IEEE International Conference on Computer Vision (ICCV) 2021. [pdf][code]
Unified unsupervised and semi-supervised domain adaptation network for cross-scenario face anti-spoofing
Yunpei Jia, Jie Zhang, Shiguang Shan, Xilin Chen. Unified unsupervised and semi-supervised domain adaptation network for cross-scenario face anti-spoofing. Pattern Recognition (PR) 2021. [pdf]
2020
Leveraging Auxiliary Tasks for Height andWeight Estimation by Multi Task Learning
Dan Han, Jie Zhang, Shiguang Shan. Leveraging Auxiliary Tasks for Height and Weight Estimation by Multi Task Learning. International Joint Conference on Biometrics (IJCB), 2020. [pdf]
Attributes Aware Face Generation with Generative Adversarial Networks
Zheng Yuan, Jie Zhang, Shiguang Shan, Xilin Chen. Attributes Aware Face Generation with Generative Adversarial Networks. International Conference on Pattern Recognition (ICPR), 2020. [pdf]
Deformable Face Net for Pose Invariant Face Recognition
Mingjie He, Jie Zhang, Shiguang Shan, Meina Kan, Xilin Chen. Deformable Face Net for Pose Invariant Face Recognition. Pattern Recognition (PR), 2020. [pdf]
Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation
Yude Wang, Jie Zhang, Meina Kan, Shiguang Shan, Xilin Chen. Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020. (Oral) [pdf] [code]
Single-Side Domain Generalization for Face Anti-Spoofing
Yunpei Jia, Jie Zhang, Shiguang Shan, Xilin Chen. Single-Side Domain Generalization for Face Anti-Spoofing. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020. [pdf] [code]
Noise Robust Hard Example Mining for Human Detection with Efficient Depth-Thermal Fusion
Zijian Zhao, Jie Zhang, Shiguang Shan. Noise Robust Hard Example Mining for Human Detection with Efficient Depth-Thermal Fusion. IEEE International Conference on Automatic Face and Gesture Recognition Workshops (FGW), 2020. (2nd Winner of Human Detection) [pdf]
PAS-Net: Pose-based and Appearance-based Spatiotemporal Networks Fusion for Action Recognition
Changzhen Li, Jie Zhang, Shiguang Shan, Xilin Chen. PAS-Net: Pose-based and Appearance-based Spatiotemporal Networks Fusion for Action Recognition. IEEE International Conference on Automatic Face and Gesture Recognition (FG), 2020. [pdf]
2019
Locality-constrained Framework for Face Alignment
Jie Zhang, Xiaowei Zhao, Meina Kan, Shiguang Shan, Xiujuan Chai, Xilin Chen. Locality-constrained Framework for Face Alignment. Frontiers of Computer Science (FCS), 2019. [pdf]
Deformable Face Net: Learning Pose Invariant Feature with Pose Aware Feature Alignment for Face Recognition
Mingjie He, Jie Zhang, Shiguang Shan, Meina Kan, Xilin Chen. Deformable Face Net: Learning Pose Invariant Feature with Pose Aware Feature Alignment for Face Recognition. IEEE International Conference on Automatic Face and Gesture Recognition (FG), 2019. (Oral) [pdf] [code]
DFT-Net: Disentanglement of Face Deformation and Texture Synthesis for Expression Editing
Jinghui Wang, Jie Zhang, Zijia Lu, Shiguang Shan. DFT-Net: Disentanglement of Face Deformation and Texture Synthesis for Expression Editing. IEEE International Conference on Image Processing (ICIP), 2019. [pdf]
2018
A Three-Category Face Detector with Contextual Information on Finding Tiny Faces
Feng Jiang, Jie Zhang, Liping Yan, Yuanqing Xia, Shiguang Shan. A Three-Category Face Detector with Contextual Information on Finding Tiny Faces. IEEE International Conference on Image Processing (ICIP), 2018. [pdf]
Efficient Weighted Kernel Sharing Convolutional Neural Networks
Helong Zhou, Yie-Tarng Chen, Jie Zhang, Wen-Hsien Fang. Efficient Weighted Kernel Sharing Convolutional Neural Networks. IEEE International Conference on Visual Communications and Image Processing (VCIP), 2018. [pdf]
2017
Robust Fec-cnn: A High Accuracy Facial Landmark Detection System
Zhenliang He, Jie Zhang, Meina Kan, Shiguang Shan, Xilin Chen. Robust Fec-cnn: A High Accuracy Facial Landmark Detection System. IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2017. (2nd Winner of Face Alignment) [pdf]
KinNet: Fine-to-Coarse Deep Metric Learning for Kinship Verification
Yong Li, Jiabei Zeng, Jie Zhang, Anbo Dai, Meina Kan, Shiguang Shan, Xilin Chen. Kinnet: Fine-to-coarse deep metric learning for kinship verification. ACM Conference on Multimedia Workshops (ACM MMW) 2017. (1st Winner of Kinship Verification) [pdf]
A Fully End-to-End Cascaded CNN for Facial Landmark Detection
Zhenliang He, Meina Kan, Jie Zhang, Xilin Chen, Shiguang Shan. A Fully End-to-End Cascaded CNN for Facial Landmark Detection. IEEE International Conference on Face and Gesture Recognition (FG), 2017. [pdf]
2016
Occlusion-free Face Alignment: Deep Regression Networks Coupled with De-corrupt AutoEncoders
Jie Zhang, Meina Kan, Shiguang Shan, Xilin Chen. Occlusion-free Face Alignment: Deep Regression Networks Coupled with De-corrupt AutoEncoders. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016. [pdf]
2015
Leveraging Datasets with Varying Annotations for Face Alignment via Deep Regression Network
Jie Zhang, Meina Kan, Shiguang Shan, Xilin Chen. Leveraging Datasets with Varying Annotations for Face Alignment via Deep Regression Network. IEEE International Conference on Computer Vision (ICCV), 2015. [pdf]
AgeNet:Deeply Learned Regressor and Classifier for Robust Apparent Age Estimation
Xin Liu, Shaoxin Li, Meina Kan, Jie Zhang, Shuzhe Wu, Wenxian Liu, hu Han, Shiguang Shan, Xilin Chen. AgeNet:Deeply Learned Regressor and Classifier for Robust Apparent Age Estimation. IEEE International Conference on Computer Vision Workshops (ICCVW), 2015. (2nd Winner of Apparent Age Estimation) [pdf]
2014
Coarse-to-Fine Auto-encoder Networks (CFAN) for Real-time Face Alignment
Jie Zhang, Shiguang Shan, Meina Kan, Xilin Chen. Coarse-to-Fine Auto-encoder Networks (CFAN) for Real-time Face Alignment. European Conference on Computer Vision (ECCV), 2014 [pdf] [code]
Topic-aware Deep Auto-encoders (TDA) for Face Alignment
Jie Zhang, Meina Kan, Shiguang Shan, Xiaowe Zhao, Xilin Chen. Topic-aware Deep Auto-encoders (TDA) for Face Alignment. Asian Conference on Computer Vision (ACCV), 2014. [pdf]