Xiongkuo Min – Publication

Journal Papers

2024

  • Perceptual Video Quality Assessment: A Survey
    Xiongkuo Min, Huiyu Duan, Wei Sun, Yucheng Zhu, and Guangtao Zhai
    SCIENCE CHINA Information Sciences, vol. 67, no. 11, pp. 211301:1–211301:57, 2024.

  • Study of Subjective and Objective Naturalness Assessment of AI-Generated Images
    Zijian Chen, Wei Sun, Haoning Wu, Zicheng Zhang, Jun Jia, Ru Huang, Xiongkuo Min, Guangtao Zhai, and Wenjun Zhang
    IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2024.
    [Database & Code]

  • AGIQA-3K: An Open Database for AI-Generated Image Quality Assessment
    Chunyi Li, Zicheng Zhang, Haoning Wu, Wei Sun, Xiongkuo Min, Xiaohong Liu, Guangtao Zhai, and Weisi Lin
    IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), vol. 34, no. 8, pp. 6833-6846, 2024.
    [Database]

  • Explain Vision Focus: Blending Human Saliency Into Synthetic Face Images
    Kaiwei Zhang, Dandan Zhu, Xiongkuo Min, Huiyu Duan, and Guangtao Zhai
    IEEE Transactions on Multimedia (TMM), 2024.

  • Unified Audio-visual Saliency Model for Omnidirectional Videos with Spatial Audio
    Dandan Zhu, Kaiwei Zhang, Nana Zhang, Qiangqiang Zhou, Xiongkuo Min, Guangtao Zhai, and Xiaokang Yang
    IEEE Transactions on Multimedia (TMM), vol. 26, pp. 764-775, 2024.

  • No-Reference Image Quality Assessment: Obtain MOS from Image Quality Score Distribution
    Yixuan Gao, Xiongkuo Min, Yuqin Cao, Xiaohong Liu, and Guangtao Zhai
    IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2024.

  • Blind Image Quality Assessment: A Fuzzy Neural Network for Opinion Score Distribution Prediction
    Yixuan Gao, Xiongkuo Min, Yucheng Zhu, Xiao-Ping Zhang, and Guangtao Zhai
    IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), vol. 34, no. 3, pp. 1641-1655, 2024.
    [Code]

  • Synergetic Assessment of Quality and Aesthetic: Approach and Comprehensive Benchmark Dataset
    Kaiwei Zhang, Dandan Zhu, Xiongkuo Min, Zhongpai Gao, and Guangtao Zhai
    IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), vol. 34, no. 4, pp. 2536-2549, 2024.

  • Un-Gaze: A Unified Transformer for Joint Gaze-Location and Gaze-Object Detection
    Danyang Tu, Wei Shen, Wei Sun, Xiongkuo Min, Guangtao Zhai, and Chang Wen Chen
    IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), vol. 34, no. 5, pp. 3271-3285, 2024.

  • Time-Smooth Wireless Transmission of Probabilistic Slicing VR 360 Video in MISO-OFDM Systems
    Long Teng, Guangtao Zhai, Yongpeng Wu, Xiongkuo Min, Biqian Feng, Yucheng Zhu, and Wenjun Zhang
    IEEE Transactions on Communications (TCOM), 2024.

  • Hidden Barcode in Sub-Images with Invisible Locating Marker
    Jun Jia, Zhongpai Gao, Yiwei Yang, Wei Sun, Dandan Zhu, Xiaohong Liu, Xiongkuo Min, and Guangtao Zhai
    ACM Transactions on Multimedia Computing, Communications and Applications (TOMM), vol. 20, no. 10, pp. 302:1-302:24, 2024.

  • Pixel-Learnable 3DLUT With Saturation-Aware Compensation for Image Enhancement
    Jing Liu, Qingying Li, Xiongkuo Min, Yuting Su, Guangtao Zhai, and Xiaokang Yang
    IEEE Transactions on Multimedia (TMM), vol. 26, pp. 11219-11231, 2024.

  • Continuous and Overall Quality of Experience Evaluation for Streaming Video Based on Rich Features Exploration and Dual-Stage Attention
    Ziheng Jia, Xiongkuo Min, Wei Sun, and Guangtao Zhai
    IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), vol. 34, no. 11, pp. 11709-11723, 2024.

  • Quality-guided Skin Tone Enhancement for Portrait Photography
    Shiqi Gao, Huiyu Duan, Xinyue Li, Kang Fu, Yicong Peng, Qihang Xu, Yuanyuan Chang, Jia Wang, Xiongkuo Min, and Guangtao Zhai
    IEEE Transactions on Multimedia (TMM), 2024.

  • Analysis of Video Quality Datasets via Design of Minimalistic Video Quality Models
    Wei Sun, Wen Wen, Xiongkuo Min, Long Lan, Guangtao Zhai, and Kede Ma
    IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 46, no. 11, pp. 7056-7071, 2024.
    [Code]

  • How is Visual Attention Influenced by Text Guidance? Database and Model
    Yinan Sun, Xiongkuo Min, Huiyu Duan, and Guangtao Zhai
    IEEE Transactions on Image Processing (TIP), vol. 33, pp. 5392-5407, 2024.
    [Database & Code]

  • GMS-3DQA: Projection-Based Grid Mini-patch Sampling for 3D Model Quality Assessment
    Zicheng Zhang, Wei Sun, Haoning Wu, Yingjie Zhou, Chunyi Li, Zijian Chen, Xiongkuo Min, Guangtao Zhai, and Weisi Lin
    ACM Transactions on Multimedia Computing, Communications and Applications (TOMM), vol. 20, no. 6, pp. 178:1-178:19, 2024.
    [Code]

  • BAND-2k: Banding Artifact Noticeable Database for Banding Detection and Quality Assessment
    Zijian Chen, Wei Sun, Jun Jia, Fangfang Lu, Zicheng Zhang, Jing Liu, Ru Huang, Xiongkuo Min, and Guangtao Zhai
    IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), vol. 34, no. 7, pp. 6347-6362, 2024.
    [Database & Code]

  • MTCAM: A Novel Weakly-Supervised Audio-Visual Saliency Prediction Model With Multi-Modal Transformer
    Dandan Zhu, Kun Zhu, Weiping Ding, Nana Zhang, Xiongkuo Min, Guangtao Zhai, and Xiaokang Yang
    IEEE Transactions on Emerging Topics in Computational Intelligence (TETCI), vol. 8, no. 2, pp. 1756-1771, 2024.

  • Quality-of-Experience Evaluation for Digital Twins in 6G Network Environments
    Zicheng Zhang, Yingjie Zhou, Long Teng, Wei Sun, Chunyi Li, Xiongkuo Min, Xiao-Ping Zhang, and Guangtao Zhai
    IEEE Transactions on Broadcasting (TBC), vol. 70, no. 3, pp. 995-1007, 2024.

2023

  • Attention-Guided Neural Networks for Full-Reference and No-Reference Audio-Visual Quality Assessment
    Yuqin Cao, Xiongkuo Min, Wei Sun, and Guangtao Zhai
    IEEE Transactions on Image Processing (TIP), vol. 32, pp. 1882-1896, 2023.
    [Code]

  • Subjective and Objective Audio-Visual Quality Assessment for User Generated Content
    Yuqin Cao, Xiongkuo Min, Wei Sun, and Guangtao Zhai
    IEEE Transactions on Image Processing (TIP), vol. 32, pp. 3847-3861, 2023. [Database & Code]

  • Blind Image Quality Assessment for Pathological Microscopic Image under Screen and Immersion Scenarios
    Yifei Guo, Menghan Hu, Xiongkuo Min, Yan Wang, Min Dai, Guangtao Zhai, Xiao-Ping Zhang, and Xiaokang Yang
    IEEE Transactions on Medical Imaging (TMI), vol. 42, no. 11, pp.3295-3306, 2023.
    [Database & Code]

  • Blind Quality Assessment for in-the-Wild Images via Hierarchical Feature Fusion and Iterative Mixed Database Training
    Wei Sun, Xiongkuo Min, Danyang Tu, Siwei Ma, and Guangtao Zhai
    IEEE Journal of Selected Topics in Signal Processing (JSTSP), vol. 17, no. 6, pp. 1178-1192, 2023.
    [Code]

  • Attentive Deep Image Quality Assessment for Omnidirectional Stitching
    Huiyu Duan, Xiongkuo Min, Wei Sun, Yucheng Zhu, Xiao-Ping Zhang, and Guangtao Zhai
    IEEE Journal of Selected Topics in Signal Processing (JSTSP), vol. 17, no. 6, pp. 1150-1164, 2023.
    [Database(TeraBox)] [Database(BaiduCloud)]

  • Develop then Rival: A Human Vision-Inspired Framework for Superimposed Image Decomposition
    Huiyu Duan, Wei Shen, Xiongkuo Min, Yuan Tian, Jae-Hyun Jung, Xiaokang Yang, and Guangtao Zhai
    IEEE Transactions on Multimedia (TMM), vol. 25, pp. 4267-4281, 2023.

  • A Novel Lightweight Audio-visual Saliency Model for Videos
    Dandan Zhu, Xuan Shao, Qiangqiang Zhou, Xiongkuo Min, Guangtao Zhai, and Xiaokang Yang
    ACM Transactions on Multimedia Computing, Communications and Applications (TOMM), vol. 19, no. 4, pp. 147:1-147:22, 2023.

  • Toward Visual Behavior and Attention Understanding for Augmented 360 Degree Videos
    Yucheng Zhu, Xiongkuo Min, Dandan Zhu, Guangtao Zhai, Xiaokang Yang, Wenjun Zhang, Ke Gu, and Jiantao Zhou
    ACM Transactions on Multimedia Computing, Communications and Applications (TOMM), vol. 19, no. 2s, pp. 99:1-99:24, 2023.

  • Subjective and Objective Quality Assessment for in-the-Wild Computer Graphics Images
    Zicheng Zhang, Wei Sun, Yingjie Zhou, Jun Jia, Zhichao Zhang, Jing Liu, Xiongkuo Min, and Guangtao Zhai
    ACM Transactions on Multimedia Computing, Communications and Applications (TOMM), vol. 20, no. 4, pp. 96:1-96:22, 2023.
    [Database]

  • Image Quality Score Distribution Prediction via Alpha Stable Model
    Yixuan Gao, Xiongkuo Min, Wenhan Zhu, Xiao-Ping Zhang, and Guangtao Zhai
    IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), vol. 33, no. 6, pp. 2656-2671, 2023.
    [Database & Code]

  • Deep Neural Network for Blind Visual Quality Assessment of 4K Content
    Wei Lu, Wei Sun, Xiongkuo Min, Wenhan Zhu, Quan Zhou, Jun He, Qiyuan Wang, Zicheng Zhang, Tao Wang, and Guangtao Zhai
    IEEE Transactions on Broadcasting (TBC), vol. 69, no. 2, pp. 406-421, 2023.

  • Toward a No-Reference Quality Metric for Camera-Captured Images
    Runze Hu, Yutao Liu, Ke Gu, Xiongkuo Min, and Guangtao Zhai
    IEEE Transactions on Cybernetics (TCYB), vol. 53, no. 6, pp. 3651-3664, 2023.

  • Blind Image Quality Assessment Via Cross-View Consistency
    Yucheng Zhu, Yunhao Li, Wei Sun, Xiongkuo Min, Guangtao Zhai, and Xiaokang Yang
    IEEE Transactions on Multimedia (TMM), vol. 25, pp. 7364-7377, 2023.

  • RIVIE: Robust Inherent Video Information Embedding
    Jun Jia, Zhongpai Gao, Dandan Zhu, Xiongkuo Min, Menghan Hu, and Guangtao Zhai
    IEEE Transactions on Multimedia (TMM), vol. 25, pp. 7607-7620, 2023.

  • Evaluating Point Cloud from Moving Camera Videos: A No-Reference Metric
    Zicheng Zhang, Wei Sun, Yucheng Zhu, Xiongkuo Min, Wei Wu, Ying Chen, and Guangtao Zhai
    IEEE Transactions on Multimedia (TMM), 2023.
    [Code]

  • A Deep Learning Based Multi-Dimensional Aesthetic Quality Assessment Method for Mobile Game Images
    Tao Wang, Wei Sun, Wei Wu, Ying Chen, Xiongkuo Min, Wei Lu, Zicheng Zhang, and Guangtao Zhai
    IEEE Transactions on Games (ToG), vol. 15, no. 4, pp. 658-668, 2023.

  • Implicit Neural Representation Learning for Hyperspectral Image Super-Resolution
    Kaiwei Zhang, Dandan Zhu, Xiongkuo Min, and Guangtao Zhai
    IEEE Transactions on Geoscience and Remote Sensing (TGRS), vol. 61, p. 5500212, 2023.

2022

  • Screen Content Quality Assessment: Overview, Benchmark, and Beyond
    Xiongkuo Min, Ke Gu, Guangtao Zhai, Xiaokang Yang, Wenjun Zhang, Patrick Le Callet, and Chang Wen Chen
    ACM Computing Surveys (CSUR), vol. 54, no. 9, pp. 1-36, 2022.
    ESI Highly Cited Paper

  • Confusing Image Quality Assessment: Toward Better Augmented Reality Experience
    Huiyu Duan, Xiongkuo Min, Yucheng Zhu, Guangtao Zhai, Xiaokang Yang, and Patrick Le Callet
    IEEE Transactions on Image Processing (TIP), vol. 31, pp. 7206-7221, 2022.
    [Code & Database]

  • No-Reference Quality Assessment for 3D Colored Point Cloud and Mesh Models
    Zicheng Zhang, Wei Sun, Xiongkuo Min, Tao Wang, Wei Lu, and Guangtao Zhai
    IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), vol. 32, no. 11, pp. 7618-7631, 2022.
    [Code]

  • Viewing Behavior Supported Visual Saliency Predictor for 360 Degree Videos
    Yucheng Zhu, Guangtao Zhai, Yiwei Yang, Huiyu Duan, Xiongkuo Min, and Xiaokang Yang
    IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), vol. 32, no. 7, pp. 4188-4201, 2022.
    Grand Prize of IEEE ICME 2018 Salient360! Grand Challenge

  • QoE Driven VR 360° Video Massive MIMO Transmission
    Long Teng, Guangtao Zhai, Yongpeng Wu, Xiongkuo Min, Wenjun Zhang, Zhi Ding, and Chengshan Xiao
    IEEE Transactions on Wireless Communications (TWC), vol. 21, no. 1, pp. 18-33, 2022.

  • RIHOOP: Robust Invisible Hyperlinks in Offline and Online Photographs
    Jun Jia, Zhongpai Gao, Kang Chen, Menghan Hu, Xiongkuo Min, Guangtao Zhai, and Xiaokang Yang
    IEEE Transactions on Cybernetics (TCYB), vol. 52, no. 7, pp. 7094-7106, 2022.

  • SMGEA: A New Ensemble Adversarial Attack Powered by Long-Term Gradient Memories
    Zhaohui Che, Ali Borji, Guangtao Zhai, Suiyi Ling, Jing Li, Xiongkuo Min, Guodong Guo, and Patrick Le Callet
    IEEE Transactions on Neural Networks and Learning Systems (TNNLS), vol. 33, no. 3, pp. 1051-1065, 2022.
    [Code]

  • Dynamic Backlight Scaling Considering Ambient Luminance for Mobile Videos on LCD Displays
    Wei Sun, Xiongkuo Min, Guangtao Zhai, Ke Gu, Siwei Ma, and Xiaokang Yang
    IEEE Transactions on Mobile Computing (TMC), vol. 21, no. 1, pp. 110-124, 2022.
    [Database]

  • HazDesNet: An End-to-End Network for Haze Density Prediction
    Jiahe Zhang, Xiongkuo Min, Yucheng Zhu, Guangtao Zhai, Jiantao Zhou, Xiaokang Yang, and Wenjun Zhang
    IEEE Transactions on Intelligent Transportation Systems (TITS), vol. 23, no. 4, pp. 3087-3102, 2022.
    [Code & Database]

2021

  • Perceptual Quality Assessment of Low-light Image Enhancement
    Guangtao Zhai, Wei Sun, Xiongkuo Min, and Jiantao Zhou
    ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), vol. 17, no. 4, pp. 130:1-130:24, 2021.
    [Code: LIEQA] [Database: LIEQ]

  • Enhancing Decoding Rate of Barcode Decoders in Complex Scenes for IoT Systems
    Adnan Sharif, Guangtao Zhai, Xiongkuo Min, Jun Jia, and Kashif Munir
    IEEE Internet of Things Journal (IoT-J), vol. 8, no. 24, pp. 17495-17507, 2021.

  • Comparative Perceptual Assessment of Visual Signals Using Free Energy Features
    Guangtao Zhai, Yucheng Zhu, and Xiongkuo Min
    IEEE Transactions on Multimedia (TMM), vol. 23, pp. 3700-3713, 2021.

  • Quality Assessment of Free-Viewpoint Videos by Quantifying the Elastic Changes of Multi-Scale Motion Trajectories
    Suiyi Ling, Jing Li, Zhaohui Che, Xiongkuo Min, Guangtao Zhai, and Patrick Le Callet
    IEEE Transactions on Image Processing (TIP), vol. 30, pp. 517-531, 2021.

  • Video Frame Interpolation and Enhancement via Pyramid Recurrent Framework
    Wang Shen, Wenbo Bao, Guangtao Zhai, Li Chen, Xiongkuo Min, and Zhiyong Gao
    IEEE Transactions on Image Processing (TIP), vol. 30, pp. 277-292, 2021.
    [Project & Code]

  • Subjective and Objective Quality Assessment of Compressed Screen Content Videos
    Teng Li, Xiongkuo Min, Heng Zhao, Guangtao Zhai, Yiling Xu, and Wenjun Zhang
    IEEE Transactions on Broadcasting (TBC), vol. 67, no. 2, pp. 438-449, 2021.
    [Database]

  • An Accurate and Efficient 1-D Barcode Detector for Medium of Deployment in IoT Systems
    Adnan Sharif, Guangtao Zhai, Jun Jia, Xiongkuo Min, Xiangyang Zhu, and Jiahe Zhang
    IEEE Internet of Things Journal (IoT-J), vol. 8, no. 2, pp. 889-900, 2021.

2020

  • Study of Subjective and Objective Quality Assessment of Audio-Visual Signals
    Xiongkuo Min, Guangtao Zhai, Jiantao Zhou, Mylene C.Q. Farias, and Alan Conrad Bovik
    IEEE Transactions on Image Processing (TIP), vol. 29, pp. 6054–6068, 2020.
    [Code] [LIVE-SJTU A/V-QA Database]

  • A Multimodal Saliency Model for Videos With High Audio-Visual Correspondence
    Xiongkuo Min, Guangtao Zhai, Jiantao Zhou, Xiao-Ping Zhang, Xiaokang Yang, and Xinping Guan
    IEEE Transactions on Image Processing (TIP), vol. 29, pp. 3805-3819, 2020.
    [Code] [AVA Database]

  • A Metric for Light Field Reconstruction, Compression, and Display Quality Evaluation
    Xiongkuo Min, Jiantao Zhou, Guangtao Zhai, Patrick Le Callet, Xiaokang Yang, and Xinping Guan
    IEEE Transactions on Image Processing (TIP), vol. 29, pp. 3790-3804, 2020.
    [Code]

  • Learning a Deep Agent to Predict Head Movement in 360-Degree Images
    Yucheng Zhu, Guangtao Zhai, Xiongkuo Min, and Jiantao Zhou
    ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), vol. 16, no. 4, pp. 130:1-130:23, 2020.

  • The Prediction of Saliency Map for Head and Eye Movements in 360 Degree Images
    Yucheng Zhu, Guangtao Zhai, Xiongkuo Min, and Jiantao Zhou,
    IEEE Transactions on Multimedia (TMM), vol. 22, no. 9, pp. 2331-2344, 2020.

  • How is Gaze Influenced by Image Transformations? Dataset and Model
    Zhaohui Che, Ali Borji, Guangtao Zhai, Xiongkuo Min, Guodong Guo, and Patrick Le Callet
    IEEE Transactions on Image Processing (TIP), vol. 29, pp. 2287-2300, 2020.
    [Code] [Database]

  • MC360IQA: The Multi-Channel CNN for Blind 360-Degree Image Quality Assessment
    Wei Sun, Xiongkuo Min, Guangtao Zhai, Ke Gu, Huiyu Duan, and Siwei Ma
    IEEE Journal of Selected Topics in Signal Processing (JSTSP), vol. 14, no. 1, pp. 64-77, 2020.
    [Code] [Database]

  • A Wavelet-Predominant Algorithm Can Evaluate Quality of THz Security Image and Identify Its Usability
    Menghan Hu, Guangtao Zhai, Rong Xie, Xiongkuo Min, Qingli Li, and Xiaokang Yang
    IEEE Transactions on Broadcasting (TBC), vol. 66, no. 1, pp. 140-152, 2020.
    [Database]

  • Perceptual Image Quality Assessment: A Survey
    Guangtao Zhai, and Xiongkuo Min
    SCIENCE CHINA Information Sciences, vol. 63, no. 11, pp. 211301, 2020.
    Hot Paper Award, ESI Highly Cited Paper

  • Tiny-BDN: An Efficient and Compact Barcode Detection Network
    Jun Jia, Guangtao Zhai, Ping Ren, Jiahe Zhang, Zhongpai Gao, Xiongkuo Min, and Xiaokang Yang
    IEEE Journal of Selected Topics in Signal Processing (JSTSP), vol. 14, no. 4, pp. 688-699, 2020.

  • DevsNet: Deep Video Saliency Network using Short-term and Long-term Cues
    Yuming Fang, Chi Zhang, Xiongkuo Min, Hanqin Huang, Yuegen Yi, Guangtao Zhai, and Chia-Wen Lin
    Pattern Recognition (PR), vol. 103, pp. 107294, 2020.

2019

  • Quality Evaluation of Image Dehazing Methods Using Synthetic Hazy Images
    Xiongkuo Min, Guangtao Zhai, Ke Gu, Yucheng Zhu, Jiantao Zhou, Guodong Guo, Xiaokang Yang, Xinping Guan, and Wenjun Zhang
    IEEE Transactions on Multimedia (TMM), vol. 21, no. 9, pp. 2319-2333, 2019.
    [Code: DEHAZEfr] [Database: SHRQ]

  • Objective Quality Evaluation of Dehazed Images
    Xiongkuo Min, Guangtao Zhai, Ke Gu, Xiaokang Yang, and Xinping Guan
    IEEE Transactions on Intelligent Transportation Systems (TITS), vol. 20, no. 9, pp. 2879-2892, 2019.
    [Code: DHQI] [Database: DHQ] [Database: rDHAZY] [Database: rFRIDA]

  • Multi-Channel Decomposition in Tandem With Free-Energy Principle for Reduced-Reference Image Quality Assessment
    Wenhan Zhu, Guangtao Zhai, Xiongkuo Min, Menghan Hu, Jing Liu, Guodong Guo, and Xiaokang Yang
    IEEE Transactions on Multimedia (TMM), vol. 21, no. 9, pp. 2334-2346, 2019.

  • EMBDN: An Efficient Multiclass Barcode Detection Network for Complicated Environments
    Jun Jia, Guangtao Zhai, Jiahe Zhang, Zhongpai Gao, Zehao Zhu, Xiongkuo Min, Xiaokang Yang, and Guodong Guo
    IEEE Internet of Things Journal (IoT-J), vol. 6, no. 6, pp. 9919-9933, 2019.

  • Visual Attention Analysis and Prediction on Human Faces for Children with Autism Spectrum Disorder
    Huiyu Duan, Xiongkuo Min, Yi Fang, Lei Fan, Xiaokang Yang, and Guangtao Zhai
    ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), vol. 15, no. 3s, pp. 90:1-90:23, 2019.

2018

  • Blind Quality Assessment Based on Pseudo Reference Image
    Xiongkuo Min, Ke Gu, Guangtao Zhai, Jing Liu, Xiaokang Yang, and Chang Wen Chen
    IEEE Transactions on Multimedia (TMM), vol. 20, no. 20, pp. 2049-2062, 2018.
    [Code: BPRI]
    Best Paper Runner-up Award, ESI Highly Cited Paper

  • Blind Image Quality Estimation via Distortion Aggravation
    Xiongkuo Min, Guangtao Zhai, Ke Gu, Yutao Liu, and Xiaokang Yang
    IEEE Transactions on Broadcasting (TBC), vol. 64, no. 2, pp. 508-517, 2018.
    [Code: BMPRI]

  • Saliency-Induced Reduced-Reference Quality Index for Natural Scene and Screen Content Images
    Xiongkuo Min, Ke Gu, Guangtao Zhai, Menghan Hu, and Xiaokang Yang
    Signal Processing (SP), vol. 145, pp. 127-136, 2018.
    [Code]

  • Evaluating Quality of Screen Content Images Via Structural Variation Analysis
    Ke Gu, Junfei Qiao, Xiongkuo Min, Guanghui Yue, Weisi Lin, and Daniel Thalmann
    IEEE Transactions on Visualization and Computer Graphics (TVCG), vol. 24, no. 10, pp. 2689-2701, 2018.
    [Code]

  • Partial-Reference Sonar Image Quality Assessment for Underwater Transmission
    Weilin Chen, Ke Gu, Xiongkuo Min, Fei Yuan, En Chen, and Wenjun Zhang
    IEEE Transactions on Aerospace and Electronic Systems (TAES), vol. 54, no. 6, pp. 2776-2787, 2018.

  • The Prediction of Head and Eye Movement for 360 Degree Images
    Yucheng Zhu, Guangtao Zhai, and Xiongkuo Min
    Signal Processing: Image Communication (SPIC), vol. 69, pp. 15-25, 2018.
    Special Award of IEEE ICME 2017 Salient360! Grand Challenge

2017

  • Unified Blind Quality Assessment of Compressed Natural, Graphic, and Screen Content Images
    Xiongkuo Min, Kede Ma, Ke Gu, Guangtao Zhai, Zhou Wang, and Weisi Lin
    IEEE Transactions on Image Processing (TIP), vol. 26, no. 11, pp. 5462-5474, 2017.
    [Project] [Code: UCA] [Database: CCT]
    ESI Hot Paper, ESI Highly Cited Pape

  • Fixation Prediction through Multimodal Analysis
    Xiongkuo Min, Guangtao Zhai, Ke Gu, and Xiaokang Yang
    ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), vol. 13, no. 1, pp. 6:1-6:23, 2017.
    [Code] [Database]

  • A Fast Reliable Image Quality Predictor by Fusing Micro- and Macro-Structures
    Ke Gu, Leida Li, Hong Lu, Xiongkuo Min, and Weisi Lin
    IEEE Transactions on Industrial Electronics (TIE), vol. 64, no. 5, pp. 3903-3912, 2017.
    [Code]
    ESI Highly Cited Pape

  • Visual Attention Analysis and Prediction on Human Faces
    Xiongkuo Min, Guangtao Zhai, Ke Gu, Jing Liu, Shiqi Wang, Xinfeng Zhang, and Xiaokang Yang
    Information Sciences (IS), vol. 420, pp. 417-430, 2017.

Conference Papers

2025

  • Textured Mesh Saliency: Bridging Geometry and Texture for Human Perception in 3D Graphics
    Kaiwei Zhang, Dandan Zhu, Xiongkuo Min, and Guangtao Zhai
    AAAI Conference on Artificial Intelligence (AAAI), 2025.

2024

  • Subjective and Objective Quality-of-Experience Assessment for 3D Talking Heads
    Yingjie Zhou, Zicheng Zhang, Wei Sun, Xiaohong Liu, Xiongkuo Min, and Guangtao Zhai
    ACM International Conference on Multimedia (ACM MM), pp. 6033-6042, 2024.
    [Database & Code]

  • LMM-PCQA: Assisting Point Cloud Quality Assessment with LMM
    Zicheng Zhang, Haoning Wu, Yingjie Zhou, Chunyi Li, Wei Sun, Chaofeng Chen, Xiongkuo Min, Xiaohong Liu, Weisi Lin, and Guangtao Zhai
    ACM International Conference on Multimedia (ACM MM), pp. 7783-7792, 2024.
    [Code]
    Best Paper Nomination

  • Large Multi-modality Model Assisted AI-Generated Image Quality Assessment
    Puyi Wang, Wei Sun, Zicheng Zhang, Jun Jia, Yanwei Jiang, Zhichao Zhang, Xiongkuo Min, and Guangtao Zhai
    ACM International Conference on Multimedia (ACM MM), pp. 7803-7812, 2024.
    [Code]

  • Subjective-Aligned Dataset and Metric for Text-to-Video Quality Assessment
    Tengchuan Kou, Xiaohong Liu, Zicheng Zhang, Chunyi Li, Haoning Wu, Xiongkuo Min, Guangtao Zhai, and Ning Liu
    ACM International Conference on Multimedia (ACM MM), pp. 7793-7802, 2024.
    [Database & Code]

  • GAIA: Rethinking Action Quality Assessment for AI-Generated Videos
    Zijian Chen, Wei Sun, Yuan Tian, Jun Jia, Zicheng Zhang, Jiarui Wang, Ru Huang, Xiongkuo Min, Guangtao Zhai, and Wenjun Zhang
    The Thirty-Eighth Annual Conference on Neural Information Processing Systems (NeurIPS), 2024.
    [Database]

  • UniProcessor: A Text-Induced Unified Low-Level Image Processor
    Huiyu Duan, Xiongkuo Min, Sijing Wu, Wei Shen, and Guangtao Zhai
    European Conference on Computer Vision (ECCV), pp. 180-199, 2024.

  • GLARE: Low Light Image Enhancement via Generative Latent Feature Based Codebook Retrieval
    Han Zhou, Wei Dong, Xiaohong Liu, Shuaicheng Liu, Xiongkuo Min, Guangtao Zhai, and Jun Chen
    European Conference on Computer Vision (ECCV), pp. 36-54, 2024.
    [Code]

  • Q-Align: Teaching LMMs for Visual Scoring via Discrete Text-Defined Levels
    Haoning Wu, Zicheng Zhang, Weixia Zhang, Chaofeng Chen, Liang Liao, Chunyi Li, Yixuan Gao, Annan Wang, Erli Zhang, Wenxiu Sun, Qiong Yan, Xiongkuo Min, Guangtao Zhai, and Weisi Lin
    International Conference on Machine Learning (ICML), 2024.
    [Project]

  • NTIRE 2024 Quality Assessment of AI-Generated Content Challenge
    Xiaohong Liu, Xiongkuo Min, Guangtao Zhai, Chunyi Li, Tengchuan Kou, Wei Sun, Haoning Wu, Yixuan Gao, Yuqin Cao, Zicheng Zhang, Xiele Wu, Radu Timofte, et al.
    IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2024, pp. 6337-6362.
    [Challenge - Track 1] [Challenge - Track 2]

  • Enhancing Blind Video Quality Assessment with Rich Quality-aware Features
    Wei Sun, Haoning Wu, Zicheng Zhang, Jun Jia, Zhichao Zhang, Linhan Cao, Qiubo Chen, Xiongkuo Min, Weisi Lin, and Guangtao Zhai
    arXiv preprint arXiv:2405.08745, 2024.
    [Code]
    Winner Award of the IEEE/CVF CVPR NTIRE 2024 Challenge on Short-form UGC Video Quality Assessment

  • Assessing UHD Image Quality from Aesthetics, Distortions, and Saliency
    Wei Sun, Weixia Zhang, Yuqin Cao, Linhan Cao, Jun Jia, Zijian Chen, Zicheng Zhang, Xiongkuo Min, Guangtao Zhai
    arXiv preprint arXiv:2409.00749, 2024.
    [Code]
    The 1st Place Award of the ECCV AIM 2024 Challenge on UHD Blind Photo Quality Assessment

  • DSA-QoE: Quality of Experience Evaluation for Streaming Video Based on Dual-Stage Attention
    Ziheng Jia, Xiongkuo Min, and Guangtao Zhai
    IEEE International Symposium on Circuits and Systems (ISCAS), pp. 1-5, 2024.
    IEEE VSPC-TC (The Visual Signal Processing and Communications Technical Committee) Best Paper Award

2023

  • MM-PCQA: Multi-Modal Learning for No-reference Point Cloud Quality Assessment
    Zicheng Zhang, Wei Sun, Xiongkuo Min, Quan Zhou, Jun He, Qiyuan Wang, and Guangtao Zhai
    International Joint Conference on Artificial Intelligence (IJCAI), pp. 1759-1767, 2023.
    [Code]

  • StableVQA: A Deep No-Reference Quality Assessment Model for Video Stability
    Tengchuan Kou, Xiaohong Liu, Wei Sun, Jun Jia, Xiongkuo Min, Guangtao Zhai, and Ning Liu
    ACM International Conference on Multimedia (ACM MM), 2023, pp. 1066-1076.
    [Code]

  • MD-VQA: Multi-Dimensional Quality Assessment for UGC Live Videos
    Zicheng Zhang, Wei Wu, Wei Sun, Danyang Tu, Wei Lu, Xiongkuo Min, Ying Chen, and Guangtao Zhai
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023, pp. 1746-1755.
    [Database]

  • VDPVE: VQA Dataset for Perceptual Video Enhancement
    Yixuan Gao, Yuqin Cao, Tengchuan Kou, Wei Sun, Yunlong Dong, Xiaohong Liu, Xiongkuo Min, and Guangtao Zhai
    IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2023, pp. 1474-1483.
    [Challenge] [Database]

  • NTIRE 2023 Quality Assessment of Video Enhancement Challenge
    Xiaohong Liu, Radu Timofte, Yunlong Dong, Zhiliang Ma, Haotian Fan, Chunzheng Zhu, Xiongkuo Min, Guangtao Zhai, Ziheng Jia, Mirko Agarla, Shiqi Zhou, Wei Sun, Yixuan Gao, Yulun Zhang, Yuqin Cao, Hongye Liu, Wenqi Wang, Kai Zhang, Tengchuan Kou, Hang Shi, Ironhead Chuang, Haoning Wu, Tengfei Shi, Yilin Li, Yu Lai, Kai Zhao, Heng Cong, Zhiwei Huang, Shiling Zhao, Hanene Brachemi Meftah, and Azadeh Mansouri
    IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2023, pp. 1551-1569.
    [Challenge]

  • The Influence of Text-guidance on Visual Attention
    Yinan Sun, Xiongkuo Min, Huiyu Duan, and Guangtao Zhai
    IEEE International Symposium on Circuits and Systems (ISCAS), 2023, pp. 1-5.
    IEEE MSA-TC Best Paper Award - Honorable Mention

  • AIGCIQA2023: A Large-scale Image Quality Assessment Database for AI Generated Images: From the Perspectives of Quality, Authenticity and Correspondence
    Jiarui Wang, Huiyu Duan, Jing Liu, Shi Chen, Xiongkuo Min, and Guangtao Zhai
    CAAI International Conference on Artificial Intelligence (CICAI), 2023, pp. 46–57.
    [Database]

  • Simple Baselines for Projection-based Full-reference and No-reference Point Cloud Quality Assessment
    Zicheng Zhang, Yingjie Zhou, Wei Sun, Xiongkuo Min, and Guangtao Zhai
    arXiv preprint arXiv:2310.17147, 2023.
    Winner Prize of the IEEE ICIP Point Cloud Visual Quality Assessment Grand Challenge

2022

  • Perceptual Attacks of No-Reference Image Quality Models with Human-in-the-Loop
    Weixia Zhang, Dingquan Li, Xiongkuo Min, Guangtao Zhai, Guodong Guo, Xiaokang Yang, and Kede Ma
    Advances in Neural Information Processing Systems (NeurIPS), 2022, vol. 35, pp. 2916-2929.
    [Code]

  • Video-based Human-Object Interaction Detection from Tubelet Tokens
    Danyang Tu, Wei Sun, Xiongkuo Min, Guangtao Zhai, and Wei Shen
    Advances in Neural Information Processing Systems (NeurIPS), 2022, vol. 35, pp. 23345-23357.

  • End-to-End Human-Gaze-Target Detection with Transformers
    Danyan Tu, Xiongkuo Min, Huiyu Duan, Guodong Guo, Guangtao Zhai, and Wei Shen
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022, pp. 2192-2200.

  • Learning Invisible Markers for Hidden Codes in Offline-to-online Photography
    Jun Jia, Zhongpai Gao, Dandan Zhu, Xiongkuo Min, Guodong Guo, and Guangtao Zhai
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022, pp. 2273-2282.

  • Iwin: Human-Object Interaction Detection via Transformer with Irregular Windows
    Danyang Tu, Xiongkuo Min, Huiyu Duan, Guodong Guo, Guangtao Zhai, and Wei Shen
    European Conference on Computer Vision (ECCV), 2022, pp. 87-103.

  • Image Quality Assessment: From Mean Opinion Score to Opinion Score Distribution
    Yixuan Gao, Xiongkuo Min, Yucheng Zhu, Jing Li, Xiao-Ping Zhang, and Guangtao Zhai
    ACM International Conference on Multimedia (ACM MM), 2022, pp. 997–1005.
    [Code]

  • A Deep Learning based No-reference Quality Assessment Model for UGC Videos
    Wei Sun, Xiongkuo Min, Wei Lu, and Guangtao Zhai
    ACM International Conference on Multimedia (ACM MM), 2022, pp. 856-865.
    [Code]

  • Saliency in Augmented Reality
    Huiyu Duan, Wei Shen, Xiongkuo Min, Danyang Tu, Jing Li, and Guangtao Zhai
    ACM International Conference on Multimedia (ACM MM), 2022, pp. 6549-6558.
    [Code & Database]

  • SMESwin Unet: Merging CNN and Transformer for Medical Image Segmentation
    Ziheng Wang, Xiongkuo Min, Fangyu Shi, Ruinian Jin, Saida S Nawrin, Ichen Yu, and Ryoichi Nagatomi
    International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2022, pp. 517–526.

  • Augmented Reality Image Quality Assessment Based on Visual Confusion Theory
    Huiyu Duan, Lantu Guo, Wei Sun, Xiongkuo Min, Li Chen, and Guangtao Zhai
    IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB), 2022, pp. 1-6.
    [Code & Database]
    Best Paper Award

  • Surveillance Video Quality Assessment Based on Quality Related Retraining
    Zicheng Zhang, Wei Lu, Wei Sun, Xiongkuo Min, Tao Wang, and Guangtao Zhai
    IEEE International Conference on Image Processing (ICIP), 2022, pp. 4278-4282.
    The First Prize of the IEEE ICIP Grand Challenge: Video Distortion Detection and Classification in the Context of Video Surveillance

2021

  • Self-Conditioned Probabilistic Learning of Video Rescaling
    Yuan Tian, Guo Lu, Xiongkuo Min, Zhaohui Che, Guangtao Zhai, Guodong Guo, and Zhiyong Gao
    IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 4490-4499.
    [Code]

  • Deep Learning Based Full-Reference and No-Reference Quality Assessment Models for Compressed UGC Videos
    Wei Sun, Tao Wang, Xiongkuo Min, Fuwang Yi, and Guangtao Zhai
    IEEE International Conference on Multimedia and Expo Workshops (ICMEW), 2021, pp. 1-6.
    [Code]
    First Place Award of IEEE ICME 2021 Grand Challenge on Quality Assessment of Compressed UGC Videos

  • A No-Reference Evaluation Metric for Low-Light Image Enhancement
    Zicheng Zhang, Wei Sun, Xiongkuo Min, Wenhan Zhu, Tao Wang, Wei Lu, and Guangtao Zhai
    IEEE International Conference on Multimedia and Expo (ICME), 2021, pp. 1-6.
    [Code]

2020 & Before

  • Blurry Video Frame Interpolation
    Wang Shen, Wenbo Bao, Guangtao Zhai, Li Chen, Xiongkuo Min, and Zhiyong Gao
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020, pp. 5114-5123.
    [Project & Code]

  • Fine Detection and Classification of Multi-class Barcode in Complex Environments
    Jiahe Zhang, Jun Jia, Zehao Zhu, Xiongkuo Min, Guangtao Zhai, and Xiao-Ping Zhang
    IEEE International Workshop on Mobile Multimedia Computing (in conjunction with IEEE ICME), 2019, pp. 1-6.
    Best Paper Award

  • A Dataset of Eye Movements for the Children with Autism Spectrum Disorder
    Huiyu Duan, Guangtao Zhai, Xiongkuo Min, Zhaohui Che, Yi Fang, Xiaokang Yang, Jesus Gutierrez, and Patrick Le Callet
    ACM Multimedia Systems Conference (ACM MMSys), 2019, pp. 255–260.
    [Database]

  • Perceptual Quality Assessment of Omnidirectional Images
    Huiyu Duan, Guangtao Zhai, Xiongkuo Min, Yucheng Zhu, Yi Fang, Xiaokang Yang
    IEEE International Symposium on Circuits and Systems (ISCAS), 2018, pp. 1-5.
    [Database]

  • Blind Quality Assessment of Compressed Images via Pseudo Structural Similarity
    Xiongkuo Min, Guangtao Zhai, Ke Gu, Yuming Fang, Xiaokang Yang, Xiaolin Wu, Jiantao Zhou, and Xianming Liu
    IEEE International Conference on Multimedia and Expo (ICME), 2016, pp. 1-6.
    [Code]
    Best Student Paper Award