Kun Yuan (袁坤)
R&D Expert in Artificial Intelligence and Computer Vision
Video Technology Group, Kuaishou Technology
Research Interests: Visual Content Generation, Video Quality Assessment, Video Enhancement and Restoration, AI Infrastructure
Email: yuankunbupt at gmail dot com
[Google Scholar]
[Linkedin]
|
|
Short Bio
I am currently a Research & Development Expert at Kuaishou Technology since 2021.
I am committed to analyzing and improving the quality of Kuaishou videos and enhancing users' experience in live and on-demand scenarios.
Before joining Kuaishou, I worked in SenseTime Research as a Computer Vision and Machine Learning Researcher from 2018 to 2021,
improving the accuracy of face recognition and classification in smart city scenarios.
I received my master degree from the National Laboratory of Pattern Recognition (NLPR), Institute of Automation, Chinese Academy of Science in 2018,
and my bachelor degree from the Beijing University of Posts and Telecommunications (BUPT) in 2015.
My research interests are in visual content generation, video quality assessment, video enhancement and restoration, neural architecture design and AI infrastructure.
工作介绍
自2021年加入快手,主要深耕于人工智能与音视频领域的结合落地,专注于通过算法提升快手视频的整体画质并改善用户体验:
1. 视频质量评估:基于海量的视频数据+AI大模型训练自研了 快手视频质量评价体系KVQ,量化视频生产消费链路中诸如编码、处理、传输等过程的画质损失,提供准确的客观质量评价。
通过自研 QPT系列算法,走通了基于海量无监督数据训练质量感知模型的技术路线,结合高质垂类数据微调,在快手100+垂类场景的表现超过Golden Eye;
并与多模态大模型进行结合,通过高质量描述数据指令微调,给出白盒化归因分析和画质改善建议。落地快手点播、直播场景,指导智能编码、多码率决策下发、审核风控、推荐分发、搜索排序等场景,日均调用2亿次。
2. 视频画质增强:针对快手视频的画质问题,基于Transformer设计并实现了多种视频处理算法,包括KEP (Kuaishou Enhanced Processing)/KRP (Kuaishou Restoration Processing),显著改善了视频画质,让用户看到比作者上传源更清晰的画面,
取得了显著的带宽成本节省和App使用时长提升收益。充分拥抱AIGC,进一步自研 Diffusion-based增强算法XPSR 和
业界首个Autoregressive-based增强算法VARSR,通过生成能力的改善突破画质上限,结合billion级别的训练数据,取得了令人惊艳的增强修复效果,落地服务端点播场景取得了显著用户时长提升收益,
同时赋能电商、商业化,通过清晰度的提升促进GMV、广告消耗。
3. AI Infrastructure:大模型的训练上,结合Deepspeed、Megatron等业界主流架构,深度优化DiT结构,实现高效DP/CP/TP等多机多卡分布式训练;部署上,自研多模型单引擎部署方案、Diffusion低精度量化、一致性模型蒸馏等技术,
将diffusion模型推理降低至1步,并与NVIDIA展开深度合作,基于TensorRT-LLM、FP8量化等技术大幅提升大模型在视频处理场景下的推理效率,整体加速80+倍,为AI能力的规模化应用提供了坚实的技术基础,显著降低了机器成本、提升了服务的覆盖率。
并在GTC2025上进行技术分享:重塑短视频视觉体验:智能视频质量评价与处理大模型。
News
-
[2025-05] One paper accepted by ICML 2025.
-
[2025-03] I give a talk at Nvidia GTC 2025 about "Redefining Visual Experience of Short-form Videos: Accelerating Large Models for Intelligent Video Quality Assessment and Processing by TensorRT-LLM".
-
[2025-03] One paper accepted by CVPR 2025.
-
[2024-07] Two papers accepted by ACM MM 2024.
-
[2024-07] One paper accepted by ECCV 2024.
-
[2024-03] Two papers accepted by CVPR 2024.
-
[2023-10] Two papers accepted by ACM MM 2023.
-
[2023-03] One paper accepted by CVPR 2023.
-
[2022-03] One paper accepted by CVPR 2022.
-
[2021-02] One paper accepted by ICLR 2021.
-
[2021-02] Two papers accepted by ICCV 2021.
-
[2020-08] One paper accepted by ECCV 2020.
-
[2018-07] One paper accepted by IJCAI 2018.
Publications
(* denotes equal contribution, # denotes corresponding author)
2025
Visual Autoregressive Modeling for Image Super-Resolution
Yunpeng Qu,
Kun Yuan#, Jinhua Hao, Kai Zhao, Qizhi Xie, Ming Sun, Chao Zhou
International Conference on Machine Learning (ICML), 2025.
[
Paper][
Project Page]
KVQ: Boosting Video Quality Assessment via Saliency-guided Local Perception
Yunpeng Qu,
Kun Yuan#, Qizhi Xie, Ming Sun, Chao Zhou, Jian Wang
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025.
[
Paper][
Project Page]
2024
QPT V2: Masked Image Modeling Advances Visual Scoring
Qizhi Xie,
Kun Yuan#, Yunpeng Qu, Mingda Wu, Ming Sun, Chao Zhou, Jihong Zhu
ACM International Conference on Multimedia (ACM MM), 2024.
[
Paper][
Project Page]
QNCD: Quantization Noise Correction for Diffusion Models
Huanpeng Chu, Wei Wu, Chengjie Zang,
Kun Yuan
ACM International Conference on Multimedia (ACM MM), 2024.
[
Paper][
Project Page]
XPSR: Cross-modal Priors for Diffusion-based Image Super-Resolution
Yunpeng Qu*,
Kun Yuan*, Kai Zhao, Qizhi Xie, Jinhua Hao, Ming Sun, Chao Zhou
European Conference on Computer Vision (ECCV), 2024.
[
Paper][
Project Page]
KVQ: Kwai Video Quality Assessment for Short-form Videos
Yiting Lu*, Xin Li*, Yajing Pei*,
Kun Yuan#, Qizhi Xie, Yunpeng Qu, Ming Sun, Chao Zhou, Zhibo Chen#
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024.
[
Paper]
[
Supp]
[
Project Page]
PTM-VQA: Efficient Video Quality Assessment Leveraging Diverse PreTrained Models from the Wild
Kun Yuan*, Hongbo Liu*, Mading Li*, Muyi Sun, Ming Sun, Jiachao Gong, Jinhua Hao, Chao Zhou, Yansong Tang
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024.
[
Paper]
2023
Capturing Co-existing Distortions in User-Generated Content for No-reference Video Quality Assessment
Kun Yuan*, Zishang Kong*, Chuanchuan Zheng, Ming Sun, Xing Wen
ACM International Conference on Multimedia (ACM MM), 2023.
[
Paper]
Ada-DQA: Adaptive Diverse Quality-aware Feature Acquisition for Video Quality Assessment
Hongbo Liu*, Mingda Wu*,
Kun Yuan*, Ming Sun, Yansong Tang, Chuanchuan Zheng, Xing Wen, Xiu Li
ACM International Conference on Multimedia (ACM MM), 2023.
[
Paper]
Quality-aware Pre-trained Models for Blind Image Quality Assessment
Kai Zhao*,
Kun Yuan*, Ming Sun, Mading Li, Xing Wen
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023.
[
Paper]
2022
ShowFace: Coordinated Face Inpainting with Memory-Disentangled Refinement Networks
Zhuojie Wu, Xingqun Qi, Zijian Wang, Wanting Zhou,
Kun Yuan, Muyi Sun, Zhenan Sun
British Machine Vision Conference (BMVC), 2022.
[
Paper]
Self-supervised Correlation Mining Network for Person Image Generation
Zijian Wang, Xingqun Qi,
Kun Yuan, Muyi Sun
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022.
[
Paper]
2021
Learning N:M Fine-grained Structured Sparse Neural Networks from Scratch
Aojun Zhou, Yukun Ma, Junnan Zhu, Jianbo Liu, Zhijie Zhang,
Kun Yuan, Wenxiu Sun, Hongsheng Li
International Conference on Learning Representations (ICLR), 2021.
[
Paper]
[
Project Page]
Incorporating Convolution Designs into Visual Transformers
Kun Yuan, Shaopeng Guo, Ziwei Liu, Aojun Zhou, Fengwei Yu, Wei Wu
IEEE/CVF International Conference on Computer Vision (ICCV), 2021.
[
Paper]
[
Project Page]
Differentiable Dynamic Wirings for Neural Networks
Kun Yuan, Quanquan Li, Shaopeng Guo, Dapeng Chen, Aojun Zhou, Fengwei Yu, Ziwei Liu
IEEE/CVF International Conference on Computer Vision (ICCV), 2021.
[
Paper]
Earlier
Learning Connectivity of Neural Networks from a Topological Perspective
Kun Yuan, Quanquan Li, Jing Shao, Junjie Yan
European Conference on Computer Vision (ECCV), 2020.
[
Paper]
SafeNet: Scale-normalization and Anchor-based Feature Extraction Network for Person Re-identification
Kun Yuan, Qian Zhang, Chang Huang, Shiming Xiang, Chunhong Pan
International Joint Conferences on Artificial Intelligence (IJCAI), 2018.
[
Paper]
Deep Networks for Degraded Document Image Binarization through Pyramid Reconstruction
Gaofeng Meng,
Kun Yuan, Ying Wu, Shiming Xiang, Chunhong Pan
International Conference on Document Analysis and Recognition (ICDAR), 2017.
[
Paper]
Efficient Cloud Detection in Remote Sensing Images using Edge-aware Segmentation Network and Easy-to-hard Training Strategy
Kun Yuan, Gaofeng Meng, Dongcai Cheng, Jun Bai, Shiming Xiang, Chunhong Pan
IEEE International Conference on Image Processing (ICIP), 2017.
[
Paper]
Workshops
NTIRE 2025 Challenge on Short-form UGC Video Quality Assessment and Enhancement: Methods and Results
Xin Li,
Kun Yuan, Bingchen Li, Fengbin Guan, Yizhen Shao, Zihao Yu, Xijun Wang, Yiting Lu, Wei Luo, Suhang Yao, Ming Sun, Chao Zhou, Zhibo Chen, Radu Timofte
IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshop (CVPRW), 2025.
[
Paper]
[
Project Page]
NTIRE 2024 Challenge on Short-form UGC Video Quality Assessment: Methods and Results
Xin Li,
Kun Yuan, Yajing Pei, Yiting Lu, Ming Sun, Chao Zhou, Zhibo Chen, Radu Timofte
IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshop (CVPRW), 2024.
[
Paper]
[
Project Page]
Zoom-VQA: Patches, Frames and Clips Integration for Video Quality Assessment
Kai Zhao,
Kun Yuan, Ming Sun, Xing Wen
IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshop (CVPRW), 2023.
[
Paper]
[
Project Page]
Awards
-
快手研发线优秀项目奖:“基于Transformer的视频处理模型研究与落地”
2024
-
快手洛子峰奖:“KVQ:基于 AI 的视频质量评价”
2023
-
快手洛子峰奖:“基于主观的智能视频增强与编解码架构联合优化”
2023