基于多特征图像视觉显著性的视频摘要化生成 Video summary generation based on multi-feature image and visual saliency期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

基于多特征图像视觉显著性的视频摘要化生成

引用本文：	金海燕,曹甜,肖聪,肖照林.基于多特征图像视觉显著性的视频摘要化生成[J].北京航空航天大学学报,2021,47(3):441-450.

作者姓名：	金海燕曹甜肖聪肖照林

作者单位：	1.西安理工大学计算机科学与工程学院, 西安 710048

基金项目：	陕西省技术创新引导计划;陕西省自然科学基础研究计划

摘要：	如何高效提取视频内容即视频摘要化，一直是计算机视觉领域研究的热点。简单通过图像颜色、纹理等特征进行检测已无法有效、完整地获取视频摘要。基于视觉注意力金字塔模型，提出了一种改进的可变比例及双对比度计算的中心-环绕视频摘要化方法。首先，以超像素方法对视频图像序列进行像素块划分以加速图像计算；然后，检测不同颜色背景下的图像对比度特征差异并进行融合；最后，结合光流运动信息，合并静态图像与动态图像显著性结果提取视频关键帧，在提取关键帧时，利用感知哈希函数进行相似性判断完成视频摘要化生成。在Segtrack V2、ViSal及OVP数据集上进行仿真实验，结果表明:所提方法可以有效提取图像感兴趣区域，得到以关键帧图像序列表示的视频摘要。
关键词：	视频摘要化视觉注意力金字塔视频显著性关键帧提取相似性判断
收稿时间：	2020-08-31
Video summary generation based on multi-feature image and visual saliency

JIN Haiyan,CAO Tian,XIAO Cong,XIAO Zhaolin.Video summary generation based on multi-feature image and visual saliency[J].Journal of Beijing University of Aeronautics and Astronautics,2021,47(3):441-450.

Authors:	JIN Haiyan CAO Tian XIAO Cong XIAO Zhaolin

Institution:	1.College of Computer Science and Engineering, Xi'an University of Technology, Xi'an 710048, China2.Shaanxi Key Laboratory for Network Computing and Security Technology, Xi'an 710048, China

Abstract:	How to extract video content efficiently, that is, video summarization, is a research hotspot in the field of computer vision. Video summary cannot be obtained effectively and completely by simply detecting the image color, texture and other features. Based on the visual attention pyramid model, this paper proposes an improved center-surround video summarization method with variable ratio and double contrast calculation. First, the video image sequence is divided into pixel blocks by superpixel method to speed up image calculation. Then, the contrast feature difference under different color backgrounds is detected and fused. Finally, combined with the optical flow motion information, the static and dynamic saliency results are merged to extract the key frames of the video. When extracting the key frames, the perceived Hash function is used to perform similarity judgment to complete the video summary generation. Simulation experiments are carried out on Segtrack V2, ViSal and OVP datasets. The experimental results show that the proposed method can be used to effectively extract the area of interest, and finally obtain the video summary expressed by the sequence of key frame images.

Keywords:
本文献已被 CNKI 万方数据等数据库收录！
	点击此处可从《北京航空航天大学学报》浏览原始摘要信息
	点击此处可从《北京航空航天大学学报》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏