首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于CNN的多尺寸航拍图像定位方法的研究与实现
引用本文:潘海侠,徐嘉璐,李锦涛,王赟豪,王华锋.基于CNN的多尺寸航拍图像定位方法的研究与实现[J].北京航空航天大学学报,2019,45(11):2170-2176.
作者姓名:潘海侠  徐嘉璐  李锦涛  王赟豪  王华锋
作者单位:北京航空航天大学软件学院,北京,100083;北京航空航天大学软件学院,北京100083;北方工业大学信息学院,北京100144
摘    要:图像定位常用于无人机视觉导航,传统的无人机视觉导航广泛采用景象匹配导航方式,随着计算机技术的不断发展,深度学习技术为视觉导航的实现提供了新途径。以无人机的垂直侦查为背景,将飞行区域的航拍图像划分成大小相同的若干网格,每个网格代表一类区域,用网格图像制作数据集训练卷积神经网络(CNN)。基于AlexNet设计了一种融合显著性特征的全卷积网络模型,有效实现了一个基于CNN的多尺寸输入的滑动窗口分类器,并提出了一种邻域显著性参照定位策略来筛选分类结果,从而实现多尺寸航拍图像的定位。 

关 键 词:多尺寸航拍图像定位  全卷积网络  滑动窗口  显著性  特征融合
收稿时间:2019-02-13

Research and implementation of multi-size aerial image positioning method based on CNN
Institution:1.School of Software, Beihang University, Beijing 100083, China2.School of Electrical&Information Engineering, North China University of Technology, Beijing 100144, China
Abstract:Image positioning is the key of UAV visual navigation. Scene matching navigation is widely used in traditional UAV visual navigation. With the continuous development of computer technology, deep learning technology provides a new way for the realization of visual navigation. In this context, this research mainly focuses on image localization based on convolution neural network. In this paper, based on the vertical reconnaissance of UAV, the aerial image of flight area is divided into several grids of the same size, each grid represents a class of regions, and the convolutional neural network (CNN) is trained by making data sets of grid images. This paper designs a fully convolutional network model based on AlexNet, which integrates saliency features. It effectively implements a sliding window classifier with CNN multi-size input, and proposes a neighborhood saliency reference positioning strategy to filter the classification results, so as to realize the positioning of multi-size aerial images. 
Keywords:
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《北京航空航天大学学报》浏览原始摘要信息
点击此处可从《北京航空航天大学学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号