非线性的卷积神经网络原文+翻译.docx

非线性的卷积神经网络原文+翻译.docx

  1. 1、本文档共16页,可阅读全部内容。
  2. 2、原创力文档(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。
  3. 3、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载
  4. 4、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
查看更多
精品文档,知识共享! Efficient and Accurate Approximations of Nonlinear Convolutional Networks 高效率和准确的非线性的卷积神经网络逼近 Abstract This paper aims to accelerate the test-time computation of deep convolutional neural networks (CNNs). Unlike existing methods that are designed for approximating linear filters or linear responses, our method takes the nonlinear units into account. We minimize the reconstruction error of the nonlinear responses, subject to a low-rank constraint which helps to reduce the complexity of filters. We develop an effective solution to this constrained nonlinear optimization problem. An algorithm is also presented for reducing the accumulated error when multiple layers are approximated. A whole-model speedup ratio of 4× is demonstrated on a large network trained for ImageNet, while the top-5 error rate is only increased by 0.9%. Our accelerated model has a comparably fast speed as the “AlexNet” [11], but is 4.7% more accurate. 摘要: 本文旨在提高深度卷积神经网络的计算测试时间(CNNs)。与现有的近似线性滤波器或线性响应设计的方法不同,该方法考虑了非线性单位。我们将重建非线性响应的误差降到最小,一个低等级的限制有助于减少过滤器的复杂性。我们将非线性响应的重建误差降到最小,除有助于减少过滤器的复杂性的一个低等级的限制。我们研制一个有效的解决这个约束非线性优化的问题.为了减少多个图层逼近时的累积误差,提出了一个算法, 整个4×的加速比模型论证了在大型ImageNet(图像处理软件)网络训练,即使top-5(五大低价主机排名)的错误率也仅增加0.9%。我们加速模型有一个比较快的速度为"AlexNet"[11],但4.7%更准确。 Introduction This paper addresses efficient test-time computation of deep convolutional neural networks (CNNs) [12, 11]. Since the success of CNNs [11] for large-scale image classification, the accuracy of the newly developed CNNs [24, 17,8, 18, 19] has been continuously improving. However, the computational cost of these networks (especially the more accurate but larger models) also increases significantly. The expensive test-time evaluation of the models can make them impractical in real-world systems. For example, a cloud service needs to process thousands of new requests per seconds; portable devices such as phones and tablets mostly have CPUs or low-end GPUs only; some recognition tasks like object detection

文档评论(0)

哆啦 + 关注
实名认证
内容提供者

该用户很懒,什么也没介绍

1亿VIP精品文档

相关文档

相关课程推荐