AlexNet

AlexNet是一个卷积神经网络，由亚历克斯·克里泽夫斯基设计^[1]，与伊尔亚·苏茨克维和克里泽夫斯基的博士导师杰弗里·辛顿共同发表^[2]^[3]，而辛顿最初抵制他的学生的想法^[1]^[4]。

AlexNet参加了2012年9月30日举行的ImageNet大规模视觉识别挑战赛^[5]，达到最低的15.3%的Top-5错误率，比第二名低10.8个百分点。原论文的主要结论是，模型的深度对于提高性能至关重要，AlexNet的计算成本很高，但因在训练过程中使用了图形处理器（GPU）而使得计算具有可行性^[5] 。

背景编辑

AlexNet并不是卷积神经网络（CNN）第一次利用快速GPU实现而赢得图像识别竞赛。K. Chellapilla等人（2006）在GPU上的CNN比同等的CPU实现速度快4倍^[6]。Dan Ciresan等人（2011）的深层CNN在IDSIA上已经快了60倍^[7]，并在2011年8月取得了超过人类的表现^[8]。从2011年5月15日到2012年9月10日，他们的CNN赢得了不少于四场图像竞赛^[9]^[10]。他们还极大提高了文献中多个图像数据库的最佳性能^[11]。

根据AlexNet的论文^[5]，其与Ciresan的早期网络“有些相似”。两者最初都用CUDA编写，可在GPU支持下运行。实际上，两者都是杨立昆等人（1989）介绍的CNN设计的变体^[12]^[13]，他将反向传播算法应用于福岛邦彦（福島邦彦）最初提出的CNN架构“neocognitron”的一个变种^[14]^[15]。后来J. Weng提出的最大池化方法修改了该架构^[16]^[10]。

网络设计编辑

AlexNet包含八层。前五层是卷积层，之后一些层是最大池化层，最后三层是全连接层^[5]。它使用了非饱和的ReLU激活函数，显示出比tanh和sigmoid更好的训练性能^[5]。

影响编辑

AlexNet被认为是计算机视觉领域最有影响力的论文之一，它刺激了更多使用卷积神经网络和GPU来加速深度学习的论文的出现^[17]。截至2020年，AlexNet论文已被引用超过54,000次。

亚历克斯·克里泽夫斯基编辑

亚历克斯·克里泽夫斯基（出生于乌克兰，在加拿大长大）是一名计算机科学家，以在人工神经网络和深度学习方面的工作而著称。在通过AlexNet赢得ImageNet 2012挑战赛后不久，他和同事将他们的创业公司DNN研究公司（DNN Research Inc.）卖给了Google^[1]。克里泽夫斯基对这项工作失去兴趣后，于2017年9月离开了Google^[1]。在Dessa公司，克里泽夫斯基将为新的深度学习技术提供建议和帮助^[1]。研究人员经常引用他的许多有关机器学习和计算机视觉的论文^[18]。

参考资料编辑

^ ^1.0 ^1.1 ^1.2 ^1.3 ^1.4 Dave Gershgorn. The inside story of how AI got good enough to dominate Silicon Valley. Quartz. 2018-06-18 [2018-10-05]. （原始内容于2019-12-12）.
^ The data that transformed AI research—and possibly the world. [2020-01-17]. （原始内容于2017-07-27）.
^ ILSVRC2012 Results. [2020-01-17]. （原始内容于2020-01-16）.
^ Krizhevsky, Alex. ImageNet Classification with Deep Convolutional Neural Networks (PDF). [2013-11-17]. （原始内容 (PDF)于2013-05-12）.
^ ^5.0 ^5.1 ^5.2 ^5.3 ^5.4 Krizhevsky, Alex; Sutskever, Ilya; Hinton, Geoffrey E. ImageNet classification with deep convolutional neural networks (PDF). Communications of the ACM. 2017-05-24, 60 (6): 84–90 [2020-01-17]. ISSN 0001-0782. doi:10.1145/3065386. （原始内容 (PDF)于2017-05-16）.
^ Kumar Chellapilla; Sid Puri; Patrice Simard. High Performance Convolutional Neural Networks for Document Processing. Lorette, Guy (编). Tenth International Workshop on Frontiers in Handwriting Recognition. Suvisoft. 2006 [2020-01-17]. （原始内容于2020-05-18）.
^ Ciresan, Dan; Ueli Meier; Jonathan Masci; Luca M. Gambardella; Jurgen Schmidhuber. Flexible, High Performance Convolutional Neural Networks for Image Classification (PDF). Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence-Volume Volume Two. 2011, 2: 1237–1242 [2013-11-17]. （原始内容 (PDF)于2013-11-16）.
^ IJCNN 2011 Competition result table. OFFICIAL IJCNN2011 COMPETITION. 2010 [2019-01-14]. （原始内容于2019-01-21）.
^ Schmidhuber, Jürgen. History of computer vision contests won by deep CNNs on GPU. 2017-03-17 [2019-01-14]. （原始内容于2018-12-19）.
^ ^10.0 ^10.1 Schmidhuber, Jürgen. Deep Learning. Scholarpedia. 2015, 10 (11): 1527–54 [2020-01-17]. CiteSeerX 10.1.1.76.1541  . PMID 16764513. doi:10.1162/neco.2006.18.7.1527. （原始内容于2016-04-19）.
^ Ciresan, Dan; Meier, Ueli; Schmidhuber, Jürgen. Multi-column deep neural networks for image classification. New York, NY: Institute of Electrical and Electronics Engineers (IEEE). June 2012: 3642–3649. CiteSeerX 10.1.1.300.3283  . ISBN 978-1-4673-1226-4. OCLC 812295155. arXiv:1202.2745  . doi:10.1109/CVPR.2012.6248110. |journal=被忽略 (帮助)
^ Y. LeCun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard, W. Hubbard, L. D. Jackel, Backpropagation Applied to Handwritten Zip Code Recognition （页面存档备份，存于互联网档案馆）; AT&T Bell Laboratories
^ LeCun, Yann; Léon Bottou; Yoshua Bengio; Patrick Haffner. (PDF). Proceedings of the IEEE. 1998, 86 (11): 2278–2324 [2016-10-07]. CiteSeerX 10.1.1.32.9552  . doi:10.1109/5.726791. （原始内容 (PDF)存档于2017-12-15）.
^ Fukushima, K. Neocognitron. Scholarpedia. 2007, 2 (1): 1717. doi:10.4249/scholarpedia.1717.
^ Fukushima, Kunihiko. Neocognitron: A Self-organizing Neural Network Model for a Mechanism of Pattern Recognition Unaffected by Shift in Position (PDF). Biological Cybernetics. 1980, 36 (4): 193–202 [2013-11-16]. PMID 7370364. doi:10.1007/BF00344251. （原始内容 (PDF)于2014-06-03）.
^ Weng, J; Ahuja, N; Huang, TS. Learning recognition and segmentation of 3-D objects from 2-D images. Proc. 4th International Conf. Computer Vision. 1993: 121–128.
^ Deshpande, Adit. The 9 Deep Learning Papers You Need To Know About (Understanding CNNs Part 3). adeshpande3.github.io. [2018-12-04]. （原始内容于2018-11-21）.
^ Alex Krizhevsky. Google Scholar Citations. [2020-01-17]. （原始内容于2020-04-17）.

[quartz-1] 1.0 ^1.1 ^1.2 ^1.3 ^1.4 Dave Gershgorn. The inside story of how AI got good enough to dominate Silicon Valley. Quartz. 2018-06-18 [2018-10-05]. （原始内容于2019-12-12）.

[:1-2] The data that transformed AI research—and possibly the world. [2020-01-17]. （原始内容于2017-07-27）.

[:2-3] ILSVRC2012 Results. [2020-01-17]. （原始内容于2020-01-16）.

[4] Krizhevsky, Alex. ImageNet Classification with Deep Convolutional Neural Networks (PDF). [2013-11-17]. （原始内容 (PDF)于2013-05-12）.

[:0-5] 5.0 ^5.1 ^5.2 ^5.3 ^5.4 Krizhevsky, Alex; Sutskever, Ilya; Hinton, Geoffrey E. ImageNet classification with deep convolutional neural networks (PDF). Communications of the ACM. 2017-05-24, 60 (6): 84–90 [2020-01-17]. ISSN 0001-0782. doi:10.1145/3065386. （原始内容 (PDF)于2017-05-16）.

[6] Kumar Chellapilla; Sid Puri; Patrice Simard. High Performance Convolutional Neural Networks for Document Processing. Lorette, Guy (编). Tenth International Workshop on Frontiers in Handwriting Recognition. Suvisoft. 2006 [2020-01-17]. （原始内容于2020-05-18）.

[flexible-7] Ciresan, Dan; Ueli Meier; Jonathan Masci; Luca M. Gambardella; Jurgen Schmidhuber. Flexible, High Performance Convolutional Neural Networks for Image Classification (PDF). Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence-Volume Volume Two. 2011, 2: 1237–1242 [2013-11-17]. （原始内容 (PDF)于2013-11-16）.

[8] IJCNN 2011 Competition result table. OFFICIAL IJCNN2011 COMPETITION. 2010 [2019-01-14]. （原始内容于2019-01-21）.

[9] Schmidhuber, Jürgen. History of computer vision contests won by deep CNNs on GPU. 2017-03-17 [2019-01-14]. （原始内容于2018-12-19）.

[schdeepscholar-10] 10.0 ^10.1 Schmidhuber, Jürgen. Deep Learning. Scholarpedia. 2015, 10 (11): 1527–54 [2020-01-17]. CiteSeerX 10.1.1.76.1541  . PMID 16764513. doi:10.1162/neco.2006.18.7.1527. （原始内容于2016-04-19）.

[mcdns-11] Ciresan, Dan; Meier, Ueli; Schmidhuber, Jürgen. Multi-column deep neural networks for image classification. New York, NY: Institute of Electrical and Electronics Engineers (IEEE). June 2012: 3642–3649. CiteSeerX 10.1.1.300.3283  . ISBN 978-1-4673-1226-4. OCLC 812295155. arXiv:1202.2745  . doi:10.1109/CVPR.2012.6248110. |journal=被忽略 (帮助)

[lecun1-12] Y. LeCun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard, W. Hubbard, L. D. Jackel, Backpropagation Applied to Handwritten Zip Code Recognition （页面存档备份，存于互联网档案馆）; AT&T Bell Laboratories

[lecun98-13] LeCun, Yann; Léon Bottou; Yoshua Bengio; Patrick Haffner. (PDF). Proceedings of the IEEE. 1998, 86 (11): 2278–2324 [2016-10-07]. CiteSeerX 10.1.1.32.9552  . doi:10.1109/5.726791. （原始内容 (PDF)存档于2017-12-15）.

[fukuneoscholar-14] Fukushima, K. Neocognitron. Scholarpedia. 2007, 2 (1): 1717. doi:10.4249/scholarpedia.1717.

[intro-15] Fukushima, Kunihiko. Neocognitron: A Self-organizing Neural Network Model for a Mechanism of Pattern Recognition Unaffected by Shift in Position (PDF). Biological Cybernetics. 1980, 36 (4): 193–202 [2013-11-16]. PMID 7370364. doi:10.1007/BF00344251. （原始内容 (PDF)于2014-06-03）.

[weng1993-16] Weng, J; Ahuja, N; Huang, TS. Learning recognition and segmentation of 3-D objects from 2-D images. Proc. 4th International Conf. Computer Vision. 1993: 121–128.

[17] Deshpande, Adit. The 9 Deep Learning Papers You Need To Know About (Understanding CNNs Part 3). adeshpande3.github.io. [2018-12-04]. （原始内容于2018-11-21）.

[GoogleScholar-18] Alex Krizhevsky. Google Scholar Citations. [2020-01-17]. （原始内容于2020-04-17）.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

www.wiki2.zh-cn.nina.az

AlexNet

目录

背景编辑

网络设计编辑

影响编辑

亚历克斯·克里泽夫斯基编辑

参考资料编辑

拉莫斯愛氏角龍

拉莫斯毛鼻鯰

拉莫斯·奥尔塔

拉耶 (伊勒-维莱讷省)

拉肯亚达─弗林楚奇

拉肯亚达-弗林楚奇

拉胡爾·甘地

拉胡尔·亚达夫·奇塔博尔纳

拉脫維亞交通

拉脫維亞共和國 (1918年)

智里爾班多爾烏帕齊拉

智铤

智頭急行HOT3500型柴聯車

晁无咎

時煒

文章

背景 编辑

网络设计 编辑

影响 编辑

亚历克斯·克里泽夫斯基 编辑

参考资料 编辑

文章

背景编辑

网络设计编辑

影响编辑

亚历克斯·克里泽夫斯基编辑

参考资料编辑