Deep learning references
This section presents a timeline with important scientific papers for deep learning.
The paper date is when it was first published.
2018 | - | BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. Google NLP model. [webpage] |
2018 | - | Deep contextualized word representations. ELMo, NLP. [webpage] |
2017 | - | Attention Is All You Need. Introduces the Transformer network architecture. [webpage] |
2016 | - | YOLO9000: Better, Faster, Stronger. YOLOv2, faster (and hotter) CNN for object detection. [webpage] |
2016 | - | DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution and Fully Connected CRFs. DeepLab, CNN for semantic segmentation with atrous convolution. |
2016 | - | Identity Mappings in Deep Residual Networks. ResNet v2. |
2016 | - | Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning. Google classification network. Inception v4 (Inception-ResNet). |
2016 | - | Mastering the game of Go with deep neural networks and tree search. The Go board game was one of the greatest challenges in AI. This paper presented a reinforcement learning solution capable of defeating human pro players. |
2015 | - | Deep residual learning for image recognition. ResNet, introduced residual connections. |
2015 | - | SSD: Single Shot MultiBox Detector Single CNN for object detection. [code] |
2015 | - | Rethinking the Inception Architecture for Computer Vision. Google classification network. Inception v2 and v3. |
2015 | - | SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation. Encoder-decoder image segmentation network. |
2015 | - | You Only Look Once: Unified, Real-Time Object Detection. Introduced YOLO, fast (and hot) CNN for object detection. [webpage] |
2015 | - | Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. Region proposal based solution for object detection in images. |
2015 | - | Deep learning. Survey-kind paper from deep learning bosses. |
2015 | - | Fast R-CNN. Region proposal based solution for object detection in images. Integrates a region proposal network and a classification network. |
2015 | - | Fully Convolutional Networks for Semantic Segmentation. FCN network for image segmentation. Introduced deconvolution(transposed convolution)(or transposed correlation?). |
2015 | - | Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. One of the most important regularization techniques. BN-Inception. |
2015 | - | Human-level control through deep reinforcement learning. Deep reinforcement learning to play Atari 2600 games in fancy journal. |
2014 | - | Generative Adversarial Nets. Introduced generative adversarial networks. |
2014 | - | Sequence to Sequence Learning with Neural Networks. Sequences to sequences map for machine translation. |
2014 | - | Going Deeper with Convolutions. Google classification network. Inception v1 (GoogleNet). |
2014 | - | Very Deep Convolutional Networks for Large-Scale Image Recognition. VGGNet, introduced factored convolutions. |
2014 | - | ImageNet Large Scale Visual Recognition Challenge. ImageNet paper describing the challenge and winner solutions. |
2014 | - | Dropout: A Simple Way to Prevent Neural Networks from Overfitting. Regularization technique. |
2013 | - | Intriguing properties of neural networks. Introduced adversarial examples and perturbations. |
2013 | - | Playing Atari with Deep Reinforcement Learning. Deep reinforcement learning to play Atari 2600 games. Introduced the DQN and experience replay. |
2013 | - | Network In Network. NiN, introduced 1x1 convolution. |
2013 | - | Rich feature hierarchies for accurate object detection and semantic segmentation. Region proposal based solution for object detection in images. Introduced the R-CNN. |
2012 | - | ImageNet Classification with Deep Convolutional neural networks. This paper presents the famous AlexNet network winner of the 2012 Imagenet challenge (ILSVRC2012). |
2012 | - | Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups. Review on DNNs for acoustic modeling. |
2009 | - | Learning deep architectures for AI. Review on deep architecture models. |
1998 | - | Gradient-based learning applied to document recognition. Convolutional neural networks for handwriting recognition. |
1997 | - | Long Short-Term Memory. Introduced the long short-term memory (LSTM) to store sequential data. |