readme.md 6.2 KB
Newer Older
V
Varuna Jayasiri 已提交
1
[![Twitter](https://img.shields.io/twitter/follow/labmlai?style=social)](https://twitter.com/labmlai)
V
Varuna Jayasiri 已提交
2

A
Aarsh Chaube 已提交
3
# [labml.ai Deep Learning Paper Implementations](https://nn.labml.ai/index.html)
V
readme  
Varuna Jayasiri 已提交
4

V
Varuna Jayasiri 已提交
5 6 7
This is a collection of simple PyTorch implementations of
neural networks and related algorithms.
These implementations are documented with explanations,
V
Varuna Jayasiri 已提交
8

V
urls  
Varuna Jayasiri 已提交
9
[The website](https://nn.labml.ai/index.html)
V
Varuna Jayasiri 已提交
10 11 12
renders these as side-by-side formatted notes.
We believe these would help you understand these algorithms better.

V
Varuna Jayasiri 已提交
13
![Screenshot](https://github.com/labmlai/annotated_deep_learning_paper_implementations/blob/master/images/dqn.png)
V
Varuna Jayasiri 已提交
14

V
Varuna Jayasiri 已提交
15 16
We are actively maintaining this repo and adding new 
implementations almost weekly.
V
Varuna Jayasiri 已提交
17
[![Twitter](https://img.shields.io/twitter/follow/labmlai?style=social)](https://twitter.com/labmlai) for updates.
V
Varuna Jayasiri 已提交
18

V
readme  
Varuna Jayasiri 已提交
19
## Modules
V
readme  
Varuna Jayasiri 已提交
20

V
urls  
Varuna Jayasiri 已提交
21
#### ✨ [Transformers](https://nn.labml.ai/transformers/index.html)
V
readme  
Varuna Jayasiri 已提交
22

V
links  
Varuna Jayasiri 已提交
23 24
* [Multi-headed attention](https://nn.labml.ai/transformers/mha.html)
* [Transformer building blocks](https://nn.labml.ai/transformers/models.html) 
V
Varuna Jayasiri 已提交
25 26
* [Transformer XL](https://nn.labml.ai/transformers/xl/index.html)
    * [Relative multi-headed attention](https://nn.labml.ai/transformers/xl/relative_mha.html)
V
Varuna Jayasiri 已提交
27
* [Compressive Transformer](https://nn.labml.ai/transformers/compressive/index.html)
V
urls  
Varuna Jayasiri 已提交
28 29 30 31 32
* [GPT Architecture](https://nn.labml.ai/transformers/gpt/index.html)
* [GLU Variants](https://nn.labml.ai/transformers/glu_variants/simple.html)
* [kNN-LM: Generalization through Memorization](https://nn.labml.ai/transformers/knn)
* [Feedback Transformer](https://nn.labml.ai/transformers/feedback/index.html)
* [Switch Transformer](https://nn.labml.ai/transformers/switch/index.html)
V
readme  
Varuna Jayasiri 已提交
33
* [Fast Weights Transformer](https://nn.labml.ai/transformers/fast_weights/index.html)
V
Varuna Jayasiri 已提交
34
* [FNet](https://nn.labml.ai/transformers/fnet/index.html)
V
Varuna Jayasiri 已提交
35
* [Attention Free Transformer](https://nn.labml.ai/transformers/aft/index.html)
V
Varuna Jayasiri 已提交
36
* [Masked Language Model](https://nn.labml.ai/transformers/mlm/index.html)
V
Varuna Jayasiri 已提交
37
* [MLP-Mixer: An all-MLP Architecture for Vision](https://nn.labml.ai/transformers/mlp_mixer/index.html)
V
Varuna Jayasiri 已提交
38
* [Pay Attention to MLPs (gMLP)](https://nn.labml.ai/transformers/gmlp/index.html)
V
vit  
Varuna Jayasiri 已提交
39
* [Vision Transformer (ViT)](https://nn.labml.ai/transformers/vit/index.html)
V
Varuna Jayasiri 已提交
40

V
urls  
Varuna Jayasiri 已提交
41
#### ✨ [Recurrent Highway Networks](https://nn.labml.ai/recurrent_highway_networks/index.html)
V
readme  
Varuna Jayasiri 已提交
42

V
urls  
Varuna Jayasiri 已提交
43
#### ✨ [LSTM](https://nn.labml.ai/lstm/index.html)
V
readme  
Varuna Jayasiri 已提交
44

V
urls  
Varuna Jayasiri 已提交
45
#### ✨ [HyperNetworks - HyperLSTM](https://nn.labml.ai/hypernetworks/hyper_lstm.html)
V
Varuna Jayasiri 已提交
46

V
readme  
Varuna Jayasiri 已提交
47 48
#### ✨ [ResNet](https://nn.labml.ai/resnet/index.html)

V
urls  
Varuna Jayasiri 已提交
49
#### ✨ [Capsule Networks](https://nn.labml.ai/capsule_networks/index.html)
V
readme  
Varuna Jayasiri 已提交
50

V
urls  
Varuna Jayasiri 已提交
51
#### ✨ [Generative Adversarial Networks](https://nn.labml.ai/gan/index.html)
V
readme  
Varuna Jayasiri 已提交
52 53 54 55
* [Original GAN](https://nn.labml.ai/gan/original/index.html)
* [GAN with deep convolutional network](https://nn.labml.ai/gan/dcgan/index.html)
* [Cycle GAN](https://nn.labml.ai/gan/cycle_gan/index.html)
* [Wasserstein GAN](https://nn.labml.ai/gan/wasserstein/index.html)
V
Varuna Jayasiri 已提交
56
* [Wasserstein GAN with Gradient Penalty](https://nn.labml.ai/gan/wasserstein/gradient_penalty/index.html)
V
Varuna Jayasiri 已提交
57
* [StyleGAN 2](https://nn.labml.ai/gan/stylegan/index.html)
V
readme  
Varuna Jayasiri 已提交
58

V
urls  
Varuna Jayasiri 已提交
59
#### ✨ [Sketch RNN](https://nn.labml.ai/sketch_rnn/index.html)
V
Varuna Jayasiri 已提交
60

V
Varuna Jayasiri 已提交
61 62 63
#### ✨ Graph Neural Networks

* [Graph Attention Networks (GAT)](https://nn.labml.ai/graphs/gat/index.html)
V
Varuna Jayasiri 已提交
64
* [Graph Attention Networks v2 (GATv2)](https://nn.labml.ai/graphs/gatv2/index.html)
V
Varuna Jayasiri 已提交
65

V
links  
Varuna Jayasiri 已提交
66 67 68 69 70 71
#### ✨ [Counterfactual Regret Minimization (CFR)](https://nn.labml.ai/cfr/index.html)

Solving games with incomplete information such as poker with CFR.

* [Kuhn Poker](https://nn.labml.ai/cfr/kuhn/index.html)

V
urls  
Varuna Jayasiri 已提交
72 73 74 75 76 77
#### ✨ [Reinforcement Learning](https://nn.labml.ai/rl/index.html)
* [Proximal Policy Optimization](https://nn.labml.ai/rl/ppo/index.html) with
 [Generalized Advantage Estimation](https://nn.labml.ai/rl/ppo/gae.html)
* [Deep Q Networks](https://nn.labml.ai/rl/dqn/index.html) with
 with [Dueling Network](https://nn.labml.ai/rl/dqn/model.html),
 [Prioritized Replay](https://nn.labml.ai/rl/dqn/replay_buffer.html)
V
readme  
Varuna Jayasiri 已提交
78
 and Double Q Network.
V
Varuna Jayasiri 已提交
79

V
urls  
Varuna Jayasiri 已提交
80 81 82 83 84 85 86
#### ✨ [Optimizers](https://nn.labml.ai/optimizers/index.html)
* [Adam](https://nn.labml.ai/optimizers/adam.html)
* [AMSGrad](https://nn.labml.ai/optimizers/amsgrad.html)
* [Adam Optimizer with warmup](https://nn.labml.ai/optimizers/adam_warmup.html)
* [Noam Optimizer](https://nn.labml.ai/optimizers/noam.html)
* [Rectified Adam Optimizer](https://nn.labml.ai/optimizers/radam.html)
* [AdaBelief Optimizer](https://nn.labml.ai/optimizers/ada_belief.html)
V
Varuna Jayasiri 已提交
87

V
links  
Varuna Jayasiri 已提交
88 89
#### ✨ [Normalization Layers](https://nn.labml.ai/normalization/index.html)
* [Batch Normalization](https://nn.labml.ai/normalization/batch_norm/index.html)
V
Varuna Jayasiri 已提交
90
* [Layer Normalization](https://nn.labml.ai/normalization/layer_norm/index.html)
V
Varuna Jayasiri 已提交
91 92
* [Instance Normalization](https://nn.labml.ai/normalization/instance_norm/index.html)
* [Group Normalization](https://nn.labml.ai/normalization/group_norm/index.html)
V
links  
Varuna Jayasiri 已提交
93
* [Weight Standardization](https://nn.labml.ai/normalization/weight_standardization/index.html)
V
Varuna Jayasiri 已提交
94
* [Batch-Channel Normalization](https://nn.labml.ai/normalization/batch_channel_norm/index.html)
V
links  
Varuna Jayasiri 已提交
95

V
Varuna Jayasiri 已提交
96 97
#### ✨ [Distillation](https://nn.labml.ai/distillation/index.html)

V
Varuna Jayasiri 已提交
98 99 100 101
#### ✨ [Adaptive Computation](https://nn.labml.ai/adaptive_computation/index.html)

* [PonderNet](https://nn.labml.ai/adaptive_computation/ponder_net/index.html)

102 103 104 105
#### ✨ [Uncertainty](https://nn.labml.ai/uncertainty/index.html)

* [Evidential Deep Learning to Quantify Classification Uncertainty](https://nn.labml.ai/uncertainty/evidence/index.html)

V
readme  
Varuna Jayasiri 已提交
106 107 108
### Installation

```bash
V
Varuna Jayasiri 已提交
109
pip install labml-nn
V
readme  
Varuna Jayasiri 已提交
110 111
```

V
Varuna Jayasiri 已提交
112
### Citing
V
readme  
Varuna Jayasiri 已提交
113

V
Varuna Jayasiri 已提交
114
If you use this for academic research, please cite it using the following BibTeX entry.
V
readme  
Varuna Jayasiri 已提交
115 116 117 118

```bibtex
@misc{labml,
 author = {Varuna Jayasiri, Nipun Wijerathne},
V
Varuna Jayasiri 已提交
119
 title = {labml.ai Annotated Paper Implementations},
V
readme  
Varuna Jayasiri 已提交
120
 year = {2020},
V
Varuna Jayasiri 已提交
121
 url = {https://nn.labml.ai/},
V
readme  
Varuna Jayasiri 已提交
122
}
V
Varuna Jayasiri 已提交
123
```
V
links  
Varuna Jayasiri 已提交
124 125 126 127 128 129 130 131 132 133 134 135

### Other Projects

#### [🚀 Trending Research Papers](https://papers.labml.ai/)

This shows the most popular research papers on social media. It also aggregates links to useful resources like paper explanations videos and discussions.


#### [🧪 labml.ai/labml](https://github.com/labmlai/labml)

This is a library that let's you onitor deep learning model training and hardware usage from your mobile phone. It also comes with a bunch of other tools to help write deep learning code efficiently.