I was wondering how are layer weights and biases initialized by default? E.g. if I create the linear layer torch.nn.Linear(5,100) How are weights and biases for this layer initialized by default?
See Question&Answers more detail:osI was wondering how are layer weights and biases initialized by default? E.g. if I create the linear layer torch.nn.Linear(5,100) How are weights and biases for this layer initialized by default?
See Question&Answers more detail:osPyTorch 1.0
Most layers are initialized using Kaiming Uniform method. Example layers include Linear, Conv2d, RNN etc. If you are using other layers, you should look up that layer on this doc. If it says weights are initialized using U(...)
then its Kaiming Uniform method. Bias is initialized using LeCunn init, i.e., uniform(-std, std)
where standard deviation std is 1/sqrt(fan_in)
(code).
PyTorch 0.4.1, 0.3.1
Weights and biases are initialized using LeCunn init (see sec 4.6) for conv layers (code: 0.3.1, 0.4.1).
If you want to override default initialization then see this answer.