PostNorm

Apply Post-Layer normalization to a children

PreNorm(
    nn.Sequential(
        nn.Linear(100, 200),
        nn.GELU(),
        nn.Linear(200, 100)
    ),
    dim=100
)

Forward

(x: torch.Tensor) -> torch.Tensor