DeiTClsHead¶
- class mmpretrain.models.heads.DeiTClsHead(num_classes, in_channels, hidden_dim=None, act_cfg={'type': 'Tanh'}, init_cfg={'layer': 'Linear', 'type': 'Constant', 'val': 0}, **kwargs)[源代码]¶
Distilled Vision Transformer classifier head.
Comparing with the
VisionTransformerClsHead
, this head adds an extra linear layer to handle the dist token. The final classification score is the average of both linear transformation results ofcls_token
anddist_token
.- 参数:
num_classes (int) – Number of categories excluding the background category.
in_channels (int) – Number of channels in the input feature map.
hidden_dim (int, optional) – Number of the dimensions for hidden layer. Defaults to None, which means no extra hidden layer.
act_cfg (dict) – The activation config. Only available during pre-training. Defaults to
dict(type='Tanh')
.init_cfg (dict) – The extra initialization configs. Defaults to
dict(type='Constant', layer='Linear', val=0)
.