RuntimeError: einsum(): operands do not broadcast with remapped shapes [original->remapped]项目场景:问题描述
项目场景:multihead-attention训练out = torch.einsum('b h d e, b h d n -> b h e n', context, q) File "/root/anaconda3/lib/python3.7/site-packages/torch/functional.py", line 327, in einsum return _VF.einsum(equation, operands) # type: ig