我是靠谱客的博主 称心黑裤,这篇文章主要介绍RuntimeError: einsum(): operands do not broadcast with remapped shapes [original->remapped]项目场景:问题描述,现在分享给大家,希望可以做个参考。
项目场景:
multihead-attention训练
复制代码
1
2
3
4out = torch.einsum('b h d e, b h d n -> b h e n', context, q) File "/root/anaconda3/lib/python3.7/site-packages/torch/functional.py", line 327, in einsum return _VF.einsum(equation, operands) # type: ignore[attr-defined] RuntimeError: einsum(): operands do not broadcast with remapped shapes [original->remapped]: [3, 4, 32, 32]->[3, 4, 32, 1, 32] [2, 4, 32, 65536]->[2, 4, 1, 65536, 32]
问题描述
完整报错:
复制代码
1
2
3
4
5
6Traceback (most recent call last): File "multi_train.py", line 38, in <module> trainer.train() File "/root/sketchMultimodal/denoising-diffusion-pytorch-master/denoising_diffusion_pytorch/denoising_diffusion_pytorch.py", line 661, in train all_images_list = list(map(lambda n: self.ema_model.sample(condition=data_c, batch_size=n), batches)) File "/root/sket
最后
以上就是称心黑裤最近收集整理的关于RuntimeError: einsum(): operands do not broadcast with remapped shapes [original->remapped]项目场景:问题描述的全部内容,更多相关RuntimeError:内容请搜索靠谱客的其他文章。
本图文内容来源于网友提供,作为学习参考使用,或来自网络收集整理,版权属于原作者所有。
发表评论 取消回复