torch.tensordot

torch.tensordot(a, b, dims=2, out=None)[源代码]

返回 a 和 b 在多维空间中的收缩结果。

tensordot 实现了通用矩阵乘法。

参数

a (张量) – 左侧待收缩的张量
b (Tensor) – 用于收缩操作的右侧张量
dims (int 或 Tuple[List[int], List[int]] 或 List[List[int]]) – 要收缩的维度数量，或者分别为a和b的显式维度列表

当使用非负整数参数 dims = $$d$$ 调用时，如果张量 a 和 b 的维度分别是 $$m$$ 和 $$n$$ ，则tensordot() 进行计算

r_{i_0,...,i_{m-d}, i_d,...,i_n} = \sum_{k_0,...,k_{d-1}} a_{i_0,...,i_{m-d},k_0,...,k_{d-1}} \times b_{k_0,...,k_{d-1}, i_d,...,i_n}.

当使用dims列表形式调用时，给定的维度将在a的最后 $$d$$ 个维度和 $$b$$ 的第一个 $$d$$ 个维度上进行收缩。这些维度的大小必须匹配，但tensordot()会处理广播维度。

示例:

>>> a = torch.arange(60.).reshape(3, 4, 5)
>>> b = torch.arange(24.).reshape(4, 3, 2)
>>> torch.tensordot(a, b, dims=([1, 0], [0, 1]))
tensor([[4400., 4730.],
        [4532., 4874.],
        [4664., 5018.],
        [4796., 5162.],
        [4928., 5306.]])

>>> a = torch.randn(3, 4, 5, device='cuda')
>>> b = torch.randn(4, 5, 6, device='cuda')
>>> c = torch.tensordot(a, b, dims=2).cpu()
tensor([[ 8.3504, -2.5436,  6.2922,  2.7556, -1.0732,  3.2741],
        [ 3.3161,  0.0704,  5.0187, -0.4079, -4.3126,  4.8744],
        [ 0.8223,  3.9445,  3.2168, -0.2400,  3.4117,  1.7780]])

>>> a = torch.randn(3, 5, 4, 6)
>>> b = torch.randn(6, 4, 5, 3)
>>> torch.tensordot(a, b, dims=([2, 1, 3], [1, 2, 0]))
tensor([[  7.7193,  -2.4867, -10.3204],
        [  1.5513, -14.4737,  -6.5113],
        [ -0.2850,   4.2573,  -3.5997]])