Question

我想计算以下三个元素的两种病媒的dot Products 。我正在研究 wasm矢量指示 ,只有一个dot指示:

i32x4.dot_i16x8_s

它的类型为2个<代码>v128,作为输入和回归代码。 http://webassembly.github.io/spec/core/valid/instructions.html

据传说:

胎盘产品是一种具有两种相同数量序列(通常协调病媒)的薄膜作业,并且只回收一个数字

但是, was号指示的用意是,它希望把这两种投入作为<条码>i32x4和<代码>i16x8加以组织。

对我来说,这并不真正有意义,因为投入媒介的数量并不相同。此外,我不理解返回的<代码>v128/code>的编排方式,即<代码>i32x4或i16x8。或者说什么?

另外,用4D计算方法计算2个3D元元元元件的正确点是,我能否将矢量打上“条形”1 。

如果我想阐述一下《世界金枪鱼养委会法》,我会想像:

(module
  (func $my_function (result i32)
    v128.const i32x4 1 3 -5 0
    v128.const i16x8 4 -2 -1 0 0 0 0 0
    i32x4.dot_i16x8_s
    i32x4.extract_lane 0
  )
)

阴道文章的例子有:

dot([1 3 -5], [4 -2 -1]) returns 3

利用网上评估这样做的恰当方式是什么?

Answer 1

The dot product in wasm must be interpreted as i32x4 being the output, and i16x8 being the input. This indeed corresponds to Intel pmaddwd or pair-wise multiply add words into double words. The intrinsic is also implementation specific, as -32768 **2 * 2 overflows int32_t.

To compute dot product of two 3-element vectors, one must just unroll it as a[0]*b[0]+a[1]*b[1].... Allocating a full v128 for just those three values might help the JIT in optimising.

Answer 2

如@harold评论和@Aki提到,i32x4.dot_i16x8_ Webassembly号指令似乎与 pmaddwd 相对。

我确实认为在

www.un.org/Depts/DGACM/index_spanish.htm Integer dot Products

i32x4.dot_i16x8_s(a: v128, b: v128) -> v128

两个矢志中道道合器在16轨道上签字,并加上整个32倍结果的相邻。

虽然不能将该指示直接用作“dot产品操作者,但正如我所希望的那样,这仍然有助于这种执行。

<代码>pmaddwd的描述是:

1. 通过源歌剧(第二次歌剧)的相应签名词,将个人签名的目的地歌剧(第一演剧)的词句多出,产生临时签名的双词结果。随后,在目的地歌剧中总结并储存了相邻的双词结果。

And this illustration is helpful:

自2006年以来我想使用3个元素的病媒,我可以把矢量输入到0s上,然后从厕所中提取资金,最后再补充。

Dot product implementation for 3D vectors

考虑到wikipedia article :

[1 3 -5] dot [4 -2 -1] = 3

我履行了一项职能,即计算这些病媒的dot Products:

(module
  (func (export "calc_dot") (result i32)
    i32.const 1
    i32.const 3
    i32.const -5

    i32.const 4
    i32.const -2
    i32.const -1

    call $dot3
    return
  )

  (func $dot3 (param i32) (param i32) (param i32) (param i32) (param i32) (param i32) (result i32)
    (local v128)

    ;; create vector from first 3 params
    (i16x8.splat (i32.const 0))
    (i16x8.replace_lane 0 (local.get 0))
    (i16x8.replace_lane 1 (local.get 1))
    (i16x8.replace_lane 2 (local.get 2))

    ;; create vector from last 3 params
    (i16x8.splat (i32.const 0))
    (i16x8.replace_lane 0 (local.get 3))
    (i16x8.replace_lane 1 (local.get 4))
    (i16x8.replace_lane 2 (local.get 5))

    ;; integer dot product
    (local.set 6 (i32x4.dot_i16x8_s))

    (i32x4.extract_lane 0 (local.get 6))
    (i32x4.extract_lane 1 (local.get 6))
    i32.add

    return
  )
)

Dot product implementation for 3D vectors

友情链接