| Parameter | Description |
|---|---|
| a | Vector a |
v128 Vector
Compute the square root of the lower single-precision (32-bit) floating-point element in "a", store the result in the lower element of "dst", and copy the upper 3 packed elements from "a" to the upper elements of "dst".