1.9.5. Half Precision Conversion And Data Movement

[Half Precision Intrinsics]

Functions

__CUDA_FP16_DECL__ __half2 __float22half2_rn ( const float2 a )
Converts both components of float2 number to half precision in round-to-nearest mode and returns half2 with converted values.
__CUDA_FP16_DECL__ __half __float2half ( const float  a )
Converts float number to half precision in round-to-nearest mode and returns half with converted value.
__CUDA_FP16_DECL__ __half2 __float2half2_rn ( const float  a )
Converts input to half precision in round-to-nearest mode and populates both halves of half2 with converted value.
__CUDA_FP16_DECL__ __half2 __floats2half2_rn ( const float  a, const float  b )
Converts both input floats to half precision in round-to-nearest mode and returns half2 with converted values.
__CUDA_FP16_DECL__ float2 __half22float2 ( const __half2 a )
Converts both halves of half2 to float2 and returns the result.
__CUDA_FP16_DECL__ float __half2float ( const __half a )
Converts half number to float.
__CUDA_FP16_DECL__ __half2 __half2half2 ( const __half a )
Returns half2 with both halves equal to the input value.
__CUDA_FP16_DECL__ __half2 __halves2half2 ( const __half a, const __half b )
Combines two half numbers into one half2 number.
__CUDA_FP16_DECL__ float __high2float ( const __half2 a )
Converts high 16 bits of half2 to float and returns the result.
__CUDA_FP16_DECL__ __half __high2half ( const __half2 a )
Returns high 16 bits of half2 input.
__CUDA_FP16_DECL__ __half2 __high2half2 ( const __half2 a )
Extracts high 16 bits from half2 input.
__CUDA_FP16_DECL__ __half2 __highs2half2 ( const __half2 a, const __half2 b )
Extracts high 16 bits from each of the two half2 inputs and combines into one half2 number.
__CUDA_FP16_DECL__ float __low2float ( const __half2 a )
Converts low 16 bits of half2 to float and returns the result.
__CUDA_FP16_DECL__ __half __low2half ( const __half2 a )
Returns low 16 bits of half2 input.
__CUDA_FP16_DECL__ __half2 __low2half2 ( const __half2 a )
Extracts low 16 bits from half2 input.
__CUDA_FP16_DECL__ __half2 __lowhigh2highlow ( const __half2 a )
Swaps both halves of the half2 input.
__CUDA_FP16_DECL__ __half2 __lows2half2 ( const __half2 a, const __half2 b )
Extracts low 16 bits from each of the two half2 inputs and combines into one half2 number.

Functions

__CUDA_FP16_DECL__ __half2 __float22half2_rn ( const float2 a )
Converts both components of float2 number to half precision in round-to-nearest mode and returns half2 with converted values.
Returns

Returns half2 which has corresponding halves equal to the converted float2 components.

Description

Converts both components of float2 to half precision in round-to-nearest mode and combines the results into one half2 number. Low 16 bits of the return value correspond to a.x and high 16 bits of the return value correspond to a.y.

__CUDA_FP16_DECL__ __half __float2half ( const float  a )
Converts float number to half precision in round-to-nearest mode and returns half with converted value.
Returns

Returns half result with converted value.

Description

Converts float number a to half precision in round-to-nearest mode.

__CUDA_FP16_DECL__ __half2 __float2half2_rn ( const float  a )
Converts input to half precision in round-to-nearest mode and populates both halves of half2 with converted value.
Returns

Returns half2 with both halves equal to the converted half precision number.

Description

Converts input a to half precision in round-to-nearest mode and populates both halves of half2 with converted value.

__CUDA_FP16_DECL__ __half2 __floats2half2_rn ( const float  a, const float  b )
Converts both input floats to half precision in round-to-nearest mode and returns half2 with converted values.
Returns

Returns half2 which has corresponding halves equal to the converted input floats.

Description

Converts both input floats to half precision in round-to-nearest mode and combines the results into one half2 number. Low 16 bits of the return value correspond to the input a, high 16 bits correspond to the input b.

__CUDA_FP16_DECL__ float2 __half22float2 ( const __half2 a )
Converts both halves of half2 to float2 and returns the result.
Returns

Returns converted float2.

Description

Converts both halves of half2 input a to float2 and returns the result.

__CUDA_FP16_DECL__ float __half2float ( const __half a )
Converts half number to float.
Returns

Returns float result with converted value.

Description

Converts half number a to float.

__CUDA_FP16_DECL__ __half2 __half2half2 ( const __half a )
Returns half2 with both halves equal to the input value.
Returns

Returns half2 with both halves equal to the input a.

Description

Returns half2 number with both halves equal to the input ahalf number.

__CUDA_FP16_DECL__ __half2 __halves2half2 ( const __half a, const __half b )
Combines two half numbers into one half2 number.
Returns

Returns half2 number which has one half equal to a and the other to b.

Description

Combines two input half number a and b into one half2 number. Input a is stored in low 16 bits of the return value, input b is stored in high 16 bits of the return value.

__CUDA_FP16_DECL__ float __high2float ( const __half2 a )
Converts high 16 bits of half2 to float and returns the result.
Returns

Returns high 16 bits of a converted to float.

Description

Converts high 16 bits of half2 input a to 32 bit floating point number and returns the result.

__CUDA_FP16_DECL__ __half __high2half ( const __half2 a )
Returns high 16 bits of half2 input.
Returns

Returns half which contains high 16 bits of the input.

Description

Returns high 16 bits of half2 input a.

__CUDA_FP16_DECL__ __half2 __high2half2 ( const __half2 a )
Extracts high 16 bits from half2 input.
Returns

Returns half2 with both halves equal to high 16 bits from the input.

Description

Extracts high 16 bits from half2 input a and returns a new half2 number which has both halves equal to the extracted bits.

__CUDA_FP16_DECL__ __half2 __highs2half2 ( const __half2 a, const __half2 b )
Extracts high 16 bits from each of the two half2 inputs and combines into one half2 number.
Returns

Returns half2 which contains high 16 bits from a and b.

Description

Extracts high 16 bits from each of the two half2 inputs and combines into one half2 number. High 16 bits from input a is stored in low 16 bits of the return value, high 16 bits from input b is stored in high 16 bits of the return value.

__CUDA_FP16_DECL__ float __low2float ( const __half2 a )
Converts low 16 bits of half2 to float and returns the result.
Returns

Returns low 16 bits of a converted to float.

Description

Converts low 16 bits of half2 input a to 32 bit floating point number and returns the result.

__CUDA_FP16_DECL__ __half __low2half ( const __half2 a )
Returns low 16 bits of half2 input.
Returns

Returns half which contains low 16 bits of the input.

Description

Returns low 16 bits of half2 input a.

__CUDA_FP16_DECL__ __half2 __low2half2 ( const __half2 a )
Extracts low 16 bits from half2 input.
Returns

Returns half2 with both halves equal to low 16 bits from the input.

Description

Extracts low 16 bits from half2 input a and returns a new half2 number which has both halves equal to the extracted bits.

__CUDA_FP16_DECL__ __half2 __lowhigh2highlow ( const __half2 a )
Swaps both halves of the half2 input.
Returns

Returns half2 with halves swapped.

Description

Swaps both halves of the half2 input and returns a new half2 number with swapped halves.

__CUDA_FP16_DECL__ __half2 __lows2half2 ( const __half2 a, const __half2 b )
Extracts low 16 bits from each of the two half2 inputs and combines into one half2 number.
Returns

Returns half2 which contains low 16 bits from a and b.

Description

Extracts low 16 bits from each of the two half2 inputs and combines into one half2 number. Low 16 bits from input a is stored in low 16 bits of the return value, low 16 bits from input b is stored in high 16 bits of the return value.