clamp_min Class — pytorch Architecture
Architecture documentation for the clamp_min class in vec256_half.h from the pytorch codebase.
Entity Profile
Source Code
aten/src/ATen/cpu/vec/vec256/vec256_half.h lines 178–189
template <>
Vectorized<Half> inline clamp_min(
const Vectorized<Half>& a,
const Vectorized<Half>& min) {
__m256 a_lo, a_hi;
__m256 min_lo, min_hi;
cvtfp16_fp32(__m256i(a), a_lo, a_hi);
cvtfp16_fp32(__m256i(min), min_lo, min_hi);
auto o1 = _mm256_max_ps(min_lo, a_lo);
auto o2 = _mm256_max_ps(min_hi, a_hi);
return cvtfp32_fp16(o1, o2);
}
Source
Analyze Your Own Codebase
Get architecture documentation, dependency graphs, and domain analysis for your codebase in minutes.
Try Supermodel Free