MR Class — pytorch Architecture
Architecture documentation for the MR class in hgemm.cc from the pytorch codebase.
Entity Profile
Source Code
aten/src/ATen/native/quantized/cpu/qnnpack/bench/hgemm.cc lines 146–161
template <uint32_t MR, uint32_t NR, uint32_t KR>
class HGEMM_L1 : public HGEMM {
public:
inline HGEMM_L1() : HGEMM(MR, NR, KR) {
cpuinfo_initialize();
const size_t l1d_size = cpuinfo_get_l1d_cache(0)->size;
const size_t l1d_reserve = 512;
kc_ = ((l1d_size - l1d_reserve) / sizeof(uint16_t) - mr() * nr()) /
(mr() + nr());
if (kr() != 1) {
kc_ = kc_ / kr() * kr();
} else {
kc_ = kc_ / nr() * nr();
}
}
};
Source
Analyze Your Own Codebase
Get architecture documentation, dependency graphs, and domain analysis for your codebase in minutes.
Try Supermodel Free