QLinearInt8 Class — pytorch Architecture
Architecture documentation for the QLinearInt8 class in Linear.cpp from the pytorch codebase.
Entity Profile
Source Code
aten/src/ATen/native/quantized/cudnn/Linear.cpp lines 343–357
class QLinearInt8 final {
public:
static at::Tensor run(
at::Tensor act,
const c10::intrusive_ptr<LinearPackedParamsBase>& packed_weight,
double output_scale,
int64_t output_zero_point) {
// TODO: check all zero_points are zero/all tensors are symmetrically quantized
if constexpr (kReluFused) {
return packed_weight->apply_relu(std::move(act), output_scale, output_zero_point);
} else {
return packed_weight->apply(std::move(act), output_scale, output_zero_point);
}
}
};
Source
Analyze Your Own Codebase
Get architecture documentation, dependency graphs, and domain analysis for your codebase in minutes.
Try Supermodel Free