compute_cpu Class — pytorch Architecture
Architecture documentation for the compute_cpu class in Repeat.cpp from the pytorch codebase.
Entity Profile
Source Code
aten/src/ATen/native/Repeat.cpp lines 17–38
template <typename index_t>
static void compute_cpu(
const index_t* repeat_ptr,
const int64_t* cumsum_ptr,
index_t* result_ptr,
int64_t size,
int64_t result_size) {
TORCH_CHECK(
(result_size == cumsum_ptr[size - 1]),
"allocated size does not match required size");
at::parallel_for(0, size, 1, [&](int64_t i_begin, int64_t i_end) {
for (const auto i : c10::irange(i_begin, i_end)) {
int64_t end = cumsum_ptr[i];
index_t size = repeat_ptr[i];
TORCH_CHECK((size >= 0), "repeats can not be negative");
int64_t start = end - size;
for (const auto j : c10::irange(start, end)) {
result_ptr[j] = i;
}
}
});
}
Source
Analyze Your Own Codebase
Get architecture documentation, dependency graphs, and domain analysis for your codebase in minutes.
Try Supermodel Free