add_dense_sparse_worker_hybrid_cpu Class — pytorch Architecture
Architecture documentation for the add_dense_sparse_worker_hybrid_cpu class in SparseTensorMath.cpp from the pytorch codebase.
Entity Profile
Source Code
aten/src/ATen/native/sparse/SparseTensorMath.cpp lines 618–647
template <typename scalar_t>
static inline void add_dense_sparse_worker_hybrid_cpu(Tensor& r, const Scalar& value, const SparseTensor& sparse, const Tensor& indices, const Tensor& values) {
// Get the dense dimension element numbers of hybrid sparse tensor
int64_t values_dense_size = values.stride(0);
TORCH_CHECK(values.is_contiguous());
scalar_t* v_ptr = values.data_ptr<scalar_t>();
scalar_t* r_ptr = r.data_ptr<scalar_t>();
TORCH_CHECK(r_ptr != nullptr);
auto indices_accessor = indices.accessor<int64_t, 2>();
scalar_t cast_value = value.to<scalar_t>();
auto sparse_dim = sparse.sparse_dim();
std::vector<int64_t> result_stride(sparse_dim);
for (auto d : c10::irange(sparse_dim)) {
result_stride[d] = r.stride(d);
}
at::parallel_for(0, sparse._nnz(), 0, [&](int64_t start, int64_t end) {
for (auto k: c10::irange(start, end)) {
auto r_index = r_ptr;
for (auto d: c10::irange(sparse_dim)) {
r_index += result_stride[d] * indices_accessor[d][k];
}
auto v_index = v_ptr + k * values_dense_size;
at::native::cpublas::axpy<scalar_t>(values_dense_size, cast_value, v_index, 1, r_index, 1);
}
});
}
Source
Analyze Your Own Codebase
Get architecture documentation, dependency graphs, and domain analysis for your codebase in minutes.
Try Supermodel Free