Apache SINGA
A distributed deep learning platform .
|
implementation of GPU host code More...
#include "tensor.h"
Go to the source code of this file.
Namespaces | |
mshadow | |
namespace for mshadow | |
Functions | |
void | mshadow::InitTensorEngine (int device_id=0) |
initialize tensor engine, used to call intialization functions of dependent libs this function should be called before all GPU tensor operations, for using tensors in CPU, this call is actually not needed More... | |
void | mshadow::ShutdownTensorEngine (void) |
Shutdown tensor engine, this function should be called after all GPU tensor operations, for using tensors in CPU, this call is actually not needed. | |
template<int dim> | |
void | mshadow::AllocSpace (Tensor< gpu, dim > &obj, bool pad=MSHADOW_ALLOC_PAD) |
refer to comment of cpu ver More... | |
template<int dim> | |
void | mshadow::FreeSpace (Tensor< gpu, dim > &obj) |
refer to comment of cpu ver More... | |
template<typename A , typename B , int dim> | |
void | mshadow::Copy (Tensor< A, dim > _dst, Tensor< B, dim > _src, cudaMemcpyKind kind) |
template<int dim> | |
void | mshadow::Copy (Tensor< cpu, dim > dst, const Tensor< gpu, dim > &src) |
refer to comment of cpu ver More... | |
template<int dim> | |
void | mshadow::Copy (Tensor< gpu, dim > dst, const Tensor< gpu, dim > &src) |
refer to comment of cpu ver More... | |
template<int dim> | |
void | mshadow::Copy (Tensor< gpu, dim > dst, const Tensor< cpu, dim > &src) |
refer to comment of cpu ver More... | |
implementation of GPU host code