Layer¶

Python API¶

Python layers wrap the C++ layers to provide simpler construction APIs.

Example usages:

from singa import layer
from singa import tensor
from singa import device

layer.engine = 'cudnn'  # to use cudnn layers
dev = device.create_cuda_gpu()

# create a convolution layer
conv = layer.Conv2D('conv', 32, 3, 1, pad=1, input_sample_shape=(3, 32, 32))
conv.to_device(dev)  # move the layer data onto a CudaGPU device
x = tensor.Tensor((3, 32, 32), dev)
x.uniform(-1, 1)
y = conv.foward(True, x)

dy = tensor.Tensor()
dy.reset_like(y)
dy.set_value(0.1)
# dp is a list of tensors for parameter gradients
dx, dp = conv.backward(kTrain, dy)

singa.layer.engine = 'cudnn'¶

engine is the prefix of layer identifier.

The value could be one of [‘cudnn’, ‘singacpp’, ‘singacuda’, ‘singacl’], for layers implemented using the cudnn library, Cpp, Cuda and OpenCL respectively. For example, CudnnConvolution layer is identified by ‘cudnn_convolution’; ‘singacpp_convolution’ is for Convolution layer; Some layers’ implementation use only Tensor functions, thererfore they are transparent to the underlying devices. For threse layers, they would have multiple identifiers, e.g., singacpp_dropout, singacuda_dropout and singacl_dropout are all for the Dropout layer. In addition, it has an extra identifier ‘singa’, i.e. ‘singa_dropout’ also stands for the Dropout layer.

engine is case insensitive. Each python layer would create the correct specific layer using the engine attribute.

class singa.layer.Layer(name, conf=None, **kwargs)¶

Bases: object

Base Python layer class.

Typically, the life cycle of a layer instance includes:

construct layer without input_sample_shapes, goto 2; construct layer with input_sample_shapes, goto 3;
call setup to create the parameters and setup other meta fields
call forward or access layer members
call backward and get parameters for update

Parameters:	name (str) – layer name

setup(in_shapes)¶

Call the C++ setup function to create params and set some meta data.

Parameters:	in_shapes – if the layer accepts a single input Tensor, in_shapes is a single tuple specifying the inpute Tensor shape; if the layer accepts multiple input Tensor (e.g., the concatenation layer), in_shapes is a tuple of tuples, each for one input Tensor

caffe_layer()¶: Create a singa layer based on caffe layer configuration.

get_output_sample_shape()¶

Called after setup to get the shape of the output sample(s).

Returns:	a tuple for a single output Tensor or a list of tuples if this layer has multiple outputs

param_names()¶

Returns:	a list of strings, one for the name of one parameter Tensor

param_values()¶

Return param value tensors.

Parameter tensors are not stored as layer members because cpp Tensor could be moved onto diff devices due to the change of layer device, which would result in inconsistency.

Returns:	a list of tensors, one for each paramter

forward(flag, x)¶

Forward propagate through this layer.

Parameters:	flag – True (kTrain) for training (kEval); False for evaluating; other values for furture use. x (Tensor or list<Tensor>) – an input tensor if the layer is connected from a single layer; a list of tensors if the layer is connected from multiple layers.
Returns:	a tensor if the layer is connected to a single layer; a list of tensors if the layer is connected to multiple layers;

backward(flag, dy)¶

Backward propagate gradients through this layer.

Parameters:	flag (int) – for future use. dy (Tensor or list<Tensor>) – the gradient tensor(s) y w.r.t the objective loss
Returns:	<dx, <dp1, dp2..>>, dx is a (set of) tensor(s) for the gradient of x , dpi is the gradient of the i-th parameter

to_device(device)¶

Move layer state tensors onto the given device.

Parameters:	device – swig converted device, created using singa.device

as_type(dtype)¶

class singa.layer.Dummy(name, input_sample_shape=None)¶

Parameters:	grad (Tensor) –
Returns:	A list of replicated grad, one per source layer

Layer¶

Python API¶

CPP API¶