0.1 Sep 2015
|
Neural Network
|
Feed forward neural network, including CNN, MLP
|
done
|
|
|
RBM-like model, including RBM
|
done
|
|
|
Recurrent neural network, including standard RNN
|
done
|
|
Architecture
|
One worker group on single node (with data partition)
|
done
|
|
|
|
done |
|
|
Distributed Hogwild
|
done |
|
|
|
done |
|
|
|
done |
|
|
Load-balance among servers
|
done
|
|
Failure recovery
|
Checkpoint and restore
|
done |
|
Tools
|
Installation with GNU auto tools
|
done
|
0.2 Jan 2016 |
Neural Network
|
Feed forward neural network, including AlexNet, cuDNN layers, etc.
|
done
|
|
|
Recurrent neural network, including GRULayer and BPTT
|
done |
|
|
Model partition and hybrid partition
|
done |
|
Tools
|
Integration with Mesos for resource management
|
done |
|
|
Prepare Docker images for deployment
|
done |
|
|
Visualization of neural net and debug information
|
done |
|
Binding
|
Python binding for major components
|
done |
|
GPU
|
Single node with multiple GPUs
|
done |
0.3 April 2016 |
GPU
|
Multiple nodes, each with multiple GPUs
|
done |
|
|
|
done |
|
|
Support cuDNN v4
|
done
|
|
Installation
|
Remove dependency on ZeroMQ, CZMQ, Zookeeper for single node training
|
done |
|
Updater
|
Add new SGD updaters including Adam, AdamMax and AdaDelta
|
done |
|
Binding
|
Enhance Python binding for training
|
done |
1.0 Aug 2016 |
Programming abstraction
|
Tensor with linear algebra, neural net and random operations |
|
|
|
Updater for distributed parameter updating |
|
|
Optimization
|
Execution and memory optimization
|
|
|
Hardware
|
Use Cuda and Cudnn for Nvidia GPU
|
|
|
|
Use OpenCL for AMD GPU or other devices
|
|
|
Cross-platform
|
To extend from Linux to MacOS and Windows
|
|
|
Examples
|
Speech recognition example
|
|
|
|
Large image models, e.g., [VGG](https://arxiv.org/pdf/1409.1556.pdf) and [Residual Net](http://arxiv.org/abs/1512.03385) |
|