Frameworks for Training

Training of ML models in Kubeflow

Chainer Training

See Kubeflow v0.6 docs for instructions on using Chainer for training

MPI Training

Instructions for using MPI for training

MXNet Training

Instructions for using MXNet

PyTorch Training

Instructions for using PyTorch

Job Scheduling

How to schedule a job with gang-scheduling

TensorFlow Training (TFJob)

Using TFJob to train a model with TensorFlow