Distributed Neural Network Training and Prediction
This project was completed by a student accepted on to the 2023 HPCC Systems Intern Program.
Project Description
Neural Networks have become a key mechanism for the analysis of many types of data. In particular they have been found to be very effective for the analysis of complex datasets such as images, video, and time-series, where classical methods have proven inadequate. The Generalized Neural Network Bundle (GNN) allows the ECL programmer to combine the parallel processing power of HPCC Systems with the powerful Neural Network capabilities of Keras and Tensorflow. The GNN bundle attaches each node in the HPCC Systems cluster to an independent Keras/Tensorflow environment and coordinates among those environments to provide a distributed environment that can parallelize all phases of Keras/Tensorflow usage. Most importantly, this coordination is transparent to the GNN user, who can program as if running on a single node.
Despite the GNN implementation, endless variations and combinations of the neural network techniques continue to be proposed in order to push the state of the art in machine learning and optimization. This project will research, implement, and evaluate alternative methods for distributed training of Neural Networks. Then, implement the most promising methods on the HPCC Systems Platform using the GNN bundle.
This work involves the design and implementation of alternative distribution models for parallelized training and evaluation of neural networks using the HPCC Systems GNN bundle and Tensorflow. The student will evaluate the various methods, and document the new capabilities, including guidance as to which algorithms are preferable for different scenarios.
If you are interested in this project, please contact the mentor shown below.
Mentor | Lili Xu Backup Mentor: Roger Dev |
Skills needed |
|
Other resources |
All pages in this wiki are subject to our site usage guidelines.