Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 3 Next »

This project is available as a student work experience opportunity with HPCC Systems this summer. Curious about other projects we are offering? Take a look at our Ideas List

Find out about the HPCC Systems Summer Internship Program.

The project proposal application period for 2020 summer internships is now open. Please see our list of Available Projects. Contact the project mentor for more information and to discuss your ideas. You may suggest a project idea of your own but it must leverage HPCC Systems in some way. Contact us for support from an HPCC Systems mentor with experience in your chosen project area.

Project Description

Add more details as needed

Create tools in ECL to be added to the HPCC Systems machine learning library in the form of bundle to prepare data. Some examples of tools to be added are

  • One-hot encoding
  • Normalization
  • Scaling
  • Sampling

The project is open to accept other suggested tools that users of the HPCC Systems ML library may find useful.

Completion of this project involves:

  • Implementation of proposed pre-processing tools in ECL
  • Unit Testing
  • Code check in on Github
  • Documentation
  • White Paper

By the mid term review we would expect you to have:

  • TBC. 
  • <What must be completed to pass the evaluation and continue on to complete the project>
Mentor

TBD
Contact Details

Backup Mentor: TBD
Contact Details 

Skills needed
  • Knowledge of ECL. Training manuals and online courses are available on the HPCC Systems website.
  • Knowledge of distributed computing techniques
  • Familiar with HPCC Systems Machine Learning Library
  • Familiar with Data Pre-Processing
  • Familiar with Github
Deliverables
  • Midterm

    • Implement at least 60% of the proposed tools

    End of project

    • Implement 100% of the proposed tools
    • Unit Testing
    • Code check in on Github
    • Documentation
    • White Paper
Other resources
  • No labels