Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 7 Next »

Find out more about the HPCC Systems Summer Intern Program including how to apply and read this blog introducing the students and their projects.

13 students joined our intern program in 2024. Our students presented about their projects to the team during the year and 12 of them entered our 2024 Poster Contest held at the virtual HPCC Systems Community Day Summit in October 2024.

Meet the Class of 2024

Name

Project Title

Description

Mentor(s)

Resources

Charan Nagaraj     

Bachelor of Computer Science RV College of Engineering, India

Migrate and Improve Regression Testing in GitHub actions

At HPCC Systems, we use two main test systems: Overnight Build and Test (OBT) and Smoketest. Regression testing of ECL bundles, initially handled by OBT, is now integrated into Continuous Integration (CI) using GitHub Actions, automatically testing bundles when a pull request (PR) is raised. Additionally, I implemented automated testing of hyperlinks in our documentation files, also using GitHub Actions. This ensures that broken links are detected early, keeping the documentation accurate without requiring manual verification.

Attila Vamos

View Poster

View Blog

Eatesam Khan            

Masters in Computer Science California State University, USA

Create a New HPCC Command Line Tool

As part of my internship, I developed a command-line tool that simplifies interaction with HPCC Systems ESDL services, offering powerful features for describing and testing services. The describe command provides detailed information about available services, methods, and request-response structures, while the test command allows users to send test requests, supporting various formats like XML and JSON. Key options include setting authentication credentials and server details. A standout feature is dynamic tab auto-completion, which helps users input commands accurately and efficiently.

Terrence Asselin

Tim Klemm

View Poster

View Blog

El Arbi Belfarsi                 

PhD in Computer Science Kennesaw State University, USA

Update and Improve the Generation of Platform Artifacts for HPCC Systems Builds

This project focuses on transitioning HPCC Systems CI/CD workflow from Jenkins to GitHub Actions, automating platform artifact generation using Python. A Python script replaces an existing web service, handling tasks like fetching assets, extracting metadata, and saving data as JSON. The workflow automates setup of AWS credentials, Docker image management, and uploads to GitHub and AWS S3, with security provided by GitHub secrets. This project streamlines the build process, reduces manual effort, and improves automation, benefiting the HPCC Systems platform and the open-source community.

Michael Gardner

Ming Wang

View Poster

View Blog

Elizabeth Lorti       

Bachelor of International Development,
King's College, UK

Technology Marketing and Branding

For this year's Tech Summit, I coordinated communication with stakeholders, collected speaker bios and abstracts for uploads, and worked closely with the project management team. I also managed all social media channels and key event aspects. Leveraging two years of prior experience, including last year's Summit, I efficiently referenced past spreadsheets to streamline bio and content management.

Jessica Lorti

View Poster

View Blog

Gagana Premnath            

Masters in Computer Science Syracuse University, USA

Integration of HPCC Systems Terraform CI with GitHub Actions

This project integrates HPCC Systems Terraform-based infrastructure management with GitHub Actions to streamline the deployment of HPCC Systems clusters. Terraform modules - vnet, storage, aks, and HPCC Systems - are deployed sequentially using GitHub Actions workflows. Key steps include configuring Terraform, managing Azure authentication, handling data persistence, and securing sensitive information with GitHub Secrets. By automating deployments through GitHub Actions, the project ensures consistency, reduces manual intervention, and improves deployment efficiency, while fostering collaborative development and maintaining reliable, version-controlled infrastructure across environments.

Godji Fortil

Ming Wang

View Poster

View Blog

Girikratna Premnath            

Bachelor of Tech Data Science
Mukesh Patel School of Technology, Management and Engineering, India

Integration of PowerBI with HPCC Systems platform

My project established a connection between Power BI and HPCC Systems using WsSQL for SQL-based data retrieval. I automated SOAP requests from Power BI to HPCC Systems, enhancing data analytics and visualization workflows. Using a Bare Metal System on WSL, I handled the Power BI integration with M code/Power Query and successfully tested it on various data sample sizes, ensuring smooth functionality.

View Poster

Harsh Raj          

Bachelor of Tech Data Science
Mukesh Patel School of Technology, Management and Engineering, India

Vehicle Build Contributory System

View Poster

Ilhan Gelle            

Bachelor of Computer Science University of Texas, USA

Test Suite for the HPCC Systems Parquet Plugin

Jack del Vecchio

View Poster

View Blog

Nisha Bagdwal              

Masters in Computer Science Kennesaw State University, USA

Develop an Automated ECL Watch Test Suite

Attila Vamos

Chris Lo

View Poster

View Blog

Rohith Surya Podugu                

Masters in Computer Science California State University, USA

Refactoring and Releasing PyHPCC

Amila de Silva

View Poster

View Blog

Sabrina Harris                             

Masters in Applied Data Science New College of Florida, FL, USA

HPCC Systems Machine Learning Tutorials

Bob Foreman

View Poster

View Blog

Scarlett Huang 

High School Student at Dreyfoos School of the Arts West Palm Beach, FL, USA 

Investigate Third-Party Environments (Google Big Query)

Ming Wang

Terrence Asselin

View Poster

View Blog

Shounak Joshi 

Bachelor of Computer Science University of Florida, USA    

Investigate Third-Party Environments (Azure Synapse Analytics)

Ming Wang

Michael Gardner

View Poster

View Blog

  • No labels