...
Name | Project Title | Description | Mentor(s) | Resources |
---|---|---|---|---|
Aryaman Gautam Bachelor of Tech Data Science | HPCC Systems local deployment on K3D cluster | The goal of this project was to establish an initial setup for a local deployment of HPCC Systems on K3D. K3D is a lightweight wrapper to run K3S (Rancher Lab's minimal Kubernetes distribution) in docker which makes it very easy to create single and multi-node K3S clusters in docker. | Xiaoming Wang Godji Fortil Chinmay Desai Sidharth Ganesan | |
Boqiang Li Ph.D. in Computer Science, Clemson University, USA | Convert Generalized Neural Network bundle (GNN) to native Tensorflow 2.0 | Neural Networks have emerged as a powerful tool for analyzing complex datasets like images, video, and time-series data, surpassing classical methods in their effectiveness. To leverage this potential, HPCC Systems offers the Generalized Neural Network Bundle (GNN), which combines the parallel processing capabilities of HPCC Systems with the robust Neural Network functionalities of Keras and TensorFlow. This project upgraded the GNN bundle to utilize the native Tensorflow 2 interface. The upgraded GNN with Tensorflow 2 demonstrated several significant advantages over its previous version. | Lili Xu Roger Dev | View Poster |
Carlos Caceres High School Student | Practical Application of Generative AI Technology | During this project a generalized interface was created for HPCC Systems to access GPT and ChatGPT. From there the steps were taken to use HPCC Systems to train a neural network model capable of classifying faces into different emotions. These emotions would then be processed by the interface to create a call to OpenAI’s API from which an appropriate response would be generated. | Lili Xu Roger Dev | |
Davi Charvi Bachelor of Tech Data Science | Resume analyzer in NLP++ | A Resume Analyzer is the implementation of an approach to apply various techniques for analyzing the resumes a company receives and retrieving the main sections. This project has leveraged the NLP++ plugin to process resumes and extract the main headers and sections of the resume, such as skills, work experience, email, and education. | David de Hilster Umesh Mahind Nandhini Velu | |
Elizabeth Lorti Bachelor of International Development, | HPCC Systems Marketing and Branding | As a returning HPCC Systems intern and one that has worked year-round on maintaining social media, this year, I completed a review of my own social media contributions and strategy to see what could be done to improve, as well as will conducted interviews among stakeholders and recorded minutes to best understand and communicate the needs of the Technology Summit and Community Day stakeholders. | Jessica Lorti | |
Hiroki Sato Masters in Computer Science | Automation of HPCC Systems Cloud Native Deployment to AWS with Terraform | This project leveraged Terraform to explore the deployment of the HPCC Systems containerized application onto AWS Elastic Kubernetes Service cluster (EKS). During the internship, we developed a hpcc-aws-terraform module. This consisted of building a necessary AWS infrastructure such as virtual private cloud (VPC), subnets, necessary security group, EKS cluster and node group. | Wayne Carty Godson Fortil | |
Jessie Mao High School Student | HPCC Systems Deployment with Various Helm Chart Configurations | This project provided two solutions for HPCC Systems deployments. The overrides solution utilizes the default values.yaml file while using other files to modify it. Overrides can be used to make small changes to the values.yaml, and mainly concentrates on Roxie and Thor. The HPCC-lite, on the other hand, does not require a custom values.yaml file, so can be used with other files to create more scenarios. | Xiaoming Wang Godson Fortil | |
Johnny Huang Bachelor of Computer Science | Improve Error Handling and Reporting for Automated Test Systems | This project concentrated primarily on refining the GitHub Actions scripts, a vital tool for automated testing within the HPCC Systems environment. These scripts analyze the logs generated from tests, providing a granular breakdown of the executed tests. I also introduced enhancements to the scripts to improve the fault tolerance of our testing systems. These included adding logic to retry failed actions, increasing the resilience of the system to transient issues, reducing test failures, and decreasing the need for manual interventions. | Attila Vamos | |
K Dheemonth Bachelor of Computer Science and Engineering | Sentiment Analysis in English | |||
Kruthika Pinnada Bachelor of Computer Science and Engineering | Resume Analyzer | |||
Logan Patterson Masters in Data Science | Designing Test Algorithms for Causal Model Discovery Within the HPCC Systems Causality Framework | |||
Narayan Kandel Ph.D. in Computer Science, Clemson University, USA | Enhancing Performance of Distributed Neural Network with GNN Bundle | |||
Nivedha Sivakumar Bachelor of Computer Science | Test Suite for a Roxie Cluster on Kubernetes | |||
Noah Seligson Bachelor of Computer Science | Convert Automated Test Systems from Python2 to Python3 | |||
Ryan Rao High School Student | HPCC Systems Storage Support With Container Storage Interface (CSI) | |||
Sarah Nash Masters in Data Science | Causal Discovery and Validation with Categorical Data | |||
Shyamaa Karthik High School Student Saint Andrew's School Boca Raton, FL, USA | Processing the Tamil Wiktionary Pages into a NLP++ Dictionary |