Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Watch recordings of all presentations made during our Community Day Summit held during October 2024. Find out more about this event and read our blog review of the event.

...

Presentation Tracks and Content

Plenary Sessions

Join us as Gavin Halliday, SVP and Head of Platform Engineering, LexisNexis® Risk Solutions, kicks off for the opening and closing sessions from the 11th annual HPCC Systems Community Virtual Summit.

...

  • Awards Ceremony & Plenary
    Trish McCall, Sr Director Program Management, Hugo Watanuki, Manager Community Tech Programs, George S Foreman, Business Analyst II, Bob Foreman, Software Engineering Lead LexisNexis Risk Solutions
    Join us for the long-awaited announcement of the 2024 Community Awards and Poster Competition winners, followed by our afternoon keynote honoring our academic community and engagement.

Platform Evolution

Take a look at the latest improvements and innovative features in the platform.

...

  • Streamlining Business Tasks with HPCC Systems Clusters and Tombolo

    Yadhap Dahal & Matthew Fancher, LexisNexis Risk Solutions
    Introducing Tombolo, an innovative open-source project designed to enhance the already robust features, speed, and capabilities of HPCC Systems clusters. Our aim is to introduce a user-friendly web application that caters to both technical and non-technical users, enabling seamless interaction with HPCC Systems clusters. During our presentation, we’ll explore the capabilities of Tombolo, including creating workflows and monitoring assets. Additionally in this session, we’ll highlight the application’s ability to proactively send timely notifications. Moreover, we’ll provide a comprehensive overview of Tombolo’s intuitive dashboard.

  • Power BI Integration with HPCC Systems

    Harsh Raj & Srinivasan Kothandam, LexisNexis Risk Solutions
    Learn more about this integration package that enables analysts and business users to read HPCC Systems native files directly from Power BI.

  • Using NLP++ to Build a Brazilian Address Cleaner in HPCC Systems

    Guilherme da Silva, LexisNexis Risk Solutions 
    ADD FROM YOUTUBE

  • Enhancing Legal Assistance Through Data Enrichment with HPCC Systems

    Nihar Mandahas, Skanda P R, Manvith L B, Pratheek Rao MP, Arya Hariharan, & Dr. Jyoti Shetty,RVCE
    ADD FROM YOUTUBE

  • Integrating Microsoft Fabric and HPCC Systems for Security Analytics

    Sowmya Myneni & Kushi Kiran, LexisNexis Risk Solutions
    Integrating HPCC Systems with Microsoft Fabric and Power BI offers a seamless workflow for transforming, linking, and visualizing data. HPCC Systems, with its user-friendly ECL language, efficiently handles complex data transformations, which can then be imported into Microsoft Fabric. From there, Power BI can be used to create interactive and insightful visualizations and reports. This integration simplifies the process of turning raw data into actionable insights, making it an effective solution for data analysis and presentation.

...

  • We’ve Come a Long Way From README.txt – Improvements in the Platform Documentation
    Jim DeFabia, LexisNexis Risk Solutions
    ADD FROM YOUTUBE

  • LLM for ECL Code Generation Using Llama3

    Connor Davis, DataSeers
    ADD FROM YOUTUBE

  • Implementing Conditional Cleanup after Regression Testing in HPCC Systems

    Goutami Sooda, Arya Vinod, Ahana Patil, & Chandana S, RVCE
    ADD FROM YOUTUBE

  • Data 360° View Using HPCC Systems

    S Dhanush & Shreyas Shankar, RVCE
    ADD FROM YOUTUBE

  • Testing Best Practices of the HPCC Systems

...

Real World Use Cases with HPCC Systems

  • An Understanding of HPCC Systems Platform Metrics
    Ken Rowland, LexisNexis Risk Solutions
    HPCC Systems provides a metric framework for collecting low level platform specific metrics. This presentation covers the framework, available metrics, component instrumentation, and metric collection. It also covers the new ESP method execution profiling. All of it is tied together with examples from a running cluster.

  • Robots, Drones and Generative AI: Exploring cutting edge technologies with HPCC Systems in a school campus environment
    Tai Donovan, Carlos Caceres, Nick Schwartz, American Heritage School 
    School safety security, and wellbeing have been a driving force for our most recent academic partnership with HPCC Systems. The first phase of our security project began in 2020 with an autonomous mobile sentry designed, built and programmed with facial recognition software developed by our students. This project was very effective, and we wanted to enhance the capabilities by increasing response times for any security concern with the use of drones. This latest version uses autonomous drone technology to enhance the security features of our school by detecting unauthorized personnel on campus, assisting security in response to active shooter situations, gathering information during lockdowns and to track the student/staff evacuation process if initiated. In parallel to these latest efforts, a recent HPCC Systems summer internship project investigated the possibility on whether generative AI could be used to generate human-like responses in our autonomous security robot robots. During the internship, an interface through HPCC Systems was constructed to interface interact with GPT and ChatGPT. Therefore, we expect to be able to use a standard neural network to identify different emotions given video of a person and these these emotions would then be processed by the interface to create a call to OpenAI’s API from which an appropriate response would be generated. This response would be designed to change based on the emotions classified by the neural network and most importantly due to the use of generative AI would be different even with the same emotional classification. Even though this is all still in the prototype phase, our projects have proven to be a great proof of concept developed by high school students to deal with the reality of our times.

  • Tombolo - Opensource Tools for Interacting with HPCC Systems Clusters
    Matt Fancher & Yadhap Dahal, LexisNexis Risk Solutions
    Introducing Tombolo version 2.0, an innovative open-source project initiated by LexisNexis Risk Solutions, aimed at providing data catalog capabilities for HPCC Systems clusters. A web application that caters to both technical and non-technical users, facilitating seamless interaction with HPCC Systems clusters. Join us in this enlightening session as we demonstrate the impressive capabilities of Tombolo. Discover how users can effortlessly create workflows by leveraging assets from HPCC Systems clusters, enhanced with our newly introduced versioning tool. This tool empowers users with increased confidence when editing existing workflows. Witness the application's exceptional ability to monitor assets and proactively send timely notifications via different medias, ensuring crucial data management tasks are completed promptly. Moreover, we'll showcase Tombolo's intuitive dashboard and its extensive range of API endpoints, offering a comprehensive overview of asset performance. During the presentation, our team will also share insights into the future development goals of Tombolo, providing a glimpse into the exciting features currently in development and planned for future additions.

  • Vulnerable Victim Monitoring Connecting the Dots to Find Missing Children Quickly
    Pasqua Ruggiero, LexisNexis Risk Solutions
    According to Child Find of America, an estimated 2,300 children go missing each day. That equates to over half a million missing children per year. Most of these children are runaways, a population that is quite vulnerable to trafficking. Trafficked victims can be exploited for their PII in relation to account take over, stolen or synthetic identity fraud. LexisNexis Risk Solutions Inquiries are comprised of searches for fraud and credit seeking consumers which are stored in HPCC Systems.  These hundreds of billions of inquiry data can be searched retroactively and monitored proactively for matches on missing children within a reasonable geographic area. Additional topics of discussion will include data visualization capabilities in Python with data obtained from both the National Center for Missing & Exploited Children’s public RSS feed (current missing cases) as well as all missing cases from the past 20 years (obtained from the ADAM Program). We will also discuss proposals for additional future studies utilizing the power of HPCC Systems to extract other types of big data and provide further insights and information to provide law enforcement and social services to better aid them in the search and prevention of human trafficking. *We would like to issue a content warning for this presentation for topics like human (and especially child) trafficking. We understand the difficulty of presenting information on this sensitive topic and appreciate your patience and understanding as we present our research. All data will be anonymized to not reveal any personal information.

...

  • Platform (Now and in the Future)
    Christopher Lo, LexisNexis Risk Solutions
    ADD FROM YOUTUBE

  • Navigating the Platform Build System

    Michael Gardner, LexisNexis Risk Solutions
    ADD FROM YOUTUBE

  • From In-House to Open Source: The Journey of PyHPCC

    Amila de Silva & Rohith Podugu, LexisNexis Risk Solutions
    PyHPCC is a Python package and wrapper built around the HPCC Systems web services that facilitates communication between Python and HPCC Systems. It was originally developed as a LexisNexis Risk Solutions internal-only tool to automate repetitive work done on HPCC Systems. Since the evangelization of PyHPCC began in 2022, there has been growing interest across the organization and the broader HPCC Systems community to adopt PyHPCC.

Fueling Success

Here are the real world use cases and success stories leveraging HPCC Systems.

  • Building an NLP Pipeline for Electronic Health Records and Brain MRI Classification

    Vishalakshi Prabhu, Eshaan Mathur, Nikhil Vasu, & Prashant Ronad, RVCE
    ADD FROM YOUTUBE

  • Learned Cache Size Setting for Roxie Clusters

    Yifan Wang, University of Hawaii
    ADD FROM YOUTUBE

  • Machine Learning and Cybersecurity Analytics Using the NSL-KDD Dataset

    Zularbine Kamal, Kennesaw State University

  • Model Inversion Attacks with the HPCC Systems Platform

    Andrew Polisetty, Kennesaw State University

  • School Safety and Security Using RFID and Drones

    Taiowa Donovan & Nick Schwartz, American Heritage School

  • Exploring the Capabilities of HPCC Systems in Facilitating Inter Fog Communication

    Henrique Antonio Buzin Vargas, Federal University of Santa Catarina (UFSC)

  • Internship to Impact: Real Life Success in the HPCC Systems Community

    George S Foreman, Christopher Connelly, Jack del Vecchio, Yash Mishra, Nathalia Ribas, & Fulvio Favilla Filho, LexisNexis Risk Solutions
    Join us for a moderated panel style interview with former interns who have emerged from the HPCC Systems Summer Internship Program. In this session, the next generation of technologists will share their experiences and successes working with the HPCC Systems community and learn from their experiences during their internship and how they transitioned to full time RELX employees.

ECL Training Workshop

The HPCC Systems Training and Support team has been very busy this year visiting universities and trade show events to promote HPCC Systems and ECL through Code Days, Hackathons, and Workshops. This Here is this year’s Community Summit workshop presents an in-depth look at these events in three 1-hour sessions:

  • Part 1: The “Music is Life!” Workshop
    Bob Foreman, Software Engineering Lead, LexisNexis Risk Solutions
    Who doesn’t love music? Take a break from your daily routine datasets and join us in this first hour. We break down a popular open-source music dataset, explore normalizing the dataset and its effects, and look at a variety of data evaluation and query techniques.

  • Part 2: The “Find Your Paradise!” Hackathon
    Bob Foreman, Software Engineering Lead, LexisNexis Risk Solutions
    Have you ever thought about building an application that can help people find places to live that maximize their quality of life and happiness? The goal of this challenge is to analyze different datasets across different categories and correlate them using the HPCC Systems platform. After analyzing, the participants will be asked to design an interface to query this data and assign it a scoring system, then deliver it to the user via ROXIE and show the user where they should most likely want to live. Users should be given choices in an easy-to-use form that when submitted will generate a unique set of scores based on locations.

  • Part 3: Better Customer Insights Through Relational Dataset Queries
    Bob Foreman, Software Engineering Lead, LexisNexis Risk Solutions
    In this final hour, we look at the power of the open data model, transforming normalized data into a denormalized relational dataset, and use the implicit relationality power to analyze relationships to provide better customer insights.