This project is available as a student work experience opportunity with HPCC Systems this summer. Curious about other projects we are offering? Take a look at our Ideas List. was completed by Farah Al Shanik, a PhD student studying Computer Science at Clemson University. Farah Joined the HPCC Systems intern program in 2018.
There are many variants to take into account for this project such as matching plural and singular forms, language variants, punctuation evident in acronyms and the use of initials and alternative spellings. Such as color with and without the ‘u’.
Find out about the HPCC Systems Summer Internship Program.
Deadline for project proposals - Saturday 22nd April 2017
Project Description
There is a detailed description of the work in the JIRA issue TS1, which includes an attachment to the the Open Source Text Search document. This JIRA also details a series of sub-tasks describing the work.
...
- Initial build version: See https://track.hpccsystems.com/browse/TS-2
- Initial search version: See https://track.hpccsystems.com/browse/TS-3
- Regression tests: See https://track.hpccsystems.com/browse/TS-4
Mentor | John Holt Backup Mentor: Roger Dev |
Skills needed |
|
Deliverables |
|
Other resources |