Auto-Count Symbols in Portable Document Format (PDF)
- ECU Author/Contributor (non-ECU co-authors, if there are any, appear on document)
- Andrew Florian (Creator)
- Institution
- East Carolina University (ECU )
- Web Site: http://www.ecu.edu/lib/
Abstract: Estimating electrical costs often involves counting symbols in a PDF document. Existing\r\nsoftware has sped up this process compared to manual counting, but there is room for\r\nfurther improvement. The proposed solution builds on open source components to e?ciently\r\nsearch a PDF document for the outlines of all symbols, including letters or numbers, used\r\nby electrical engineers to di?erentiate between otherwise similar symbols. It then sorts these\r\noutlines into groups and counts each occurrence. Symbol for symbol, it takes less than half\r\nthe time required by two leading competitors. Unfortunately, current settings often produce\r\nnumerous sub-groups which need to be combined to provide meaningful totals. K-means and\r\nother improved clustering methods are being explored. The proposed concept could also be\r\nhelpful in other similar applications that identify symbols or text in images.
Additional Information
- Publication
- Honors Project
- Language: English
- Date: 2023
- Subjects
- pdf;symbol;auto;count
Title | Location & Link | Type of Relationship |
Auto-Count Symbols in Portable Document Format (PDF) | http://hdl.handle.net/10342/8954 | The described resource references, cites, or otherwise points to the related resource. |