Auto-Count Symbols in Portable Document Format (PDF)

ECU Author/Contributor (non-ECU co-authors, if there are any, appear on document)
Andrew Florian (Creator)
Institution
East Carolina University (ECU )
Web Site: http://www.ecu.edu/lib/

Abstract: Estimating electrical costs often involves counting symbols in a PDF document. Existing\r\nsoftware has sped up this process compared to manual counting, but there is room for\r\nfurther improvement. The proposed solution builds on open source components to e?ciently\r\nsearch a PDF document for the outlines of all symbols, including letters or numbers, used\r\nby electrical engineers to di?erentiate between otherwise similar symbols. It then sorts these\r\noutlines into groups and counts each occurrence. Symbol for symbol, it takes less than half\r\nthe time required by two leading competitors. Unfortunately, current settings often produce\r\nnumerous sub-groups which need to be combined to provide meaningful totals. K-means and\r\nother improved clustering methods are being explored. The proposed concept could also be\r\nhelpful in other similar applications that identify symbols or text in images.

Additional Information

Publication
Honors Project
Language: English
Date: 2023
Subjects
pdf;symbol;auto;count

Email this document to

This item references:

TitleLocation & LinkType of Relationship
Auto-Count Symbols in Portable Document Format (PDF)http://hdl.handle.net/10342/8954The described resource references, cites, or otherwise points to the related resource.