QA4R: A QUESTION ANSWERING SYSTEM FOR R PACKAGES
- ECU Author/Contributor (non-ECU co-authors, if there are any, appear on document)
- Ganesh Babu (Creator)
- Institution
- East Carolina University (ECU )
- Web Site: http://www.ecu.edu/lib/
Abstract: There is a massive amount of data from various sources available today, and querying meaningful information from those datasets would be valuable. Question Answering Systems (QAS) implement information retrieval (IR) and Natural Language Processing (NLP) that can automatically answer the questions posed in a natural language. There are three different types of QAS as Open Domain, Closed Domain, and Restricted Domain. Following are the various types of questions: fact-based, definition, how, why, hypothetical, semantically constrained, and cross-lingual. R is a dynamic programming language widely used for statistical computing that combines functional and object-oriented programming. The R development community maintains thousands of R packages through its Comprehensive R Archive Network CRAN. However, while websites like rdrr.io, rseek.org, and search.r-project.org provide search results for R packages, no intelligent question-answering system is currently available for R.\n\nThis study examines Question Answering Systems (QAS), current developments and academic research areas in the QAS field, and QAS implementations. In this research, we propose a prototype question answering system for R packages that returns R packages relevant to the user query in natural language. We created a question answering dataset (QAD4R) for R packages using web scraping and developed a question generation model. Pre-trained BERT-based language models were used to create the question-answering system for R. All the code files are available publicly at this GitHub location https://github.com/GanB/QA4R-A-Question-AnsweringSystem-for-R-Packages.
Additional Information
- Publication
- Thesis
- Language: English
- Date: 2023
- Subjects
- Question Answering System, Question Generation, R Packages, Natural Language Processing, Word Tokenization, N-Grams, Transformers, BERT
Title | Location & Link | Type of Relationship |
QA4R: A QUESTION ANSWERING SYSTEM FOR R PACKAGES | http://hdl.handle.net/10342/12311 | The described resource references, cites, or otherwise points to the related resource. |