DSpace Repository

Utilizing a RAG-powered LLM for information retrieval in a research repository

Show simple item record

dc.contributor.author Tan, Timothy Marcus
dc.date.accessioned 2025-08-18T04:46:38Z
dc.date.available 2025-08-18T04:46:38Z
dc.date.issued 2025-06
dc.identifier.uri http://dspace.cas.upm.edu.ph:8080/xmlui/handle/123456789/3146
dc.description.abstract Though widely use across many research repositories, keyword search may not be sufficient for people who are becoming more familiar with the use of chatbots like ChatGPT. The proposed system will serve as a search engine for the UPM IRS which is a repository for the university’s theses. The system will utilize the vector space model in retrieving documents by directly embedding the user’s query into a vector to be compared to the vectors stored in a vector store by cosine similarity. Retrieval Augmented Generation (RAG) will then be used as the top documents will be given to a large language model (LLM) to create an overview of the top documents. The combination of a semantic retrieval method and a LLM was able to yield a good user experience and relevant results to the users. en_US
dc.subject Large Language Model en_US
dc.subject Information Retrieval en_US
dc.subject Document Retrieval en_US
dc.subject Vector Space Model en_US
dc.subject Natural Language Processing en_US
dc.subject Text Generation en_US
dc.subject Research Repository en_US
dc.subject Keyword Search en_US
dc.subject Chatbots en_US
dc.title Utilizing a RAG-powered LLM for information retrieval in a research repository en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account