TY - GEN
T1 - A Retrieval-Augmented Framework For Meeting Insight Extraction
AU - Bhuvaji, Sartaj
AU - Chouhan, Prachitee
AU - Irukulla, Madhuroopa
AU - Singhvi, Jay
AU - Bae, Wan D.
AU - Alkobaisi, Shayma
N1 - Publisher Copyright:
Copyright © 2025 held by the owner/author(s).
PY - 2025/5/14
Y1 - 2025/5/14
N2 - Meetings are vital for collaboration and decision-making in professional environments, yet recalling key details from past discussions can be challenging and this impacts productivity. In this paper, we address this issue by developing a solution that extracts crucial insights from historical meeting records using Retrieval Augmented Generation (RAG) techniques. Users can easily upload meeting records and query for relevant information. A core feature of our proposed system is grouping meetings based on abstractive summaries, using state-of-the-art clustering algorithms extensively trained for accuracy. Upon user inquiry, the system identifies the most relevant cluster and retrieves related conversations from the Pinecone vector store database. These conversations, paired with custom prompts, are processed through a Large Language Model (LLM) to generate precise responses. Our optimization efforts focus on exploring various encoders and LLMs, with fine-tuning to ensure seamless integration and high performance. This approach tackles challenges in meeting summarization, content discovery, and user-friendly information retrieval.
AB - Meetings are vital for collaboration and decision-making in professional environments, yet recalling key details from past discussions can be challenging and this impacts productivity. In this paper, we address this issue by developing a solution that extracts crucial insights from historical meeting records using Retrieval Augmented Generation (RAG) techniques. Users can easily upload meeting records and query for relevant information. A core feature of our proposed system is grouping meetings based on abstractive summaries, using state-of-the-art clustering algorithms extensively trained for accuracy. Upon user inquiry, the system identifies the most relevant cluster and retrieves related conversations from the Pinecone vector store database. These conversations, paired with custom prompts, are processed through a Large Language Model (LLM) to generate precise responses. Our optimization efforts focus on exploring various encoders and LLMs, with fine-tuning to ensure seamless integration and high performance. This approach tackles challenges in meeting summarization, content discovery, and user-friendly information retrieval.
KW - BART
KW - LLM
KW - abstractive summarization
KW - meeting data retrieval
KW - pinecone
KW - speech to text conversion
KW - text summarization
UR - http://www.scopus.com/inward/record.url?scp=105006465517&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=105006465517&partnerID=8YFLogxK
U2 - 10.1145/3672608.3707915
DO - 10.1145/3672608.3707915
M3 - Conference contribution
AN - SCOPUS:105006465517
T3 - Proceedings of the ACM Symposium on Applied Computing
SP - 899
EP - 906
BT - 40th Annual ACM Symposium on Applied Computing, SAC 2025
PB - Association for Computing Machinery
T2 - 40th Annual ACM Symposium on Applied Computing, SAC 2025
Y2 - 31 March 2025 through 4 April 2025
ER -