EduBot: a voice-driven conversational AI for university support system : a thesis in Data Science

Roshni Pal

doi:10.62791/20510

Back

Thesis

Open access

EduBot: a voice-driven conversational AI for university support system : a thesis in Data Science

Roshni Pal

Master of Science (MS), University of Massachusetts Dartmouth

2025

DOI:

https://doi.org/10.62791/20510

Abstract

This thesis presents the design and development of a modular voice assistant capable of handling real-time, speech-based interactions tailored for academic environments. The system integrates key components—voice activity detection (VAD), automatic speech recognition, text generation, and text-to-speech synthesis—to deliver a seamless, end-to-end conversational experience. Unlike generic voice assistants, this system prioritizes domain-specific accuracy and low-latency communication, with a focus on university-level student support. Interaction begins with browser-based VAD, which detects when a user starts and stops speaking, enabling hands-free communication. The captured audio is transcribed using OpenAI’s Whisper-Large-V3-Turbo, selected for its robustness in handling various accents and noisy environments. The transcription is then processed by a fine-tuned LLaMA 3.2B model, which has been adapted using Low-Rank Adaptation (LoRA) techniques. Trained on custom Q&A pairs from the University of Massachusetts Dartmouth’s College of Engineering, the model generates responses specific to academic advising, curricula, and campus resources. Context tracking enables coherent multi-turn dialogue. Generated responses are synthesized using Kokoro 82M, a lightweight neural TTS model known for its expressive, low-latency audio output. The conversation is managed through a Gradio interface that integrates voice and text input-output, visual feedback, and message history. This work demonstrates how modern AI tools can be integrated into a practical, domain-specific educational assistant. With its modular architecture and open-source foundation, the system provides a scalable framework that is adaptable to other departments or institutions.

Files and links (1)

pdf

Pal R. COE MS Thesis 20251.71 MBDownload View

Open Access CC BY-NC-ND V4.0

Metrics

43 File views/ downloads

62 Record Views

Details

Title: EduBot
Creators: Roshni Pal
ORCID: 0009-0006-9072-4038
Contributors: Long Jiao (Advisor) - University of Massachusetts Dartmouth, Department of Computer and Information Science
Yuchou Chang (Committee Member) - University of Massachusetts Dartmouth, Department of Computer and Information Science
Ashokkumar Ratilal Patel (Committee Member) - University of Massachusetts Dartmouth, Department of Computer and Information Science
Number of pages: ix, 45 pages
Illustrations: illustrations (some color)
Table of contents: List of figures -- List of tables -- Chapter 1. Introduction -- Background and motivation -- Problem statement -- Objectives -- Scope -- Chapter 2. Literature review -- Overview of voice assistants -- Voice assistants in the educational domain -- Comparison of STT model -- Comparison of TTT model -- Comparison of TTS model -- Chapter 3. System architecture and design -- System overview -- Data preparation of datasets -- Utilization of NVIDIA RTX A6000 GPU for optimization -- Component breakdown -- Chapter 4. Implementation -- Dataset -- Training and fine-tuning models -- JavaScript VAD and trigger functions -- Voice-UI synchronization using web sockets -- Chapter 5. Evaluation and results -- Evaluation methodology -- Evaluation metrics comparison for fine-tuned vs pre-trained model -- Training progress and model convergence -- Latency evaluation across audio durations -- Model selection rationale and comparison -- GPU profiling and metrics -- Model comparison with ground truth -- Chapter 6. Conclusion and future work -- Potential improvements -- Future scope in education and beyond -- References.
References: Includes bibliographical references (pages 43-45).
Awarding Institution: University of Massachusetts Dartmouth
Degree Awarded: Master of Science (MS)
Degree in: Data Science
Academic Unit: Department of Computer and Information Science
Language: English
Resource Type: Thesis
DOI: https://doi.org/10.62791/20510
Record Identifier: 9914504463201301