Master's Theses

Off-campus SJSU users: To download campus access theses, please use the following link to log into our proxy server with your SJSU library user name and PIN.

Enhancing Medical Reasoning in Small Language Models Through Feedback-Guided Refinement

Monish Sai Lakamraju, San Jose State UniversityFollow

Publication Date

Fall 2025

Degree Type

Thesis - Campus Access Only

Degree Name

Master of Science (MS)

Department

Computer Engineering

Advisor

KaiKai Liu; Bernardo Flores; Mahima Agumbe Suresh

Abstract

Large Language Models exhibit remarkable reasoning capabilities in mathematical and logical tasks. However, their application in healthcare is constrained by computational demands exceeding 100 billion parameters. Conversely, Small Language Models (<10B) present a feasible alternative for environments with limited resources. Nonetheless, they exhibit reduced capacity for multi-step reasoning and logical coherence, both of which are critical for clinical decision-making. Although recent advancements in prompt engineering and fine-tuning are promising, existing Chain-of-Thought methods still necessitate large training datasets, often comprising tens of thousands of samples, coupled with complex fine-tuning procedures, thereby limiting practical implementation. To address these challenges, this thesis investigates feedback-guided refinement strategies to enhance medical reasoning using minimal training data. We developed a multi-stage pipeline wherein prompts function as navigational beacons for reasoning path generation, actively guiding the reasoning process rather than merely verifying results. Through iterative prompt refinement and preservation-aware feedback mechanisms, our approach identifies and maintains valid reasoning while rectifying logical inconsistencies. This method effectively curates high-quality training samples for fine-tuning. Experimental evaluation on standard medical benchmarks indicates that the approach achieves performance comparable to methods requiring substantially larger datasets. This demonstrates that feedback-guided refinement offers a practical framework for medical reasoning in resource-constrained settings.

Recommended Citation

Lakamraju, Monish Sai, "Enhancing Medical Reasoning in Small Language Models Through Feedback-Guided Refinement" (2025). Master's Theses. 5739.
DOI: https://doi.org/10.31979/etd.36hk-cmch
https://scholarworks.sjsu.edu/etd_theses/5739

Download

COinS

DOI

https://doi.org/10.31979/etd.36hk-cmch

Master's Theses

Enhancing Medical Reasoning in Small Language Models Through Feedback-Guided Refinement

Publication Date

Degree Type

Degree Name

Department

Advisor

Abstract

Recommended Citation

DOI

Search

Browse All

Links

Master's Theses

Enhancing Medical Reasoning in Small Language Models Through Feedback-Guided Refinement

Author

Publication Date

Degree Type

Degree Name

Department

Advisor

Abstract

Recommended Citation

Share

DOI

Search

Browse All

Links