Abstract: Recently, researchers in the field of math word problem (MWP) solving have reported performance metrics for various large language models (LLMs) on benchmark datasets, with some models ...
GSM8K-V is a purely visual multi-image mathematical reasoning benchmark that systematically maps each GSM8K math word problem into its visual counterpart to enable a clean, within-item comparison ...
A proposed revision of the state’s math standards would dump existing upper-level requirements and replace them with a menu ...
Abstract: The accessibility and quality of education in Sri Lanka face significant disparities, particularly between rural and urban areas. This research developed a personalized Intelligent Tutoring ...