Content area

Abstract

Chart reasoning, a critical task for automating data interpretation in domains such as aiding scientific data analysis and medical diagnostics, leverages large-scale vision language models (VLMs) to interpret chart images and answer natural language questions, enabling semantic understanding that enhances knowledge accessibility and supports data-driven decision making across diverse domains. In this work, we formalize chart reasoning as a sequential decision-making problem governed by a Markov Decision Process (MDP), thereby providing a mathematically grounded framework for analyzing visual question answering tasks. While recent advances such as multi-step reasoning with Monte Carlo tree search (MCTS) offer interpretable and stochastic planning capabilities, these methods often suffer from redundant path exploration and inefficient reward propagation. To address these challenges, we propose a novel algorithmic framework that integrates a pheromone-guided search strategy inspired by Ant Colony Optimization (ACO). In our approach, chart reasoning is cast as a combinatorial optimization problem over a dynamically evolving search tree, where path desirability is governed by pheromone concentration functions that capture global phenomena across search episodes and are reinforced through trajectory-level rewards. Transition probabilities are further modulated by local signals, which are evaluations derived from the immediate linguistic feedback of large language models. This enables fine grained decision making at each step while preserving long-term planning efficacy. Extensive experiments across four benchmark datasets, ChartQA, MathVista, GRAB, and ChartX, demonstrate the effectiveness of our approach, with multi-agent reasoning and pheromone guidance yielding success rate improvements of +18.4% and +7.6%, respectively.

Details

1009240
Title
A Tree-Based Search Algorithm with Global Pheromone and Local Signal Guidance for Scientific Chart Reasoning
Author
Zhou, Min 1   VIAFID ORCID Logo  ; Qi Zhiheng 2 ; Zhu Tianlin 3 ; Vijg, Jan 4   VIAFID ORCID Logo  ; Huang Xiaoshui 1   VIAFID ORCID Logo 

 School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai 200080, China 
 School of Basic Medical Sciences and Forensic Medicine, Hangzhou Medical College, Hangzhou 310013, China 
 School of Computer Science and Artificial Intelligence, The Jiangxi University of Finance and Economics, Nanchang 330013, China 
 Department of Genetics, Albert Einstein College of Medicine, 1301 Morris Park Avenue, Bronx, NY 10461, USA 
Publication title
Volume
13
Issue
17
First page
2739
Number of pages
19
Publication year
2025
Publication date
2025
Publisher
MDPI AG
Place of publication
Basel
Country of publication
Switzerland
Publication subject
e-ISSN
22277390
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-08-26
Milestone dates
2025-07-28 (Received); 2025-08-23 (Accepted)
Publication history
 
 
   First posting date
26 Aug 2025
ProQuest document ID
3249691724
Document URL
https://www.proquest.com/scholarly-journals/tree-based-search-algorithm-with-global-pheromone/docview/3249691724/se-2?accountid=208611
Copyright
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-09-12
Database
ProQuest One Academic