Content area

Abstract

Large language models (LLMs) have demonstrated remarkable capabilities in document processing, data analysis, and code generation. However, the generation of spatial information in a structured and unified format remains a challenge, limiting their integration into production environments. In this paper, we introduce a benchmark for generating structured and formatted spatial outputs from LLMs with a focus on enhancing spatial information generation. We present a multi-step workflow designed to improve the accuracy and efficiency of spatial data generation. The steps include generating spatial data (e.g., GeoJSON) and implementing a novel method for indexing R-tree structures. In addition, we explore and compare a series of methods commonly used by developers and researchers to enable LLMs to produce structured outputs, including fine-tuning, prompt engineering, and retrieval-augmented generation (RAG). We propose new metrics and datasets along with a new method for evaluating the quality and consistency of these outputs. Our findings offer valuable insights into the strengths and limitations of each approach, guiding practitioners in selecting the most suitable method for their specific use cases. This work advances the field of LLM-based structured spatial data output generation and supports the seamless integration of LLMs into real-world applications.

Details

1009240
Business indexing term
Title
Large Language Model-Driven Structured Output: A Comprehensive Benchmark and Spatial Data Generation Framework
Author
Li, Diya 1   VIAFID ORCID Logo  ; Zhao, Yue 2 ; Wang, Zhifang 2   VIAFID ORCID Logo  ; Jung, Calvin 2 ; Zhang, Zhe 3   VIAFID ORCID Logo 

 Survey123, Esri, Redlands, CA 92374, USA; [email protected] (D.L.); ; Department of Geography, Texas A&M University, College Station, TX 77840, USA 
 Survey123, Esri, Redlands, CA 92374, USA; [email protected] (D.L.); 
 Department of Geography, Texas A&M University, College Station, TX 77840, USA 
Volume
13
Issue
11
First page
405
Publication year
2024
Publication date
2024
Publisher
MDPI AG
Place of publication
Basel
Country of publication
Switzerland
Publication subject
e-ISSN
22209964
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2024-11-10
Milestone dates
2024-09-01 (Received); 2024-11-07 (Accepted)
Publication history
 
 
   First posting date
10 Nov 2024
ProQuest document ID
3133059795
Document URL
https://www.proquest.com/scholarly-journals/large-language-model-driven-structured-output/docview/3133059795/se-2?accountid=208611
Copyright
© 2024 by the authors. Published by MDPI on behalf of the International Society for Photogrammetry and Remote Sensing. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2024-11-27
Database
ProQuest One Academic