Content area

Abstract

Computational tools have become increasingly prevalent in the analysis and evaluation of various linguistic dimensions in second language (L2) writing pedagogy and research. Despite their widespread use, there is limited research investigating the alignment between computationally derived linguistic features and human assessments of academic writing quality. To fill this gap, this study probed the extent to which computational indices of syntactic and lexical features predict human-judged assessments of narrative writing quality. A total of 104 essays written by Iranian undergraduate learners of English as a Foreign Language (EFL) were analyzed using three computational tools: Coh-Metrix, VocabProfiler, and the Tool for the Automatic Analysis of Cohesion (TAACO). The results from correlation and regression analyses revealed that the computational indices of lexical features were significant predictors of human-judged writing quality, with lexical diversity and sophistication emerging as the most significant predictors. Manual coding of syntactic complexity proved to be a stronger predictor of writing quality than computational measures of this text feature. These findings underscore the value of computational tools in L2 writing assessment, while simultaneously highlighting their limitations in capturing the multifaceted nature of writing quality. Furthermore, the results point to an overemphasis on infrequent and diverse vocabulary in current analytic writing rubrics, suggesting that these rubrics should be revised to adopt a more comprehensive perspective on lexical proficiency in L2 writing pedagogy and evaluation.

Details

1009240
Title
Computationally derived linguistic features of L2 narrative essays and their relations to human-judged writing quality
Author
Janebi Enayat, Mostafa 1 

 University of Maragheh, Marāgheh, Iran, Islamic Republic of (GRID:grid.449862.5) (ISNI:0000 0004 0518 4224) 
Publication title
Volume
15
Issue
1
Pages
35
Publication year
2025
Publication date
Dec 2025
Publisher
Springer Nature B.V.
Place of publication
Heidelberg
Country of publication
Netherlands
Publication subject
e-ISSN
22290443
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-07-01
Milestone dates
2025-06-10 (Registration); 2025-02-06 (Received); 2025-06-10 (Accepted)
Publication history
 
 
   First posting date
01 Jul 2025
ProQuest document ID
3226092122
Document URL
https://www.proquest.com/scholarly-journals/computationally-derived-linguistic-features-l2/docview/3226092122/se-2?accountid=208611
Copyright
Copyright Springer Nature B.V. Dec 2025
Last updated
2025-12-04
Database
2 databases
  • Education Research Index
  • ProQuest One Academic