Content area

Abstract

This dataset captures responses from a lexical generation task designed to examine word production under structural constraints. Native Spanish speakers were presented with three-consonant strings and instructed to generate valid five-to-seven-letter Spanish words by inserting only vowels, maintaining the consonants in their original relative order. The task was conducted under time pressure and without semantic cues, allowing researchers to explore lexical access, phonotactic preferences, and the role of consonants and vowels in word formation processes. The dataset includes both item-level and participant-level files. Item-level data comprise individual responses with lexical frequency, word length, and response time. Participant-level data summarize age, gender, and aggregate lexical metrics per individual. This resource enables a range of investigations, including analyses of syllabic structures, relative consonant positioning, lexical diversity, and frequency effects. The dataset is encoded in UTF-8 CSV format and is directly compatible with standard data analysis environments. It offers a valuable tool for researchers studying lexical creativity and orthographic processing in Spanish.

Details

1009240
Title
Spanish word generation dataset from structured consonant prompts
Author
Duñabeitia, Jon Andoni 1   VIAFID ORCID Logo 

 Centro de Investigación Nebrija en Cognición (CINC), Universidad Nebrija, Madrid, Spain (ROR: https://ror.org/03tzyrt94) (GRID: grid.464701.0) (ISNI: 0000 0001 0674 2310) 
Publication title
Volume
12
Issue
1
Pages
1402
Number of pages
6
Publication year
2025
Publication date
2025
Section
Data Descriptor
Publisher
Nature Publishing Group
Place of publication
London
Country of publication
United States
Publication subject
e-ISSN
20524463
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-08-11
Milestone dates
2025-07-30 (Registration); 2025-05-15 (Received); 2025-07-29 (Accepted)
Publication history
 
 
   First posting date
11 Aug 2025
ProQuest document ID
3238566511
Document URL
https://www.proquest.com/scholarly-journals/spanish-word-generation-dataset-structured/docview/3238566511/se-2?accountid=208611
Copyright
© The Author(s) 2025. This work is published under http://creativecommons.org/licenses/by/4.0/ (the "License"). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-11-07
Database
ProQuest One Academic