It appears you don't have support to open PDFs in this web browser. To view this file, Open with your PDF reader
Abstract
This dissertation documents the development and validation of a standardized large-scale language test on second language (L2) interactional competence (IC). More specifically, this dissertation investigates L2-Chinese as its target language for an IC test that is delivered in the computer-mediated environment to enhance the practicality of IC assessment. Throughout the process of test development and validation, the researcher defined IC as a theoretically multidimensional construct, which encompasses a speaker’s ability to manage interaction at the sequential, emotional, moral, logical and categorial dimensions. From the assessment perspective, the researcher demonstrated that IC is concurrently theoretically multidimensional and psychometrically unidimensional, which corroborates the assessability of IC as a test construct.
Adopting an argument-based approach to test validation, the researcher designed the test following a task-based needs analysis (TBNA), eliciting the perspectives from L2-Chinese speakers, first language (L1) Chinese teachers, and L1-Chinese interactants on what L2-Chinese speakers struggle the most with interpersonal interaction in L2 Chinese. Findings from the TBNA informed the design of a nine-item IC test that targets test-takers’ ability to manage social actions that are disaffiliative in nature, which are actions that can threaten social harmony (e.g., making a complaint about workplace fairness to your employer). The test is delivered on a mobile-phone application, covers three sub-language use domains (everyday life, work, and study), and includes three degrees of interactiveness in terms of task methods (1st pair part voice messaging, 2nd pair part voice messaging, and live video chat).
The specification of the IC test construct and development of the rating scale were based on everyday-life linguistic laypersons’ criteria of IC. The use of everyday members’ criteria to define the test construct represents an attempt to democratize research by complementing the perspective of applied linguists with the one of linguistic laypersons, who use the language on a daily basis and are the final arbiters of successful interaction. 36 linguistic laypersons listened to and commented on 22 pilot test-takers’ performances on the test. Thematic analysis on linguistic laypersons’ interview transcripts and written comments returned five indigenous categories that formed the five rating categories in an indigenous IC scale. When analysing pilot test-taker discourse in terms of IC, the researcher proposed the use of Sequential-Categorical Analysis, which combines Conversation Analysis and Membership Categorization Analysis to allow for the investigation of both the sequential and categorical aspects of interaction. The analysis of test-taker discourse through Sequential-Categorical Analysis assisted the researcher to theorize the a-theoretical indigenous rating scale into a theorized IC scale, which has 1) disaffiliation control, 2) affiliation promotion, 3) morality, 4) reasoning, and 5) social role management as its five rating categories. The IC construct developed in this dissertation moves beyond constructs in existing IC scholarship as it illustrates the multidimensional nature of IC, encompassing not only the sequential dimension of interaction, but also the emotional, moral, logical and categorical dimensions.
105 test-takers from 26 different countries participated in the main testing study, whose performances were rated in a fully-crossed design by two raters using the theorized IC rating scale. Many Facet Rasch Analysis on rating results showed that item performance, rater reliability and rating scale functioning were satisfactory. Rasch Principal Component Analysis demonstrated the unidimensionality of IC as a test construct. Correlational analyses on test-takers’ IC test performance and an external measure of proficiency revealed that proficiency has limited predictive strength on IC: r (104)= .42, p<0.05. Lower proficiency L2 speakers could outperform higher proficiency L2 speakers and even L1 speakers. This finding challenges the longstanding native-speakerism in language teaching and testing as it shows that L1 speakers are not the gold standard of IC. It also highlights that IC does not develop automatically as L2 speakers’ proficiency increases. Therefore, IC is an ability that needs to be taught and assessed separately from proficiency for L1 and L2 speakers alike.
The extrapolative power of the IC test was ascertained through a self-assessment questionnaire and a peer-assessment questionnaire. Test-takers’ self-assessment and peer-assessment were correlated with their test performance, the results of which showed that the IC test can reasonably predict test-takers’ IC in non-assessment, real-life settings. Test-takers’ attitude towards the test was also elicited in the questionnaire, which was favourable and supportive of the use of IC items in general speaking assessment.
Although the role-play IC test in this dissertation is based on L2 Chinese, findings from this dissertation are potentially applicable to other test tasks and target languages due to the highly theorized nature of the IC construct in this dissertation, which is not specific to any context, language, or task type. The theorized IC rating scale embodies a holistic IC construct that goes beyond the mechanics of interaction and ventures into the assessment of speakers’ ability at managing affect, logic, morality and categorization in and for interaction. This IC model is theoretically robust as it is consistent with other more holistic models on interpersonal interaction such as Dell Hymes’s original conceptualization of communicative competence and Aristotle’s three artistic proofs. Future research can adapt the current IC test and localize the theorized IC scale to see if findings from this study still obtain when applied to other target languages, task types and assessment contexts. The use of computer-mediated communication platforms for test delivery in this dissertation increases the practicality, accessibility and affordability of IC assessment, especially during times when the COVID-19 pandemic made face-to-face assessment unfeasible. The computer-delivered nature of this IC test also allowed test-takers from a wide range of backgrounds to participate, which differed from the typical participants in applied linguistics research, who tend to be affluent middle-class university students. This design in the dissertation helped to promote greater fairness, democracy and inclusivity in applied linguistics and language testing research.