Headnote
Abstract
In academic writing, conjunctions are crucial because they promote coherence, cohesion, and logical connections between ideas. The current study scrutinizes the frequencies of the ten most widespread B2 level conjunctions in the British Academic Written English Corpus as found in published PhD theses written in the English Language Teaching field by native English, native Turkish, and native Spanish researchers. The aim of this comparative study is to learn more about the similarities and differences in conjunction usage among researchers with various linguistic backgrounds. The comparison of English language users with Turkish and Spanish researchers is a novel feature of this study. A plausible dataset of published PhD dissertations was subjected to a corpus-based analysis in order to identify and quantify the frequencies of the target conjunctions. The results of this study offer insightful information on how researchers with various linguistic backgrounds use conjunctions at the B2 level in academic writing. The findings aid in the comprehension of language transfer effects and could provide researchers and language educators with information on potential language-specific difficulties faced by non-native English speakers while writing academically. The study also gives information on how native language influences conjunction usage, laying the groundwork for future studies in contrastive linguistics and second language teaching.
Keywords: Corpus-based analysis, conjunctions, English Language Teaching, contrastive linguistics, academic writing
Öz
Akademik yazımda, bağlaçlar, fikirler arasındaki tutarlılığı, bütünlüğü ve mantıksal bağlantıları destekledikleri için çok önemlidir. Bu çalışma, İngiliz Dili Eğitimi (ELT) alanında anadili İngilizce, Türkçe ve İspanyolca olan araştırmacılar tarafından yazılmış doktora tezlerinde bulunan İngiliz Akademik Yazılı İngilizce Derlemindeki (BAWE) en yaygın on B2 seviyesi bağlacın frekanslarını incelemektedir. Bu karşılaştırmalı çalışmanın amacı, farklı dilsel geçmişlere sahip araştırmacılar arasında bağlaç kullanımındaki benzerlikler ve farklılıklar hakkında daha fazla bilgi edinmektir. İngiliz dil kullanıcılarının Türk ve İspanyol araştırmacılarla karşılaştırılması bu çalışmanın alana önemli bir katkısıdır. Yayınlanmış doktora tezlerinden oluşan makul bir veri kümesi, hedef bağlaçların kullanım sıklıklarını belirlemek ve ölçmek için derlem tabanlı bir analize tabi tutulmuştur. Bu çalışmanın sonuçları, farklı dilsel geçmişlere sahip araştırmacıların akademik yazımda B2 düzeyinde bağlaçları nasıl kullandıklarına dair aydınlatıcı bilgiler sunmaktadır. Bulgular, dil transferi etkilerinin anlaşılmasına yardımcı olurken, araştırmacılara ve dil eğitimcilerine, ana dili İngilizce olmayanların akademik yazım sırasında karşılaştıkları dile özgü olası zorluklar hakkında bilgi sağlayabilir. Çalışma aynı zamanda anadilin bağlaç kullanımını nasıl etkilediğine dair bilgi vererek karşılaştırmalı dilbilim ve ikinci dil öğretimi alanlarında yapılacak çalışmalara zemin hazırlamaktadır.
Anahtar kelimeler: Derlem tabanlı analiz, bağlaçlar, İngilizce öğretimi, karşılaştırmalı dilbilim, akademik yazma
INTRODUCTION
With the advancement of technology and computer use, a number of academics have made significant use of corpus-based studies in recent years. A structured collection of written or spoken texts for qualitative and quantitative studies is referred to as a corpus (Biber, Conrad, & Reppen, 1998). Corpus linguistic techniques are used to evaluate and grasp the frequency and usage of linguistic characteristics to determine how they are utilized in the corpus under consideration. Utilizing corpora offers the chance to examine organically occurring language in texts, which is one of its benefits. Additionally, researchers can overcome subjective reflection, limited knowledge, and preconceived notions with the aid of corpora by moving beyond obvious linguistic trends and noted traits (Mehl, 2016). Along with linguistic analyses, corpus studies also aid in the study of foreign and second languages through data-driven activities that build students' learning skills and give them access to relevant and real-world language data (Trebits, 2009). Additionally, concordance software enables students to recognize linguistic trends (Hunston, 1995). In any case, corpus-based studies will show researchers, teachers, and students new avenues for learning more about the language(s) under a microscope.
Researchers have discovered a chance to identify trends in language use in spoken and written texts through the study of corpora. A number of studies (Adel & Erman, 2012; Byrd & Coxhead, 2010; Chen & Baker, 2010; Conrad & Biber, 2005; Nesi & Basturkmen, 2006; O'Flynn, 2022; Ozturk & Durmu§oglu-K6se, 2016) have concentrated on the use of lexical bundles in English in academic writing. Contrarily, there are numerous corpus-based research on conjunction usage. One of these studies, conducted by Trebits (2009), uses the British National Corpus (BNC) database to examine the use of conjunctions in texts connected to the EU texts written in general English. Many other studies have also questioned the usage of conjunctions by non-native English speakers in academic contexts, such as in their theses and/or research publications (Granger & Tyson, 1996; Martin, 2003; Partridge, 1995; Zhao, 2011; Zou, 2015). The majority of this research concentrates on English learners from China or Japan. To our knowledge, Cosjain's (2011) study, in which he contrasts Uzbek and Turkish learners and their employment of cohesive devices in writing, is the only research work that focuses on Turkish learners of English in terms of their use of conjunctions. In light of this, it is necessary to look into the use of conjunctions and the degree to which non-native researchers use them on par with native researchers, as it is crucial for non-native writers to sound as native in their dissertations as possible (Pallotti, 2009).
The present study delves into the role of conjunctions in academic writing, focusing on the frequencies of the ten most prevalent B2 level conjunctions in the British Academic Written English Corpus (BAWE). The research focuses on PhD theses from researchers of diverse linguistic backgrounds, including LI English, LI Turkish, and LI Spanish. The methodological approach involves a corpus-based analysis, identifying and quantifying the frequencies of target conjunctions.
To frame this research theoretically, "usage-based linguistics" provides a comprehensive aspect by exploring the significance of the role of frequency in language use. In usage-based linguistics, which is grounded in the role of language use in creating linguistic knowledge, the position is that all linguistic structures are functions of language use, and thus represented in the mind as constructions (Bybee, 2008; Diessel, 2017). Therefore, higher frequency constructions will be more quickly processed and will have a greater impact on linguistic knowledge, which is observed both in native and non-native language users. Similarly, the "constructionist approach" emphasizes how to learn and use linguistic constructions in diverse contexts (Ellis, 2006; Gries & Ellis, 2015), and how crucial to analyze conjunction use in varied linguistic backgrounds is (Goldberg, 2006).
In the present study, these two frameworks are combined to outline conjunction usage and explain it within the general framework of language acquisition, as suggested by the usage-based theory. This approach will increase sensitivity to the process of how L2 English speakers acquire and use linguistic structures, thereby providing both theoretical enrichment and better practical applications in language pedagogy. The study aims to examine whether the frequency of conjunction use differs in academic writing, specifically whether the given frequencies stay constant with general patterns of use found in the BAWE corpus and how L1 English, L1 Turkish, and LI Spanish researchers compare.
The findings provide valuable insights into the usage of conjunctions by researchers from different linguistic backgrounds, enhancing language transfer effects and addressing language-specific challenges faced by non-native English speakers. The study also highlights the influence of native language on conjunction usage, offering theoretical and practical advancements in academic writing and language pedagogy. More specifically, according to Gries and Ellis (2015), the hierarchical organization in a network of linguistic knowledge means that conjunctions, like any other constructions, are acquired and used as a function of their frequency and the contexts of their occurrence. In this regard, the current study seeks to identify the ten CEFR (Common European Frameworkof Reference for Languages)-B2-level conjunctions most frequently employed by LI English, LI Turkish, and LI Spanish scholars in their PhD dissertations in the subject of English Language Teaching (ELT). This will be a unique study to compare English language users with Turkish and Spanish, two non-native English speaker groups, both among themselves and with a group of native English speakers. The research questions listed below will be put out in order to fully describe the investigation:
1. What are the frequencies of the ten most frequent B2 level conjunctions in the British Academic Written English Corpus (B AWE) as a reference corpus?
2. How do these frequencies compare to the conjunctive preferences of LI English, LI Turkish, and LI Spanish researchers in the field of ELT as observed in published PhD Dissertations?
3. Do the frequencies of these conjunctions differ significantly between:
a. Turkish and Spanish authors' corpora?
b. Turkish and English authors' corpora?
c. Spanish and English authors' corpora?
Overall, the current study will unearth subtle insights into conjunction use in academic writing through an application of a usage-based linguistic framework. The findings generate better views on how conjunctions are used by scholars from different linguistic backgrounds, increasing our realization of the effects of language transfer and challenges specific to a language for speakers of English as a non-native language. The findings also prove instrumental in developing effective language teaching strategies and assisting non-native speakers in overcoming language-specific pitfalls in academic writing.
LITERATURE REVIEW
Corpus Studies Among Different Language Variables
Although corpus-based research has spread internationally in recent years, the corpus most frequently compared to other corpora or to itself has primarily been the corpus of native English speakers. Many researchers have developed an interest in how non-native speakers of English use the language and the differences between their use and that of native speakers in a particular linguistic item after noticing the research gap between native and non-native speakers of English. In addition to studies on learner corpora, several scholars have focused on academic corpora, or how non-native English speakers use the language in their dissertations and/or research publications (Kuswoyo et al., 2020; Ucar, 2017).
The usage of lexical bundles in various linguistic variables has been the main focus of the majority of corpus-based studies. As is acknowledged, lexical bundles are an essential resource in determining academic discourse because the successful use of lexical bundles is vital for writers in order to sound fluent and native-like (Oztiirk & Durmu§oglu-K6se, 2016). Some of recent studies (Adel & Erman, 2012; Chen & Baker, 2010) claim that non-native writers of English produce less varied lexical bundles and overuse some of them compared to native writers of English. In this regard, numerous studies (Adel & Erman, 2012; Byrd & Coxhead, 2010; Chen& Baker, 2010; Conrad & Biber, 2005; Nesi & Basturkmen, 2006; O'Flynn, 2022; Oztiirk & Durmu§oglu-K6se, 2016) have tried to identify the frequency of using lexical bundles in different language variables and examine to what extent non-native writers of English sound native-like in their writings in terms of lexical bundles use.
The Use of Conjunctions in Academic Writing Among Different Language Variables
A conjunction is a word that joins sentences or groups words together in a sentence. Conjunctions serve as a semantic link between two ideas, and it is necessary to comprehend the first notion before using conjunctions to interpret the second (Halliday & Hasan, 1976). Conjunctions have a significant role in achieving coherence and cohesion, two of the most important aspects of academic writing. The former emphasizes connections at the argument level, while the latter emphasizes connections at the sentence level. Conjunctions can wholly or partially change the meaning if they are not employed in a way that promotes coherence and/or cohesiveness. Thus, it is crucial for speakers of a foreign language to be familiar with and employ the majority of conjunctions, if not all of them, in suitable contexts. It is considerably more important for academic researchers who are writing a dissertation and/or a research article in a foreign language to have mastered the right uses of conjunctions because academic writing calls for a far more thorough and advanced linguistic use of the language.
Though there has been some research on the use of English conjunctions, many studies have mostly concentrated on native English speakers (Christie & Derewianka, 2008; Diessel, 2004; Schleppegrell, 2004)specifically focusing on how youngsters learn conjunctions rather than how adults use them in that language. Because adults may weigh causes and present arguments for and against diverse perspectives, academic academics who are by nature adults were chosen for this study (Slobin, 1996). Granger and Tyson (1996) conducted one of the earliest studies on the use of conjunctions by native and non-native speakers, and their initial premise was that LI English speakers tend to overuse, and L2 English speakers tend to underuse specific conjunctions in their essays. Although the qualitative study demonstrated that there are semantic, stylistic, and syntactic errors between these two groups of English speakers, the quantitative analysis demonstrated that the hypothesis was incorrect. Martin (2003) performed a genre analysis on Spanish and English research article abstracts, the two languages that make up two of the three variables in the present study. He discovered rhetorical factors in both of these linguistic variables and connected these variations to the various demands placed on Spanish and English writers. Michel (2013), one of the researchers to work on a spoken corpus in this area, compared the use of conjunctions by native and non-native English speakers in cognitively simple versus complex oral tasks and tested the Cognition Hypothesis (Robinson, 2007) on a potential difference between these two groups, but she was unable to find a statistically significant difference between the two groups.
There is a dearth of research that compares how pre-determined conjunctions are used by English-speaking writers who are native speakers and non-native speakers in academic writing. The research focuses on conjunctions at the B2 level because this level denotes a high proficiency where learners can comfortably interact with native speakers, demonstrating a level of language competency that reduces stress during communication for both the learner and the native speaker. Below this level of proficiency is mostly limited to daily interactions rather than academic writing (Council of Europe, 2001).
METHOD
The study uses a corpus-based research design, employing the AntConc software, which enables a thorough investigation of the frequencies of B2 level conjunctions in published PhD dissertations written by academics with LI English, LI Turkish, and LI Spanish backgrounds in the ELT area.
Research Corpus
The three independent corpora employed in this study were drawn from 20 L2 English Ph.D. dissertations by Turkish writers, 20 L2 English Ph.D. dissertations by Spanish authors, and 20 L1 Ph.D. dissertations by English native speakers working in the ELT area. The primary justification for selecting doctoral theses is that it is assumed that English writers at the doctoral level have a high degree of proficiency in the language and have likely produced several research articles, making them knowledgeable in conjunction usage.
Dissertations from the ELT field were chosen not only because it is the researcher's field and she is interested in seeing how her colleagues use conjunctions, but also because it is important to maintain field consistency when choosing the theses in order to avoid any misjudgments brought on using conjunctions in various fields. To maintain relevance to contemporary academic standards and research interests, the study focuses on recent theses from 2014. Moreover, methodological consistency is ensured, and biases are prevented by using a constant time frame. For the purpose of gathering and analyzing data, it is feasible to examine theses from more recent times. Concentrating on current theses, in addition, helps the readers better understand the evolution of academic discourse in the selected study area and identify shifts in theoretical frameworks, research goals, and social significance.
Turkish writers' theses were acquired from the Turkish Council of Higher Education Thesis Center's official website at https://tez.yok.gov.tr/UlusalTezMerkezi/giris.jsp. Then, from an official website (https://dialnet.unirioja.es/tesis), which contains Ph.D. dissertations from 40 Spanish universities, the theses written by Spanish authors were taken. The final step was to retrieve the theses authored by English native speakers from ProQuest, a database of dissertations and theses (http://search.proquest.com/pqdtglobal?accountid=l 1248).
Descriptive Statistics of the Corpus
After the theses were downloaded, they were uploaded into the software AntConc 3.4.4.m. The descriptive data for each corpus used in this investigation are shown below.
As can be seen from Table 1, the total tokens of Turkish and LI English authors' corpora are close to each other in terms of number, whereas the Spanish authors' corpus almost doubles both corpora in terms of tokens and types. In order to avoid the issues that can arise from this huge difference in the number of tokens between the Spanish authors' corpus and the others, the log-likelihood and effect size calculator (http://ucrel.lancs.ac.uk/llwizard.html) was used.
Identifying Conjunctions
After the preliminary analyses, the BAWE Corpus was uploaded to the program AntConc 3.4.4.m, where a frequency analysis was conducted to arrange the words according to how frequently they appeared throughout the entire corpus. The researcher then used the Cambridge Dictionaries Online website (http://dictionary.cambridge.org) to manually choose the conjunctions starting at the top of the frequency list and determine where those conjunctions belonged in the CEFR.
Frequency Analysis
After the most frequently used ten CEFR-B2-level conjunctions in the BAWE Corpus were determined, the second phase of the study was conducted and frequencies both within and across the three language variables were analyzed. Determining whether a conjunction is overused or underused usually entails statistical analysis that compares the conjunction's frequency in one corpus (e.g., LI Turkish writers' theses) with another corpus (e.g., LI Spanish authors' theses). The functions of the conjunctions were examined as the third step following the frequency analysis, paying particular attention to those that could have more than one function or a different function when used with another word (for example, the conjunction though changes its function when the word as is preceding it). The data were removed from uses that did not fulfil the primary purpose of the conjuncts in question.
FINDINGS
The results revealed that the most frequent ten conjunctions at B2 level according to CEFR in BAWE Corpus are thus, though, furthermore, as a result of, whereas, moreover, consequently, nevertheless, additionally, and in conclusion; starting from the most frequent. Table 2 below shows the rank and frequency of these conjunctions in BAWE Corpus.
The Rank and Frequency Statistics of Conjunctions in Each Corpus
The first research question tries to find out the frequencies of the most frequent ten B2-level conjunctions according to BAWE Corpus in PhD theses written by English, Turkish, and Spanish researchers. After the conjunctions in question were finalized, a word search in the AntConc 3.4.4.m software was computed for each language and the rank and frequency of each conjunction in each corpus were found. Figure 1 shows the frequency statistics of the conjunctions in each corpus in a general view.
As seen in Figure 1, the use of conjunctions in native and non-native languages varies although there is no difference in the frequency of some conjunctions such as consequently, additionally, and in conclusion across three languages. After this overall evaluation, Table 3 shows the statistics for each language variable in detail to answer the second research question concerning how these frequencies compare to the conjunctive preferences of LI English, LI Turkish, and LI Spanish researchers in ELT as observed in published PhD dissertations.
The reference conjunctions could, at this point, be ranked according to the frequency with which they appear in each corpus, as shown in Table 3. In order to determine whether there are any parallels or differences, we will score the conjunctions from one to ten in descending order, with one being the most frequently used CEFR-B2-level conjunction and ten being the least frequently used.
The Frequencies of the Conjunctions Across Three Language Variables
The third research question examines whether the use of conjunctions differs significantly across the PhD dissertations written by LI Turkish, LI Spanish, andLl English researchers. In this respect, first, the ranks of the conjunctions in each corpus were described. The ranks of the conjunctions in three different corpora are displayed in Table 4.
Three conjunctions are ranked the same in all three corpora, as seen in Table 4: additionally, ranking seventh; consequently, ranking eighth; and in conclusion, ranking tenth. Additionally, Rayson's online calculator was utilized to see statistical differentiation between the conjunction frequency in each corpus. In this analysis, the Bayes Factor and Effect Size for Log-likelihood (ELL) are taken into account. The assumption that two words to be evaluated are used at non-significantly different rates in two corpora is the null hypothesis in this calculation (Wilson, 2013). This null hypothesis is stronger the further the Bayes Factor deviates from the point of 2. If so, there would be no use in considering ELL. The ELL, however, becomes significant when the Bayes Factor is greater than 2 because it establishes the magnitude of the statistical difference between the two words under study.
Comparison between Turkish and Spanish Authors' Corpora
To examine the conjunctions employed in two non-native authors' PhD theses in the field of ELT, the researcher compared their corpora. Since English is a language that neither the Spanish nor the Turkish researchers are native speakers of, it was expected that there would be some overlap between these two corpora in terms of the frequency of conjunctions employed in academic writing. Although there were some variances in the frequency of the other conjunctions as being over- and underused, the researcher discovered parallels in the use of though, consequently, and additionally. The comparison of the corpora of Turkish and Spanish authors is seen in Table 5.
The empty lines belonging to the conjunctions though, consequently and additionally were kept unfilled because there was no statistically significant variation in their frequency. Seven conjunctions are apparently used at a statistically different frequency rate. As stated above, additionally and consequently rank the same across the Turkish and Spanish authors' corpora, and probably that is why no statistical difference was found in terms of frequency. However, for the conjunction in conclusion, the case is different: it is overused in Turkish authors' corpus, though more slightly than the other conjunctions. The conjunction used with the biggest statistical difference is as a result of, which is overused in Turkish authors' corpus with a significance value lower than .0001. Other five conjunctions with statistical difference value, thus, furthermore, moreover, nevertheless, whereas are all underused in the Turkish authors' corpus, the first four of which have a significance value lower than .001.
After answering the question of how often, the question of how the Turkish and Spanish authors use the conjunction as a result of the highest difference rate arises. To have a clearer idea of the usage of as a result of the most overused conjunction in Turkish corpora relative to Spanish corpora, concordance hits on the usage of as a result of 'in both corpora are given below.
Figures 2 and 3 show that although there is no statistically significant difference in the grammatical usage of the conjunction between these two language variables, there are differences in the parts of the theses where it is employed. Spanish authors utilize the conjunction primarily for discussion or literature review parts, as can be observed by the concordance lines and what comes after it as a result, whereas Turkish authors prefer to use it primarily for data analysis or techniques sections. As a result of 'is used in a variety of sections in this particular corpus, and this variety of sections where it is used reveals the likely cause of this conjunction's overuse in Turkish authors' corpora compared to Spanish authors' corpora. The first and second hits in Turkish authors' corpora show us that they also use it for literature reviews or discussion parts to some extent.
Comparison between Turkish and English Authors' Corpora
After comparing two non-native authors' corpora, the researcher tried to understand the similarities and differences between Turkish and English authors' corpora. Therefore, another analysis was computed between LI Turkish authors' corpus and LI English authors' corpus. Table 6 below shows the statistical significance of the frequencies of the conjunctions.
Table 6 shows that there are now only five conjunctions employed at a statistically significant differential rate. This demonstrates that Turkish and LI English authors employ the aforementioned conjunctions more frequently than Turkish and Spanish authors do. The fact that all of the conjunctions in Table 6 that differ significantly fall under the category p < 0.0001 and are all used at extremely different rates from one another is particularly noteworthy. It can also be said that the conjunction in conclusion is overused in the Turkish corpus though they rank the same (10th) in both corpora. As a result of is overused and nevertheless is underused in LI Turkish authors' theses, which is consistent with the comparison of Turkish and Spanish authors' corpora and echoes the use patterns seen in LI English authors' corpus.
To provide a clearer understanding, the concordance lines for two conjunctions, thus and though, that are over- and underused in Turkish authors' corpora in comparison to LI English authors' corpora are presented in the figures below.
Figures 4 and 5 demonstrate how the conjunction's grammatical usage varies between LI Turkish and LI English author corpora, two language variables. Despite the fact that Turkish authors appear to have stuck with the sentence-connecting function of thus, LI English authors use it in two additional grammar structures: connecting sentences with the same subject by adding the gerund form of the second sentence's verb (as seen in lines 1, 3, 6, 9, 10, and 12) and connecting sentences with the same subject by adding the second sentence's conjugated verb (lines 11,15 and 16). Figure 4 illustrates how only lines 8,11 and 15 were used by Turkish authors to implement these functions. In Figure 5, lines 2 and 8 adopt the same grammar as the majority of Turkish authors, joining two independent phrases and beginning the second one with so. Turkish authors use thus less frequently than their L1 English colleagues, but an intriguing conclusion is that they appear to have overused it in their theses. Figures 6 and 7 below give us a better view of the conjunction though in Turkish and L1 English authors' corpora.
Comparison between Spanish and English Authors' Corpora
Another comparison involved the corpora of LI Spanish authors and LI English authors. Although some similarities in conjunction usage across LI Turkish, LI English, and LI Spanish; LI English corpora were anticipated, the results were considerably different, with the exception of the use of though. The results are shown in Table 7 below.
As illustrated in Table 7, the only underused conjunction in Spanish authors' corpus compared to LI English authors' corpus is though. The other five conjunctions, which are thus, furthermore, whereas, moreover and nevertheless, were used at a statistically different rate and determined as being overused in Spanish authors' corpus.
To exhibit the usage variations of the conjunction used at the highest statistical difference rate, though (p < 0.0001), concordance lines generated in LI Spanish and LI English corpora are respectively shown below in Figures 8 and 9.
When we look at how though is used grammatically in both corpora, we can see that LI Spanish authors typically stick to the even though usage and do not choose to utilize any other functions. This is likely the same factor that contributes to LI Turkish and Spanish authors' underuse of conjunctions compared to LI English authors.
DISCUSSION & CONCLUSION
Corpus linguistics is a valuable resource to enhance language teaching. By providing naturally occurring language data, it paves the door for academics to learn more about the language(s) they are interested in as well as learners to see how the language is used by native speakers. In order to make their teaching more fruitful and authentic, it also gives teachers the opportunity to develop corpus-driven activities or data-driven learning.
This study concentrated on two research questions, the first of which attempted to determine the frequency of ten conjunctions at B2-level according to CEFR in the B AWE Corpus in corpora of published PhD Theses in English written by L1 English, L1 Turkish, and L1 Spanish researchers in the ELT field. This study alsoaimed to contribute to the field by highlighting the benefits of corpus linguistics. The second question further asked whether there is a substantial difference in the frequency of these conjunctions across three language variables. The most frequent ten conjunctions in question were revealed as thus, though, furthermore, as a result of, whereas, moreover, consequently, nevertheless, additionally and in conclusion, starting from the most frequent to the least frequent in this list. Comparing LI Turkish to LI English, LI Spanish to LI English, and LI Turkish to LI Spanish revealed overuses and underuses of each applicable conjunction.
The findings of this study can be interpreted on several counts. When the corpora of two non-native English speakers - Turkish and Spanish authors— are combined and contrasted with those of LI English authors, the conjunction thus can be considered to be overused in both non-native corpora, whereas the conjunction though is underused. Although there is no statistically significant difference in the frequency of the conjunction though between Turkish and Spanish authors' corpora, the Turkish authors' corpus shows that the conjunction thus is underused. If the goal is to track how L1 English authors use conjunctions, this can be taken to mean that Spanish authors should aim to use thus less and that though usage can be increased inborn settings by exposing non-native scholars to more though usage and by encouraging them to expand their usage beyond just using it in conjunction with even though.
It may be claimed that Turkish authors utilize the conjunction nevertheless at an underused rate in both comparisons (Turkish vs L1 English and Turkish vs Spanish) when the corpus of Turkish authors is placed on one side and the Spanish and LI English corpora are placed on the other. It could be because this conjunction is taught considerably later than others at the B2 level, like thus or as a result (of, and is considered a more advanced conjunction as opposed to a CEFR-B2-level conjunction. It is highly likely to infer that lexical complexity (Palloti, 2015) may contribute to this. On the other hand, in both comparisons, the conjunctions as a result of and in conclusion are overused.
The five conjunctions -thus, furthermore, whereas, moreover, and nevertheless— were overused by Spanish authors in both comparisons (Spanish vs Turkish and Spanish vs LI English), when the corpus of Spanish authors is taken as one side and the Turkish and LI English corpora are as the other. The inclusion of furthermore and moreover on this list may be due, in part, to the fact that the Spanish word for these conjunctions, ademds, contains the English word more (mas), making it easier for learners to code these words in their second language (L2). Further analyses that explore the reasons why the other conjunctions, such as whereas, thus, and nevertheless, appear to be overused in both comparisons are required.
Overall, some conjunctions have a similar syntactic structure in different languages, e.g., furthermore and ademds (Spanish) and this possibly explains the overuse of the conjunction in question. Some conjunctions (e.g., nevertheless) are taught in Turkish curriculum of teaching English later than the others in the list (e.g., thus and in conclusion) to be consistent with the principles of language learning, which begin with basics and progressively move onto more intricate language structures as students progress (Ellis, 2015; Lightbown& Spada, 2021). For instance, in academic writing, "thus" and "in conclusion" are used to denote conclusions and logical linkages. Likewise, Demir (2019) found that the word "nevertheless" was absent from pre-test but returned in the post-test. The finding is in line with theories of language acquisition that support a progressive curriculum that introduces more sophisticated linguistic structures later on. The fact that the word "nevertheless" was absent from the pre-test and then appeared in the post-test shows how learners' comprehension and application of complicated conjunctions have evolved over time. Although there is no rigid guideline for teaching conjunctions, this approach aligns with language fundamentals, which might explain overuses and underuses.
The research can be expanded upon by adding two more corpora—LI Spanish speakers' theses in Spanish and LI Turkish speakers' theses in Turkish—and by comparing the corpora of native speakers of different languages to one another in order to determine how (and how frequently) a conjunction is utilized in each language. To further comprehend the difference between these language characteristics, L1 interference may be investigated. Instead of PhD theses, research articles might be used as a database for corpora to carry out a comparable investigation. In addition to these, conjunctions having various grammatical functions, such as though, can be thoroughly studied in other corpora used by non-native academics, LI English corpora, namely ICLE (International Corpus of Learner English), COCA (Corpus of Contemporary American English) and BASE (British Academic Spoken English).
Implications
This study has a great deal of promise to advance teaching academic writing in a number of ways. At the outset, this study uses a corpus-based approach to analyze academic writing, providing insights into linguistic patterns and preferences for LI English, LI Turkish, and LI Spanish researchers while using conjunctions, aiding language instructors in effective teaching. In addition, the study examines ten B2-level conjunction frequencies, enabling educators to identify common connective elements in academic writing, enhancing students' understanding and comprehension (Yoon & Hirvela, 2004). Furthermore, the study's investigation of conjunction usage across three language variables (L1 English, L1 Turkish, and L1 Spanish) emphasizes the impact of language variation (O'Sullivan, 2019). Teachers who are aware of these variations can better adapt their methods of instruction to meet the requirements of students from various language backgrounds. This study can also pinpoint the conjunctions and linguistic elements that help written texts acquire an academic voice through corpus analysis (Flottum et al., 2006). Last but not least, it may create an opportunity to learn how to do research and data analysis using a corpus-based approach, emphasizing evidence-based writing (Giannoni, 2010) and critical analysis in academic environments. Inspiring students to conduct their own linguistic research enhances their research skills. Overall, these contributions to both corpus linguistics and the field of ELT will aid language learners, teachers, and researchers in understanding the languages in question in a more systematic and thorough manner.
Statements of Publication Ethics
To guarantee the legitimacy and dependability of the research findings within the academic community, this study adheres to the principles of data integrity, proper attribution, and transparent reporting.
Conflict of Interest
This study does not have any conflict of interest.
References
REFERENCES Add, A., & Erman, B. (2012). Recurrent word combinations in academic writing by native and non-native speakers of English: A lexical bundles approach. English for Specific Purposes, 31(2), 81-92. https://doi.Org/10.1016/j.esp.2011.08.004 Biber, D., Conrad, S., & R. Reppen. (1998). Corpus linguistics: Investigating language structure and use. Cambridge: Cambridge University Press.
Bybee, J. (2008). Usage-based grammar and second language acquisition. In P. Robinson & N. C. Ellis (Eds.), Handbook of Cognitive Linguistics and Second Language Acquisition (pp. 226-246). Routledge. Byrd, P., & Coxhead, A. (2010). On the other hand: Lexical bundles in academic writing and in the teaching of EAP. University of Sydney Papers in TESOL, 5(5), 31-64.
Chen, Y. H., & Baker, P. (2010). Lexical bundles in LI and L2 academic writing. Language Learning & Technology 14(2), 30-49. Christie, F., & Derewianka, B. (2008). School discourse: Learning to write across the years of schooling. London: Continuum.
Conrad, S. M, & Biber, D. (2005). The frequency and use of lexical bundles in conversation and academic prose. https://doi.org/10.1515/9783484604674.56 Cosjoin, E. (2011). Cohesion in Compositions of Turkish and Immigrant Students. Kuram Ve Uygulamada Egitim Bilimleri, 11(2), 892-899.
Council of Europe. (2001). Common European framework of reference for languages: Learning, teaching, assessment. Cambridge, U.K: Press Syndicate of the University of Cambridge. Demir, C. (2019). Developing of conjunctive adverbs in the writing of English for undergraduate students. EKEV Akademi Dergisi, 23(78), 137-154. Diessel, H. (2004). The acquisition of complex sentences. New York/Cambridge: Cambridge University Press.
Diessel, H. (2017). Usage-based linguistics. InM. Aronoff (Ed.), Oxford Bibliographies in Linguistics. New York: Oxford University Press. Ellis, N. C. (2006). Language acquisition as rational contingency learning. Applied Linguistics, 27, 1-24.
Ellis, R. (2015). Understanding second language acquisition 2nd edition. Oxford University Press. Fkfttum, K., Dahl, T., & Kinn, T. (2006). Academic voices. Pragmatics & Beyond, 148. https://doi.org/10.1075/pbns. 148
Giannoni, D. S. (2010). Mapping academic values in the disciplines: A corpus-based approach, (Vol. 124). Peter Lang. Goldberg, A. E. (2006). Constructions at work: The nature of generalization in language. Oxford, UK: Oxford University Press.
Granger, S. & Tyson, S. (1996). Connector Usage in the English Essay Writing of Native and non-Native EFL Speakers of English. World Englishes, 15(1), 17-27. https://doi.Org/10.llll/j.1467-971X.1996.tb00089.x Gries, S. T., & Ellis, N. C. (2015). Statistical measures for usage-based linguistics. Language Learning, <55(S1), 228-255.
Hasan, R., & Halliday, M. A. (1976). Cohesion in English. London: Longman. Hunston, S. (1995). Grammar in teacher education: The role of a corpus. Language Awareness, 4(1), 15-31. https://doi.org/10.1080/09658416.1995.9959864
Kuswoyo, H., Sujatna, E. T. S., Indrayani, L. M, & Rido, A. (2020). Cohesive conjunctions and and so as discourse strategies in English native and non-native engineering lecturers: A corpus-based study. International Journal of Advanced Science and Technology, 29(1), 2322-2335. Lightbown, P. M, & Spada, N. (2021). How languages are learned 5th edition. Oxford University Press.
Martin, P. M. (2003). A genre analysis of English and Spanish research paper abstracts in experimental social sciences. English for Specific Purposes, 22(1), 25-43. https://doi.org/10.1016/S0889-4906(01)00033-3 Me hi, S. W. (2016). Corpus onomasiology: A study in World Englishes (Doctoral dissertation, UCL (University College London)).
Nesi, H., & Basturkmen, H. (2006). Lexical bundles and discourse signalling in academic lectures. International Journal of Corpus Linguistics, 11(3), 283-304. https://doi.org/10.1075/ijcl.lL3.04nes O'Flynn, J. (2022). Lexical bundles in the academic writing of the Arts and Humanities: from corpus to CALL. Yearbook of Phraseology, 13(1), 81-108. https://doi.org/10.1515/phras-2022-0006
O'Sullivan, J. (2019). Corpus linguistics and the analysis ofsociolinguistic change: Language variety and ideology in advertising. Routledge. Oztiirk, Y., & Durmu§oglu-K6se, G (2016). Turkish and Native English Academic Writers' Use of Lexical Bundles. Journal of Language and Linguistic Studies, 12(1), 149-165.
Partridge, B. (1995). Genre Analysis and the Identification of Textual Boundaries. Applied Linguistics, (2), 502-505. https://doi.org/10.1093/applin/15.3.288 Pallotti, G (2009). CAF: Defining, refining and differentiating constructs. Applied Linguistics, 30, 590-601. https://doi.org/10.1093/applin/amp045
Pallotti, G (2015). A simple view of linguistic complexity. Second Language Research, 31(1), 117-134. https://doi.org/10.1177/0267658314536435 Robinson, P. (2007). Task complexity, theory of mind, and intentional reasoning: Effects onL2 speech production, interaction, uptake and perceptions of task difficulty. International Review of Applied Linguistics, 45, 193-213. https://doi.org/10.1515/iral.2007.009
Schleppegrell, M. J. (2004). The language of schooling: A functional linguistics perspective. Mahwah, NJ: Lawrence Erlbaum. Trebits, A. (2009). Conjunctive cohesion in English language EU documents-A corpus-based analysis and its implications. English for Specific Purposes, 28(3), 199-210. https://doi.Org/10.1016/j.esp.2009.04.004
Ucar, S. (2017). A corpus-based study on the use of three-word lexical bundles in the academic writing by native English and Turkish non-native writers. English Language Teaching, 10(12), 28-36. http://doi.org/10.5539/elt.vl0nl2p28 Wilson, A. (2013). Embracing Bayes factors for key item analysis in corpus linguistics. In: New approaches to the study of linguistic variability. Language Competence and Language Awareness in Europe. Peter Lang: Frankfurt, 3-11.
Yoon, H., & Hirvela, A. (2004). ESL student attitudes toward corpus use in L2 writing. Journal of Second Language Writing, 13(4), 257-283. https://doi.org/10.1016/jjslw.2004.06.002 Zhao, X. (2011). The use of casual connectives in non-English majors' writings: A study based on picture compositions and the SWECCL corpus. Foreign Language Education in China, 4(2), 12-20. Zou, B. (2015). A Corpus-based Analysis of the Use of Conjunctions in an EAP Teaching Context at a Sino-British University in China. In Corpus Linguistics in Chinese Contexts (Eds. Zou, B. & Hoey, M). 134-157. Palgrave, Macmillan: UK