This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
1. Introduction
The development of Internet technology has driven the networking and internationalization of education. Online learning has gained unprecedented attention during COVID-19 prevention and control, which implies it will confront greater chances and problems [1]. Unlike in traditional classrooms, teachers cannot observe the status of students studying knowledge point videos in online learning, making it impossible for teachers to accurately determine the knowledge difficulties that online learners encounter in the learning process. Although teachers can estimate the difficulty of knowledge points empirically, several studies have shown that it is difficult for teachers to determine the correct difficulty level of knowledge points based on the cognitive level of students [2]. There are three main ways to understand the difficulty of knowledge points in the traditional teaching methods: questionnaire surveys, questions, and face-to-face talking. However, these traditional methods are time-consuming and influenced by learners’ subjective emotions, resulting in a failure to accurately reflect the real difficulty of the knowledge points.
With the widespread use of online learning platforms such as Coursera, edX, and Udacity, these online learning platforms store the learners’ interactive behaviors data in the process of online learning [3, 4]. The literature [5] summarizes online interactive behaviors into three types: student-system interaction (the behavior of learners interacting with knowledge point videos, courses, and resource systems), student-teacher interaction, and student-student interaction. These interactive behaviors reflect the students’ learning situations and understanding of knowledge points [6, 7]. For example, when students study a knowledge point more frequently, it means that the knowledge point is more difficult to understand. In terms of knowledge point difficulty, we can automatically obtain the knowledge point clusters at different levels of difficulty through Educational Data Mining (EDM) based on student learning behaviors, which can assist teachers in continuously optimizing the design of teaching content to provide a reliable guarantee for improving teaching efficiency. However, due to factors such as the difficulty of the knowledge points and learning habits, the interactive behavior (student-system, student-teacher, student-student) of different students in learning the knowledge points is different. The challenge of this research paper is how to use the multi-interactive behavior data (student-system, student-teacher, and student-student) in the online education system to accurately classify the difficulty of knowledge points for learners. From the perspective of student-system interactive behavior, the increased frequency and duration of watching videos means that students study difficult knowledge points [8, 9]. From the perspective of learning paths, students’ repeated study of the difficult knowledge points forms the structure of the learning sequence of the partial knowledge point cycle [10, 11]. Furthermore, the learning path must take into account students’ forgetting behavior [12], which is because students may study knowledge point videos because they forget the content. From the perspective of student-teacher interaction and student-student interactive behaviors, students interact more with their teachers and peers when they study difficult knowledge points. Therefore, the relationship between the different categories of interactive behaviors and the difficulty of the knowledge point needs to be analyzed.
Based on the above analysis, this paper innovatively proposes a difficulty-based knowledge point clustering algorithm using students’ multi-interactive behaviors (MIBKPC), which mainly includes the following contributions:
(1) We propose a difficulty-based knowledge point classification algorithm that combines three interactive behaviors to measure the similarity of knowledge point difficulty and provides a more accurate classification of knowledge point difficulty.
(2) Based on student-system interaction, a similarity measurement approach for the difficulty of knowledge points is proposed. The approach integrates the group learning path network, the degree of student-system interaction, and the forgetting behavior of learners, which can assist in measuring the learning process more accurately.
(3) Due to the sparsity problem of student-teacher and student-student interactions, traditional methods of measuring similarity are inaccurate. We propose a similarity measurement method of knowledge point difficulty based on interaction, which resolves the problem by considering all information about student interactions with knowledge points.
The rest of the paper is organized as follows: Section 2 introduces the research related to difficulty-based knowledge point classification. Section 3 describes the definition and calculations related to the algorithm. In Section 4, a difficulty-based knowledge point clustering algorithm using students’ multi-interactive behaviors is proposed. Section 5 presents a comparative analysis of different experimental results to evaluate the performance of the proposed algorithm. Finally, Section 6 concludes the work.
2. Related Works
Online learning platforms and educational institutions store a variety of student data. According to the literature [13], Multidimensional data analysis of learning behaviors using educational data mining techniques can help teachers and researchers better understand the learners’ learning process. However, only a few studies have been conducted to cluster or classify the difficulty of knowledge points based on the interactive behavior of learners.
In the study of the classification of teaching resources based on statistical methods: Li et al. [8] found that rewatching the video, speeding down, frequent pauses, and skips implied that the videos were more difficult for learners to study Sluis et al. [14] explored the relationship between video complexity and dwelling time or dwelling rate by analyzing learners’ clickstream tracking data Brinton et al. [10] proposed an event sequence-based framework to extract repeated subsequences of student behavior to identify recurrent viewing behaviors and found that subsequences were significantly correlated with learning effects Zhu et al. [15] investigated the weight coefficients of the impact of implicit video feedback in the student-system interactive behavior on the learning effect. The implicit video feedback included the video learning frequency, the video learning duration, and the video pausing and dragging frequency. The above works only explore the relationship between the difficulty of videos and interactive behaviors; however, they do not propose a specific method for classifying the difficulty of knowledge point videos. But these works provide a reference for extracting features of interactive behavior as input for knowledge point difficulty classification.
In the study of the classification of teaching resources based on machine learning: Kastrati et al. [16] proposed a video classification framework based on video content by converting the video into text, converting the text into vector space using representation techniques, and training the video classifier using the extracted vectors Othman et al. [17] proposed a classification framework based on video metadata, whereby XML technology was used to extract metadata related to videos, such as video description information and comments information, and then the metadata was used to classify videos using data mining techniques. In the above works, the classification is based on the content of the videos. However, video information only describes the basic content of the video and cannot be used to measure the difficulty of the video. Therefore, these methods cannot be used to classify the difficulty of knowledge point videos.
Zhang et al. [11] proposed a personalized classification algorithm for MOOC videos, which clusters students by their knowledge level, mines the VLBP structure of each class based on their video viewing data by using process mining techniques, and then measures the difficulty and importance of MOOC videos Zhang et al. [9] proposed a difficulty-based clustering method for SPOC videos, using the SimRank++ algorithm to calculate the difficulty similarity between two videos, and then a spectral clustering algorithm is used to achieve video clustering. Though the above algorithms studied the mapping model between the difficulty of videos and the learning behavior, they did not consider the mechanism of the intrinsic association between the multiple interactive behaviors of the learners and the difficulty of knowledge point videos.
To sum up, there is no existing research on knowledge point video difficulty classification based on learners’ multidimensional interactive behaviors. In this paper, student-system interactive behaviors, student-teacher interactive behaviors and student-student interactive behaviors in the online learning process are modeled to obtain the knowledge point difficulty similarity matrix, which is combined with the group-directed learning path network and knowledge point difficulty similarity measurement, and then the spectral clustering algorithm is used to classify the knowledge point difficulty.
3. Correlation Definition
In this section, the proposed algorithm’s relevant definitions and computational methods are described, and some of the definitions are analyzed and illustrated.
3.1. Knowledge Point
Knowledge point videos refer to the orderly short instructional videos recorded by teachers, which are numbered by the researcher, according to the course knowledge framework. Since the videos are short instructional videos, most of the videos only contain one knowledge point. Therefore, in this paper, one video represents one knowledge point.
3.2. The Degree of Student-System Interaction
The degree of student-system interaction refers to the degree of learners watching knowledge point videos [15]. The recordings of learners watching the knowledge point videos are stored in the online learning platform. From these records, we extracted three behavioral features: the knowledge point video learning frequency, the knowledge point video learning duration, and the knowledge point video pausing and dragging frequency. The calculation process is as follows:
3.3. The Degree of Student-Teacher Interaction
The degree of student-teacher interaction refers to students communicating with the teacher in the online learning process [18]. The online learning platforms store text information that teachers answer questions about knowledge points for students. Extracting keywords from the text and matching them with the knowledge point name allows us to determine which knowledge point is being asked about, then count the effective time and frequency of communication between teachers and students. The degree of student-teacher interaction is portrayed by the frequency and duration of interaction in student-teacher interactive behaviors. The calculation process is as follows:
3.4. The Degree of Student-Student Interaction
The degree of student-student interaction refers to the communication and discussion between students about knowledge points [18]. The online learning platforms also record text records from student-to-student discussions about knowledge points. We can also determine which knowledge point is discussed, such as teacher-student interaction, then count the effective time and frequency of communication between students. It is portrayed by the frequency and duration of student-student interactive behaviors. The degree of student-student interaction is calculated from the following equations:
3.5. Directed Learning Path Network
A directed learning path network (DLPN) is a topological network generated based on time series data of student-system interactive behavior [19, 20]. Directed learning path networks can be divided into personal directed learning path networks (PDLPN) and group-directed learning path networks (GDLPN).
4. Difficulty-Based Knowledge Point Clustering Algorithm Using Multi-Interactive Behaviors (MIBKPC)
The flow block diagram of the difficulty-based knowledge point clustering algorithm using students’ multi interactive behaviors (MIBKPC) is shown in Figure 1.
[figure(s) omitted; refer to PDF]
First, the student-system interaction data are analyzed to obtain the SC matrix and GDLPN, and the GDLPN is analyzed to obtain the In-Degree centrality of knowledge node (defined by formula (9)). In-Degree centrality of knowledge node and SC are used as the input of the knowledge point difficulty similarity model based on student-system interaction to obtain the SC-based knowledge point difficulty similarity matrix.
Second, ST matrix and SS matrix are obtained by analyzing the student-teacher interaction and student-student interaction data. ST and SS are used as the input of the knowledge point difficulty similarity model based on interaction to obtain the ST-based knowledge point difficulty similarity matrix and SS-based knowledge point difficulty similarity matrix, respectively.
Finally, the knowledge point difficulty similarity matrix is obtained by linear combination analysis of SC-based knowledge point difficulty similarity matrix, ST-based knowledge point difficulty similarity matrix, and SS-based knowledge point difficulty similarity matrix. A spectral clustering algorithm is used to implement difficulty-based knowledge points clustering based on the obtained similarity matrix.
The proposed algorithm is composed of four parts: the knowledge point difficulty similarity model based on student-system interaction, the knowledge point difficulty similarity model based on interaction, measurement of the difficulty similarity of knowledge points, and spectral clustering based on the difficulty of knowledge points. We have a detailed introduction in the following sections.
4.1. Knowledge Point Difficulty Similarity Model Based on Student-System Interaction
By analyzing the student-system interactive behavior, the knowledge point difficulty similarity is measured only by the degree of student-system interaction that can cause dimension curse. The dimension of the interaction vector of knowledge points in the student-system interaction matrix
4.1.1. Knowledge Point Difficulty Similarity Based on the Degree of Student-System Interaction
In online learning, learners usually study each knowledge point video, making the SC matrix a dense matrix. Compared to other similarity approaches in dense matrices, the Adjusted Cosine similarity method can better measure the knowledge point difficulty similarity [23]. This is because learners have different learning preferences (different learning habits and learning foundations), which lead to different interactive behaviors for each learner at the same difficulty level of knowledge point. The Adjusted Cosine similarity method removes the effect of learner preference on the difficulty similarity between two knowledge points by measuring the angle between two decentered knowledge point vectors. Therefore, the knowledge point difficulty similarity measurement based on the degree of system interaction we choose is Adjusted Cosine, as given in the following formula:
4.1.2. Knowledge Point Difficulty Similarity Based on GDLPN
The GDLPN’s partial directed learning path diagram is shown in Figure 2. Nodes represent knowledge point videos. The direction of the edge represents the order in which learners study the knowledge point videos. The weight of the edge indicates the number of learning times. The In-Degree of nodes indicates the process of students repeatedly learning knowledge points in GDLPN. In addition, the forgetting curve proposed by Murre and Dros [12] suggests that the students’ interactive behaviors that are too far apart in the course learning sequence may be due to the student forgetting the content. To quantify this process, referencing the In-Degree centrality in directed weighted networks [24, 25], this paper integrates the weights of the In-Degree edges, the number of connected In-Degree nodes, and the forgetting distance of students to obtain In-Degree centrality of knowledge node, which is shown in the following formula:
[figure(s) omitted; refer to PDF]
In formula (9),
For example, in Figure 2, the set of directly connected In-Degree nodes with knowledge point 1 is
Based on the above analyses, the knowledge point difficulty similarity based on GDLPN is shown in the following formula:
4.2. Knowledge Point Difficulty Similarity Model Based on Interaction
The ST and SS matrix exist a sparsity problem due to the interaction environment and the teaching model. Traditional difficulty similarity calculation cannot be performed when two knowledge points do not have a co-learner [26, 27]. To solve this problem, we innovatively propose an improved JMSD [28] similarity model through the knowledge point popularity difference and the average interaction degree difference of knowledge points. The specific process is as follows:
Firstly, the popularity of knowledge point
Secondly, by analyzing teacher-student interaction and student-student interactive behaviors, we find that the smaller the value of
4.3. Measurement of the Difficulty Similarity of Knowledge Points
Considering this situation, communication between students and teachers and between students and students may not occur throughout the course. To solve this problem, the difficulty similarity between two knowledge points is obtained by weighting the combination of
4.4. Spectral Clustering Based on the Difficulty of Knowledge Points
Teachers determine the number of knowledge point difficulty clusters in the actual teaching. Assuming
Firstly, this paper constructs the knowledge point difficulty similarity matrix
Then, the eigenvector of the Laplacian matrix
The MIBKPC is shown in Table 1.
Table 1
Algorithm MIBKPC (pseudo-code).
Algorithm 1 MIBKPC |
Input: |
|
|
|
Output: |
(1): for each |
(2): According to Section 3.5, construct ( |
(3): for each |
(4): Calculate |
(5): Calculate |
(6): Calculate |
(7): end for |
(8):end for |
(9): for each |
(10): |
(11): end for |
(12): for each |
(13): for each |
(14): Calculate |
(15): Calculate |
(16): Calculate |
(17): Calculate |
(18): end for |
(19): Spectral clustering of |
(20): Return |
5. Experimental Results and Analysis
In this section, we evaluate the effectiveness of the proposed algorithm (MIBKPC). Firstly, MIBKPC is compared with commonly used classical methods for knowledge point difficulty classification accuracy. Secondly, we have examined the generalizability of the MIBKPC algorithm, which also obtains good classification results by relying only on the student-system interactive behavior. Thirdly, to verify the superiority of MIBKPC on knowledge point difficulty similarity calculation, other similarity methods are compared for clustering precision. Fourthly, the results of knowledge point difficulty classification for learners at different cognitive levels are analyzed. Finally, the relationship between the three interactive behaviors and the difficulty of knowledge points for learners at different cognitive levels is analyzed.
5.1. Data Sets
The data source was obtained from the interactive behavior data of 2019 students participating in the Data Structure and Algorithm course, which is a mandatory course for sophomores at a university. The experimental dataset consists of 207 knowledge point videos, 77,753 video-watching records of 362 students, 8422 text records of interactions between teachers and students, and 1463 text records of interactions between students and knowledge point test data.
5.2. Dataset Preprocessing
In part of the preprocessing of the experiment, learners who watched more than 10 seconds of each video were considered to have learned the knowledge point effectively, so we removed the records of students who watched each video for less than 10 seconds. Furthermore, after deleting the records, the watching records of students who viewed less than 1/3 of all videos are deleted, as well as the corresponding student-teacher interaction and student-student interaction records. After the data preprocessing, we retained 50,544 video-watching records of 272 students, 7683 text records of interactions between teachers and students, and 1252 text records of interactions between students. The duration of knowledge point videos is shown in Figure 3.
[figure(s) omitted; refer to PDF]
5.3. Experimental Evaluation
We use the external evaluation method to evaluate the clustering results [29], and the external evaluation method needs to obtain the real difficulty of knowledge points. We use
Students take a test after completing the course, we define the test score matrix
The average mastery degree of knowledge point is shown in the following formula:
In addition, a teacher pays more attention to the most difficult and easiest knowledge points for learners to optimize the content design. The effectiveness of the algorithm is measured by the set of the most difficult and easiest knowledge points [9]. The process is as follows:
First, all knowledge points to be ranked in order of
According to equation (19),
5.4. Experimental Settings
Based on the selected dataset and evaluation method, we used the proposed algorithm (MIBKPC) to obtain knowledge point clusters and the precision of the algorithm under different numbers of clusters
5.5. Experimental Results
In the first experiment, to verify the effectiveness of the proposed algorithm (MIBKPC), we compared it with the three commonly used classical methods. The three commonly used classical methods are defined as follows.
The first method is defined as MS, which sorts the knowledge points in ascending order by interaction degree of knowledge points
The second method is defined as MC, which uses K-means clustering algorithm to cluster the knowledge points based on
The third method is defined as MVC, which defines a 4-dimensional interaction feature vector
The knowledge points are classified by MIBKPC, MS, MC, and MVC under the conditions of K = 2, 3, and 5, respectively, and the classified results are evaluated to obtain the corresponding precision of the clustering
[figure(s) omitted; refer to PDF]
Figure 4 shows that for different
In the second experiment, we considered that some learning platforms do not provide functions for student-teacher interaction and student-student interaction, resulting in the inability to collect data on student-teacher interaction and student-student interaction. However, the MIBKPC algorithm requires three types of interactive behavior data to be applied. To verify the generalizability of the MIBKPC algorithm, we define the MIBKPC-SC algorithm which is a simplification of the MIBKPC algorithm. The MIBKPC-SC algorithm requires only the student-system interactive behavior data of learners to achieve knowledge point difficulty clustering. Firstly, MIBKPC-SC measures
[figure(s) omitted; refer to PDF]
From Figure 5, the clustering accuracy of MIBKPC-SC is similar to that of the MIBKPC algorithm when
In the third experiment, we verify the superiority of MIBKPC in the similarity calculation of knowledge point difficulty. MIBKPC constructs the similarity matrix of knowledge point difficulty from three similarity models (
Similarly, this paper compares with SimRank++ [9], RJMSD [26], ACOS [30], PCC [31], and JMSD [28] similarity methods to obtain the corresponding precision of the clustering. The experimental results are shown in Figure 6.
[figure(s) omitted; refer to PDF]
As shown in Figure 6, under the conditions of different K values, the clustering precision of MIBKPC is better than other similarity models. For the traditional similarity models, JMSD and ACOS have better accuracy, and PCC is worse. SimRank++ has better clustering accuracy than RJMSD. Due to their general application to student similarity calculations, PCC and RJMSD are less effective when calculating similarity of knowledge point difficulty. Moreover, SimRank++ and JMSD are based on the structural perspective to portray the difficulty similarity of knowledge points, which can have better clustering accuracy, but there is only co-interaction data that the two algorithms consider. Therefore, the proposed similarity model can more precisely portray the knowledge point difficulty similarity by making full use of the knowledge point interaction data and considering GDLPN and interaction degree.
In the fourth experiment, we analyze the influence of learners at different cognitive levels on the effectiveness of MIBKPC. We divide the dataset into three datasets (primary, intermediate, and advanced) according to the cognitive level of learners, and then obtain the clustering precision of MIBKPC algorithm in Figure 7.
[figure(s) omitted; refer to PDF]
In Figure 7, the clustering precision of MIBKPC decreases on the three datasets compared with the second experiment at different K values. The clustering precision of intermediate learners is the best. To better analyze the reasons for the decrease in clustering precision, this paper further combines GDLPN and knowledge point difficulty clustering results analysis of learners at different cognitive levels; GDLPN is shown in Figure 8. Table 2 shows the final knowledge point difficult clustering result.
[figure(s) omitted; refer to PDF]
Table 2
Knowledge point difficulty classification of learners at different cognitive levels.
Classification by difficult (K = 3) | |||
Number of learners | Knowledge point name | Number of knowledge points | |
Primary | 69 | 81 | |
Intermediate | 171 | 74 | |
Advanced | 32 | 71 |
In learning path networks, the red line represents the In-Degree edge and the green line represents the Out-Degree edge. From Figures 8(b) and 8(c), we find that intermediate learners and advanced learners repeatedly study the difficult knowledge points, and their interactive behavior data can reflect the difficulty of knowledge points, thus their clustering accuracy is better than that of primary learners. By comparing Figures 8(a) and 8(b), we can see that primary learners watch the knowledge video only once and rarely repeat the knowledge points, which leads to the interactive behaviors not reflecting their real learning effects, resulting in the lowest clustering accuracy. According to Table 2, we can see that advanced learners have the least difficulty with knowledge points, while primary learners have more difficulty than intermediate learners. We find that advanced learners have stronger learning abilities, so even if they encounter difficult knowledge points, they can understand them quickly, which results in fewer difficult knowledge points for advanced learners than for intermediate learners.
In the fifth experiment, based on the three data sets divided in the fourth experiment, we explore the relationship between three interactive behaviors and the difficulty of knowledge points for learners at different cognitive levels. According to formula (15),
[figure(s) omitted; refer to PDF]
From Figure 9(a), primary learners have the highest clustering accuracy when
6. Conclusions and Future Works
This paper proposes a difficulty-based knowledge point clustering algorithm using students’ multi-interactive behaviors. Firstly, we propose a knowledge point difficulty similarity model based on student-system interaction. The model innovatively combines interaction degrees and learning paths. Secondly, to solve the problem of the sparsity of student-teacher interaction and student-student interaction, we propose a knowledge point difficulty similarity model based on interactive behavior by using the full information of interaction data. Finally, the knowledge difficulty similarity matrix obtained by three types of interactive behavior is used to obtain the knowledge point difficulty classification using spectral clustering.
The proposed algorithm helps teachers understand the difficult knowledge points of learners for better optimization of teaching process design and teaching content. If an easy knowledge point is always classified into a difficult cluster and the difficulty level of the knowledge point is not the same as the teacher considered, teachers can consider whether the video of the knowledge point is not well explained or enhance the explanation of the knowledge point in class to better optimize the teaching process design and teaching content.
The proposed algorithm can be used with tiny sample datasets, which does not need data to be trained, and can also be applied to some learning platforms that store only student-system interactive behavior data. Based on the analysis of the experimental results, our proposed algorithm has better results in classifying the difficulty of knowledge points compared to other existing methods. If we can collect more student-teacher and student-student interaction data, as sometimes some students interact with each other through other platforms, the algorithm will better measure the difficulty similarity between knowledge points and thus improve the clustering accuracy.
The knowledge point difficulty classification method proposed in this paper is for groups of students. It does not provide individualized knowledge point difficulty classification for each student with different learning preferences. To achieve personalized education with multiple intelligences and improve learners’ learning effectiveness, future studies need to examine the relationship between the difficulty of knowledge points and individual learners. Therefore, we should examine whether learners’ learning behaviors contain more behavioral characteristics related to the difficulty of knowledge points. We should also develop prediction models for predicting learners’ knowledge difficulties, and recommend multiple intelligence learning strategies that meet each student’s needs.
Acknowledgments
This research was funded by the National Natural Science Foundation of China (62177012, 61967005, and 62267003), Innovation Project of GUET Graduate Education (2020YCXS022 and 2021YCXS033), and Key Laboratory of Cognitive Radio and Information Processing Ministry of Education (CRKL190107).
The symbols, their notations, and respective values are provided in Table 3 for clarity to readers.
Table 3
Symbols, semantics, and the respective values.
Symbols | Meaning and value |
Index for the learner that can have values as | |
Index of knowledge point that can have values as | |
The degree of student-system interaction associated with learner | |
The degree of student-teacher interaction associated with learner | |
The degree of student-student interaction associated with learner | |
The frequency of student-system interaction of student | |
The duration of student-system interaction of student | |
The frequency of pausing and dragging of student | |
Weighting factors associated with | |
Directed learning path network associated with learner | |
Group-directed learning path networks | |
The in-degree of nodes associated with knowledge point | |
Adjustable parameters associated with | |
The set of learners who have interacted with knowledge point | |
The popularity of knowledge point | |
The degree of interaction of learner | |
The difficulty similarity based on student-system interaction between knowledge point | |
The difficulty similarity based on the degree of student-system interaction between knowledge point | |
The difficulty similarity based on group-directed learning path networks between knowledge point | |
The difficulty similarity based on student-teacher interaction between knowledge point | |
The similarity of the difficulty based on student-student interaction between knowledge point | |
The difficulty similarity between knowledge point | |
Weighting factors associated with | |
Number of clusters. Here, we considered values as 2, 3, 5 | |
The average mastery degree of knowledge point associated with knowledge point | |
Interaction degree of knowledge point | |
Precision of the clustering |
[1] D. Turnbull, R. Chugh, J. Luck, "Transitioning to E-learning during the COVID-19 pandemic: how have higher education institutions responded to the challenge?," Education and Information Technologies, vol. 26 no. 5, pp. 6401-6419, DOI: 10.1007/s10639-021-10633-w, 2021.
[2] E. Verdú, M. J. Verdú, L. M. Regueras, J. P. de Castro, R. García, "A genetic fuzzy expert system for automatic question classification in a competitive learning environment," Expert Systems with Applications, vol. 39 no. 8, pp. 7471-7478, DOI: 10.1016/j.eswa.2012.01.115, 2012.
[3] M. Zhu, A. R. Sari, M. M. Lee, "A comprehensive systematic review of MOOC research: research techniques, topics, and trends from 2009 to 2019," Educational Technology Research & Development, vol. 68 no. 4, pp. 1685-1710, DOI: 10.1007/s11423-020-09798-x, 2020.
[4] C. Romero, S. Ventura, "Educational data mining and learning analytics: an updated survey," WIREs Data Mining and Knowledge Discovery, vol. 10 no. 3,DOI: 10.1002/widm.1355, 2020.
[5] T. Miyazoe, T. Anderson, "The interaction equivalency theorem," The Journal of Interactive Online Learning, vol. 9 no. 2, 2010.
[6] V. A. Romero-Zaldivar, A. Pardo, D. Burgos, C. Delgado Kloos, "Monitoring student progress using virtual appliances: a case study," Computers & Education, vol. 58 no. 4, pp. 1058-1067, DOI: 10.1016/j.compedu.2011.12.003, 2012.
[7] F. Ke, "Online interaction arrangements on quality of online interactions performed by diverse learners across disciplines," The Internet and Higher Education, vol. 16, pp. 14-22, DOI: 10.1016/j.iheduc.2012.07.003, 2013.
[8] N. Li, Ł. Kidziński, P. Jermann, P. Dillenbourg, "MOOC video interaction patterns: what do they tell us?," Design for Teaching and Learning in a Networked World, vol. 9307, pp. 197-210, DOI: 10.1007/978-3-319-24258-3_15, 2015.
[9] F. Zhang, D. Liu, C. Liu, "Difficulty-based SPOC video clustering using video-watching data," IEICE - Transactions on Info and Systems, vol. E104.D no. 3, pp. 430-440, DOI: 10.1587/transinf.2020edp7106, 2021.
[10] C. G. Brinton, S. Buccapatnam, M. Chiang, H. V. Poor, "Mining MOOC clickstreams: video-watching behavior vs. in-video quiz performance," IEEE Transactions on Signal Processing, vol. 64 no. 14, pp. 3677-3692, DOI: 10.1109/tsp.2016.2546228, 2016.
[11] F. Zhang, D. Liu, C. Liu, "Mooc video personalized classification based on cluster analysis and process mining," Sustainability, vol. 12 no. 7,DOI: 10.3390/su12073066, 2020.
[12] J. M. J. Murre, J. Dros, "Replication and analysis of Ebbinghaus’ forgetting curve," PLoS One, vol. 10 no. 7,DOI: 10.1371/journal.pone.0120644, 2015.
[13] A. Dutt, M. A. Ismail, T. Herawan, "A systematic review on educational data mining," IEEE Access, vol. 5, pp. 15991-16005, DOI: 10.1109/access.2017.2654247, 2017.
[14] F. V. D. Sluis, J. Ginn, T. V. D. Zee, "Explaining student behavior at scale: the influence of video complexity on student dwelling time," Proceedings of the third (2016) acm conference on learning@ scale, pp. 51-60, .
[15] H. Zhu, Y. Liu, F. Tian, Y. Ni, K. Wu, Y. Chen, Q. Zheng, "A cross-curriculum video recommendation algorithm based on a video-associated knowledge map," IEEE Access, vol. 6, pp. 57562-57571, DOI: 10.1109/access.2018.2873106, 2018.
[16] Z. Kastrati, A. S. Imran, A. Kurti, "Integrating word embeddings and document topics with deep learning in a video classification framework," Pattern Recognition Letters, vol. 128, pp. 85-92, DOI: 10.1016/j.patrec.2019.08.019, 2019.
[17] E. H. Othman, S. Abdelali, E. B. Jaber, "Education data mining: mining MOOCs videos using metadata based approach," Proceedings of the 2016 4th IEEE International Colloquium on Information Science and Technology (CiSt), pp. 531-534, .
[18] S. Zhaoyu, W. Yiru, C. Pan, Z. Huibing, "Personalized knowledge map recommendations based on interactive behavior preferences," International Journal of Performability Engineering, vol. 17 no. 1, pp. 36-49, DOI: 10.23940/ijpe.21.01.p4.3649, 2021.
[19] Z. Shou, X. Lu, Z. Wu, H. Yuan, H. Zhang, J. Lai, "On learning path planning algorithm based on collaborative analysis of learning behavior," IEEE Access, vol. 8, pp. 119863-119879, DOI: 10.1109/access.2020.3005793, 2020.
[20] J. Zhang, X. Shao, W. Zhang, J. Na, "Path-following control capable of reinforcing transient performances for networked mobile robots over a single curve," IEEE Transactions on Instrumentation and Measurement, vol. 1,DOI: 10.1109/tim.2022.3201930, 2022.
[21] D. Wang, Y. Yih, M. Ventresca, "Improving neighbor-based collaborative filtering by using a hybrid similarity measurement," Expert Systems with Applications, vol. 160,DOI: 10.1016/j.eswa.2020.113651, 2020.
[22] X. Shao, Y. Shi, W. Zhang, "Input-and-Measurement event-triggered output-feedback chattering reduction control for MEMS gyroscopes," IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 52 no. 9, pp. 5579-5590, DOI: 10.1109/tsmc.2021.3129275, 2022.
[23] B. Sarwar, G. Karypis, J. Konstan, J. Riedl, "Item-based collaborative filtering recommendation algorithms," Proceedings of the 10th international conference on World Wide Web, pp. 285-295, .
[24] T. Opsahl, F. Agneessens, J. Skvoretz, "Node centrality in weighted networks: generalizing degree and shortest paths," Social Networks, vol. 32 no. 3, pp. 245-251, DOI: 10.1016/j.socnet.2010.03.006, 2010.
[25] W. Zhang, X. Shao, W. Zhang, J. Qi, H. Li, "Unknown input observer-based appointed-time funnel control for quadrotors," Aerospace Science and Technology, vol. 126,DOI: 10.1016/j.ast.2022.107351, 2022.
[26] S. Bag, S. K. Kumar, M. K. Tiwari, "An efficient recommendation generation using relevant Jaccard similarity," Information Sciences, vol. 483, pp. 53-64, DOI: 10.1016/j.ins.2019.01.023, 2019.
[27] X. Shao, J. Zhang, W. Zhang, "Distributed cooperative surrounding control for mobile robots with uncertainties and aperiodic sampling," IEEE Transactions on Intelligent Transportation Systems, vol. 23 no. 10, pp. 18951-18961, DOI: 10.1109/tits.2022.3189201, 2022.
[28] J. Bobadilla, F. Serradilla, J. Bernal, "A new collaborative filtering metric that improves the behavior of recommender systems," Knowledge-Based Systems, vol. 23 no. 6, pp. 520-528, DOI: 10.1016/j.knosys.2010.03.009, 2010.
[29] X. Wang, L. Nie, X. Song, D. Zhang, T. S. Chua, "Unifying virtual and physical worlds: learning toward local and global consistency," ACM Transactions on Information Systems, vol. 36 no. 1,DOI: 10.1145/3052774, 2018.
[30] H. J. Ahn, "A new similarity measure for collaborative filtering to alleviate the new user cold-starting problem," Information Sciences, vol. 178 no. 1, pp. 37-51, DOI: 10.1016/j.ins.2007.07.024, 2008.
[31] Y. Shi, M. Larson, A. Hanjalic, "Collaborative filtering beyond the user-item matrix: a survey of the state of the art and future challenges," ACM Computing Surveys, vol. 47 no. 1,DOI: 10.1145/2556270, 2014.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Copyright © 2022 Zhaoyu Shou et al. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0/
Abstract
To improve learners’ performance in online learning, a teacher needs to understand the difficulty of knowledge points learners of different cognitive encounter levels in the learning process. This paper proposes a difficulty-based knowledge point clustering algorithm based on collaborative analysis of multi-interactive behaviors. Firstly, combining the group-directed learning path network, forgetting factors and the degree of student-system interaction, we propose a measurement model to calculate the similarity of the difficulty between knowledge points on student-system interactive behavior. Secondly, to solve the data sparsity problem of interaction, we propose an improved similarity model to calculate the similarity of the difficulty between knowledge points on student-teacher and student-student interactive behavior. Finally, the knowledge point difficulty similarity matrix is obtained by integrating the difficulty similarity of knowledge points obtained from student-system interactive behavior, student-teacher interactive behavior, and student-student interactive behavior. The spectral clustering algorithm is used to achieve knowledge point difficulty classification based on the obtained similarity matrix. The experiments on real datasets show that the proposed method has better knowledge point difficulty classification results than the existing methods.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details





1 School of Information and Communication, Guilin University of Electronic Technology, Guilin 541004, China
2 School of Computer and Information Security, Guilin University of Electronic Technology, Guilin 541004, China