ArticlePDF Available

Learning analytics using deep learning techniques for efficiently managing educational institutes

December 2021
Materials Today Proceedings 51(4)

December 2021
51(4)

DOI:10.1016/j.matpr.2021.11.416

Authors:

Indrajit Patra

Mohd Naved

Jaipuria Institute of Management

Show all 7 authorsHide

Increasing numbers of higher education institutions see themselves as service providers, catering primarily to the needs of its students. The improvement of student performance is a top priority for universities. It is critical to first assess the present situation of the students before designing a program to improve their performance. Higher education administrators face a significant problem in predicting a student's future success. The goal of this study is to learn what factors influence college students' decision on a major. It will be possible to forecast students' behavior, attitudes, and performance with the use of predictive tools and procedures. Predicting student performance ahead of time makes it possible to take proactive measures to raise achievement levels. To obtain a high education standard, several attempts have been made to forecast student performance. However the accuracy of these predictions falls short of the desired level of excellence. Machine learning approaches including Artificial Neural Network, Nave Bayes, and SVM are being studied. A University Data Set from UCI Machinery is used in the experimental investigation.

Content uploaded by Shehab Beram

Content may be subject to copyright.

Learning analytics using deep learning techniques for efﬁciently

managing educational institutes

Ravi Kishore Veluri

, Indrajit Patra

, Mohd Naved

, Veduri Veera Prasad

, Myla M. Arcinas

Shehab Mohamed Beram

, Abhishek Raghuvanshi

Aditya Engineering College(A), Surampalem, India

An Independent Researcher, PhD from NIT Durgapur, West Bengal, India

Department of Business Analytics, Jagannath University, Delhi-NCR, India

Associate Professor, Behavioral Science Department, De La Salle University, Philippines

Research Scholar, Department of Computing and Information Systems, Sunway University, Malaysia

Department of Computer Engineering, Mahakal Institute of Technology, Ujjain, India

article info

Article history:

Available online xxxx

Keywords:

Learning analytics

Deep learning

Educational data mining

Machine learning

Student performance

Classiﬁcation

Prediction

abstract

Increasing numbers of higher education institutions see themselves as service providers, catering primar-

ily to the needs of its students. The improvement of student performance is a top priority for universities.

It is critical to ﬁrst assess the present situation of the students before designing a program to improve

their performance. Higher education administrators face a signiﬁcant problem in predicting a student’s

future success. The goal of this study is to learn what factors inﬂuence college students’ decision on a

major. It will be possible to forecast students’ behavior, attitudes, and performance with the use of pre-

dictive tools and procedures. Predicting student performance ahead of time makes it possible to take

proactive measures to raise achievement levels. To obtain a high education standard, several attempts

have been made to forecast student performance. However the accuracy of these predictions falls short

of the desired level of excellence. Machine learning approaches including Artiﬁcial Neural Network, Nave

Bayes, and SVM are being studied. A University Data Set from UCI Machinery is used in the experimental

investigation.

Selection and peer-review under responsibility of the scientiﬁc committee of the International Confer-

ence on Advances in Materials Science

1. Introduction

Data mining [1 2] aids companies in discovering and under-

standing hidden patterns in large collections by leveraging their

existing reporting skills. These patterns are then included into data

mining algorithms, which are then used to correctly forecast indi-

vidual behavior. In light of this knowledge, organizations are better

able to allocate resources and employees. By using data mining to

anticipate how many students will enroll in a course ahead of time,

an institution can take proactive measures before a student drops

out. Data mining can also help an institution in better allocation of

resources by accurately predicting the probable number of stu-

dents in a particular course.

This study examines data mining’s capabilities and potential

applications in higher education. Data mining uses a combination

of explicit knowledge, powerful analytical capabilities, and subject

experience to uncover hidden trends and patterns. Prediction mod-

els [345] build on these trends and patterns to make new observa-

tions based on the present data. To discover new patterns, trends,

and correlations in large amounts of data stored in repositories, a

method utilizing pattern recognition technology and statistical

and mathematical methodologies is required. In order to do data

mining on very big or raw datasets, either supervised or unsuper-

vised data mining methods should be used. It’s important to

remember that no data mining can be done without interacting

with unitary data ﬁrst. Machine learning is used in many real

world areas [67].

EDM [89] converts raw data into useful facts and knowledge for

various educational contexts by using it as input. This information

may assist educational policymakers, school administrators,

instructors, and students in making well-informed decisions about

how to manage and utilize educational resources. With data-

driven decision-making, current educational practices and learning

resources can be improved. As educational information systems

https://doi.org/10.1016/j.matpr.2021.11.416

Selection and peer-review under responsibility of the scientiﬁc committee of the International Conference on Advances in Materials Science

Materials Today: Proceedings xxx (xxxx) xxx

Contents lists available at ScienceDirect

Materials Today: Proceedings

journal homepage: www.elsevier.com/locate/matpr

Please cite this article as: Ravi Kishore Veluri, I. Patra, M. Naved et al., Learning analytics using deep learning techniques for efﬁciently managing educa-

tional institutesgiven names and surnames to make sure that we have identiﬁed them correctly and that they are presented in the desired order. Carefully

(EIS) have evolved, a vast amount of student data has been avail-

able. This demonstrates the necessity of employing EDM to exam-

ine the learning habits of pupils. EDM aids in the accurate

evaluation of educational institutions in order to get the most

out of the available learning resources.

EDM makes use of research data to identify educational prob-

lems and propose solutions. It tries to look for patterns in curricu-

lum, learning behavior, and student family data from various

educational institutions that have yet to be uncovered. With the

EDM, we hope to gain a better understanding of the current causes

and effects of schooling.

2. Investigation of machine learning and deep learning

techniques for analytics and prediction

Predictive models are developed using classiﬁcation and regres-

sion techniques in supervised learning. Support vector machines

(SVM), naive Bayes classiﬁcation approaches, decision trees, closest

neighbor, logistic regression, discriminate analysis, and neural net-

works are examples of traditional classiﬁcation algorithms.

To anticipate the outcome of unlabeled datasets, unsupervised

learning is used. Clustering is the most used unsupervised learning

technique. Some simple clustering approaches include the K-

means clustering algorithm, K-means clustering algorithms, hier-

archical clustering, and hidden Markova models. There are numer-

ous supervised and unsupervised learning algorithms; however

their application differs greatly from one circumstance to the next.

As a result, choosing the right machine learning algorithm yields

superior results in prediction and classiﬁcation procedures. Choos-

ing the correct algorithm, on the other hand, is frequently a chal-

lenging undertaking. As a result, the article was thoroughly

examined in order to determine the one that best suited the clas-

siﬁcation algorithm for the heart disease prediction system. Classi-

ﬁcation is the procedure in predictive analysis that accurately

classiﬁes the provided input data and maps it to their correspond-

ing classes. There are two sorts of data: labeled data and unlabeled

data. The labeled data contains a large number of predictor quali-

ties as well as a single target attribute. The class label is denoted by

each value of the target characteristics. The predictor attributes are

the only ones in the unlabeled attributes. The primary goal of the

classiﬁcation process is to accurately predict the class of unlabeled

data using classiﬁcation models developed from labeled instances

(historical data). To begin, a training model is constructed for

which the corresponding class (or target values) is known. The

training data model provides a summary of the relationship

between the data components. This training data model is used

to forecast target values when the target values are unknown.

The predicted values are then compared to the known values or

labeled data to determine the classiﬁcation process’s accuracy. This

method is known as data model testing, and the data utilized for

testing is referred to as test data or evaluation data. It assesses

the predictability of the process [9].

A root node, branches, and leaf nodes are all present in each

decision tree. The root node is at the top, while the remaining

nodes are leaf or branch nodes. The decision rule or test is imposed

by the internal node on one or more properties of the provided

data. The output is deﬁned by the branch node. Decision trees

are a well-known classiﬁcation approach since they do not require

any prior knowledge of data distribution. Furthermore, it performs

well with noisy and ambiguous data.

J. Ross Quinlan, a researcher, developed the ID-3 algorithm

(Iterative Dichotomiser-3), which is the ﬁrst evolved decision

tree-based system. This algorithm is based on entropy and infor-

mation gain measurements. The original dataset starts with a base

nodule and computes the entropy measure of the functional char-

acteristics for each iteration. The attribute with the lowest error

rate (entropy) and most information gain is chosen as a split attri-

butes, and the dataset is split to generate the subset of attributes

based on it. Unless the algorithm is accurately classiﬁed to its tar-

get classes, it is recursively repeated on every subset of data. The

decision tree is built with a non terminal node, and the terminal

nodes are deﬁned by the branch’s ﬁnal subset. The non terminal

node is deﬁned by the split attribute, while the class labels are rep-

resented by the terminal node. It employs the ID-3-based decision

tree model developed to efﬁciently classify and predict heart prob-

lems at an early stage. Using well-known decision tree techniques

such as CART and ID-3, a prediction model with a large health data-

set is developed. For validation, a 10-fold cross validation approach

is utilized. The results show that utilizing decision tree classiﬁca-

tion approaches, an accurate and efﬁcient model of prediction

models may be developed. The ID-3 algorithm gives better out-

comes with less datasets and better computation measures. How-

ever, when dealing with continuous and massive data sets, such as

electronic health records, it becomes computationally expensive,

lowering the performance metric. Another important disadvantage

of ID-3-based decision tree classiﬁcation algorithms is model over

ﬁtting, which gradually reduces the accuracy of the health data

categorization process [10].

Author in [11] describe the decision tree classiﬁcation model.

First, a classiﬁcation model is created that accurately classiﬁes data

instances. It employs a top-down approach to categorization, mov-

ing from the root node to the leaf nodes. It chooses an efﬁcient

attribute and divides the given data into a subset of datasets based

on entropy measures. For the decision-making process, the attri-

bute with the highest normalized information gain is used. It is

computationally efﬁcient since it prunes branches with unneces-

sary features. It also handles missing values and properties with

numeric values well. When dealing with numeric qualities, the

decision tree becomes more difﬁcult.

According to [12], the decision tree C-5 is an enhanced version

of the C4.5 method that was designed to alleviate the limitations of

the C4.5 algorithm. It allows for faster computing tasks while

requiring less store space and tree size. It allows for boosting and

automatically removes qualities that aren’t needed for the classiﬁ-

cation process. It employs a search constraint over the training

dataset. An independent test set is used to validate the ﬁnal classi-

ﬁcation results. The use of association rules efﬁciently links the

cardiac risk factor to illness severity assessments. The C-5 decision

tree methodology decreases the number of association rules while

increasing accuracy.

In his article [13] describe that classiﬁcation and regression tree

is a well-known method of decision tree based classiﬁer. Every

base nodule represents a distinct input as well as distinct base

points across the variable. It is assumed that the input attribute

value is a single digit. The output variable, represented by the leaf

node, is used for prediction. It is based on discriminate analysis and

creates a statistical model to categories the dataset with greater

accuracy. It is effective on both categorical and continuous attri-

butes. In his research, [14] proposed that an ensemble learning

approach conducts classiﬁcation and regression operations. During

the training phase, it builds a large number of decision trees and

uses regression methods to predict the outcomes of the individual

trees. It has a low variance and quickly links the various aspects of

the given data for prediction purposes. The reason for the initial

lack of commitment to this technique is that the random forest

classiﬁcation algorithms are difﬁcult to interpret.

Authors [13] explain in their study that information processing

is carried out by highly linked neurons. This method is widely used,

and its main applications include pattern recognition and data

classiﬁcation. It creates the network by connecting nodes, which

are referred to as neurons. The process of signal transduction from

Ravi Kishore Veluri, I. Patra, M. Naved et al. Materials Today: Proceedings xxx (xxxx) xxx

one neuron to another is accomplished through the usage of con-

necting nodes. The artiﬁcial neural networks’ input signals are real

numbers, while the output units are nonlinear intakes. The

weighted edge size gradually raises or decreases the signal

strength at the associated edges. At the nodes, a preset threshold

value is set, and neurons can only send signals if they are greater

than or equal to the ﬁxed threshold value. Artiﬁcial neurons are

typically depicted in a layered fashion. Each layer applies its own

set of transformations to the inputs provided. Signals often move

from the ﬁrst to the last layer by traversing the middle layers

numerous times. The primary notion behind artiﬁcial neural net-

work approaches is that they are used to solve issues in the same

way that human brains do. Medical diagnosis, video and image

identiﬁcation are only a few of the important uses of artiﬁcial neu-

ral networks.

A K-nearest neighbor technique [11] is the most robust algo-

rithm utilized in pattern recognition and data categorization oper-

ations. The distance functions or similarity measure are the

fundamental concept underlying K-nearest neighbor algorithms.

It saves the state of all instances and uses the similarity metric

to classify freshly deﬁned instances. For efﬁcient categorization

procedures, it employs the instance-based learning method. A

new instance of the dataset is categorized depending on the major-

ity of votes cast by its adjacent classes. For both the training and

test datasets, the distance measure is computed. The algorithm’s

initial step is to select a value for k and calculate the distance

between the instances using the k value.

SVM is a binary linear classiﬁcation algorithm that is not prob-

abilistic. It creates a training model that categorizes the samples

into one or more target classes. The data objects are represented

in space as points. The objects of different categories are separated

by a visible gap, causing its width to expand. The target classes of

the new instances are mapped based on which side of the gap they

land on. When the input datasets are not labeled, the support vec-

tor machine also supports non-linear classiﬁcation. Because there

are no target classes to which the instances can be mapped, the

support vector machine uses an unsupervised learning approach

to categories the data. After clusters are built based on functions,

new instances are added to them. The paper presents an effective

model-based recommendation system based on non-linear sup-

port vector machine [15]. Non-linear support vector machine tech-

niques are the most extensively used methodology for dealing with

unlabeled data, and they are employed in a variety of industrial

applications.

3. Framework for performing learning analytics of student data

Fig. 1 depicts a system for student data categorization and per-

formance prediction. A student data collection is utilized as input

in this approach. The data set has been preprocessed. The data his-

tograms are equalized in conjunction with wavelet de noising. The

primary advantage of this approach is that it not only equalizes the

histogram but also adjusts for information loss. Principal compo-

nent analysis is used to extract features. The classiﬁcation step’s

goal is to identify the student’s category on the basis of analytics

Preprocessing of Data Set

by Histogram Equalization

Input Data Set

Classification

SVM,

Naive Bayes

ANN

Feature Extraction using

PCA

Prediction of Student

Performance

Classification

Result

Fig. 1. Framework for Student Performance Classiﬁcation and Prediction.

100

ANN SVM Naïve Bayes

Accuracy in %

Fig. 2. Accuracy Results of Classiﬁcation Algorithms for University Data Set.

Ravi Kishore Veluri, I. Patra, M. Naved et al. Materials Today: Proceedings xxx (xxxx) xxx

of features supplied in the data set. In machine learning, there are

several classiﬁers available, including Naive Bayes, Support Vector

Machine (SVM), and Artiﬁcial neural network (ANN).

4. Results

University data set [16] is used for experimental study. This

data set consists of 285 instances. This data set contains seventeen

attributes. The accuracy of classiﬁcation achieved by different

machine learning algorithms is shown below in Fig. 2.

5. Conclusion

Predicting a student’s future success is a huge challenge for

higher education administration. The purpose of this research is

to discover what factors inﬂuence college students’ major selec-

tion. With the application of predictive tools and techniques, it will

be feasible to forecast students’ behavior, attitudes, and perfor-

mance. Predicting student performance in advance allows for

proactive efforts to enhance achievement levels. Several attempts

have been made to forecast student performance in order to

achieve a high education standard. The accuracy of these forecasts,

however, falls short of the anticipated level of excellence. This

paper offered a framework based on machine learning for doing

learning analytics on university student data. The supplied data

collection is also subjected to data preparation. Machine learning

methods such as Artiﬁcial Neural Network, Nave Bayes, and SVM

are being investigated. The experimental inquiry makes use of a

University Data Set from UCI Machinery. The experimental results

have proved that the accuracy of artiﬁcial neural network method

is better as far as classiﬁcation of student data is concerned.

Declaration of Competing Interest

The authors declare that they have no known competing ﬁnan-

cial interests or personal relationships that could have appeared

to inﬂuence the work reported in this paper.

References

[1] L. Ji, X. Zhang, L. Zhang, Research on the Algorithm of Education Data Mining

Based on Big Data, in: 2020 IEEE 2nd International Conference on Computer

Science and Educational Informatization (CSEI), 2020, pp. 344–350, https://doi.

org/10.1109/CSEI50228.2020.9142529.

[2] A. Aleem, M.M. Gore, Educational Data Mining Methods: A Survey, in: 2020

IEEE 9th International Conference on Communication Systems and Network

Technologies (CSNT), 2020, pp. 182–188, https://doi.org/10.1109/

CSNT48778.2020.9115734.

[3] A. Hicham, A. Jeghal, A. Sabri, H. Tairi, A Survey on Educational Data Mining

[2014-2019], International Conference on Intelligent Systems and Computer

Vision (ISCV) 2020 (2020) 1–6, https://doi.org/10.1109/

ISCV49265.2020.9204013.

[4] S. Kovalev, A. Kolodenkova, E. Muntyan, Educational Data Mining: Current

Problems and Solutions, V International Conference on Information

Technologies in Engineering Education (Inforino) 2020 (2020) 1–5, https://

doi.org/10.1109/Inforino48376.2020.9111699.

[5] N. Khodeir, Student Modeling Using Educational Data Mining Techniques, in:

2019 6th International Conference on Advanced Control Circuits and Systems

(ACCS) & 2019 5th International Conference on New Paradigms in Electronics

& information Technology (PEIT), 2019, pp. 7–14, https://doi.org/10.1109/

ACCS-PEIT48329.2019.9062874.

[6] Ravi Manne, Snigdha Kantheti, Sneha Kantheti, Classiﬁcation of Skin cancer

using deep learning, Convolutional Neural Networks - Opportunities and

vulnerabilities- A systematic Review, Int. J. Modern Trends Sci. Technol., ISSN:

2455-3778, 06 (11) (2020) 101–108. https://doi.org/10.46501/IJMTST061118.

[7] R. Manne, Machine learning techniques in drug discovery and development,

Int. J. Appl. Res. 7 (4) (2021) 21–28.

[8] M.M. Arcinas, A Blockchain Based Framework for Securing Students

Educational Data, Linguistica Antverpiensia 2 (2021) 4475–4484.

[9] Myla M. Arcinas, Guna Sekhar Sajja, Shazia Asif, Sanjeev Gour, Ethelbert

Okoronkwo, The Role of Data Mining in Education for Improving Students

Performance for Social Change, Turkish J. Physiotherapy Rehabil., ISSN 2651-

4451, 32 (3) 6519–6526.

[10] S. Shrestha, M. Pokharel, Machine Learning algorithm in educational data,

Artif. Intell. Transform. Busin. Soc. (AITB) 2019 (2019) 1–11, https://doi.org/

10.1109/AITB48515.2019.8947443.

[11] B. Hssina, A. Merbouha, H. Ezzikouri, M. Erritali, A comparative study of

decision tree ID3 and C4.5, Int. J. Adv. Comp. Sci. Appl. 4 (2) (2014), https://doi.

org/10.14569/issn.2156-557010.14569/SpecialIssue.2014.040203.

[12] L.M. Crivei, G. Czibula, G. Ciubotariu, M. Dindelegan, Unsupervised learning

based mining of academic data sets for students’ performance analysis, in:

2020 IEEE 14th International Symposium on Applied Computational

Intelligence and Informatics (SACI), 2020, pp. 000011–000016, https://doi.

org/10.1109/SACI49304.2020.9118835.

[13] B. Al Breiki, N. Zaki, E.A. Mohamed, Using Educational Data Mining Techniques

to Predict Student Performance, in: 2019 International Conference on

Electrical and Computing Technologies and Applications (ICECTA), 2019, pp.

1–5, https://doi.org/10.1109/ICECTA48151.2019.8959676.

[14] T.O. Olaleye, O.R. Vincent, A Predictive Model for Students’ Performance and

Risk Level Indicators Using Machine Learning, 2020 International Conference

in Mathematics, Computer Engineering and Computer Science (ICMCECS),

2020, pp. 1–7, http://doi.org/10.1109/ICMCECS47690.2020.240897.

[15] X. Yang, M. Li, Y. Zhang, J. Ning, Cost-sensitive naive bayes classiﬁcation of

uncertain data, J. Sci. World 9 (8) (2014) 1897–1904.

[16] http://archive.ics.uci.edu/ml/datasets/university.

Ravi Kishore Veluri, I. Patra, M. Naved et al. Materials Today: Proceedings xxx (xxxx) xxx

Identifying Student Behavior Patterns Based on LMS Data

Article

Full-text available

Mar 2024

Sam Levin

In contemporary education, data analysis methods to establish patterns of student behavior for subsequent optimization of the educational process are becoming increasingly relevant. This paper presents research aimed at identifying patterns of student behavior based on the processing of data obtained from Learning Management Systems (LMS). The article examines data collection and analysis issues from LMS, including activity logs, grades, participation in forums, and other interactive elements. Additionally, methods such as statistical analysis and machine learning, applied to identify patterns of student behavior, are discussed. The text describes the identified patterns of student behavior in the electronic educational environment, which are subsequently linked to students' academic performance levels. The paper's concluding section presents the research findings and potential scenarios for their application. Key Words: adaptive learning, machine learning, learning management systems, student behavior patterns.

An Efficient Deep Learning Approach for Prediction of Student Performance Using Neural Network

Article

Full-text available

Dec 2023

In recent years, schools have shown interest in utilizing data mining to improve the quality of education. To enhance academic performance, accurately predicting how students will perform in their classes is crucial, which is essential for their progress in further education. Some students encounter challenges upon entering higher education, and predicting their performance early on is vital to keeping them on the right track. Our research aims to assess student performance using various classification strategies to identify the most accurate one. We utilize a Kaggle dataset for this study. Initially, we clean up the dataset by removing duplicate records and filling in any missing information. Subsequently, we apply six different classifiers, including Neural Networks and methods such as Random Forest and Support Vector Machine, utilizing the Weka tool. Additionally, we employ Principal Component Analysis (PCA) to extract optimized features that enhance model accuracy. We evaluate all models on Training and Testing splits, as well as the 10-K Fold options provided by the Weka tool. Finally, we calculate Training Accuracy, Testing Accuracy, Precision, Recall, and F1-Score for each model and compare their results. Notably, Neural Networks and Random Forest demonstrate superior results compared to other models.

A data-driven precision teaching intervention mechanism to improve secondary school students’ learning effectiveness

Article

Full-text available

Nov 2023
Educ Inform Tech

The continuous development of Educational Data Mining (EDM) and Learning Analytics (LA) technologies has provided more effective technical support for accurate early warning and interventions for student academic performance. However, the existing body of research on EDM and LA needs more empirical studies that provide feedback interventions, and more attention should be paid to primary and secondary school students. This study proposed a data-driven precision teaching intervention mechanism combining EDM and LA technologies. The proposed mechanism aims to assist teachers in predicting students’ academic performance and implementing corresponding interventions. This approach enables early warnings and reminders for students in crisis, and offers teaching assistance and support tailored to students at different levels. A quasi-experimental design was employed to examine the impact of the data-driven precision teaching intervention mechanism on secondary school students’ learning outcomes. A total of 142 seventh-grade students participated in the intervention experiment, with an experimental group (50) receiving the data-driven precision teaching intervention, control group2 (48) receiving a group intervention stratified by teacher experience, and control group1 (44) receiving a traditional group intervention. Posttest data were collected after three rounds of intervention. Compared to the two control groups, students in the experimental group demonstrated superior academic achievement, intrinsic motivation, self-efficacy, and meta-cognitive awareness. These findings indicate that the data-driven precision teaching intervention approach positively impacted students’ academic development, and effectively promoted their personalized learning. The findings provide pedagogical insights into the application of EDM in conjunction with LA prediction and actionable interventions.

Reducing dropout rate through a deep learning model for sustainable education: long-term tracking of learning outcomes of an undergraduate cohort from 2018 to 2021

Article

Full-text available

Oct 2023

In recent years, initiatives and the resulting application of precision education have been applied with increasing frequency in Taiwan; the accompanying discourse has focused on identifying potential applications for artificial intelligence and how to use learning analytics to improve teaching quality and learning outcomes. This study used the established dropout risk prediction model to improve student learning effectiveness. The model was based on the academic portfolios of past students and built with statistical learning and deep learning methods. This study used this model to predict the dropout risk of 2205 freshmen enrolled in the fall semester of 2018 (graduated in June 2022) in the field of sustainable education. A total of 176 students with a dropout risk of more than 20% were considered high-risk students. After tracking and the appropriate guidance, the dropout risk of 91 students fell from > 20% to < 20%. To discuss the results from the perspective of gender and financial disadvantages, the improvement rate of the dropout risk for male students was 10.2% better than that of female students at 2.9%. The improvement rate in dropout risk for students with disadvantageous financial situations was as high as 12.0%, surpassing the 5.9% rate among general students. Overall, the dropout rate in the second year of the 2018 freshman cohort was lower than that of the 2016 and 2017 freshman cohorts. A predictive model established by statistical learning and deep learning methods was used as a tool to promote precision education, accurately and efficiently identifying students who are having difficulty learning, as well as leading to a better understanding of AI (artificial intelligence) in smart learning for sustainable education.

Personalized learning efficiency data analysis based on multi-scale convolution architecture and hybrid loss

Article

Full-text available

Oct 2023
NEURAL COMPUT APPL

Personalized learning has gained significant attention in education as a means to cater to the diverse needs of learners and optimize educational outcomes. However, ensuring the efficiency of personalized learning remains a challenge. It requires the ability to accurately analyze and interpret vast amounts of data collected from learners. Traditional analytical approaches often struggle to handle the complexity and heterogeneity of this data, limiting the potential for personalized learning interventions. To address these challenges, this paper proposes a personalized learning efficiency data analysis network (PLEDANet) based on machine learning. First, PLEDANet redesigns a convolutional neural network based on the ResNet structure. The network performs convolutions using multiple convolution kernels of different scales to extract diverse feature information from personalized learning efficiency data. To enhance the extraction and representation of fine-grained differentiated features, PLEDANet introduces a hybrid attention module to combine channel and spatial information among feature maps. Second, PLEDANet designs a hybrid loss function for model training, which consists of the AM-softmax loss and the Center loss. The former increases the inter-class distance of features by imposing a fixed angular margin, while the latter reduces the intra-class distance by constraining the samples and feature centers. Finally, extensive experiments are conducted on PLEDANet. The experimental results validate the superiority of PLEDANet for personalized learning efficiency analysis.

Enhanced Quantum Prediction of Molecular Crystals Based on Machine Learning Technique

Chapter

Full-text available

Apr 2024

Bibliometric and Visual Insights Into Higher Education Informatization:

Article

Full-text available

Jan 2024
Int J Inform Comm Tech Educ

Higher education informatization (HEI) is an interdisciplinary field that examines the use and integration of information and communication technologies (ICTs) in higher education. This paper provides a bibliometric and visual analysis of the research trends, patterns, and topics in this field. Using the Web of Science database, the authors selected and analyzed 199 SCI and SSCI papers on HEI published from 2000 to 2023 by VOSviewer and CiteSpace software. The results indicate that the publication volume of HEI research has grown significantly in recent years. The author network shows the collaboration and contribution of different researchers and institutions, while the journal network reveals the multidisciplinary nature and scope of the field. The keyword network and the burst keyword analysis identify the main research themes and the emerging hot topics in HEI. The co-citation network of sources illustrates the theoretical and methodological foundations and influences of the field. The paper concludes with some implications and suggestions for future HEI research.

Machine Learning Applications in Higher Education Services: Perspectives of Student Academic Performance

Chapter

Feb 2024

Artificial Intelligence (AI) offers new technical reality in driving innovations across service sectors in areas of human societies. The application of AI has shown a profound impact for transforming key practices, including in the Higher Education (HE) sector. The presence of AI in the HE sector is already widely acknowledged for playing a pivotal role and potentiality in transforming process, policy and practices of the HE landscape. This chapter presents a contemporary overview of Artificial Intelligence applications to provide top trends in current AI in HE literature. The main objective of this chapter is to identify emerging directions of current AI applications in HE from the latest and existing studies in this domain. We studied and analyzed 10 existing case studies as representative samples of AI in HE. The analysis provided insights about the current trend and revealed the topic areas and themes that would be of paramount significance for HE stakeholders.

Integrated Model for Asynchronous Learning and Predictive Analytics for Enhanced Learner Experience

Chapter

Full-text available

Dec 2023

Organizations of every size and industry are embracing data analytics to extract insightful information from the data with the goal of enhancing decision-making, boosting productivity, streamlining workflows, and reducing costs. The adoption of data, more especially learning analytics, which is the act of obtaining, measuring, and analyzing data on learners in order to better understand their requirements and learning preferences, is an excellent possibility for the education industry. Predictive learner analytics can be used to gain a variety of insights that can help institutions improve learner retention, engagement, and performance measurement, particularly in asynchronous online learning where students feel helpless and distracted. In light of this, the chapter suggests a methodology and model for integrating asynchronous learning and predictive analytics.

KNIGHT Learning Analytics Architecture for Betterment of Student Education

Chapter

Nov 2023

Artificial intelligence has been revolutionizing education analytics and improving student education continuously. Our innovative architecture allows students and educators to analyze activities from Learning Management Systems (LMS) such as Moodle and other data sources, to review their performance, get personalized learning experiences, and receive realistic predictions of their student’s performance. Furthermore, our proposed KNIGHT LA architecture can detect at-risk pupils early on, allowing for timely interventions and assisting teachers in making data-driven decisions. In addition to this, the learning model has the ability to minimize human bias and reliance on conventional measurements, resulting in a more egalitarian and accessible educational system. However, the ethical considerations of employing these technologies, such as privacy and the threat of biased outcomes, must be considered. Overall, incorporating smart methodologies into education analytics has significant raise of around 25\(\%\) of promise for improving student education and establishing a more inclusive and equitable educational system.

Machine Learning Techniques in Drug Discovery and Development

Article

Full-text available

Apr 2021

Ravi Manne

The advancement and progress in technology and related techniques have created an opportunity for progress in many scientific fields and various industries. Machine learning has become important tool for drug designs and discovery with the availability of bit data from large databases. IN this paper I analyze Machine Learning and Deep learning techniques which help Pharma industry in all stages of drug discovery which includes target validation, prognostic biomarkers, clinical trials.

Classification of Skin cancer using deep learning, ConvolutionalNeural Networks -Opportunities and vulnerabilities-A systematic Review

Article

Full-text available

Nov 2020

Background: Skin cancer classificationusing convolutional neural networks (CNNs) proved better results in classifying skin lesions compared with dermatologists which is lifesaving in terms of diagnosing. This will help people diagnosetheir cancer on their own by just installing app on mobile devices. It is estimated that 6.3 billion people will use the subscriptions by the end of year 2021[28] for diagnosing their skin cancer. Objective: This study represents review of many research articles on classifying skin lesions using CNNs. With the recent enhancement in machine learning algorithms, misclassification rate of skin lesions has reduced compared to a dermatologist classifying them.In this article we discuss how using CNNs has evolved in successfully classifying skin cancer type, and methods implemented, and the success rate. Even though Deep learning using CNN has advantages compared to a dermatologist, it also has some vulnerabilities, in terms of misclassifying images under some Criteria, and situations. We also discuss about those Vulnerabilities in this review study. Methods: We searched theScienceDirect, PubMed,Elsevier, Web of Science databases and Google Scholar for original research articles that are published. We selected papers that have sufficient data and information regarding their research, and we created a review on their approaches and methods they have used. From the articles we searched online So far no review paper has discussed both opportunities and vulnerabilities that existed in skin cancer classification using deep learning. Conclusions: The improvements in machine learning, Deep learning techniques, can avoid human mistakes that could be possible in misclassifying and diagnosing the disease. We will discuss, how Deep learning using CNN helped us and its vulnerabilities.

A Survey on Educational Data Mining [2014-2019]

Conference Paper

Full-text available

Sep 2020

Nowadays Data Mining is used in many application areas enabling large data streams and algorithms for analysis and extraction of powerful data. On their side, the Computer Environments for Human Learning (EIAH) offer TEL devices (Technology-enhanced learning) such as simulators, serious games, MOOCs (massive online open courses), or educational platforms. These devices provide data that are traces of the activities of students or teachers. The data produced are cognitive information of very fine levels (student knowledge, skills, and errors) and require specific analysis and processing tools, we talk here about educational data mining methods, Educational data processing (EDM) is rising as a notion of research and analysis with a set of machine and psychological ways and research approaches for understanding however students learn. EDM uses machine approaches to research instructional knowledge so as to review instructional queries. For this knowledge exploration, several tools were used like personal learning environments, recommender systems, Context learning, and Course management systems. These tools offer numerous edges for instructional data processing. In this survey, we have a tendency to focus and supply numerous tools of analysis trends exploitation EDM Tools to explore data and knowledge, and explaining the process of EDM application, the goal is not only to transform the data into knowledge but also to filter the extracted knowledge to know how to modify the educational environment to improve learners’ learning. This paper surveys the foremost relevant studies administrated during this field up to date.

A Predictive Model for Students’ Performance and Risk Level Indicators Using Machine Learning

Conference Paper

Full-text available

Mar 2020

Student Modeling Using Educational Data Mining Techniques

Conference Paper

Full-text available

Nov 2019

Nabila Khodeir

Research on the Algorithm of Education Data Mining Based on Big Data

Conference Paper

Jun 2020

Unsupervised learning based mining of academic data sets for students’ performance analysis

Conference Paper

May 2020

Educational Data Mining Methods: A Survey

Conference Paper

Apr 2020

Educational Data Mining (EDM) is an emerging inter-disciplinary research area that involves education and computer science. EDM employs data mining tools and techniques, on large datasets related to education, to extract meaningful and useful information. EDM works toward the improvement of educational processes by introducing better and effective learning practices. EDM methods refer to the set of methods that are used for building models/applications. This article presents an extensive literature survey of EDM methods. The article also discusses research trends and challenges in EDM. This insight into EDM attempts to provide useful and valuable information to researchers interested in furthering the field of EDM.

Educational Data Mining: Current Problems and Solutions

Conference Paper

Apr 2020

Using Educational Data Mining Techniques to Predict Student Performance

Conference Paper

Nov 2019

Learning analytics using deep learning techniques for efficiently managing educational institutes

Abstract

Recommended publications

ROLE OF DATA MINING IN EDUCATION FOR IMPROVING STUDENTS PERFORMANCE FOR SOCIAL CHANGE

Using Classification Data Mining for Predicting Student Performance

Machine learning in education, finance and management: Applications and future trends

Design of Machine Learning Based Model to Predict Students Academic Performance