
 
 
Predict Student Score using Text Mining in English for Librarian 
Course 
Febrianti Widyahastuti
1
 and Viany Utami Tjhin
2
 
1
School of Information Technology, Deakin University, 221 Burwood Hwy, Burwood, Victoria 3125, Australia 
2
Department of Information Systems, Bina Nusantara University, Jakarta, Indonesia 
Keywords:  Text  Mining,  Prediction,  Discussion  Forum,  Classification,  Information  Retrieval,  Learning  Analytics, 
Learning Management System (LMS)  
Abstract:  As we know that most information contain text document, studying such issue are promising research areas 
because many documents need deep learning to discovery new phenomena. This paper aims to identify and 
discover new knowledge through analysis of text extraction in online discussion forum that capable to predict 
students’ performance by applying text mining from an undergraduate English for Librarians course for one 
semester in Open University, Indonesia. The result of prediction model in this research can be integrated with 
the current conventional evaluation process. Additionally, prediction approach can give the best practice that 
the evaluation method can be predicted using text mining in online discussion forum. In this research, there 
are two approaches used to predict students’ performance: first, incorporating learning material documents 
and each students’ response every week. In this case, algorithm using TF-IDF approach is used to leverage 
the  information  from  students’  response  and  learning  materials  about  how  often  words  occur  in  both 
documents. Second, classifying terms into three categories: students’ answer text related to learning material, 
English  meaningful  text  related  to  learning  material  and  Indonesian  meaningful  text  related  to  learning 
material. The correlation result shows that English meaningful text related to learning material have strong 
relationship with students’ performance. 
1  INTRODUCTION 
Learning  analytics  can  provide  powerful  analytical 
tools  from  varied  sources  such  as  audit  logs  of 
students’ activities and discussion log interactions in 
Learning Management System (LMS). The idea has 
motivated us to focus on useful informational text on 
online  discussion  forum  logs  to  find  meaningful 
knowledge using text mining  and understanding of 
students  learning  progress  and  behaviour  in  the 
learning environment.  
The use of text mining in document management 
become the most promising trends in improving the 
accuracy and speed of document analysis. As a part 
of  the  artificial  intelligent  form,  text  mining 
establishes  mapping  process  of  the  artificial 
intelligent at various levels of implementation.  
The fact that majority of web data are constructed 
in unstructured text format that is not automatic and 
need processes to be understood (Li & Wu 2010); and 
the  intention  of  many  researchers  who  try  to  get 
useful information as well as meaningful knowledge 
from  tremendous  amounts  of  text  on  online 
discussion make it necessary to develop innovation of 
prediction model based on text document on online 
discussion forum. The result of such kind of research 
will help the process of acceleration of educational 
assessment and improving the quality of learning. 
The focus of this study was on the prediction of 
students’ performance based on students’ response on 
online discussion forum dataset. The experiment was 
conducted  by  involving  69  students  enrolled  in 
English for Librarian course. Additionally, the texts 
used  as  the  basis  of  analysis  were  the  mixture  of 
Indonesian and English text.  
This study aimed to find  texts  with the highest 
frequency  and  whether  those  texts  are  related  to 
learning materials. To address such purpose, the data 
collection and analysis was done by utilizing TF-IDF 
(Term Frequency and Inverse Document Frequency) 
method. 
 
 
 
Widyahastuti, F. and Tjhin, V.
Predict Student Score using Text Mining in English for Librarian Course.
DOI: 10.5220/0009016900002297
In Proceedings of the Borneo International Conference on Education and Social Sciences (BICESS 2018), pages 83-89
ISBN: 978-989-758-470-1
Copyright
c
 2022 by SCITEPRESS – Science and Technology Publications, Lda. All rights reserved
83