Fernandez, N., Ghosh, A., Liu, N., Wang, Z., Choffin, B.,
Baraniuk, R., & Lan, A. (2022). Automated Scoring for
Reading Comprehension via In-context BERT Tuning.
arXiv preprint arXiv:2205.09864.
Horbach A and Zesch T (2019) The Influence of Variance
in Learner Answers on Automatic Content Scoring.
Front. Educ. 4:28. doi: 10.3389/feduc.2019.00028.
Hussein, M. A., Hassan, H. A., &Nassef, M. (2020). A trait-
based deep learning automated essay scoring system
with adaptive feedback. International Journal of
Advanced Computer Science and Applications, 11(5).
Chen, Y., & Li, X. (2023, July). PMAES: Prompt-mapping
contrastive learning for cross-prompt automated essay
scoring. In Proceedings of the 61st Annual Meeting of
the Association for Computational Linguistics (Volume
1: Long Papers) (pp. 1489-1503).
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina
Toutanova. 2019. Bert: Pre-training of deep
bidirectional transformers for language understanding.
In Proceedings of the 2019 Conference of the North
American Chapter of the Association for Computational
Linguistics: Human Language Technologies, pages
4171–4186.
Kumar, Y., Aggarwal, S., Mahata, D., Shah, R. R.,
Kumaraguru, P., & Zimmermann, R. (2019, July). Get
it scored using autosas—an automated system for
scoring short answers. In Proceedings of the AAAI
Conference on Artificial Intelligence (Vol. 33, No. 01,
pp. 9662-9669).
Kumar, Y. et al. (2020) “Calling Out Bluff: Attacking the
Robustness of Automatic Scoring Systems with Simple
Adversarial Testing.” ArXiv abs/2007.06796.
Li, F., Xi, X., Cui, Z., Li, D., & Zeng, W. (2023). Automatic
essay scoring method based on multi-scale features.
Applied Sciences, 13(11), 6775.
Liu, J., Xu, Y., & Zhu, Y. (2019). Automated essay scoring
based on two-stage learning. arXiv preprint
arXiv:1901.07744.
Lun J, Zhu J, Tang Y, Yang M (2020) Multiple data
augmentation strategies for improving performance on
automatic short answer scoring. In: Proceedings of the
AAAI Conference on Artifcial Intelligence, 34(09):
13389-13396
Do, H., Kim, Y., & Lee, G. G. (2023). Prompt-and trait
relation-aware cross-prompt essay trait scoring. arXiv
preprint arXiv:2305.16826.
Mathias S, Bhattacharyya P (2018) ASAP++: Enriching the
ASAP automated essay grading dataset with essay
attribute scores. In: Proceedings of the Eleventh
International Conference on Language Resources and
Evaluation (LREC 2018)
Mayfield, E., & Black, A. W. (2020, July). Should you fine-
tune BERT for automated essay scoring?. In
Proceedings of the Fifteenth Workshop on Innovative
Use of NLP for Building Educational Applications (pp.
151-162).
Muangkammuen, P., & Fukumoto, F. (2020, December).
Multi-task Learning for Automated Essay Scoring with
Sentiment Analysis. In Proceedings of the 1st
Conference of the Asia-Pacific Chapter of the
Association for Computational Linguistics and the 10th
International Joint Conference on Natural Language
Processing: Student Research Workshop (pp. 116-123).
Ormerod, C. M., Malhotra, A., & Jafari, A. (2021).
Automated essay scoring using efficient transformer-
based language models. arXiv preprint
arXiv:2102.13136.
Künnecke, F., Filighera, A., Leong, C., & Steuer, T. (2024).
Enhancing Multi-Domain Automatic Short Answer
Grading through an Explainable Neuro-Symbolic
Pipeline. arXiv preprint arXiv:2403.01811.
Yao, L., & Jiao, H. (2023). Comparing performance of
feature extraction methods and machine learning
models in essay scoring. Chinese/English Journal of
Educational Measurement and Evaluation, 4(3), 1.
Rodriguez, P. U., Jafari, A., & Ormerod, C. M. (2019).
Language models and automated essay scoring. arXiv
preprint arXiv:1909.09482.
Riordan, B., Flor, M., & Pugh, R. (2019, August). How to
account for mispellings: Quantifying the benefit of
character representations in neural content scoring
models. In Proceedings of the Fourteenth Workshop on
Innovative Use of NLP for Building Educational
Applications (pp. 116-126).
Sawatzki, J., Schlippe, T., & Benner-Wickner, M. (2021,
July). Deep learning techniques for automatic short
answer grading: Predicting scores for English and
German answers. In International conference on
artificial intelligence in education technology (pp. 65-
75). Singapore: Springer Nature Singapore.
Poulton, A., & Eliens, S. (2021, September). Explaining
transformer-based models for automatic short answer
grading. In Proceedings of the 5th International
Conference on Digital Technology in Education (pp.
110-116).
Song, W., Zhang, K., Fu, R., Liu, L., Liu, T., & Cheng, M.
(2020, November). Multi-stage pre-training for
automated Chinese essay scoring. In Proceedings of the
2020 Conference on Empirical Methods in Natural
Language Processing (EMNLP) (pp. 6723-6733).
Süzen, N., Gorban, A. N., Levesley, J., &Mirkes, E. M.
(2020). Automatic short answer grading and feedback
using text mining methods. Procedia Computer Science,
169, 726–743
Taghipour, K., & Ng, H. T. (2016, November). A neural
approach to automated essay scoring. In Proceedings of
the 2016 conference on empirical methods in natural
language processing (pp. 1882-1891).
Tay, Y., Phan, M., Tuan, L. A., & Hui, S. C. (2018, April).
SkipFlow: Incorporating neural coherence features for
end-to-end automatic text scoring. In Proceedings of the
AAAI conference on artificial intelligence (Vol. 32, No.
1).
Wang, Y., Wang, C., Li, R., & Lin, H. (2022). On the Use
of BERT for Automated Essay Scoring: Joint Learning
of Multi-Scale Essay Representation. arXiv preprint
arXiv:2205.03835.
Wang Z, Liu J, Dong R (2018a) Intelligent Auto-grading
System. In: 2018 5th IEEE International Conference on