Automated Analysis of Job Requirements for Computer Scientists in Online Job Advertisements

Joscha Grüger, Georg J. Schneider

2019

Abstract

The paper presents a concept and a system for the automatic identification of skills in German-language job advertisements. The identification process is divided into Data Acquisition, Language Detection, Section Classification and Skill Recognition. Online job exchanges served as the data source. For identification of the part of a job advertisement containing the requirements, different machine-learning approaches were compared. Skills were extracted based on a POS-template. For classification of the found skills into predefined skill classes, different similarity measures were compared. The identification of the part of a job advertisement containing the requirements works with the pre-trained LinearSVC model for 100% of the tested job advertisements. Extracting skills is difficult because skills can be written in different ways in the German language – especially since the language allows ad-hoc creation of compound. For extraction of skills, POS templates were used. This approach worked for 87.33% of the skills. The combination of a fasttext model and Levenshtein distance achieved a correct assignment of skills to skill classes for 75.33% of the recognized skills. The results show that extracting required skills from German-language job ads is complex.

Download


Paper Citation


in Harvard Style

Grüger J. and Schneider G. (2019). Automated Analysis of Job Requirements for Computer Scientists in Online Job Advertisements.In Proceedings of the 15th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST, ISBN 978-989-758-386-5, pages 226-233. DOI: 10.5220/0008068202260233


in Bibtex Style

@conference{webist19,
author={Joscha Grüger and Georg Schneider},
title={Automated Analysis of Job Requirements for Computer Scientists in Online Job Advertisements},
booktitle={Proceedings of the 15th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,},
year={2019},
pages={226-233},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0008068202260233},
isbn={978-989-758-386-5},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 15th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,
TI - Automated Analysis of Job Requirements for Computer Scientists in Online Job Advertisements
SN - 978-989-758-386-5
AU - Grüger J.
AU - Schneider G.
PY - 2019
SP - 226
EP - 233
DO - 10.5220/0008068202260233