Lastly, we evaluate and discuss our performance 
scores. Here, we compare our model with self-trained 
word embeddings to BERT embeddings. In opposite 
to other scholars, our approach performs two steps in 
one: aspect phrase extraction and classification. 
Nevertheless, we outperform other approaches such 
as Pontiki et al., (2016b) though we use another 
domain and dataset. However, we regard this domain 
as a complex one using the morphologically rich 
German language.  
In the future, we not only plan to build more 
annotated datasets, but want to include the opinion 
extraction part, too. 
ACKNOWLEDGEMENTS 
This work was partially supported by the German 
Research Foundation (DFG) within the Collaborative 
Research Centre On-The-Fly Computing (SFB 901). 
REFERENCES 
Apotheke-Adhoc. 2018. Von Jameda zur Konkurrenz 
geschickt. [Sent by Jameda to the competitors] URL: 
https://www.apotheke-adhoc.de/nachrichten/detail/ 
apothekenpraxis/von-jameda-zur-konkurrenz-
geschickt-bewertungsportale/ (visited on 10/28/2019).  
Bird, S.; Klein, E.; and Loper, E. 2009. Natural Language 
Processing with Python. O’Reilly Media, 1. Edition.  
Blair-Goldensohn, S.; Hannan, K.; McDonald, R.; Neylon, 
T.; Reis, G. A.; and Reynar, J. 2008. Building A 
Sentiment Summarizer For Local Service Reviews. In 
Proceedings of the WWW Workshop NLP Challenges in 
the Information Explosion Era, 14, 339–348. ACM.  
Bäumer, F. S.; Grote, N.; Kersting, J.; and Geierhos, M. 
2017. Privacy Matters: Detecting Nocuous Patient Data 
Exposure In Online Physician Reviews. In Proceedings 
of the 23rd International Conference on Information 
and Software Technologies,, 756, 77–89. Druskininkai, 
Lithuania: Springer.  
Bäumer, F. S.; Kersting, J.; Kuršelis, V.; and Geierhos, M. 
2018. Rate Your Physician: Findings From A 
Lithuanian Physician Rating Website. In Proceedings 
of the 24th International Conference on Information 
and Software Technologies, Communications in 
Computer and Information Science, 920, 43–58. 
Vilnius, Lithuania: Springer. 
Bojanowski, P., Grave, E., Joulin, A. & Mikolov, T. (2017). 
Enriching Word Vectors with Subword Information. 
Transactions of the ACL, 5, 135–146. ACL. 
Cieliebak, M., Deriu, J. M., Egger, D. & Uzdilli, F. (2017). 
A Twitter Corpus and Benchmark Resources for 
German Sentiment Analysis. In Proceedings of the 5th 
International Workshop on Natural Language 
Processing for Social Media. 45–51. Valencia, Spain: 
ACL. 
Chinsha, T. C., and Shibily, J. 2015. A Syntactic Approach 
for Aspect Based Opinion Mining. In Proceedings of 
the 9th IEEE International Conference on Semantic 
Computing. 24–31. Anaheim, CA, USA: IEEE. 
Cohen, J. (1960). A Coefficient of Agreement for Nominal 
Scales. Educational and Psychological Measurement, 
20(1), 37–46. 
Cordes, M. 2018. Wie bewerten die anderen? Eine 
übergreifende Analyse von Arztbewertungsportalen in 
Europa. [What do the others think? An overarching 
analysis of doctor rating portals in Europe] Master 
Thesis. Universität Paderborn. 
Danda, P.; Mishra, P.; Kanneganti, S.; and Lanka, S. 2017. 
IIIT-H at IJCNLP-2017 Task 4: Customer Feedback 
Analysis Using Machine Learning and Neural Network 
Approaches. In Proceedings of the 8th International 
Joint Conference on Natural Language Processing, 
Shared Tasks, 155–160. Taipei, Taiwan: AFNLP. 
Deepset. 2019. Deepset – Open Sourcing German BERT. 
URL: https://deepset.ai/german-bert. / (visited on 
11/28/2019). 
De Clercq, O.; Lefever, E.; Jacobs, G.; Carpels, T.; and 
Hoste, V. 2017. Towards an Integrated Pipeline for 
Aspect-based Sentiment Analysis In Various Domains. 
In  Proceedings of the 8th ACL Workshop on 
Computational Approaches to Subjectivity, Sentiment 
and Social Media Analysis. 136–142, Copenhagen, 
Denmark: ACL.  
Devlin, J.; Chang, M.-W.; Lee, K.; and Toutanova, K. 2018. 
BERT: Pre-Training of Deep Bidirectional 
Transformers for Language Understanding. arXiv 
preprint.  
Emmert, M.; Sander, U.; Esslinger, A. S.; Maryschok, M.; 
and Schöffski, O. 2012. Public Reporting In Germany: 
The Content of Physician Rating Websites. Methods of 
Information in Medicine. 51(2):112–120.  
Emmert, M.; Meier, F.; Pisch, F.; and Sander, U. 2013. 
Physician Choice Making and Characteristics 
Associated With Using Physician-Rating Websites: 
Cross-Sectional Study. Journal of Medical Internet 
Research 15(8):e187.  
Emmert, M.; Sander, U.; and Pisch, F. 2013. Eight 
Questions About Physician-Rating Websites: A 
Systematic Review. Journal of Medical Internet 
Research 15(2):e24.  
ExplosionAI. 2019. Release de_core_news_sm-2.1.0 
explosion/spacy-models GitHub. URL: 
https://github.com/explosion/spacy-models/releases// 
tag/de_core_news-sm-2.1.0 (visited on 11/13/2019). 
Garcia-Pablos, A.; Cuadros, M.; and Rigau, G. 2018. 
W2VLDA: Almost Unsupervised System for Aspect-
based Sentiment Analysis. Expert Systems with 
Applications 91:127–137.  
Hochreiter, S., and Schmidhuber, J. 1997. Long Short-Term 
Memory. Neural Computation 9(8):1735–1780.  
Hu, M., and Liu, B. 2004. Mining and Summarizing 
Customer Reviews. In Proceedings of the 10th ACM 
SIGKDD International Conference on Knowledge