Sie haben Javascript deaktiviert!
Sie haben versucht eine Funktion zu nutzen, die nur mit Javascript möglich ist. Um sämtliche Funktionalitäten unserer Internetseite zu nutzen, aktivieren Sie bitte Javascript in Ihrem Browser.

Teamfoto vom 23.11.2017 (Foto: Universität Paderborn, Astrid van Kimmenade)

Markus Dollmann, M.Sc.

 Markus Dollmann, M.Sc.

Digitale Kulturwissenschaften

Ehemaliger - AG Semantische Informationsverarbeitung

01.12.2017 - heute

Wissenschaftlicher Mitarbeiter an der Professur für Digitale Kulturwissenschaften

Arbeitsgruppe Semantische Informationsverarbeitung
Universität Paderborn

01.04.2016 - 30.11.2017

Wissenschaftlicher Mitarbeiter an der Juniorprofessur für Wirtschaftsinformatik, insb. Semantische Informationsverarbeitung

Heinz Nixdorf Institut
Universität Paderborn

21.07.2016 - 28.07.2016

LxMLS 2016 - 6th Lisbon Machine Learning School

11.07.2016 - 15.07.2016

2nd ScaDS Summer School on Big Data

01.04.2014 - 31.03.2016

Studentische Hilftskraft an der Juniorprofessur für Wirtschaftsinformatik, insb. Semantische Informationsverarbeitung

Heinz Nixdorf Institut
Universität Paderborn

01.04.2014 - 31.03.2016

M.Sc. in Management Information Systems

Universität Paderborn

19.09.2015 - 25.09.2015

Autum School for Information Foraging and Retrieval 2015

01.10.2010 - 31.03.2014

B. Sc. in Wirtschaftsinformatik

Universität Paderborn

01.04.2013 - 31.07.2013

Studentische Hilfskraft am Institut für Informatik bei der Fachgruppe Softwaretechnik

Liste im Research Information System öffnen


Improving Classifiers for Semantic Annotation of Software Requirements with Elaborate Syntactic Structure

Y.. Kim, S. Lee, M. Dollmann, M. Geierhos, International Journal of Advanced Science and Technology (2018), 112, pp. 123-136

A user generally writes software requirements in ambiguous and incomplete form by using natural language; therefore, a software developer may have difficulty in clearly understanding what the meanings are. To solve this problem with automation, we propose a classifier for semantic annotation with manually pre-defined semantic categories. To improve our classifier, we carefully designed syntactic features extracted by constituency and dependency parsers. Even with a small dataset and a large number of classes, our proposed classifier records an accuracy of 0.75, which outperforms the previous model, REaCT.


Semantic Annotation of Software Requirements with Language Frame

Y. Kim, S.. Lee, M. Dollmann, M. Geierhos, International Journal of Software Engineering for Smart Device (2017), 4(2), pp. 1-6

An end user generally writes down software requirements in ambiguous expressions using natural language; hence, a software developer attuned to programming language finds it difficult to understand th meaning of the requirements. To solve this problem we define semantic categories for disambiguation and classify/annotate the requirement into the categories by using machine-learning models. We extensively use a language frame closely related to such categories for designing features to overcome the problem of insufficient training data compare to the large number of classes. Our proposed model obtained a micro-average F1-score of 0.75, outperforming the previous model, REaCT.

Studying Software Descriptions in SourceForge and App Stores for a better Understanding of real-life Requirements

F.S. Bäumer, M. Dollmann, M. Geierhos, in: Proceedings of the 2nd ACM SIGSOFT International Workshop on App Market Analytics, ACM, 2017, pp. 19-25

Users prefer natural language software requirements because of their usability and accessibility. Many approaches exist to elaborate these requirements and to support the users during the elicitation process. But there is a lack of adequate resources, which are needed to train and evaluate approaches for requirement refinement. We are trying to close this gap by using online available software descriptions from SourceForge and app stores. Thus, we present two real-life requirements collections based on online-available software descriptions. Our goal is to show the domain-specific characteristics of content words describing functional requirements. On the one hand, we created a semantic role-labeled requirements set, which we use for requirements classification. On the other hand, we enriched software descriptions with linguistic features and dependencies to provide evidence for the context-awareness of software functionalities.


On- and Off-Topic Classification and Semantic Annotation of User-Generated Software Requirements

M. Dollmann, M. Geierhos, in: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing (EMNLP), Association for Computational Linguistics (ACL), 2016, pp. 1807-1816

Users prefer natural language software requirements because of their usability and accessibility. When they describe their wishes for software development, they often provide off-topic information. We therefore present an automated approach for identifying and semantically annotating the on-topic parts of the given descriptions. It is designed to support requirement engineers in the requirement elicitation process on detecting and analyzing requirements in user-generated content. Since no lexical resources with domain-specific information about requirements are available, we created a corpus of requirements written in controlled language by instructed users and uncontrolled language by uninstructed users. We annotated these requirements regarding predicate-argument structures, conditions, priorities, motivations and semantic roles and used this information to train classifiers for information extraction purposes. The approach achieves an accuracy of 92% for the on- and off-topic classification task and an F1-measure of 72% for the semantic annotation.


Find a Physician by Matching Medical Needs described in your Own Words

F.S. Bäumer, M. Dollmann, M. Geierhos, in: The 6th International Conference on Emerging Ubiquitous Systems and Pervasive Networks (EUSPN 2015) / The 5th International Conference on Current and Future Trends of Information and Communication Technologies in Healthcare (ICTH-2015) / Affiliated Workshops, Elsevier, 2015, pp. 417-424

The individual search for information about physicians on Web 2.0 platforms can affect almost all aspects of our lives. People can directly access physician rating websites via web browsers or use any search engine to find physician reviews and ratings filtered by location resp. specialty. However, sometimes keyword search does not meet user needs because of the disagreement of users’ common terms queries for symptoms and the widespread medical terminology. In this paper, we present the prototype of a specialised search engine that overcomes this by indexing user-generated content (i.e., review texts) for physician discovery and provides automatic suggestions as well as an appropriate visualisation. On the one hand, we consider the available numeric physician ratings as sorting criterion for the ranking of query results. Furthermore, we extended existing ranking algorithms with respect to domain-specific types and physicians ratings on the other hand. We gathered more than 860,000 review texts and collected more than 213,000 physician records. A random test shows that about 19.7% of 5,100 different words in total are health- related and partly belong to consumer health vocabularies. Our evaluation results show that the query results fit user's particular health issues when seeking for physicians.


SentiBA: Lexicon-based Sentiment Analysis on German Product Reviews

M. Dollmann, M. Geierhos, in: Workshop Proceedings of the 12th Edition of the KONVENS Conference, Universitätsverlag Hildesheim, 2014, pp. 185-191

In this paper, we describe our system developed for the GErman SenTiment AnaLysis shared Task (GESTALT) for participation in the Maintask 2: Subjective Phrase and Aspect Extraction from Product Reviews. We present a tool, which identifies subjective and aspect phrases in German product reviews. For the recognition of subjective phrases, we pursue a lexicon-based approach. For the extraction of aspect phrases from the reviews, we consider two possible ways: Besides the subjectivity and aspect look-up, we also implemented a method to establish which subjective phrase belongs to which aspect. The system achieves better results for the recognition of aspect phrases than for the subjective identification.

Liste im Research Information System öffnen

Informationen zu den Lehrveranstaltungen von Markus Dollmann finden Sie hier.

Eine Übersicht über die Forschung von Markus Dollmann finden Sie hier.

Hier erfahren Sie mehr zum wissenschaftlichen Werdegang und den Auszeichnungen von Markus Dollmann.

Die Universität der Informationsgesellschaft