Options
Improving Explainability and Accuracy through Feature Engineering: A Taxonomy of Features in NLP-based Machine Learning
Journal
Forty-Second International Conference on Information Systems
Type
journal article
Date Issued
2021
Author(s)
Abstract (De)
Natural Language Processing (NLP)-based machine learning receives continuous attention in Information System (IS) research and practice. Despite the success of deep learning models, NLP feature engineering still plays a vital role in contexts where only little annotated data is available, and in which explainability is a precondition for productive deployment. However, NLP feature engineering is a labor-intensive and time-consuming endeavor, and there is still limited shared knowledge about the distinctive characteristics of NLP features from an interdisciplinary perspective. To address this gap, we draw on a systematic literature review and develop a five-dimensional NLP feature taxonomy based on 133 unique features from 211 scientific studies. This helps IS researchers and practitioners to classify, compare, and evaluate their NLP studies. Moreover, we used cluster heat mapping analysis to derive three clusters and several white spots to provide further assistance for designing new NLP solutions in IS.
Language
English
HSG Classification
contribution to scientific community
HSG Profile Area
SoM - Business Innovation
Refereed
Yes
Publisher place
Austin, Texas
Subject(s)
Division(s)
Eprints ID
264582