Improving Explainability and Accuracy through Feature Engineering: A Taxonomy of Features in NLP-based Machine Learning

Wambsganss, Thiemo; Engel, Christian; Fromm, Hansjörg

Improving Explainability and Accuracy through Feature Engineering: A Taxonomy of Features in NLP-based Machine Learning

Journal

Forty-Second International Conference on Information Systems

Type

journal article

Date Issued

2021

Author(s)

Wambsganss, Thiemo

;

Engel, Christian

;

Fromm, Hansjörg

Abstract (De)

Natural Language Processing (NLP)-based machine learning receives continuous attention in Information System (IS) research and practice. Despite the success of deep learning models, NLP feature engineering still plays a vital role in contexts where only little annotated data is available, and in which explainability is a precondition for productive deployment. However, NLP feature engineering is a labor-intensive and time-consuming endeavor, and there is still limited shared knowledge about the distinctive characteristics of NLP features from an interdisciplinary perspective. To address this gap, we draw on a systematic literature review and develop a five-dimensional NLP feature taxonomy based on 133 unique features from 211 scientific studies. This helps IS researchers and practitioners to classify, compare, and evaluate their NLP studies. Moreover, we used cluster heat mapping analysis to derive three clusters and several white spots to provide further assistance for designing new NLP solutions in IS.

Language

English

HSG Classification

contribution to scientific community

HSG Profile Area

SoM - Business Innovation

Refereed

Yes

Publisher place

Austin, Texas

URL

https://www.alexandria.unisg.ch/handle/20.500.14171/111417

Subject(s)

information managemen...

Division(s)

IWI - Institute of In...

Eprints ID

264582

File(s)

open.access

Name

ICIS2021_Feature Taxonomy_revised_v2.pdf

Size

1.02 MB

Format

Adobe PDF

Checksum (MD5)

8909c892b9109ffe77e5beb3ee1a0316