On the automatic classification of app reviews. Issue 3 (September 2016)
- Record Type:
- Journal Article
- Title:
- On the automatic classification of app reviews. Issue 3 (September 2016)
- Main Title:
- On the automatic classification of app reviews
- Authors:
- Maalej, Walid
Kurtanović, Zijad
Nabil, Hadeer
Stanik, Christoph - Abstract:
- Abstract App stores like Google Play and Apple AppStore have over 3 million apps covering nearly every kind of software and service. Billions of users regularly download, use, and review these apps. Recent studies have shown that reviews written by the users represent a rich source of information for the app vendors and the developers, as they include information about bugs, ideas for new features, or documentation of released features. The majority of the reviews, however, is rather non-informative just praising the app and repeating to the star ratings in words. This paper introduces several probabilistic techniques to classify app reviews into four types: bug reports, feature requests, user experiences, and text ratings. For this, we use review metadata such as the star rating and the tense, as well as, text classification, natural language processing, and sentiment analysis techniques. We conducted a series of experiments to compare the accuracy of the techniques and compared them with simple string matching. We found that metadata alone results in a poor classification accuracy. When combined with simple text classification and natural language preprocessing of the text—particularly with bigrams and lemmatization—the classification precision for all review types got up to 88–92 % and the recall up to 90–99 %. Multiple binary classifiers outperformed single multiclass classifiers. Our results inspired the design of a review analytics tool, which should help app vendorsAbstract App stores like Google Play and Apple AppStore have over 3 million apps covering nearly every kind of software and service. Billions of users regularly download, use, and review these apps. Recent studies have shown that reviews written by the users represent a rich source of information for the app vendors and the developers, as they include information about bugs, ideas for new features, or documentation of released features. The majority of the reviews, however, is rather non-informative just praising the app and repeating to the star ratings in words. This paper introduces several probabilistic techniques to classify app reviews into four types: bug reports, feature requests, user experiences, and text ratings. For this, we use review metadata such as the star rating and the tense, as well as, text classification, natural language processing, and sentiment analysis techniques. We conducted a series of experiments to compare the accuracy of the techniques and compared them with simple string matching. We found that metadata alone results in a poor classification accuracy. When combined with simple text classification and natural language preprocessing of the text—particularly with bigrams and lemmatization—the classification precision for all review types got up to 88–92 % and the recall up to 90–99 %. Multiple binary classifiers outperformed single multiclass classifiers. Our results inspired the design of a review analytics tool, which should help app vendors and developers deal with the large amount of reviews, filter critical reviews, and assign them to the appropriate stakeholders. We describe the tool main features and summarize nine interviews with practitioners on how review analytics tools including ours could be used in practice. … (more)
- Is Part Of:
- Requirements engineering. Volume 21:Issue 3(2016)
- Journal:
- Requirements engineering
- Issue:
- Volume 21:Issue 3(2016)
- Issue Display:
- Volume 21, Issue 3 (2016)
- Year:
- 2016
- Volume:
- 21
- Issue:
- 3
- Issue Sort Value:
- 2016-0021-0003-0000
- Page Start:
- 311
- Page End:
- 331
- Publication Date:
- 2016-09
- Subjects:
- User feedback -- Review analytics -- Software analytics -- Machine learning -- Natural language processing -- Data-driven requirements engineering
Requirements engineering -- Periodicals
Software engineering -- Periodicals
Computer software -- Development -- Periodicals
Logiciels -- Développement -- Périodiques
Génie logiciel -- Périodiques
005.12 - Journal URLs:
- http://firstsearch.oclc.org ↗
http://firstsearch.oclc.org/journal=0947-3602;screen=info;ECOIP ↗
http://link.springer-ny.com/link/service/journals/00766/index.htm ↗
http://www.springerlink.com/content/0947-3602/ ↗
http://www.springer.com/gb/ ↗ - DOI:
- 10.1007/s00766-016-0251-9 ↗
- Languages:
- English
- ISSNs:
- 0947-3602
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 7713.844000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 10044.xml