已收录 268921 条政策
 政策提纲
  • 暂无提纲
Identification of Security related Bug Reports via Text Mining using Supervised and Unsupervised Classification
[摘要] This paper is focused on automated classification of software bug reports to security and non-security related, using both supervised and unsupervised approaches. For both approaches, three types of feature vectors are used. For supervised learning, we experiment with multiple learning algorithms and training sets with different sizes. Furthermore, we propose a novel unsupervised approach based on anomaly detection. The evaluated is based on three NASA datasets. The results show that supervised classification is affected more by the learning algorithms than by feature vectors and using only 25% of the data for training provides as good results as if 90% of data are used for training. Both supervised and unsupervised learning can be used for identification of security bug reports; the former slightly outperforms the latter at the expense of labeling the testing set. In general, the performance differs across datasets, mainly due to the different amounts of security related information.
[发布日期] 2018-07-17 [发布机构] 
[效力级别]  [学科分类] 软件
[关键词] CLASSIFICATIONS;MACHINE LEARNING;ANOMALIES [时效性] 
   浏览次数:25      统一登录查看全文      激活码登录查看全文