已收录 268921 条政策
 政策提纲
  • 暂无提纲
Real Time Text Analysis
[摘要] This paper aims to illustrate real time analysis of large scale data. For practical implementation we are performing sentiment analysis on live Twitter feeds for each individual tweet. To analyze sentiments we will train our data model on sentiWordNet, a polarity assigned wordNet sample by Princeton University. Our main objective will be to efficiency analyze large scale data on the fly using distributed computation. Apache Spark and Apache Hadoop eco system is used as distributed computation platform with Java as development language.
[发布日期]  [发布机构] VIT Univeristy, Vellore; Tamilnadu; 632014, India^1
[效力级别] 工业技术 [学科分类] 
[关键词] Apache hadoop;Distributed computations;Large scale data;On the flies;Princeton University;Real time analysis;SentiWordNet;Text analysis [时效性] 
   浏览次数:21      统一登录查看全文      激活码登录查看全文