基金项目

基于文本挖掘的图书馆舆情情感分析

展开
  • 成都工业学院

网络出版日期: 2024-07-02

Analysis of Public Sentiment in Libraries Based on Text Mining

Expand
  • Chengdu Technological University

Online published: 2024-07-02

摘要

随着移动互联网在高校图书馆年轻读者的影响越来越大,新生代读者使用QQ即时通信软件的比例越来越大, 对于图书馆QQ群文本信息进行文本挖掘从而了解图书馆的舆情,可以用于图书馆舆情预警,为图书馆决策层提供更强的舆 情应对能力。采用网络爬虫技术爬取2022年9月至2022年12月份时间段内的QQ群中聊天记录作为图书馆舆情数据,接着 对原始的舆情数据进行数据去重、清洗等数据预处理操作,然后运用清华大学Thulac分词技术提取舆情数据的关键字、计算其 权重,并使用WordCloud库进行可视化展示,接下来采用spaCy库给舆情数据计算出具体的情感倾向及分值,最后通过实验来 分析图书馆舆情的情感倾向。

本文引用格式

王龙军  王 晶  李光华  陈亮 . 基于文本挖掘的图书馆舆情情感分析[J]. 电脑与电信, 2024 , 1(3) : 13 . DOI: 10.15966/j.cnki.dnydx.2024.03.020

Abstract

With the increasing influence of mobile Internet on young readers in university libraries, the proportion of new generation readers using QQ instant messaging software is increasing, text mining of library QQ group text information can be used to under‐ stand the public sentiment of the library, which can be used for library public sentiment warning and provide strong public sentiment response capabilities for library decision-makers. Web crawler technology is used to crawl chat records in QQ groups from Septem‐ ber 2022 to December 2022 as library public sentiment data, then performing data preprocessing operations such as deduplication and cleaning on the original public sentiment data. Then, Tsinghua University's Thulac segmentation technology is used to extract keywords and calculate their weights from the public sentiment data, and them the WordCloud library is used for visualization. Next, the spaCy library is used to to calculate specific emotional tendencies and scores for public opinion data, and finally the emotional tendencies of library public opinion are analyzed through experiments.
Options
文章导航

/