摘要
随着大数据时代下的数据来源和获取日趋重要,基于Python 的爬虫技术已成为获取数据工具的研究热点之
一。本文应用Python 爬虫关键技术对网易云季度歌词以及歌词相关文章的信息采集和汇总,并对其汇总后的网易歌词利用
Python 类库和数据分析技术对歌手情绪、词频统计、词云可视化以及歌手对时光和城市偏爱程度等进行数据分析。研究结果
表明,当下民谣歌手情绪稳定且有激情,能通过歌曲表达其正面情感,以及对当下时光与繁华城市的喜好。
Abstract
Data resource and procurement are getting increasingly important in big data era. Crawler technology based on Python
has become one of the research hotspots on data acquisition tool. This article applies Python crawler technologies to collect and
summarize the information of Netease Cloud's quarterly lyrics and lyrics-related articles. The singer emotion data, word frequency
statistics, word cloud visualization, and the degree of singer’preference for time and city are analyzed by Python library and data
analysis technology. The research results show that the ballad singer is stable and passionate, and can express his positive feelings
through songs, and enjoy the current time and prosperous cities.
关键词
Python /
爬虫 /
第三方库 /
词云 /
数据统计与分析
Key words
Python /
crawler /
third party library /
word cloud /
data statistics and analysis
方子菱, 匡芳君.
基于Python 的网易民谣歌词数据分析[J]. 电脑与电信. 2018, 1(4): 53-56
FANG Zi-ling, KUANG Fang-jun.
Data Analysis on NetEase Ballad Lyrics Based on Python[J]. Computer & Telecommunication. 2018, 1(4): 53-56
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}