Python超簡單分析評論提取關鍵詞製作精美詞雲流程

2022-03-26 13:00:03

一、抓取全部評論

吾的這篇文章，有 1022 次評論，一條條看，吾看不過來，於是想到 Python 詞雲，提取關鍵詞，倒也是一樁趣事。

評論情況： {'android': 545 次, 'ios': 110 次, 'pc': 44 次, 'uniapp': 1 次}

一個小細節：給我評論的裝置中，安卓蘋果比是 5:1。

Building prefix dict from the default dictionary ... Loading model cost 0.361 seconds. Prefix dict has been built successfully.

1、找到評論介面

開啟 chrome 瀏覽器，開發者模式
點選評論列表（圖示 1）
點選介面連結（圖示 2）
檢視 response 返回值（評論結果的 json 格式）

2、Python 獲取評論

def get_comments(articleId):
    # 確定評論的頁數
    main_res = get_commentId(articleId,1)
    pageCount = json.loads(main_res)['data']['pageCount']
 
    comment_list,comment_list2 = [],[]
    source_analy = {}
    for p in range(1,pageCount+1):
        res = get_commentId(articleId, p)
        try:
            commentIds = json.loads(res)['data']['list']
            for i in commentIds:
                commentId = i['info']['commentId']
                userName = i['info']['userName']
                nickName = i['info']['nickName']  ## 獲取使用者名稱
                source_dvs = i['info']['commentFromTypeResult']['key']   # 操作裝置
                content = i['info']['content']
                comment_list.append([commentId, userName, nickName, source_dvs, content])
                comment_list2.append("%s 丨 %s"%(userName, nickName))
                if source_dvs not in source_analy.keys():
                    source_analy[source_dvs] = 1
                else:
                    source_analy[source_dvs] = source_analy[source_dvs] + 1
                # print(source_analy)
        except:
            print('本頁失敗！')
    print('評論數：' + str(len(comment_list)))
    return source_analy, comment_list, comment_list2

二、文字分詞、詞雲製作

1、文字分析

西紅柿採用的是結巴分詞, 和 wordcloud。

# -*- coding:utf8 -*-
import jieba
import wordcloud

程式碼實現：

seg_list = jieba.cut(comments, cut_all=False)  # 精確模式
    word = ' '.join(seg_list)

2、生成詞雲

背景圖西紅柿採用的是心形圖片

pic = mpimg.imread('/Users/pray/Downloads/aixin.jpeg')

完整程式碼：：

def word_cloud(articleId):
    source_analy, comment_list, comment_list2 = get_comments(articleId)
    print("評論情況：", source_analy)
    comments = ''
    for one in comment_list:
        comment = one[4]
        if 'face' not in comment:
            comments = comments + comment
    seg_list = jieba.cut(comments, cut_all=False)  # 精確模式
    word = ' '.join(seg_list)
 
    pic = mpimg.imread('/Users/pray/Downloads/aixin.jpeg')
    wc = wordcloud.WordCloud(mask=pic, font_path='/Library/Fonts/Songti.ttc', width=1000, height=500,
                             background_color='white').generate(word)

3、初步效果-模糊不清

西紅柿發現文字模糊、影象曲線邊緣不清晰的問題。

於是，指定解析度，高清整起來。

# 儲存
plt.savefig('xin300.png', dpi=300) #指定解析度儲存

4、最終效果-高清無馬

到此這篇關於Python超簡單分析評論提取關鍵詞製作精美詞雲流程的文章就介紹到這了,更多相關Python 製作詞雲內容請搜尋it145.com以前的文章或繼續瀏覽下面的相關文章希望大家以後多多支援it145.com！