Please wait a minute...
Computer & Telecommunication  2019, Vol. 1 Issue (5): 1-7    DOI:
Current Issue | Archive | Adv Search |
Chinese Text Feature Classification Based on Distributed Framework
1. Inner Mongolia University of Science and Technology; 2. Qingdao Metro Group Co., Ltd. Operating Branch
Download:   PDF(0KB)
Export: BibTeX | EndNote (RIS)      
Abstract   The study uses Fudan Chinese text and Sogou Chinese document as the research object. It improves the Chinese text classification accuracy and recall rate. And it analyzes and obtains the best contribution value of the feature words. Based on naive Bayes classification method, improved TFIDF keyword extraction and weight calculation, the TNBIF model classification method is proposed and implemented on the Spark platform. The experimental results show that the Chinese text classification is applied by the TNBIF model. The accuracy is as high as 95.49%, which is 5.41% higher than the traditional text classification method and the recall rate is increased by 6.64%. This study obtains an optimal contribution of 0.95.
Key wordsTNBIF      model      massive data set      Spark      feature classification      parallel classification     
Published: 13 August 2019
ZTFLH:  TP391.1  

Cite this article:

ZHANG Hui-fang ZONG Cai-le ZHANG Xiao-lin. Chinese Text Feature Classification Based on Distributed Framework. Computer & Telecommunication, 2019, 1(5): 1-7.

URL:

http://www.computertelecom.com.cn/EN/     OR     http://www.computertelecom.com.cn/EN/Y2019/V1/I5/1

[1] ZHOU Chao-xuan LIU Xiao-ying. The Resources Construction and Discussion on the Implementation Scheme of Blended Teaching Model of Data Structure Course[J]. 电脑与电信, 2021, 1(8): 33-36.
[2] LIU Hui-fang. Research and Application of Anti-fraud Model Based on Big Data of Communication Operator[J]. 电脑与电信, 2021, 1(7): 46-52.
[3] GE Xiao-yan. Exploring on Mixed Teaching Reform of Data Mining Course[J]. 电脑与电信, 2021, 1(6): 43-46.
[4] XU Ming-yuan YANG Yan-hong. Design of Baggage Detection System Based on Deep Learning[J]. 电脑与电信, 2021, 1(4): 36-40.
[5] WANG Liang-cheng. Discussion on Innovative Entrepreneurship Education in Private Colleges and Universities[J]. 电脑与电信, 2021, 1(3): 10-12.
[6] LIU Xinhao MA Xin-rui WANG Da-wei. Simulation Modeling of Cognitive Radio Spectrum Allocation Based on Graph Theory Model[J]. 电脑与电信, 2021, 1(3): 16-20.
[7] ZHONG Wei-jianHE Qing. Research on User Value Analysis Method of Enterprise Private Cloud[J]. 电脑与电信, 2021, 1(3): 56-62.
[8] LIN Long. Model-based Robust RecognitionAlgorithm for Deep Learning Communication Signals[J]. 电脑与电信, 2021, 1(1): 20-22.
[9] LIU Feng. Research and Practice of UI Talent Training Model under the Background of Integration of Production and Education[J]. 电脑与电信, 2021, 1(1): 64-66.
[10] SONG Zhi-yang ZHOU-Chen ZHANG Juan CHEN Ying-ying. Design and Implementation of PositionAnalysis System of Recruitment Website[J]. 电脑与电信, 2020, 1(9): 6-10.
[11] GE Shi-xin. Research on Intelligent Media Industry Reform and Management Mode Innovation in 5G Era[J]. 电脑与电信, 2020, 1(9): 53-55.
[12] XIAO Cheng-wei. Design of the G.657 Single-mode Fiber Based on OptiFiber Simulation System[J]. 电脑与电信, 2020, 1(8): 36-40.
[13] YIN Hong-mei. Analysis and Construction of Intelligent Campus Based on IPv6[J]. 电脑与电信, 2020, 1(6): 63-65.
[14] YANG Jing. Innovative and Entrepreneurial Cross-border E-commerce Training Mode for Private Universities in Shaanxi Province[J]. 电脑与电信, 2020, 1(4): 1-4.
[15] ZHENG Shao-lan. Application of Flipping Class in 3DSMAX Modeling Technology Course[J]. 电脑与电信, 2020, 1(4): 27-29.
Copyright © Computer & Telecommunication, All Rights Reserved.
Powered by Beijing Magtech Co. Ltd