Please wait a minute...
Computer & Telecommunication
Current Issue | Archive | Adv Search |
K-Means ParallelizationAlgorithm Based on Canopy
Download:   PDF(0KB)
Export: BibTeX | EndNote (RIS)      
Abstract  Aiming at the massive information brought by big data, the traditional data mining method is no longer applicable. In recent years, many scholars have proposed new data mining methods, or improved the traditional methods. But it is still far from adapting to this vast amount of information. After summarizing the previous methods, an improved K-Means algorithm based on Canopy is proposed in this paper. Compared with the traditional K-Means, the improved method proposed in this paper will first de- termine the initial center by density, and then run the reduced data on the Hadoop distributed cluster. The experimental results show that this method can reduce the computational complexity and improve the computational efficiency under the condition of ensuring the accuracy.
Key wordsdata mining      Canopy      deserialize      Hadoop     
Published: 13 July 2019

Cite this article:

. K-Means ParallelizationAlgorithm Based on Canopy. Computer & Telecommunication, 2019, 1(7): 30-.

URL:

http://www.computertelecom.com.cn/EN/     OR     http://www.computertelecom.com.cn/EN/Y2019/V1/I7/30

[1] WANG Mao-fa WANG Zi-min WANG Hua-deng LIU Zhen-bing. Construction and Research on Data Mining Course Based on Jupyter[J]. 电脑与电信, 2021, 1(7): 12-16.
[2] GE Xiao-yan. Exploring on Mixed Teaching Reform of Data Mining Course[J]. 电脑与电信, 2021, 1(6): 43-46.
[3] SONG Zhi-yang ZHOU-Chen ZHANG Juan CHEN Ying-ying. Design and Implementation of PositionAnalysis System of Recruitment Website[J]. 电脑与电信, 2020, 1(9): 6-10.
[4] CAI Zhao-zhao NUERAILI Aierken. Research on Mixed Teaching Mode of Data Mining Course Based on SPOC[J]. 电脑与电信, 2020, 1(9): 40-42.
[5] ZHAO Yu-kuo. Design Method of Communication Trace and Information Analysis System[J]. 电脑与电信, 2018, 1(9): 52-53.
[6] WANG Yi-bai. Research on Clustering K-means Algorithm Based on Hadoop Platform[J]. 电脑与电信, 2018, 1(4): 18-20.
[7] ZHANG Zhi-wen, HE Ming-chang, YANGWei-wei, LIU Ren-xiao, WANG Yu, HU Xue-you. The Design and Implementation of System to Realize the Value of Knowledge and Skills[J]. 电脑与电信, 2018, 1(4): 27-30.
[8] WEI Shuang. An Enhanced Data Mining Method for Text Clustering[J]. 电脑与电信, 2018, 1(3): 46-48.
[9] LU Yu-fan, WANG Xun. Research on the Hadoop Platform Safety Reinforcement Based on the Kerberos Authentication Mechanism[J]. 电脑与电信, 2018, 1(1-2): 47-49.
[10] PAN Zheng-yong. Design and Application of Automatic Data Acquisition System for Soil and Water Conservation Monitoring Station Based on Big Data[J]. 电脑与电信, 2018, 1(1-2): 64-66.
[11] Jiang Mengyi, He Mingchang, Zhou Linhui, Ye Huan, Yan Liuping, Wang Rong. The Design and Implementation of Public Square Dancing Online Community Based on .NET[J]. 电脑与电信, 2017, 1(5): 7-10.
[12] Qiu Junling, Zhang Pan. Application of Quantum Algorithm in Big Data Age[J]. 电脑与电信, 2017, 1(5): 47-49.
[13] Wei Di. The Application of Management Information System in Education and the Development Trend[J]. 电脑与电信, 2017, 1(1-2): 81-83.
[14] CHEN Min-tao, KUANG Fang-jun. Research on the Application of Data Mining Technology in Medical Big Data[J]. 电脑与电信, 2017, 1(11): 34-36.
[15] HAN Lei, CHEN Han, HAO Xiao-xue. Research on Security Management Scheme of Enterprise Hadoop Big Data Platform[J]. 电脑与电信, 2017, 1(11): 40-43.
Copyright © Computer & Telecommunication, All Rights Reserved.
Powered by Beijing Magtech Co. Ltd