Focusing on the prediction of customers' new car model purchase behavior, this paper carries out a comparative analysis on the performance of four classification algorithms: Logistic Regression (LR), K-Nearest Neighbors (KNN), Gaussian Naive Bayes (GaussianNB), and Support Vector Machine (SVM). Based on data related to car consumption, following data preprocessing, hyperparameter tuning is performed for each model. For Logistic Regression, the regularization parameters and penalty terms are optimized. For the KNN algorithm, the number of neighbors is adjusted, and an appropriate distance measurement method is selected. For Gaussian Naive Bayes, the adaptability of the feature independence assumption is verified, and corresponding fine-tuning is carried out. Optimization is performed on key parameters of SVM, including the kernel function and regularization coefficient. After training and testing evaluation, the SVM demonstrates outstanding comprehensive performance after tuning, with an AUC reaching 0.965 and a prediction accuracy of 93%. Both KNN and Gaussian Naive Bayes achieve an AUC of 0.963 and an accuracy of 93%, while Logistic Regression has an AUC of 0.955 and an accuracy of 91%. The research shows that SVM has significant advantages in fitting nonlinear data, is suitable for the car consumption prediction scenario, and provides powerful data support and decision basis for enterprises to develop targeted marketing strategies.
Key words
classification algorithms /
automobile consumption prediction /
performance comparison /
adaptability research
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}
References
[1] StuartRussell,PeterNorvig.人工智能:现代方法(第4版)(上下册).(美)斯图尔特·罗素,彼得·诺维格著,张博雅,等译[M].北京:人民邮电出版社.2004.
[2] 张轶. 基于改进逻辑回归模型的数据预测与校验技术[J].信息技术,2025(5):57-61,68.
[3] 吕宁. 融合XGBoost和逻辑回归算法的电信客户流失预测模型[J].现代电子技术,2025,48(11):136-143.
[4] 田文娜. 基于KNN算法的电力计量异常数据检测模型优化研究[J].自动化应用,2025,66(10):165-170.
[5] 王海燕,焦增晨,赵剑,等.基于Bayes超参数优化梯度提升树的心脏病预测方法[J].吉林大学学院(理学版),2025,63(2):472-478.
[6] 卫苗苗. 基于支持向量机方法的气候变化影响下台风生成预测建模[J].东南大学学报(自然科学版) ,2025:1-10.
[7] 高海宾. 一种融合贝叶斯优化的 K 最近邻分类算法[J].绵阳师范学院学报(自然科学版),2025,44(5):79-87.