In view of the problems of insufficient knowledge accuracy, lack of real-time performance and professional limitations in the application of large language models (LLMs) in professional fields, this study proposes a retrieval-augmented generation (RAG) question-answering system based on the LangChain framework and the Zhangheng-1 satellite electric field knowledge graph. By integrating the reasoning ability of RAG technology and LLMs, and using LangChain's modular components (including LLMs access, prompt word templates and task chain orchestration) and Milvus vector database, dynamic retrieval and generation optimization of professional knowledge are achieved. The experimental data comes from 41 core papers in the field of Zhangheng-1 satellite electric field, covering research directions such as electric field anomaly detection and data processing methods. The test results show that compared with the ordinary Qwen-Plus model, the enhanced version of RAG shows better professionalism and accuracy in scientific parameter description and method applicability analysis. The study confirms that RAG technology can effectively solve the knowledge limitations of LLMs in professional fields and provide a feasible technical solution for building a highly reliable professional knowledge question-answering system, which has important theoretical value and practical significance.
Key words
LangChain /
RAG /
Zhangheng-1 satellite /
question-answering system
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}
References
[1] Huang L,Yu W,Ma W,et al.A survey on hallucination in large language models: Principles,taxonomy, challenges,and open questions[J].ACM Transactions on Information Systems,2025,43(2):1-55.
[2] Peng B,Zhu Y,Liu Y,et al.Graph retrieval-augmented generation: A survey[J].arXiv preprint arXiv:2408. 08921,2024.
[3] Asai A,Wu Z,Wang Y,et al.Self-rag:Learning to retrieve,generate,and critique through self-reflection[C]//The Twelfth International Conference on Learning Representations.2023.
[4] Mei L,Mo S,Yang Z,et al.A survey of multimodal retrieval-augmented generation[J].arXiv preprint arXiv:2504.08748,2025.
[5] Fan W,Ding Y,Ning L,et al.A survey on rag meeting LLMss:Towards retrieval-augmented large language models[C]//Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining.2024:6491-6501.
[6] Zhao P,Zhang H,Yu Q,et al.Retrieval-augmented generation for ai-generated content:A survey[J].arXiv preprint arXiv:2402.19473,2024.
[7] Gao Y,Xiong Y,Gao X,et al.Retrieval-augmented generation for large language models:A survey[J].arXiv preprint arXiv:2312.10997,2023,2(1).
[8] Fan W,Ding Y,Ning L,et al.A survey on rag meeting llms:Towards retrieval-augmented large language models[C]//Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining.2024:6491-6501.
[9] 黄建平,张富志,李忠,等.张衡一号卫星观测的震前电场数据扰动识别研究[J].中国科学:地球科学,2023,53(8):1834-1843.
[10] Zhima Z,Dapeng L,Xiaoying S,et al.Current status and scientific progress of the Zhangheng-1 satellite mission[J].Reviews of Geophysics and Planetary Physics,2023,54(4):455-465.