Geoprospect: A Domain-Specific Geological Large Language Model with Enhanced Continual Learning

14 Pages Posted: 6 May 2025

See all articles by Gang Wu

Gang Wu

National Supercomputing Center in Zhengzhou

Haitao Wang

Zhengzhou University

Kunyan Zhang

Zhejiang University

Shengguang Zhu

Zhengzhou University

Jian Cui

Henan Institute of Geological Survey

Mengzhe Fan

National Supercomputing Center in Zhengzhou

Hailiang Wang

Zhengzhou University

Hengliang Guo

National Supercomputing Center in Zhengzhou

Gubin Zhang

Henan Institute of Geological Survey

Dujuan Zhang

National Supercomputing Center in Zhengzhou

Haitao Wei

Zhengzhou University

Shan Zhao

Zhengzhou University

Abstract

Developing domain-specific large language models (LLMs) in highly specialized fields such as geology presents three challenges: the discrepancy between general LLMs’ pretrained knowledge and the specialized knowledge structures of geology, the lack of domain-specific corpora, and the risk of catastrophic forgetting during continual learning. To tackle these challenges, we introduce GeoProspect, the first LLM designed specifically for the geological domain. Our methodology includes the creation of a high-quality geological corpus and the adaptation of the general-purpose Qwen2-7B model through continued pretraining and supervised instruction fine-tuning. We present two variants, GeoProspect-LoRA, a model fine-tuned using low-rank adaption (LoRA) as an initial phase, and GeoProspect, which alleviates catastrophic forgetting during continual learning via a novel task-driven grouped mixture of LoRA experts (TG-MoLE) architecture. TG-MoLE integrates a dual-layer grouped routing strategy and a task-driven cross-group routing adjustment mechanism to optimize weight distribution and expert collaboration. To assess model performance, we develop GeoEval, the first benchmark tailored to geological LLMs. Experimental results demonstrate that our model significantly improves performance in geological knowledge understanding and mineral exploration question answering. Moreover, GeoProspect, equipped with TG-MoLE, maintains general task performance while achieving superior results in geology-specific applications.

Keywords: Domain-Specific LLMs, Geology, Continual Learning, Mixture of Experts

Suggested Citation

Wu, Gang and Wang, Haitao and Zhang, Kunyan and Zhu, Shengguang and Cui, Jian and Fan, Mengzhe and Wang, Hailiang and Guo, Hengliang and Zhang, Gubin and Zhang, Dujuan and Wei, Haitao and Zhao, Shan, Geoprospect: A Domain-Specific Geological Large Language Model with Enhanced Continual Learning. Available at SSRN: https://ssrn.com/abstract=5242692 or http://dx.doi.org/10.2139/ssrn.5242692

Gang Wu

National Supercomputing Center in Zhengzhou ( email )

China

Haitao Wang

Zhengzhou University ( email )

100 Science Avenue
Zhengzhou, CO 450001
China

Kunyan Zhang

Zhejiang University ( email )

38 Zheda Road
Hangzhou, 310058
China

Shengguang Zhu

Zhengzhou University ( email )

100 Science Avenue
Zhengzhou, CO 450001
China

Jian Cui

Henan Institute of Geological Survey ( email )

Mengzhe Fan

National Supercomputing Center in Zhengzhou ( email )

China

Hailiang Wang

Zhengzhou University ( email )

100 Science Avenue
Zhengzhou, CO 450001
China

Hengliang Guo

National Supercomputing Center in Zhengzhou ( email )

Gubin Zhang

Henan Institute of Geological Survey ( email )

China

Dujuan Zhang

National Supercomputing Center in Zhengzhou ( email )

China

Haitao Wei

Zhengzhou University ( email )

Shan Zhao (Contact Author)

Zhengzhou University ( email )

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
20
Abstract Views
43
PlumX Metrics