Page 188 - AI for Good Innovate for Impact
P. 188

AI for Good Innovate for Impact



                      (continued)

                                Item                                    Details
                             Category         Climate Change/Natural Disaster
                       Model Training and  The GeoGPT models are built upon a diversified portfolio, includ-
                       Fine-Tuning            ing Llama (USA), Qwen (China), Mixtral (Europe), and DeepSeek
                                              R1 (China), offering users the flexibility to choose the model best
                                              suited to their region and needs. These models undergo a series of
                                              post-training processes, which include Continual Pre-training (CPT),
                                              Supervised Fine-Tuning (SFT), and Direct Preference Optimization
                                              (DPO). [3][4][5][6] The training process is carried out in three stages:
                                              •  Continual Pre-training (CPT): This stage utilizes a diverse set of
                                                 geoscience-related corpora to obtain a solid specialized model
                                                 for geoscience.
                                              •  Supervised Fine-tuning (SFT): This stage enhances the model’s
                                                 ability to follow geoscience-specific instructions by incorporating
                                                 QA pairs labelled by geoscientists, along with those generated
                                                 from the training corpus in CPT stage.

                                              Human Preference Alignment: This stage uses the Direct Preference
                                              Optimization (DPO) with preference data labelled by large language
                                              models to align model's responses with human expectations and
                                              preferences.

                       Testbeds or Pilot Deploy- GeoGPT platform: https:// geogpt .zero2x .org [8]
                       ments                  The open platform for GeoGPT, offering intelligent services such as
                                              literature analysis, data extraction, map chat, domain-specific Q&A,
                                              etc.


                      2      Use Case Description


                      2�1     Description

                      GeoGPT is an AI system powered by large language models (LLMs) with retrieval-augmented
                      generation  (RAG) technology,  applicable  to the  monitoring and  protection  of marine
                      ecosystems. The GeoGPT models are built upon a diversified portfolio, including Llama (USA),
                      Qwen (China), Mixtral (Europe), and DeepSeek R1 (China), offering users the flexibility to
                      choose the model best suited to their region and needs. [3][4][5][6]
                      One of its key applications is in deep-sea mineral exploration. By incorporating mineral
                      prospecting prediction models and risk assessment models, GeoGPT helps define target
                      exploration areas, assess the economic viability of extracting mineral resources, and provide
                      impact  analyse  to  support  exploration decision-making.  This  approach  helps  minimize
                      unnecessary mining activities, thereby reducing ecological damage. The system currently
                      achieves an accuracy rate of over 60%.

                      Another major application of GeoGPT is in marine biological disaster monitoring. The system
                      incorporates microalgae species classification algorithms to accurately identify harmful algae,
                      analyze algae distribution patterns, detect red tide signals, and predict the timing, location,
                      and scale of red tide outbreaks, significantly improving monitoring and early warning efficiency.
                      Furthermore, GeoGPT evaluates the impact of global warming on microalgae distribution,
                      shedding light on how climate change influences the behavior, distribution, and interactions
                      of algae in ecosystems. This capability provides critical scientific support for aquatic ecosystem



                  152
   183   184   185   186   187   188   189   190   191   192   193