How to Precisely Update Large Language Models Knowledge While Avoiding Catastrophic Forgetting

Preprint | 
10.55415/deep-2024-0007.v1
This is not the most recent version. There is anewer versionof this content available.
fei ding#*
APUS
APUS

# contributed equally to this work, * Corresponding author


Abstract

Recent advancements in Large Language Models (LLMs) have showcased their remarkable capabilities in text understanding and generation. However, even stronger LLMs are susceptible to acquiring erroneous or obsolete information from the training corpus. Direct secondary fine-tuning with data containing new knowledge may be ineffective in updating knowledge due to the conflict between old and new knowledge. In this paper, we propose a new paradigm for fine-tuning called DFT.This method utilizes parametric arithmetic to precisely pinpoint the location of knowledge and update only the minimal set of relevant parameters . Experimental results on two publicly available datasets demonstrate that our proposed DFT can obviously improve the knowledge updating performance of full fine-tuning , simultaneously outperforming the existing baselines in most cases.

Keywords
Subject Area
Version History
  • 24 Jun 2024 18:18 Version 1
Scores
 0
Rapid Rating Times: 0
· Level of Quality: -
· Level of Repeatability: -
· Level of Innovation: -
· Level of Impact: -

*Each rating ranges from 0-5

Rapid Rating
Your professional field is different from the direction of this article. Go Settings!
  • Level of Quality
    Is the publication of relevance for the academic community and does it provide important insights? Is the language correct and easy to understand for an academic in the field? Are the figures well displayed and captions properly described? Is the article systematically and logically organized?
    0.0
  • Level of Repeatability
    Is the hypothesis clearly formulated? Is the argumentation stringent? Are the data sound, well-controlled and statistically significant? Is the interpretation balanced and supported by the data? Are appropriate and state-of-the-art methods used?
    0.0
  • Level of Innovation
    Does the work represent a novel approach or new findings in comparison with other publications in the field?
    0.0
  • Level of Impact
    Does the work have potential huge impact to the related research area?
    0.0
Submit

我们使用 cookie 将您与其他用户区分开来, 并在我们的网站上为您提供更好的体验。

关闭此消息以接受 cookie 或了解如何管理您的 cookie 设置。

了解更多关于我们的隐私声明..

goTop