InstructPLM-mu: 1-Hour Fine-Tuning of ESM2 Beats ESM3 in Protein Mutation Predictions

Xu, Junde; Shi, Yapin; Lang, Lijun; Cui, Taoyong; Zhang, Zhiming; Chen, Guangyong; Qiu, Jiezhong; Heng, Pheng-Ann

定量生物学 > 定量方法

arXiv:2510.03370 (q-bio)

[提交于 2025年10月3日 (v1) ，最后修订 2025年10月10日 (此版本， v2)]

标题： InstructPLM-mu：1小时微调ESM2在蛋白质突变预测中超越ESM3

标题： InstructPLM-mu: 1-Hour Fine-Tuning of ESM2 Beats ESM3 in Protein Mutation Predictions

Authors:Junde Xu, Yapin Shi, Lijun Lang, Taoyong Cui, Zhiming Zhang, Guangyong Chen, Jiezhong Qiu, Pheng-Ann Heng

摘要：多模态蛋白质语言模型在突变效应预测上表现出色，但从头开始训练此类模型需要大量的计算资源。在本文中，我们提出了一种称为 InstructPLM-mu 的微调框架，并尝试回答一个问题： \textit{多模态微调预训练的仅序列蛋白质语言模型能否达到端到端训练模型的性能？} 令人惊讶的是，我们的实验表明，使用结构输入对 ESM2 进行微调可以达到与 ESM3 相当的性能。为了了解这是如何实现的，我们系统地比较了三种不同的特征融合设计和微调方案。我们的结果揭示了融合方法和调优策略都会显著影响最终准确性，这表明微调过程并不简单。我们希望这项工作能为将结构注入预训练蛋白质语言模型提供实际指导，并激发对更好融合机制和微调协议的进一步研究。

摘要： Multimodal protein language models deliver strong performance on mutation-effect prediction, but training such models from scratch demands substantial computational resources. In this paper, we propose a fine-tuning framework called InstructPLM-mu and try to answer a question: \textit{Can multimodal fine-tuning of a pretrained, sequence-only protein language model match the performance of models trained end-to-end? } Surprisingly, our experiments show that fine-tuning ESM2 with structural inputs can reach performance comparable to ESM3. To understand how this is achieved, we systematically compare three different feature-fusion designs and fine-tuning recipes. Our results reveal that both the fusion method and the tuning strategy strongly affect final accuracy, indicating that the fine-tuning process is not trivial. We hope this work offers practical guidance for injecting structure into pretrained protein language models and motivates further research on better fusion mechanisms and fine-tuning protocols.

评论：	预印本
主题：	定量方法 (q-bio.QM) ; 人工智能 (cs.AI); 计算工程、金融与科学 (cs.CE)
引用方式：	arXiv:2510.03370 [q-bio.QM]
	(或者 arXiv:2510.03370v2 [q-bio.QM] 对于此版本)
	https://doi.org/10.48550/arXiv.2510.03370

提交历史

来自： Junde Xu [查看电子邮件]
[v1] 星期五， 2025 年 10 月 3 日 07:42:22 UTC (1,220 KB)
[v2] 星期五， 2025 年 10 月 10 日 02:14:50 UTC (1,220 KB)

定量生物学 > 定量方法

标题： InstructPLM-mu：1小时微调ESM2在蛋白质突变预测中超越ESM3

标题： InstructPLM-mu: 1-Hour Fine-Tuning of ESM2 Beats ESM3 in Protein Mutation Predictions

提交历史

获取论文：

参考文献与引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

定量生物学 > 定量方法

标题： InstructPLM-mu：1小时微调ESM2在蛋白质突变预测中超越ESM3 显示英文标题

标题： InstructPLM-mu: 1-Hour Fine-Tuning of ESM2 Beats ESM3 in Protein Mutation Predictions

提交历史

获取论文：

参考文献与引用

BibTeX 格式的引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

标题： InstructPLM-mu：1小时微调ESM2在蛋白质突变预测中超越ESM3