arXivDaily arXiv每日学术速递 周一至周五更新
重置
2606.13620 2026-06-12 q-bio.QM 新提交

Balancing label resolution and computational cost in dynamical models of lipid metabolism

脂质代谢动力学模型中标签分辨率与计算成本的平衡

Paul Jonas Jost, Christoph Thiele, Jan Hasenauer

AI总结 研究多标签脂质代谢实验中模型标签数量对参数估计、轨迹恢复和计算成本的影响,发现使用三个标签可在实验可行性、推理能力和计算效率间取得平衡。

Comments 3 Supplementary Files

详情
AI中文摘要

脂质代谢是一个核心生物学过程,通常使用破坏性质谱实验进行研究。最近提出的一种策略利用多个标签从单次破坏性测量中提取脂质代谢的时间信息。然而,基于模型的数据分析的计算复杂度随着标签数量迅速增加,在测量信息内容和分析成本之间产生基本权衡。在这里,我们研究了建模标签数量如何影响参数估计准确性、轨迹恢复和计算成本,以及建模少于实验可用标签是否可以缓解这种权衡。使用五标签实验的合成数据,我们发现建模五个标签中的三个在实验可行性、推理能力和计算可处理性之间提供了实用的平衡。在肝细胞甘油三酯循环的应用中,我们进一步表明,最具成本效益的单标签模型可能对未观测物种产生生物学上不可信的预测,而解析更多标签的模型更好地约束了这些潜在动力学。这些结果为多标签实验中选择模型分辨率提供了实用指导,并为平衡推理能力与计算成本建立了定量基础。

英文摘要

Lipid metabolism is a central biological process that is commonly studied using destructive mass-spectrometry experiments. A recently proposed strategy, uses multiple labels to extract temporal information about lipid metabolism from a single destructive measurement. However, the computational complexity of the model-based data analysis increases rapidly with the number of labels, creating a fundamental trade-off between the information content of the measurements and the cost of analysis. Here, we examine how the number of modelled labels affects parameter estimation accuracy, trajectory recovery, and computational cost, and whether modelling fewer labels than are experimentally available can mitigate this trade-off. Using synthetic data from a five-label experiment, we find that modelling three of the five labels provides a practical balance between experimental feasibility, inferential power, and computational tractability. In an application to hepatocyte triglyceride cycling, we further show that the most cost-efficient, single-label model can yield biologically implausible predictions for unobserved species, whereas models that resolve more labels better constrain these latent dynamics. These results provide practical guidance for selecting model resolution in multi-label experiments and establish a quantitative basis for balancing inferential power against computational cost.

2606.13475 2026-06-12 q-bio.QM q-bio.PE 新提交

A likelihood-based framework for simultaneously learning both noise and growth dynamics using biologically-informed neural networks

基于似然的框架:利用生物信息神经网络同时学习噪声和生长动力学

Rebecca M. Crossley, Ruth E. Baker

AI总结 提出一种扩展的生物信息神经网络框架,通过可学习的噪声模型从数据中联合发现噪声结构和生长动力学,提高了预测准确性。

Comments 28 pages (including one page SI), 6 figures (one in SI)

详情
AI中文摘要

近年来,神经常微分方程框架如生物信息神经网络(BINNs)在从稀疏数据中学习机械定律方面显示出潜力。然而,大多数现有方法隐含地假设同方差高斯噪声,因此未考虑生物变异性中潜在有意义的结构。在此,我们提出了对现有BINNs框架的扩展,包括一个可学习的噪声模型,允许直接从数据中发现噪声模型。以种群增长为例,我们证明了该框架能够准确恢复底层噪声结构,并相比现有方法改进了对底层生长定律的预测。因此,这项工作建立了一个通用的基于似然的框架,用于在机械神经网络方法中联合学习动力学和异方差噪声。

英文摘要

In recent years, neural ordinary differential equation frameworks such as Biologically-Informed Neural Networks (BINNs) have shown promise for learning mechanistic laws from sparse data. However, most existing approaches implicitly assume homoscedastic Gaussian noise, and therefore do not account for potentially meaningful structure in biological variability. Here, we present an extension to the existing BINNs framework that includes a learnable noise model, allowing discovery of the noise model directly from data. Using population growth as an example, we demonstrate that the framework accurately recovers the underlying noise structure and improves predictions of the underlying growth laws compared to existing approaches. As such, this work establishes a general likelihood-based framework for jointly learning dynamics and heteroscedastic noise within mechanistic neural network approaches.

2606.13463 2026-06-12 q-bio.OT 新提交

Begging with a Purpose? Testing Behavioural Hallmarks of First-Order Intentionality in Free-ranging Hanuman Langurs

乞讨有目的?自由活动哈努曼叶猴一阶意向性行为标志的测试

Dishari Dasgupta, Shriparna Chattopadhyay, Sruti Banerjee, Pratyusha Adhikary, Akash Dutta, Manabi Paul, Anindita Bhadra

AI总结 通过实验测试自由活动哈努曼叶猴向人类乞食时的行为标志,发现其展示了一阶意向性的多个特征,扩展了意向性研究在非猿灵长类中的分布。

详情
AI中文摘要

意向性交流在灵长类中已被广泛研究,但来自自由活动的非猿类物种的证据仍然有限。最近描述了哈努曼叶猴(Semnopithecus entellus)向人类乞食的姿势,但这些行为是否表现出与一阶意向性相关的行为标志尚不清楚。在这里,我们通过实验调查了印度南西孟加拉邦六个地点自由活动哈努曼叶猴中这些标志的存在。我们进行了360次实验和对照试验,并量化了常用于操作化意向性交流的行为标记。实验试验引发了观众检查、接收者定向、快速接近反应、乞食姿势和姿势灵活性,而这些行为在对照试验中很少或没有出现。实验和对照条件之间的差异在所有六个研究地点均显著。信号也在获得食物后停止,符合与“明显满意结果”相关的停止规则。我们的研究结果表明,自由活动哈努曼叶猴向人类定向的姿势交流中存在与一阶意向性相关的多个行为标志。这些结果将意向性研究扩展到猿类之外,并为意向性相关特征在灵长类中的进化分布提供了新见解。

英文摘要

Intentional communication has been studied extensively in primates, yet evidence from free-ranging non-ape species remains limited. Human-directed food-solicitation gestures in Hanuman langurs (Semnopithecus entellus) have recently been described, but whether these behaviours exhibit behavioural hallmarks associated with first-order intentionality remains unknown. Here, we experimentally investigated the presence of these hallmarks in free-ranging Hanuman langurs across six anthropogenic sites in southern West Bengal, India. We conducted 360 experimental and control trials and quantified behavioural markers commonly used to operationalize intentional communication. Experimental trials elicited audience checking, recipient-directed orientation, rapid approach responses, food-solicitation gestures and gestural flexibility, whereas these behaviours were rare or absent in control trials. Differences between experimental and control conditions were significant across all six study sites. Signalling also ceased following food acquisition, consistent with the stopping rule associated with an Apparently Satisfactory Outcome. Our findings demonstrate the presence of multiple behavioural hallmarks linked to first-order intentionality in the human-directed gestural communication of free-ranging Hanuman langurs. These results extend the study of intentionality beyond apes and provide new insights into the evolutionary distribution of intentionality-related traits across primates.

2606.13132 2026-06-12 q-bio.NC 新提交

Including the Cost of Irreducible Uncertainty in the Policy Compression Framework

将不可约不确定性的成本纳入策略压缩框架

Álvaro Garrido-Pérez, Pieter Simoens, Amrapali Pednekar, Yara Khaluf

AI总结 本文扩展策略压缩框架,通过引入条件熵加权项来建模不可约不确定性的认知成本,使最优策略精度可独立于奖励敏感性变化,更准确解释人类决策偏差。

Comments Accepted at the 5th International Conference on Hybrid Human-Artificial Intelligence, 2026

详情
AI中文摘要

AI决策支持系统可以从预测人类决策中的偏差中受益。许多此类偏差可能源于人类认知限制。策略压缩框架将决策建模为奖励最大化与编码状态依赖行动策略的认知成本之间的权衡,该成本形式化为状态与行动之间的互信息(策略复杂度)。我们认为这一描述是不完整的,因为它将条件熵——给定状态下应选择哪个行动的不可约不确定性——视为无成本,尽管经验证据表明它调节反应时间。因此,我们通过将认知成本定义为策略复杂度与加权条件熵项(由新参数$\eta$控制)之和来扩展该框架。由此产生的最优策略保留标准指数形式,但随着$\eta$增加而变得更尖锐,使得策略精度可以更独立于奖励敏感性变化。这一修改意味着标准策略压缩框架可能低估行动选择的认知成本,并有可能更好地解释人类决策中的偏差。同时,它给将模型拟合到人类数据带来了额外的复杂性,这需要未来工作来解决。

英文摘要

AI decision-support systems can benefit from anticipating biases in human decision-making. Many such biases may arise from human cognitive limitations. The policy compression framework models decision-making as a trade-off between reward maximization and the cognitive cost of encoding state-dependent action policies, formalized as the mutual information between states and actions (policy complexity). We argue that this account is incomplete because it treats conditional entropy--the irreducible uncertainty about which action should be selected given a state--as costless, even though empirical evidence suggests that it modulates reaction times. We therefore extend the framework by defining cognitive cost as the sum of policy complexity and a weighted conditional-entropy term, governed by a new parameter, $η$. The resulting optimal policy retains the standard exponential form but becomes sharper as $η$ increases, allowing policy precision to vary more independently of reward sensitivity. This modification implies that the standard policy compression framework may underestimate the cognitive cost of action selection, and it has the potential to better account for biases in human decision-making. At the same time, it introduces additional complexity for fitting the model to human data, which future work will need to address.

2606.13047 2026-06-12 q-bio.BM q-bio.CB 新提交

Irregular curvature at focal adhesions modulates Piezo1 activity and low frequency ultrasound induced apoptosis in cancer cells

黏着斑处的不规则曲率调控Piezo1活性及低频超声诱导的癌细胞凋亡

Ivana Pajic-Lijakovic, Milan Milivojevic, Boris Martinac, Peter V. E. McClintock

AI总结 本文提出理论框架,解释癌细胞与健康细胞对低频超声的不同响应:癌细胞不规则黏着斑曲率导致Piezo1通道松散排列保持活性,而健康细胞规则曲率促使胆固醇重排降低Piezo1活性,从而揭示超声选择性杀伤癌细胞的物理机制。

Comments 38 pages, 4 figures

详情
Journal ref
Physics of Life Reviews, June 2026
AI中文摘要

低频低强度超声(LIUS)已成为一种有前景的物理方式,能够诱导癌细胞选择性凋亡,同时保留健康上皮细胞和成纤维细胞。迄今为止,这种选择性的机制尚不清楚,但我们现提出并发展了一个理论框架,将癌细胞与健康细胞的不同力学行为与其对LIUS的差异性响应联系起来。我们指出,癌细胞表现出不均匀的腹侧应力纤维网络,这可能在低强度超声(LIUS)下产生不规则的黏着斑几何形状和黏着斑附近的内向膜曲率。这些曲率不规则性有利于Piezo1通道的松散堆积,从而保持其活性。相反,健康上皮细胞和成纤维细胞表现出更均匀的细胞骨架组织,这可能导致黏着斑附近更规则的曲率轮廓。这导致曲率驱动的胆固醇重新分布,从而改变Piezo1簇的空间组织并降低协调通道活性,使细胞在暴露于LIUS时保持其活跃增殖状态。基于理论建模和先前的实验发现,我们提出细胞骨架组织和膜曲率的差异可能导致健康细胞与癌细胞之间不同的Piezo1激活模式。我们的分析将曲率介导的Piezo1重新分布确定为LIUS选择性的潜在物理基础,并为设计基于超声的疗法以利用癌细胞固有的细胞骨架脆弱性提供了机制基础。

英文摘要

Low-frequency, low intensity ultrasound (LIUS) has emerged as a promising physical modality capable of inducing selective apoptosis of cancer cells, while sparing healthy epithelial cells and fibroblasts. Hitherto, the mechanism underlying this selectivity has been unclear, but we now propose and develop a theoretical framework linking the distinct mechanical behaviours of cancer versus healthy cells to their differential responses to LIUS. We point out that cancer cells exhibit inhomogeneous ventral stress-fiber networks, which can produce irregular focal adhesion geometry and inward membrane curvature near focal adhesions under low-intensity ultrasound (LIUS). These curvature irregularities can favor loose packing of Piezo1 channels, thereby preserving their activity. In contrast, healthy epithelial cells and fibroblasts display more homogeneous cytoskeletal organization, which can result in more regular curvature profiles adjacent to focal adhesions. This leads to curvature-driven cholesterol redistribution, resulting in altered spatial organization of Piezo1 clusters and reduced coordinated channel activity and allowing cells to remain in their active, proliferative state when exposed to LIUS. Based on theoretical modeling and previous experimental findings, we propose that differences in cytoskeletal organization and membrane curvature can contribute to distinct Piezo1 activation patterns between healthy and cancerous cells. Our analysis identifies curvature-mediated Piezo1 redistribution as a potential physical basis for LIUS selectivity and provides a mechanistic foundation for designing ultrasound-based therapies to exploit the intrinsic cytoskeletal vulnerabilities of cancer cells.

2606.12772 2026-06-12 q-bio.QM 新提交

EasyNano: rapid epitope-targeted nanobody CDR design via differentiable distogram optimization with ESMFold2

EasyNano: 通过可微距离图优化与ESMFold2实现快速表位靶向纳米抗体CDR设计

Yue Hu, Wanyu Cheng, Junqing Wang, Yingchao Liu

AI总结 提出EasyNano流程,利用ESMFold2可微距离图优化,在10-20分钟内快速设计靶向特定表位的纳米抗体CDR,显著提升ipTM指标。

详情
AI中文摘要

计算设计结合用户指定蛋白表位的纳米抗体可能变革治疗开发,但当前方法要么依赖随机采样需要数天GPU计算,要么采用无法直接靶向表位的逆折叠方法。这里我们提出EasyNano,一个实用的流程,用于快速、表位靶向的纳米抗体互补决定区(CDR)设计,在高端个人工作站上约10-20分钟完成。EasyNano通过ESMFold2成对距离距离图进行梯度下降优化CDR残基logits,使用轻量级ESMFold2-Fast模型(721M)作为可微预言机,由包含专用表位邻近项的复合损失引导。完整的ESMFold2(1.3B)CA坐标结构先验防止框架位姿漂移。野生型logit初始化偏差作为控制CDR突变性的关键实际参数出现。在涵盖自我恢复和从头设计场景的六个靶标-框架对中,EasyNano将ipTM提升高达+0.559——从0.143到0.702(Ty1/RBD)——并在手动对接的AQP4靶向框架上实现4.6倍改进(ipTM从0.117到0.538),同时保持已有强结合剂的ipTM。随机CDR基线(每个靶标n=30)确认统计显著性(Ty1高于随机均值5.7 sigma)。多种子分析揭示多样的局部最小值,强调了重复运行的重要性。针对晶体结构的Kabsch交叉验证确认设计的CDR保留了框架位姿盆地。EasyNano证明基于ESMFold2的可微优化为纳米抗体CDR设计提供了一种快速、实用且表位特异的方法。

英文摘要

Computational design of nanobodies that bind user-specified protein epitopes could transform therapeutic development, but current methods either rely on stochastic sampling requiring days of GPU computation or inverse folding approaches unable to target epitopes directly. Here we present EasyNano, a practical pipeline for rapid, epitope-targeted nanobody complementarity-determining region (CDR) design that operates in approximately 10-20 minutes on a high-end personal workstation. EasyNano optimizes CDR residue logits via gradient descent through the ESMFold2 pairwise distance distogram, using the lightweight ESMFold2-Fast model (721M) as a differentiable oracle guided by a composite loss including a dedicated epitope proximity term. A full ESMFold2 (1.3B) CA-coordinate structure prior prevents framework pose drift. The wild-type logit initialization bias emerges as a critical practical parameter controlling CDR mutability. Across six target-framework pairs spanning self-recovery and de novo design scenarios, EasyNano improves ipTM by up to +0.559 -- from 0.143 to 0.702 (Ty1/RBD) -- and achieves a 4.6-fold improvement (ipTM 0.117 to 0.538) on a manually docked AQP4-targeting framework, while preserving ipTM on already-strong binders. Random CDR baselines (n=30 per target) confirm statistical significance (5.7 sigma above random mean for Ty1). Multi-seed analysis reveals diverse local minima, underscoring the importance of replicate runs. Kabsch cross-validation against crystal structures confirms that designed CDRs preserve the framework pose basin. EasyNano demonstrates that ESMFold2-based differentiable optimization provides a fast, practical, and epitope-specific approach to nanobody CDR design.

2606.12712 2026-06-12 q-bio.MN 新提交

Predictions for and lack of maximal information transmission in the neuromuscular junction

神经肌肉接头中最大信息传输的预测与缺失

Eitan Goldfein, Sarah Marzen

AI总结 通过信息最大化分析,比较理论预测与果蝇神经肌肉接头的实验分布,发现果蝇NMJ并未通过调节突触囊泡释放概率分布来最大化神经到肌肉的信息传输。

Comments 12 pages, 7 figures

详情
AI中文摘要

理论生物学的一个关键问题是生物系统在物理和功能约束下如何有效地保留其输入信息。我们在神经肌肉接头(NMJ)中通过研究胆碱能和谷氨酸能NMJ中神经递质浓度如何转化为电流来探讨这个问题。基于对剂量-反应关系的生物学理解,使用信息最大化分析推导出神经递质浓度的理论分布。将这些理论分布与从果蝇NMJ获得的实验分布进行比较。理论和实验分布显示出非常低的一致性,表明果蝇NMJ并未通过塑造其突触囊泡释放概率分布来最大化从神经系统到肌肉的信息传输。提供了胆碱能系统的预测。

英文摘要

A key question in theoretical biology is how effectively biological systems preserve information about their inputs while operating under physical and functional constraints. We examine that question at the neuromuscular junction (NMJ) by studying how neurotransmitter concentration is transformed into current at both cholinergic and glutamatergic NMJs. An information maximization analysis was used to derive a theoretical distribution over neurotransmitter concentrations based on biological understandings of dose-response relationships. These theoretical distributions were compared to an experimentally derived distribution obtained from a Drosophila NMJ. The theoretical and experimental distributions showed very little agreement, indicating that the Drosophila NMJ does not shape its distribution of synaptic vesicle release probabilities in order to maximize information transmission from nervous system to muscle. Predictions for cholinergic systems are provided.

2606.12597 2026-06-12 q-bio.QM q-bio.PE 新提交

A structural causal framework for interventions on evolutionary accumulation models

进化累积模型干预的结构因果框架

Ramon Diaz-Uriarte, Íñigo Ríos-Arroyo, Iain G. Johnston

AI总结 提出一个基于Pearl do算子的结构因果框架,用于从进化累积模型中提取干预预测,并区分杀死和失活两种干预类型,以排序候选干预目标。

详情
AI中文摘要

进化累积模型(EvAMs),也称为癌症进展模型(CPMs),从横截面数据推断肿瘤进展过程中突变累积顺序的依赖关系。已有研究表明EvAMs可用于识别治疗靶点,但文献中缺乏如何从这些模型中提取干预下预测的流程。简单的条件化于突变缺失的方法会给出错误预测。我们通过使用Pearl的do算子和条件干预,形式化“干预”对所有当前可用的EvAM方法(OT, OncoBN, CBN, H-ESBCN, MHN, HyperHMM, HyperTraPS)的含义,填补了这一空白。对于每个模型,我们展示了如何实施干预(在大多数情况下作为特定的参数修改),识别等效的实施程序,并分析模块化假设——干预需要定义良好——是否合理。借助将适应性作为显式变量的个体级因果DAG,我们区分了标准EvAM表示中混淆的两种干预类型(杀死和失活)。由于目标是优先考虑干预候选,我们将问题重新定义为排序问题:我们定义了三个干预目标,并提供了一个评估EvAMs对目标排序效果的协议。我们的框架不特定于癌症或EvAMs;它适用于任何可将拟合的计算模型解释为结构因果模型的情况。代码可从该网址获取。

英文摘要

Evolutionary accumulation models (EvAMs), also known as cancer progression models (CPMs), infer dependencies in the order of accumulation of mutations during tumor progression from cross-sectional data. It has been suggested that EvAMs could be used to identify therapeutic targets, but there is no procedure in the literature for how to extract predictions under intervention from these models. A simple approach of conditioning on the absence of a mutation gives incorrect predictions. We address this gap by formalizing what ``intervene'' means for all currently available EvAM methods (OT, OncoBN, CBN, H-ESBCN, MHN, HyperHMM, HyperTraPS), using Pearl's do operator and conditional interventions. For each model, we show how to implement the intervention (in most cases as specific parameter modifications), identify equivalent implementation procedures, and analyze whether the modularity assumption -- required for the intervention to be well-defined -- is justified. Drawing on individual-level causal DAGs that make fitness an explicit variable, we distinguish two types of intervention (killing and inactivating) that are conflated in standard EvAM representations. Since the goal is to prioritize intervention candidates, we recast the problem as one of ranking: we define three intervention objectives and provide a protocol for evaluating how well EvAMs rank targets. Our framework is not specific to cancer or EvAMs; it applies wherever fitted computational models can be interpreted as structural causal models. Code available from https://github.com/rdiaz02/scm-interv-evams.

2606.12449 2026-06-12 q-bio.NC 新提交

A quantum-like benchmark for context-sensitive associative memory with adaptive plasticity

具有自适应可塑性的上下文敏感联想记忆的类量子基准

Yashine H. Goolam Hossen, Lea Gassab, Travis J. A. Craddock

AI总结 提出一种顺序敏感的自适应可塑性基准,用于测试类量子联想记忆模型在弱支持条件下的真实回忆能力,发现自适应可塑性(尤其是稳态稳定化)是主要贡献因素,且类量子模型在顺序敏感性和阶段组织上更一致。

详情
AI中文摘要

学习与记忆需要在可塑性与稳定性之间取得平衡:突触连接必须编码新信息,同时不崩溃、饱和或擦除先前有用的结构。当固定背景连接已经承载了部分任务时,联想记忆模型可能看似成功学习,这使得难以区分真正的回忆动态与结构辅助。我们使用一个顺序敏感的自适应可塑性基准来测试这一问题,该基准用于阶段性联想回忆。基准将类量子联想记忆模型与匹配的实值无相位和马尔可夫率控制模型在相同任务计划、扰动轮廓、弱支持条件和可塑性设置下进行比较。这里,“类量子”指的是建模形式,而非关于量子计算的生物学主张。我们首先筛选弱结构支持,然后固定一个保守的操作点,用于跨模型家族和可塑性机制的因子比较。有用的弱支持区间狭窄且非单调。在无可塑性消融中,弱结构单独无法挽救回忆,而大多数有用的回忆增益来自自适应可塑性,尤其是稳态稳定化。马尔可夫率控制通常实现更强的原始回忆,但类量子模型更一致地保持顺序敏感性和阶段依赖的组织。这些结果不支持普遍的类量子优势。相反,它们表明模型类别通过结合回忆、时间组织和上下文敏感性的多目标轮廓比通过任何单一回忆得分更能被区分。因此,该基准为在弱支持、受调控可塑性和匹配经典比较下研究上下文敏感记忆动态提供了一个受控框架。

英文摘要

Learning and memory require a balance between plasticity and stability: synaptic connections must encode new information without collapsing, saturating, or erasing previously useful structure. Associative-memory models can appear to learn successfully when fixed background connectivity already carries part of the task, making it difficult to distinguish genuine recall dynamics from structural assistance. We test this issue using an order-sensitive adaptive-plasticity benchmark for staged associative recall. The benchmark compares a quantum-like associative-memory model with matched real-valued no-phase and Markov-rate controls under the same task schedule, perturbation profiles, weak-support conditions, and plasticity settings. Here, "quantum-like" refers to the modeling formalism, not to a biological claim about quantum computation. We first screen weak structural support and then fix a conservative operating point for factorial comparisons across model families and plasticity mechanisms. The useful weak-support regime is narrow and non-monotonic. Weak structure alone does not rescue recall in the no-plasticity ablation, whereas most useful recall gains arise from adaptive plasticity, especially homeostatic stabilization. The Markov-rate control often achieves stronger raw recall, but the quantum-like model more consistently preserves order sensitivity and stage-dependent organization. These results do not support a universal quantum-like advantage. Instead, they show that model classes are better distinguished by a multi-objective profile combining recall, temporal organization, and context sensitivity than by any single recall score. The benchmark therefore provides a controlled framework for studying context-sensitive memory dynamics under weak support, regulated plasticity, and matched classical comparison.

2606.13017 2026-06-12 q-bio.NC cs.LG 新提交

Deep Sleep Classification via EEG Signal Criticality: A Passive BCI Approach for Sleep-Improvement Neurofeedback

基于EEG信号临界性的深度睡眠分类:一种用于改善睡眠神经反馈的被动BCI方法

Stanisław Narębski, Tomasz Komendziński, Tomasz M. Rutkowski

AI总结 本研究利用去趋势波动分析(DFA)提取的临界性特征,通过朴素贝叶斯分类器实现了对深度睡眠(N3)的高精度识别(平衡准确率87.17%),为被动脑机接口中的状态依赖神经反馈提供了高效感知机制。

Comments 7 pages, 3 figures, accepted for publication in the Proceedings of the 10th Graz Brain-Computer Interface Conference 2026, Graz, Austria, September 14-17, 2026

详情
AI中文摘要

自动睡眠分期是被动脑-机接口(pBCI)的一项基础应用,它解码自发神经状态以实现独立于用户意图的闭环干预。本研究评估了从去趋势波动分析(DFA)中提取的临界性特征,用于特定识别深度睡眠(N3)。我们分析了来自290名老年女性的347,232个EEG时段,使用UMAP流形学习可视化状态转换。随后,通过10折交叉验证对六个分类器进行基准测试,使用平衡准确率确定此http URL的最佳“状态感知”引擎。朴素贝叶斯达到了最高的平均平衡准确率(87.17% ± 0.24%),显著优于全连接深度神经网络(FNN:81.58%)和随机森林(80.97%)。线性模型(LDA:57.21%;SVM:51.01%)表现不佳,表明DFA衍生的临界性特征位于一个独特的非线性流形上。EEG临界性的概率解码为pBCI提供了一种高精度的感知机制。这种稳健的分类流程支持开发状态依赖的神经反馈,例如靶向听觉刺激,以增强认知恢复。

英文摘要

Automated sleep staging is a fundamental application of passive Brain-Computer Interfaces (pBCI), decoding spontaneous neural states to enable closed-loop interventions independent of user intent. This study evaluates criticality features derived from Detrended Fluctuation Analysis (DFA) for the specific identification of deep sleep (N3). We analyzed $347,232$ EEG epochs from $290$ older women using UMAP manifold learning to visualize state transitions. Subsequently, six classifiers were benchmarked via 10-fold cross-validation, using balanced accuracy to determine the optimal "state-sensing" engine for neurofeedback.Naive Bayes achieved the highest mean balanced accuracy ($87.17\% \pm 0.24\%$), significantly outperforming a fully connected deep neural network (FNN: $81.58\%$) and Random Forest ($80.97\%$). Linear models (LDA: $57.21\%$; SVM: $51.01\%$) performed poorly, indicating that DFA-derived criticality features reside on a distinct, non-linear manifold. Probabilistic decoding of EEG criticality provides a high-accuracy sensing mechanism for pBCIs. This robust classification pipeline supports the development of state-dependent neurofeedback, such as targeted auditory stimulation, to enhance cognitive recovery.

2606.12838 2026-06-12 q-bio.QM cs.AI cs.LG q-bio.GN 新提交

OCOO-T : A Simple and Scalable Virtual Cell Model for Transcriptional Perturbation Response Prediction

OCOO-T: 一种用于转录扰动响应预测的简单可扩展虚拟细胞模型

Danning Jiang, Zheming An, Yalong Zhao, Lipeng Lai

AI总结 提出OCOO-T,一种基于流匹配的简约虚拟细胞模型,通过连续时间去噪和自适应层归一化,在多个基准上实现转录扰动预测的最优性能。

Comments 22 pages, 6 figures

详情
AI中文摘要

预测单细胞对遗传、化学和细胞因子扰动的转录响应是计算生物学和AI虚拟细胞(AIVC)建模中的一个基本挑战,对药物发现和基因调控网络的阐明具有直接影响。现有方法通常依赖辅助细胞状态编码器、分层变分自编码器、专用Transformer编码器-解码器模块或基因相互作用先验,将高维表达谱压缩为潜在表示。虽然有效,但这些设计增加了架构复杂性,可能限制可扩展性和泛化性。本文介绍了OCOO-T,一种基于流匹配的简约AIVC模型,用于转录扰动响应预测。OCOO-T利用一个直接操作连续基因表达谱的普通Transformer堆栈,并将扰动响应预测表述为连续时间去噪过程。通过自适应层归一化和上下文令牌整合扰动嵌入、剂量信息以及细胞系/细胞类型特异性。在Tahoe100M、Replogle和PBMC基准上的全面评估表明,OCOO-T在多种扰动和细胞类型上实现了最先进的性能,同时通过细胞上下文的修补和拆补有效扩展到长转录谱。通过利用基于Transformer去噪的单细胞组学简单性,OCOO-T为计算机细胞模拟提供了一个有效且可扩展的框架。

英文摘要

Predicting single-cell transcriptional responses to genetic, chemical and cytokine perturbations is a fundamental challenge in computational biology and AI Virtual Cell (AIVC) modeling, with direct implications for drug discovery and the elucidation of gene regulatory networks. Existing approaches often rely on auxiliary cell-state encoders, hierarchical variational autoencoders, dedicated Transformer encoder-decoder modules, or gene-interaction priors to compress high-dimensional expression profiles into latent representations. While effective, these designs increase architectural complexity and may limit scalability and generalizability. This paper introduces OCOO-T, a minimalist flow-matching-based AIVC model for transcriptional perturbation response prediction. OCOO-T utilizes a vanilla Transformer stack that operates directly on continuous gene expression profiles and formulates perturbation response prediction as a continuous-time denoising process. Perturbation embeddings, dosage information, and cell-line/cell-type specificity are integrated through adaptive layer normalization and in-context tokens. Comprehensive evaluations on Tahoe100M, Replogle, and PBMC benchmarks demonstrate that OCOO-T achieves state-of-the-art performance across diverse perturbations and cell types while effectively scaling to long transcriptional profiles through patching and depatching of cellular contexts. By leveraging the simplicity of Transformer-based denoising for single-cell omics, OCOO-T provides an effective and scalable framework for in-silico cellular simulation.

2606.13260 2026-06-12 cs.LG q-bio.NC 新提交

Extracting Governing Equations from Latent Dynamics via Multi-View Contrastive Learning

通过多视图对比学习从潜在动力学中提取控制方程

Paolo Muratore, Mackenzie Weygandt Mathis

发表机构 * EPFL(瑞士联邦理工学院洛桑)

AI总结 提出DYSCO算法,利用多视图时间对比学习从噪声高维观测中联合恢复潜在轨迹和动力学方程,并通过结构化基函数实现符号恢复,理论保证强可识别性。

详情
AI中文摘要

从噪声高维测量中识别潜在动力系统是表示学习、系统辨识和科学发现交叉领域的一个核心问题。我们提出了DYSCO,一种多视图时间对比学习算法,通过利用同一底层过程的多个独立噪声视图来区分信号与噪声,从而从这些观测中联合恢复潜在轨迹和控制动力学。通过在结构化函数基上参数化动力学,我们的框架进一步能够在仿射规范内符号恢复控制方程。我们提供了强可识别性的理论保证,直到仿射不确定性,将先前的可识别性结果扩展到噪声非线性观测的现实设置。实验上,我们在高斯和泊松观测噪声下(后者尤其与神经记录相关),在多种动力学 regime(如混沌、振荡和亚稳态)中展示了潜在轨迹和流场的准确恢复。

英文摘要

Identifying latent dynamical systems from noisy, high-dimensional measurements is a central problem at the intersection of representation learning, system identification, and scientific discovery. We present DYSCO, a multi-view temporal contrastive learning algorithm that jointly recovers latent trajectories and the governing dynamics from such observations, by leveraging multiple independent noisy views of the same underlying process to disentangle signal from noise. By parameterizing the dynamics in a structured functional basis, our framework further enables symbolic recovery of the governing equations within an affine gauge. We offer theoretical guarantees for strong identification up to an affine indeterminacy, extending prior identifiability results to the realistic setting of noisy nonlinear observations. Empirically, we demonstrate accurate recovery of both latent trajectories and flow fields across a diverse set of dynamical regimes (e.g., chaotic, oscillatory, and metastable) under both Gaussian and Poisson observation noise, the latter being particularly relevant for neural recordings.

2606.12854 2026-06-12 cs.CL q-bio.QM 新提交

Small LLMs for Biomedical Claim Verification: Cost-Effective Fine-Tuning, Structural Dataset Shortcuts, and Cross-Domain Generalization

小型LLM用于生物医学声明验证:成本效益微调、结构性数据集捷径与跨域泛化

Gaurav Kumar

发表机构 * Moveworks AI University of California San Diego(加州大学圣迭戈分校)

AI总结 通过QLoRA微调小型LLM(Phi-3-mini、Qwen2.5-3B、Mistral-7B),在生物医学声明验证中超越GPT-4o和GPT-5(F1提升12%),并发现SciFact数据集的结构性伪影,提出基于结构稳健数据的跨域迁移方法。

Comments 8 pages, 2 figures, 12 tables. To appear at BioNLP Workshop, ACL 2026

详情
AI中文摘要

大型语言模型如GPT-4o和GPT-5在生物医学声明验证上表现出强大的零样本性能,但成本和透明度限制了其可扩展使用。我们通过QLoRA在SciFact和HealthVer上微调了三个小型LLM:Phi-3-mini(3.8B)、Qwen2.5-3B和Mistral-7B,首次研究了QLoRA模型与GPT-4o及微调BioLinkBERT编码器的对比。Mistral-7B QLoRA在仅使用1,008个训练样本的情况下,以极低的成本超越了GPT-4o和GPT-5(F1提升高达12%)。我们进行了广泛的域内和跨域评估:在SciFact上训练的模型在HealthVer上测试,反之亦然,并匹配模型大小以隔离数据集结构与数据量的影响。我们识别了SciFact中一个先前未报告的结构性伪影,该伪影夸大了域内得分,并通过双向域外评估表明,在结构稳健的数据上训练能够实现鲁棒的跨域迁移。我们计划发布所有代码和适配器检查点。

英文摘要

Large Language Models such as GPT-4o and GPT-5 achieve strong zero-shot performance on biomedical claim verification, but cost and opacity limit scalable use. We fine-tune three small LLMs: Phi-3-mini (3.8B), Qwen2.5-3B, and Mistral-7B, via QLoRA on SciFact and HealthVer, providing the first study of QLoRA models against GPT-4o and fine-tuned BioLinkBERT encoders. Mistral-7B QLoRA surpasses both GPT-4o and GPT-5 (up to 12% F1 gain) at a fractional cost using just 1,008 training examples. We conduct extensive in-domain and cross-domain evaluation: models trained on SciFact tested on HealthVer and vice versa, at matched sizes to isolate dataset structure from data quantity. We identify a previously unreported structural artifact in SciFact that inflates in-domain scores, and show through bidirectional out-of-domain evaluation that training on structurally sound data enables robust cross-domain transfer. We plan to release all code and adapter checkpoints.

2606.12658 2026-06-12 cs.LG q-bio.QM stat.ML 新提交

Physics-Informed Neural Networks for Chemotherapy Pharmacokinetics: Benchmarking the Clinical Estimator and Exposing Parameter Identifiability

基于物理信息的神经网络用于化疗药代动力学:基准测试临床估计器并揭示参数可辨识性

Riya Bisht, Dhruv Agarwal

发表机构 * University of California, Berkeley(加州大学伯克利分校)

AI总结 本研究将物理信息神经网络(PINN)应用于化疗药代动力学,在双室线性模型上匹配临床标准方法,在Michaelis-Menten扩展模型中揭示参数不可辨识性,并通过稀疏组织观测部分恢复可辨识性。

详情
AI中文摘要

物理信息神经网络(PINN)是生物学中部分观测问题的一个有吸引力的工具,其中控制动力学已知但某些隔室无法测量。化疗药代动力学(PK)是一个清晰的实例:血浆中的药物浓度常规测量,但组织中的浓度——决定肿瘤杀伤和脱靶毒性——无法测量。我们在两个PK问题上将PINN与标准临床基线(非线性最小二乘解析双指数血浆解,以下简称NLS)和物理无关的神经基线(仅数据的MLP)进行基准测试。在线性双室问题上,NLS接近最优;PINN在匹配其性能(小常数因子内)的同时,在单次训练过程中产生组织曲线,而仅数据的MLP在组织上失败约10倍。在Michaelis-Menten扩展(可饱和消除)上,双指数闭式不再存在,因此NLS被错误指定并静默返回无意义的速率常数。PINN反而揭示了一个更深层的事实:Michaelis-Menten双室模型仅从血浆数据不可辨识,PINN通过收敛到k12 -> 0的盆地诚实地报告这一点。添加两个稀疏组织观测在很大程度上解决了可辨识性:在五个随机种子上,PINN恢复k21在真实值的1%以内,Vmax和Km在一个标准差范围内,而k12向正确方向移动(0.02 -> 0.82)但仍低于真实值约2个标准差——这是闭式NLS估计器根本无法尝试的恢复,因为其双指数假设仅描述血浆。我们的主张不是PINN击败NLS。而是PINN提供了一种统一的方案,该方案在教科书问题上与教科书估计器匹配,揭示了教科书估计器隐藏的结构可辨识性,并在单一损失中吸收异构测量。

英文摘要

Physics-Informed Neural Networks (PINNs) are an attractive tool for partial-observation problems in biology, where the governing dynamics are known but some compartments cannot be measured. Chemotherapy pharmacokinetics (PK) is a clean instance: drug concentration in plasma is routinely measured, but concentration in tissue -- which determines tumour kill and off-target toxicity -- is not. We benchmark a PINN against the standard clinical baseline (nonlinear least-squares on the analytical biexponential plasma solution, hereafter NLS) and a physics-agnostic neural baseline (a data-only MLP) on two PK problems. On the linear two-compartment problem, NLS is near-optimal; the PINN matches it to within a small constant factor while also producing the tissue curve in a single training pass, whereas the data-only MLP fails on tissue by roughly 10x. On a Michaelis-Menten extension (saturable elimination), the biexponential closed form no longer exists, so NLS is mis-specified and silently returns meaningless rate constants. The PINN instead exposes a deeper fact: the Michaelis-Menten two-compartment model is non-identifiable from plasma alone, and the PINN reports this honestly by converging to a basin with k12 -> 0. Adding two sparse tissue observations largely resolves identifiability: across five seeds the PINN recovers k21 to within 1% of truth and Vmax, Km to within one standard-deviation bar, while k12 moves in the correct direction (0.02 -> 0.82) but remains ~2 sigma below truth -- a recovery the closed-form NLS estimator cannot attempt at all, because its biexponential ansatz describes only plasma. Our claim is not that PINNs beat NLS. It is that PINNs offer a uniform recipe that ties the textbook estimator on the textbook problem, exposes structural identifiability that the textbook estimator hides, and absorbs heterogeneous measurements within a single loss.

2606.12639 2026-06-12 cs.LG q-bio.QM 新提交

The Metric Picks the Winner: Evaluation Choice Flips Model Rankings for Drug-Response Prediction in Unseen Chemistry

度量选择胜者:评估选择翻转未见化学空间中药物反应预测的模型排名

Dhruv Agarwal, Riya Bisht

发表机构 * University of California, Berkeley(加州大学伯克利分校)

AI总结 本研究通过VCPI竞赛数据,发现药物反应预测模型排名随评估指标反转:简单基线在代理指标下胜出,但真实指标下深度模型显著优于线性指纹基线,首次在真实药物化学数据上验证了度量校准效应。

详情
AI中文摘要

预测细胞转录组对其从未见过的药物的反应是计算细胞生物学中的一个核心难题:最近的基准测试表明,一旦测试化合物按化学结构留出,复杂模型往往无法击败简单基线。我们研究了一个细胞系和检测方法,即通过DRUG-seq分析的THP-1细胞,由VCPI预测竞赛的活性化合物加权MSE(wMSE)评分。我们提出了一种分阶段方法:该领域一直无法击败的简单基线(未处理对照和平均训练化合物响应);非参数检索(对留出化合物的最近训练化合物进行Tanimoto加权平均);以及一个融合阶段,将冻结的化学嵌入与检索支持特征相结合,以预测相对于均值的残差,并包含不确定性头和基因程序。在发布的VCPI THP-1 drug-seq数据(14,026个训练化合物)上,采用Bemis-Murcko骨架划分,模型排名根据度量标准反转。在逆方差每基因代理度量下,基于Morgan指纹的正则化线性回归似乎胜过了深度模型、检索和ChemBERTa——这是教科书式的“简单基线获胜”结果。但在竞赛的真实活性集度量(每(基因,化合物)的Mejia权重,经官方评分器验证;均值基线0.535 vs 组织者的0.507参考)下,情况反转:深度模型获胜,我们的融合解码器显著优于线性指纹基线(-0.012 wMSE,配对bootstrap p < 10^-4),而代理度量的胜者成为最差的化学感知预测器。选择度量即选择胜者——据我们所知,这是首次在真实留出药物化学数据上证明度量校准效应,该效应此前主要在遗传扰动中建立。我们发布了一个可复现的流水线,连接到官方评分器,可在真实的1064 x 12,995网格上生成有效提交。

英文摘要

Predicting how a cell's transcriptome responds to a drug it has never seen is a core, hard problem in computational cell biology: recent benchmarks show complex models often fail to beat trivial baselines once test compounds are held out by chemistry. We study one cell line and assay, THP-1 cells profiled by DRUG-seq, scored by the active-compound weighted MSE(wMSE) of the VCPI prediction contest. We propose a staged approach: dumb baselines (untreated control and mean training-compound response) that the field keeps failing to beat; non-parametric retrieval (a Tanimoto-weighted average of a held-out compound's nearest training compounds); and a fusion stage combining a frozen chemistry embedding with retrieval-support features to predict the residual over the mean, with an uncertainty head and gene programs. On the released VCPI THP-1 drug-seq data (14,026 training compounds), under a Bemis-Murcko scaffold split, the model ranking inverts depending on the metric. Under an inverse-variance per-gene proxy, a regularized linear regression on Morgan fingerprints appears to win over the deep models, retrieval, and ChemBERTa -- the textbook "simple baselines win" result. But under the contest's true active-set metric (per-(gene, compound) Mejia weights, validated against the official scorer; mean baseline 0.535 vs the organizers' 0.507 reference), that reverses: the deep models win, our fusion decoder significantly beats the linear fingerprint baseline (-0.012 wMSE, paired bootstrap p < 10^-4), and the proxy's winner becomes the worst chemistry-aware predictor. Picking the metric picks the winner -- to our knowledge the first demonstration on real held-out drug chemistry of the metric-calibration effect established largely on genetic perturbation. We release a reproducible pipeline wired to the official scorer that emits a valid submission over the real 1064 x 12,995 grid.

2606.12609 2026-06-12 cs.LG q-bio.QM 新提交

Viral Proteins Reveal Geometry of Protein Language Models

病毒蛋白质揭示蛋白质语言模型的几何结构

Arthur Bigot, Harmon Bhasin, Core Francisco Park, Eugene Shakhnovich, Dianzhuo Wang

发表机构 * University of Washington(华盛顿大学) DeepMind(深度思维)

AI总结 研究蛋白质语言模型在不平衡数据下对病毒蛋白的表示,发现嵌入空间中存在主导的“天然性”轴,该轴按模型困惑度排序序列,且缩放效果因病毒家族而异,但嵌入仍保留病毒特异性信号。

Comments Accepted at ICML 2026 GenBio Workshop and FM4LS Workshop. Code available at https://github.com/MisteFr/viral-proteins-plms

详情
AI中文摘要

蛋白质语言模型在高度不平衡的数据集上训练,引发了一个问题:它们如何表示代表性不足的生物序列?以病毒蛋白作为跨ESM模型家族的案例研究,我们在嵌入空间中识别出一个主导的天然性轴,该轴与掩码重建困惑度对齐,将序列从建模良好的细胞蛋白通过病毒蛋白排序到打乱和随机序列。缩放效果在不同病毒家族间不均匀地压缩该轴。尽管如此,蛋白质语言模型嵌入保留了病毒特异性信号:病毒蛋白在零样本困惑度和浅层序列特征之上仍然是线性可分的。这些结果共同表明,pLM表示由天然性的一般概念结构化,同时保留了特定于不同生物群体的信息。

英文摘要

Protein language models are trained on highly imbalanced datasets, raising the question of how they represent underrepresented biological sequences. Using viral proteins as a case study across ESM model families, we identify a dominant nativeness axis in embedding space, aligned with masked reconstruction perplexity, that orders sequences from well-modeled cellular proteins through viral proteins to shuffled and random sequences. Scaling contracts this axis unevenly across viral families. Despite this, protein language model embeddings retain viral-specific signal: viral proteins remain linearly separable beyond zero-shot perplexity and shallow sequence features. Together, these results suggest that pLM representations are structured by a general notion of nativeness while preserving information specific to distinct biological groups.

2606.12684 2026-06-12 q-bio.NC math.DS 新提交

Phase model analysis of the effect of M-current on neural synchrony in hippocampal networks

M电流对海马网络神经同步性影响的相位模型分析

Megha Manoj, Sue Ann Campbell

AI总结 通过相位模型约化,研究乙酰胆碱通过调节M电流对海马神经元集群同步性的双向作用,发现低ACh促进完全同步,高ACh导致多稳定对称集群解。

Comments 39 pages, 14 figures

详情
AI中文摘要

在海马中观察到的神经集群,即短暂协调的神经元群体,被认为是情景记忆形成的基础。乙酰胆碱(ACh)是一种神经调节剂,由海马接收,在记忆和学习中起关键作用。一个得到充分支持的假说认为,在主动探索和快速眼动(REM)睡眠期间高水平的ACh促进记忆编码,而在安静清醒和慢波睡眠(SWS)期间低水平的ACh支持记忆巩固。我们通过ACh对神经元间同步性的影响来研究其在神经集群形成中的双向作用。我们考虑一个锥体神经元网络模型,每个神经元配备一个缓慢的、电压依赖性的、非失活的钾电流(M电流),该电流在ACh存在时下调。神经集群被表示为该系统的集群解。利用一维相位模型约化,对在不同M电流水平下弱耦合的一对锥体神经元,我们预测了在具有全连接全局均匀、对称距离依赖和最近邻耦合架构的更大网络中可能出现的对称集群解。我们发现,在低ACh条件下,网络可以完全同步,而高ACh水平则可以使网络去同步,形成多个稳定的对称集群解,代表不同的神经集群。

英文摘要

Neural assemblies, transiently coordinated groups of neurons, observed in the hippocampus are thought to underlie the formation of episodic memories. Acetylcholine (ACh), a neuromodulator, that is received by the hippocampus, plays a critical role in memory and learning. A well supported hypothesis suggests that high levels of ACh during active exploration and rapid eye movement (REM) sleep promote memory encoding, while low levels during quiet waking and slow-wave sleep (SWS) support memory consolidation. We study this bidirectional role of ACh in neural assembly formation through its effect on the synchrony among neurons. We consider a network model of pyramidal neurons, each equipped with a slow, voltage-dependent, non-inactivating potassium current (M-current), which is downregulated in the presence of ACh. Neural assemblies are represented as cluster solutions to this system. Using a one-dimensional phase model reduction of a pair of weakly coupled pyramidal neurons under different levels of the M-current, we predict the symmetric cluster solutions that may emerge in larger networks equipped with all-to-all globally homogeneous, symmetric distance-dependent and nearest-neighbours coupling architectures. We find that under low ACh conditions, the network can fully synchronize, whereas high levels can desynchronize the network into multiple stable symmetric cluster solutions representing distinct neural assemblies.

2606.12573 2026-06-12 q-bio.MN math.DS 新提交

Implementation of Linear Regression and Linear Interpolation using Reaction Networks

利用反应网络实现线性回归和线性插值

Aryan Kumar, Amey Choudhary, Jiaxin Jin, Chittaranjan Hens, Abhishek Deshpande

AI总结 提出基于反应网络的方法实现单变量/多变量线性回归和线性插值,通过编码稳态浓度作为输出,并引入处理负数除法的广义除法模块,在合成数据集上验证了有效性。

Comments 30 pages, 7 figures

详情
AI中文摘要

执行统计推断是数据科学的重要组成部分。本文关注两种推断技术,即回归和插值。我们提出了一种基于反应网络的方法,可以实现线性回归(包括单变量和多变量)和线性插值。我们通过将物种的稳态浓度编码为这些推断技术的输出来实现这一点。为此,我们使用了一种新颖的广义除法模块,可以处理负数的除法。我们通过在标准合成数据集上的计算机模拟结果进行比较,验证了我们的结果。

英文摘要

Performing statistical inference is an essential component of data science. Our focus in this work is on two inference techniques, viz. regression and interpolation. We propose a reaction network based approach that can implement linear regression (both univariate and multivariate) and linear interpolation. We do this by encoding the steady state concentration of species as the output of these inference techniques. Towards this, we use a novel generalized division module that can handle division of negative numbers. We verify our results by comparing them with in-silico implementation on standard synthetic datasets.

2606.13386 2026-06-12 physics.soc-ph q-bio.QM 新提交

Mathematical Modeling of HDV RNA, HBV DNA, and HBsAg Dynamics during Lonafarnib-Based Therapy: Insights from the LOWR HDV-1 Study

基于Lonafarnib治疗中HDV RNA、HBV DNA和HBsAg动力学的数学建模:来自LOWR HDV-1研究的见解

Adquate Mhlanga, Louis Shekhtman, Rami Zakh, Sarah Duehren, Ashish Goyal, Alexander Churkin, Vladimir Reinharz, Danny Barash, Jeffrey Glenn, Ohad Etzion, Scott J. Cotler, Cihan Yurdaydin, Harel Dahari

AI总结 通过数学建模分析Lonafarnib治疗下HBV/HDV共感染患者的病毒动力学,揭示HDV抑制对HBV的激活作用及HBsAg稳定性机制。

详情
AI中文摘要

Lonafarnib (LNF) 是一种靶向丁型肝炎病毒 (HDV) 但不靶向乙型肝炎病毒 (HBV) 的研究性药物,为模拟HDV动力学以及HDV变化如何影响HBV提供了独特机会。我们对15名接受LNF治疗的HBV/HDV共感染患者进行了详细的动力学分析,并开发了一个数学模型来解释血清HBV DNA、HDV RNA和乙型肝炎表面抗原 (HBsAg) 的动力学。在0-2天的延迟后,患者经历了快速的HDV第一期下降,随后是病毒平台期、第二期较慢下降期或病毒突破 (VB)。LNF单药治疗导致平坦的部分反应(通常随后出现VB),而LNF联合利托那韦或聚乙二醇干扰素-α (PEG-IFNα) 治疗与双相HDV下降相关(无VB)。除LNF+PEG-IFNα外,所有治疗中至少有一名患者出现HBV在治疗期间升高。我们的模型成功再现了观察到的HDV和HBV动力学。我们估计血清中HDV RNA半衰期为1.26天 [95%置信区间, CI: 1.05--1.47],所有治疗中抑制HDV RNA产生的疗效为94% [95% CI: 89%--97%],这反映在第一期HDV下降中。HDV下降的第二期可以通过疗效随时间增加来解释,最大达到98.9%。该模型通过当HDV下降到抑制阈值以下时,HBV DNA产生率中位数增加4倍 [四分位距, IQR: 1--28] 来解释血清HBV DNA的增加。血清HBsAg的稳定性通过产生HBsAg的细胞数量恒定来解释。

英文摘要

Lonafarnib (LNF) is an investigational drug targeting hepatitis delta virus (HDV) but not hepatitis B virus (HBV), providing a unique opportunity to model HDV kinetics and how changes in HDV affect HBV. We performed a detailed kinetic analysis and developed a mathematical model to explain serum HBV DNA, HDV RNA and hepatitis B surface antigen (HBsAg) kinetics in 15 HBV/HDV coinfected patients receiving LNF-based treatment. After a delay of 0-2 days, patients experienced a rapid 1st-phase HDV-decline followed by either a viral plateau, 2nd slower-decline phase, or viral breakthrough (VB). LNF monotherapy led to a flat-partial-response (often followed by VB), while LNF combination therapy with ritonavir or pegylated interferon-$α$ (PEG-IFN$α$) was associated with a biphasic HDV decline (without VB). All treatments except LNF+PEG-IFN$α$ had at least one patient experiencing an increase in HBV on-treatment. Our model successfully reproduced the observed HDV and HBV kinetics. We estimated an HDV RNA half-life of 1.26 days [95% confidence interval, CI: 1.05--1.47] in serum and treatment efficacy of 94% in inhibiting HDV RNA production across all treatments [95% CI: 89%--97%], as reflected by the 1st phase HDV decline. The 2nd phase of HDV decline was explained by a time-dependent increase in efficacy, reaching a maximum of 98.9%. The model explained the increase in serum HBV DNA by a median 4-fold [interquartile range, IQR: 1--28] increase in HBV DNA production rate when HDV declined below an inhibitory threshold. The stability of serum HBsAg was explained by a constant number of HBsAg-producing cells.

2606.12836 2026-06-12 physics.data-an q-bio.QM stat.ME 新提交

Interpretable model-free inference of parametric variation across time-series data through large-scale feature extraction

通过大规模特征提取进行时间序列数据参数变化的可解释无模型推断

Ben D. Fulcher, Carl H. Lubba, Giorgio F. Gilestro, Simon R. Schultz, Nick S. Jones

AI总结 提出一种无监督数据驱动方法,利用超过7000个时间序列特征库,从时间序列数据中推断未知生成过程的参数变化维度和性质,无需指定或拟合模型。

详情
AI中文摘要

这里我们解决了直接从时间序列数据中估计未知生成过程中参数变化的维度和性质的问题,无需指定或拟合模型。特别地,我们假设时间序列集合中的实例间变化是由生成模型中的参数变化引起的。我们假设,给定一个足够大的时间序列特征库,低维参数变化将表现为特征空间中的低维结构,从而可以构建潜在自由度的可解释估计量。我们使用一个包含超过7000种多样且可解释的时间序列统计量的特征库,以及13个具有已知参数变化的模拟系统(涵盖线性随机过程、非线性振荡器和混沌动力学)来测试我们的假设。我们的无监督数据驱动方法通常能在这广泛的模拟动力系统范围内重建潜在的参数变化,同时为每个潜在维度生成可解释的估计量。应用于1143只果蝇的运动动力学,我们使用该方法提取了对应于性别和昼夜节律的生物意义成分。我们的结果为急需的数据驱动方法铺平了道路,以弥合动力学的可解释理论理解与表征现代科学问题的大规模复杂数据集之间的差距。

英文摘要

Here we address the problem of estimating the dimensionality and nature of parametric variation in an unknown generative process directly from time-series data, without specifying or fitting a model. In particular we suppose that inter-instance variation in collections of time series is caused by parametric variation in the generating model. We hypothesize that, given a sufficiently large library of time-series features, low-dimensional parametric variation will manifest as low-dimensional structure in feature space, enabling interpretable estimators of the underlying degrees of freedom to be constructed. We test our hypothesis using a library of over 7000 diverse and interpretable time-series statistics and thirteen simulated systems with known parametric variation, spanning linear stochastic processes, nonlinear oscillators, and chaotic dynamics. Our unsupervised, data-driven approach often reconstructs the underlying parametric variation across this extensive range of simulated dynamical systems while also yielding interpretable estimators for each underlying dimension. Applied to the movement dynamics of 1143 fruit flies, we use this method to extract biologically meaningful components corresponding to sex and circadian rhythmicity. Our results pave the way for much-needed data-driven methods to bridge the gap between interpretable theoretical understanding of dynamics and the large and complex datasets that characterize modern scientific problems.

2606.12564 2026-06-12 physics.soc-ph math.DS q-bio.PE 新提交

SCAR dynamics of adolescent substance use: peer influence, dropout, and bifurcation structure in a school-based model

青少年物质使用的SCAR动力学:同伴影响、辍学与基于学校模型的分岔结构

Tamantha Pizarro, Jinni Su, Yixuan He, Yun Kang

AI总结 构建青少年物质使用的SCAR模型,分析同伴驱动、学校脱离和康复重返机制,发现分岔结构与多稳态现象,提出综合学校干预策略。

详情
AI中文摘要

我们针对高中环境中的青少年物质使用,开发了一个四仓室易感-偶然-成瘾-抵抗(SCAR)模型。该模型将学生分为易感非使用者、偶然或实验性使用者、持续或物质使用障碍(SUD)水平参与的学生,以及处于保护性反使用环境中的抵抗学生。模型包括同伴驱动的起始、从偶然使用向问题使用的升级、保护性同伴影响、学校脱离以及康复后的部分重返。定性分析和分岔图显示了三个主要结果。首先,返回参数\\(\phi\\)区分了两个区域:当\\(\phi=1\\)时,总人口守恒,可能存在内部平衡;当\\(\phi<1\\)时,问题使用导致净学校人口损失,因此正的比例平衡可能不代表真正的地方病平衡。其次,起始和升级由不同的阈值控制,意味着首次使用和向问题使用的进展在动态上是不同的。第三,该模型可以表现出多稳态,包括无物质状态和稳定高使用状态之间的双稳态,因此长期结果可能取决于初始条件。这些发现表明,有效的学校政策应结合普遍预防、对偶然使用者的早期干预、对有风险成为问题使用者的学生的针对性支持、康复支持性环境以及强大的学校重返途径。

英文摘要

We develop a four-compartment susceptible--casual--addicted--resistant (SCAR) model for adolescent substance use in a high-school setting. The model divides students into susceptible non-users, casual or experimental users, students with sustained or substance-use-disorder (SUD)-level involvement, and resistant students in protective anti-use environments. It includes peer-driven initiation, escalation from casual to problematic use, protective peer influence, school disengagement, and partial re-entry after rehabilitation. Qualitative analysis and bifurcation diagrams show three main results. First, the return parameter \(ϕ\) separates two regimes: when \(ϕ=1\), the total population is conserved and interior equilibria may exist; when \(ϕ<1\), problematic use causes net school-population loss, so positive scaled equilibria may not represent true endemic equilibria. Second, initiation and escalation are governed by distinct thresholds, meaning first use and progression to problematic use are dynamically different. Third, the model can exhibit multistability, including bistability between a substance-free state and a stable high-use state, so long-term outcomes may depend on initial conditions. These findings suggest that effective school policy should combine universal prevention, early intervention for casual users, targeted support for students at risk of problematic use, recovery-supportive environments, and strong school re-engagement pathways.

2606.12465 2026-06-12 physics.soc-ph q-bio.PE 新提交

A systematic review of COVID-19 epidemic models with endogenous human behaviour. What's next?

内源性人类行为的COVID-19流行病模型的系统综述:下一步是什么?

Elena D'Agnese, Alessia Melegaro, Vittoria Offeddu, Alberto d'Onofrio, Nicola Perra, Chris T. Bauch, Piero Manfredi

AI总结 本文系统综述了内源性人类行为的COVID-19传播模型,发现数据使用扩大但行为数据有限、模型结构创新不足,并提出加强数据基础设施、AI和跨学科合作等建议。

Comments 16 pages, 5 figures, 1 table

详情
AI中文摘要

人类行为与流行病动态相互交织,但将这种反馈纳入模型仍是流行病学建模的主要挑战之一。COVID-19大流行提供了克服传统局限的机会,人们期望数据驱动的内源性行为建模方法能取得重大进展。为量化进展,我们对内源性纳入人类行为以响应流行病动态的SARS-CoV-2传播模型进行了系统综述。COVID-19大流行在流行病-行为建模中经验数据的使用范围扩大方面取得了长足进步。然而,它也显示出在行为经验数据使用有限、模型结构缺乏创新以及与其它学科和决策者互动不足等方面的缺陷。总体而言,我们的结果表明,确定模型设计和行为数据的优先事项、建立充分的数据收集基础设施、利用人工智能进展以及促进跨学科合作,对于大流行防范至关重要。

英文摘要

Human behaviour and epidemic dynamics are intertwined, yet accounting for this feedback remains one of the key challenges of epidemiological modelling. The COVID-19 pandemic was an opportunity to overcome the traditional limitations of the field, raising expectations that data-informed endogenous approaches to behaviour modelling would advance substantially. To quantify the progresses made, we conducted a systematic review of SARS-CoV-2 transmission models endogenously including human behaviour in response to epidemic dynamics. The COVID-19 pandemic saw great strides in terms of the expanded use of empirical data in epi-behavioural modelling. However, it also showed shortcomings with respect to limited use of behavioural empirical data, lack of innovation in model structure, and limited engagement with other disciplines and decision-makers. Overall, our results suggest that identifying priorities in model design and behavioural data, building an adequate data collection infrastructure, leveraging on AI advancements, and fostering interdisciplinarity are strategies of utmost importance for pandemic preparedness.

2606.12456 2026-06-12 physics.soc-ph q-bio.PE 新提交

Network-Based Multi-Layer Model Using Machine Learning for Optimal Vaccine Prioritization in Heterogeneous Populations

基于网络的异质人群中最优疫苗优先级的机器学习多层模型

Mordecai Opoku Ohemeng, Bernard Asamoah Afful

AI总结 本研究通过整合人群异质性、网络结构和机器学习决策策略,提出基于图神经网络和强化学习的疫苗优先级方法,在真实网络中显著优于传统中心性策略。

详情
AI中文摘要

这项工作通过整合人群异质性、网络结构和基于机器学习的决策策略,将流行病控制推进到传统大规模疫苗接种模型之外。利用Email-Eu-core联系网络,我们比较了经典的中心性驱动疫苗接种策略与图神经网络(GNN)和强化学习(RL)方法。在30次随机模拟中,经典启发式方法(包括度、介数和基于层的疫苗接种)表现出相似的性能,反映了网络的密集连接和适度的社区结构。相比之下,基于GNN的策略显著降低了峰值感染、最终流行规模和达到峰值的时间,展示了其识别经典指标忽略的结构关键节点的能力。这些结果表明,基于学习的疫苗接种策略通过利用真实网络中的高阶关系模式,可以显著优于传统启发式方法,为有针对性的流行病干预提供了强大的框架。

英文摘要

This work advances epidemic control beyond traditional mass vaccination models by integrating population heterogeneity, network structure, and machine-learning-based decision policies. Using the Email-Eu-core contact network, we compare classical centrality-driven vaccination strategies with graph neural network (GNN) and reinforcement learning (RL) approaches. Across 30 stochastic simulations, classical heuristics, including degree, betweenness, and layer-based vaccination, exhibit similar performance, reflecting the network's dense connectivity and modest community structure. In contrast, the GNN-based strategy substantially reduces peak infection, final epidemic size, and time to peak, demonstrating its ability to identify structurally critical nodes that classical metrics overlook. These results show that learning-based vaccination policies can significantly outperform traditional heuristics by exploiting higher-order relational patterns in real-world networks, offering a powerful framework for targeted epidemic intervention.

2606.12600 2026-06-12 q-bio.NC nlin.AO 新提交

Multifractal human signals at the edge of life reveal a heart-brain anti-correlation

生命边缘的多分形人体信号揭示心脑反相关

Yago Emanoel Ramos, Maria Eloá do Ó, Henrique Ferraz de Arruda, Mauro Copelli, G. Camelo-Neto, Pedro V. Carelli

AI总结 通过多分形去趋势波动分析,发现临终患者脑电信号多分形性减弱而心电信号异常增宽,两者呈负相关,表明心脑功能解耦和反相关动力学。

详情
AI中文摘要

本研究通过非线性动力学视角分析多分形谱,探究人类神经生理功能的终末崩溃。使用多分形去趋势波动分析(MF-DFA),我们量化了终末期患者同步脑电图(EEG)和心电图(ECG)时间序列中复杂性的时间演变。结果揭示了多分形谱宽度的显著差异:神经活动表现出多分形性向更受限状态的崩溃,而心脏信号则出现异常谱宽化,表明非线性波动和动力学不稳定性增加。这些谱宽度之间的负相关表明神经和心脏系统之间有效的功能解耦和反相关动力学的出现。这种差异并非反映统一的生理衰退,而是与身体到大脑的崩溃一致,其中外周功能障碍逐渐压倒中枢调节过程。在更广泛的背景下,观察到的相反趋势类似于其他身体驱动适应过程中报告的模式,表明当约束源于外周而非中枢机制时,耦合系统之间可能出现逆动力学。最终,死亡过程似乎代表了跨系统解离的一种极端形式,以维持整合生理功能的层级协调的崩溃为特征。

英文摘要

This study investigates the terminal breakdown of human neurophysiological function through the lens of non-linear dynamics by analyzing the multifractal spectrum. Using Multifractal Detrended Fluctuation Analysis (MF-DFA), we quantify the temporal evolution of complexity in synchronized electroencephalogram (EEG) and electrocardiogram (ECG) time series from patients in the terminal stage. Our results reveal a marked divergence in multifractal spectrum width: while neural activity exhibits a collapse of multifractality toward a more constrained state, cardiac signals undergo anomalous spectral broadening, indicating increased non-linear fluctuations and dynamical instability. A negative correlation between these spectral widths suggests effective functional decoupling and the emergence of anti-correlated dynamics between neural and cardiac systems. Rather than reflecting a uniform physiological decline, this divergence is consistent with a body-to-brain breakdown in which peripheral dysfunction progressively overwhelms central regulatory processes. In a broader context, the observed opposing trends resemble patterns reported in other body-driven adaptive processes, suggesting that inverse dynamics across coupled systems may emerge when constraints originate from peripheral rather than central mechanisms. Ultimately, the dying process appears to represent an extreme form of cross-system disintegration, marked by the collapse of the hierarchical coordination that normally sustains integrated physiological function.

2606.04525 2026-06-12 cs.CL cs.LG q-bio.GN 版本更新

GENEB: Why Genomic Models Are Hard to Compare

GENEB:为什么基因组模型难以比较

Daria Ledneva, Mikhail Nuridinov, Denis Kuznetsov

发表机构 * GitHub arXiv

AI总结 针对基因组基础模型评估碎片化的问题,提出GENEB基准,通过统一探测协议在100项任务上比较40个模型,揭示模型排名不稳定、规模收益有限等关键发现。

Comments change first page figure, fix model sizes, add more consistency

详情
AI中文摘要

由于基准碎片化、评估协议不兼容以及任务特定报告,基因组基础模型的进展难以评估。因此,关于模型优越性或通用性的声明往往无法直接比较。我们引入GENEB,这是一个大规模诊断基准,在统一的基于探测的协议下(包括少样本场景),评估来自40个基因组基础模型的冻结表示,涵盖100个任务,跨越13个功能类别。GENEB能够在明确暴露任务级权衡的同时,对模型规模、架构、分词和预训练数据进行受控比较。我们的分析表明,整体排行榜不稳定:模型排名在不同任务类别间变化剧烈,规模仅带来适度且不一致的收益,而架构和预训练对齐常常超过参数数量的影响。这些结果凸显了当前评估实践的局限性,并将GENEB定位为基因组机器学习中原则性比较和类别感知模型选择的参考框架。

英文摘要

Progress in genomic foundation models is difficult to assess due to fragmented benchmarks, incompatible evaluation protocols, and task-specific reporting. As a result, claims of superiority or generality across models are often not directly comparable. We introduce GENEB, a large-scale diagnostic benchmark that evaluates frozen representations from 40 genomic foundation models across 100 tasks spanning 13 functional categories under a unified probing-based protocol, including few-shot regimes. GENEB enables controlled comparison across model scale, architecture, tokenization, and pretraining data while explicitly exposing task-level trade-offs. Our analysis shows that aggregate leaderboards are unstable: model rankings vary sharply across task categories, scale provides only modest and inconsistent gains, and architectural and pretraining alignment frequently outweigh parameter count. These results highlight limitations of current evaluation practices and position GENEB as a reference framework for principled comparison and category-aware model selection in genomic machine learning.

2605.23111 2026-06-12 q-bio.NC 版本更新

Contextual Role Modulates Object Representational Geometry in the Human Brain

情境角色调节人脑中物体的表征几何结构

Julien Dirani, Shankar Chawla, Leila Wehbe, Bradford Z. Mahon

AI总结 本研究结合fMRI与自然电影观看,发现物体作为动作目标时激活顶叶动作网络,其表征按动作可供性组织;作为被动元素时激活枕颞网络,按语义维度组织,表明大脑根据情境角色动态重映射物体表征几何结构。

详情
AI中文摘要

人脑表征物体时既保持跨实例的不变性,又足够灵活以支持不同情境和任务。然而,当同一物体在情境角色间转换时,其表征如何被动态重映射仍不清楚。本研究结合fMRI与自然电影观看,探究同一物体作为场景中的被动元素与作为目标导向动作的目标时,其表征方式。当物体是动作目标时,它们激活了以缘上回和中央后回为中心的顶叶动作网络;而被动物体则招募了参与视觉物体识别的分布式枕颞网络。在各自情境中最强编码物体的网络内,表征几何结构表现出双重分离:目标物体表征按动作可供性和手姿势可供性维度组织,而被动物体表征则与语义维度对齐。此外,视觉表征结构在不同情境下保持不变。在这些情境特异性脑网络之外,表征内容保持情境不变性,表明灵活性和不变性在同一表征系统的不同层次上运作。总之,这些发现展示了物体表征几何结构的神经重映射,其方式依赖于自然场景中物体情境相关性的实时变化。

英文摘要

The human brain represents objects in a way that is both invariant across instances and flexible enough to support different contexts and tasks. Yet it remains unknown how object representations are dynamically remapped as the same object shifts across contextual roles. Using fMRI during naturalistic movie viewing we investigated how the same objects are represented when they are passive scene elements versus targets of goal-directed actions. Action targets engaged a parietal action network centered in the supramarginal and postcentral gyri, while passive objects recruited a distributed occipito-temporal network involved in visual object recognition. Within context-selective networks, representational geometry showed a double dissociation: target objects were organized by action affordance and hand posture affordance dimensions, while passive objects aligned with semantic dimensions. Visual representational structure was invariant to context. Outside these networks, representational content retained invariance, indicating that flexibility and invariance operate at different levels of the same representational system. These findings demonstrate neural remapping of object representations depending on moment-to-moment changes in contextual roles during a naturalistic scene.

2603.02274 2026-06-12 q-bio.QM cs.AI 版本更新

Contextual Invertible World Models: A Neuro-Symbolic Agentic Framework for Colorectal Cancer Drug Response

上下文可逆世界模型:用于结直肠癌药物反应的神经符号智能框架

Christopher Baker, Tianyu Ren, Karen Rafferty, Hui Wang

AI总结 提出上下文可逆世界模型(CIWM),结合机器学习模拟器与大语言模型推理层,通过逆推理进行CRISPR扰动,揭示KRAS突变在5-氟尿嘧啶耐药中的主导作用及PIK3CA修复的意外效应。

详情
AI中文摘要

精准肿瘤学目前受到小N大P悖论的限制,即高维基因组数据丰富但药理学反应样本稀疏。虽然深度学习实现了预测准确性,但它常常无法提供临床采用所需的机制清晰度。我们提出了上下文可逆世界模型(CIWM),这是一个神经符号智能框架,通过将定量机器学习模拟器与大语言模型推理层集成来弥合这一差距。利用在Sanger GDSC数据集(\\( N=83 \\))上严格筛选的高保真数据工程流程,我们从体外伪影中分离出真正的生物信号,为复杂转录组学建立了严格的基线预测相关性(\\( r=0.268 \\))。通过逆推理,我们在结直肠癌景观中进行了计算机CRISPR扰动。该框架自主推翻了经典机制假设,识别出突变KRAS在驱动5-氟尿嘧啶耐药(\\( \Delta=-0.0469 \\))中相对于APC/Wnt轴具有层级优势,并通过映射到MAPK/PI3K网络的“KRAS盾牌”实现。此外,智能层识别出“PIK3CA悖论”,揭示修复PIK3CA通过触发补偿性反馈环过度激活主导的MAPK生存通路,无意中增加了化疗耐药性(\\( \Delta=+0.0085 \\))。

英文摘要

Precision oncology is currently limited by the small-N, large-P paradox, where high-dimensional genomic data is abundant but pharmacological response samples are sparse. While deep learning achieves predictive accuracy, it frequently fails to provide the mechanistic clarity required for clinical adoption. We present the Contextual Invertible World Model (CIWM), a Neuro-Symbolic Agentic Framework that bridges this gap by integrating a quantitative machine learning emulator with a Large Language Model reasoning layer. Utilising a stringently curated, high-fidelity data engineering pipeline on the Sanger GDSC dataset (\( N=83 \)), we isolate true biological signals from in vitro artifacts to establish a rigorous baseline predictive correlation for complex transcriptomics (\( r=0.268 \)). Through Inverse Reasoning, we perform in silico CRISPR perturbations across the colorectal landscape. The framework autonomously overturns classical mechanistic assumptions, identifying a hierarchical dominance of mutant KRAS over the APC/Wnt-axis in driving 5-fluorouracil resistance (\( Δ=-0.0469 \)) via a "KRAS Shield" mapped to MAPK/PI3K networks. Furthermore, the agentic layer identified a "PIK3CA Paradox", revealing that repairing PIK3CA inadvertently increases chemoresistance (\( Δ=+0.0085 \)) by triggering a compensatory feedback loop that hyperactivates the dominant MAPK survival pathway.

2604.20782 2026-06-12 q-bio.QM q-bio.BM 版本更新

LAFA: A Framework for Reproducible Longitudinal Assessment of Protein Function Annotation Models

LAFA:可重复的蛋白质功能注释模型纵向评估框架

An Phan, Yanli Wang, Frimpong Boadu, Jianlin Cheng, Predrag Radivojac, Iddo Friedberg

AI总结 提出LAFA服务器,作为蛋白质功能预测方法的持续基准测试系统,通过容器化方法实现动态、可重复的评估,加速方法迭代并支持可重复性。

详情
AI中文摘要

动机:蛋白质功能预测是计算生物学中一项具有挑战性的任务和一个开放性问题。蛋白质功能注释的关键评估(CAFA)是一项三年一次的、社区驱动的倡议,通过延时基准测试实验,为蛋白质功能预测的计算方法提供独立的大规模评估。CAFA在突出高性能方法、促进详细分析和思想交流方面发挥了关键作用。然而,在定期的CAFA挑战之外,没有平台可以持续评估新开发的方法并跟踪随着功能注释积累的性能变化。结果:本文介绍了蛋白质功能注释模型的纵向评估服务器(LAFA),作为蛋白质功能预测方法的持久基准测试系统。LAFA提供对容器化功能预测方法的持续评估,能够在不断演变的真实标签下进行最新且稳健的方法性能比较评估。LAFA加速了方法迭代,支持可重复性,并提供了蛋白质功能预测进展的更动态和细粒度的视图。代码和数据可用性:LAFA可在以下网址获取:此 https URL。详细评估结果可在以下网址找到:此 https URL。

英文摘要

Motivation: Protein function prediction is a challenging task and an open problem in computational biology. The Critical Assessment of protein Function Annotation (CAFA) is a triennial, community-driven initiative that provides an independent, large-scale evaluation of computational methods for protein function prediction through time-delayed benchmarking experiments. CAFA has played a key role in highlighting high-performing methodologies and fostering detailed analysis and exchange of ideas. However, outside the periodic CAFA challenges, there is no platform for the continuous evaluation of newly developed methods and tracking performance as function annotations accumulate. Results: Here we introduce the Longitudinal Assessment of Protein Function Annotation Models server (LAFA) as a persistent benchmarking system for protein function prediction methods. LAFA provides a continuous evaluation of containerized function prediction methods, enabling up-to-date and robust comparative assessment of method performance under evolving ground truth. LAFA accelerates methodological iteration, supports reproducibility, and offers a more dynamic and fine-grained view of progress in protein function prediction. Code and Data Availability: LAFA is available at https://functionbench.net/. Detailed evaluation results can be found at https://github.com/anphan0828/CAFA_forever

2512.02528 2026-06-12 q-bio.QM 版本更新

Assessment of Simulation-based Inference Methods for Stochastic Compartmental Models in Epidemiological Research

基于模拟的推断方法在流行病学随机房室模型中的评估

Vincent Wieland, Nils Wassmuth, Lorenzo Contento, Martin Kühn, Jan Hasenauer

AI总结 比较伪边际粒子马尔可夫链蒙特卡洛和条件归一化流两种贝叶斯推断方法在三种随机房室模型上的性能,展示其准确鲁棒的推断能力和预测能力。

详情
AI中文摘要

全球大流行,如最近的COVID-19危机,凸显了对能够捕捉疾病传播固有随机性的随机流行病模型的需求。此类模型必须配备参数估计方法,以便生成快速的即时预测和短期预测,为公共卫生决策提供信息。本文比较了两种先进的贝叶斯推断方法:1) 伪边际粒子马尔可夫链蒙特卡洛,使用粒子滤波器获得的无偏似然估计;2) 条件归一化流。我们研究了它们在三种常用房室模型上的性能:经典易感-感染-易感模型、易感-感染-康复模型和双变种易感-暴露-感染-康复模型,并辅以将潜在轨迹映射到经验数据的观测模型。针对随机设置中参数推断的难处理似然挑战,我们的分析强调了这些无似然方法如何提供准确且鲁棒的推断能力。我们的模拟研究结果进一步强调了这些方法在捕捉流行病随机动态方面的有效性,为疫情爆发的控制提供了预测能力。在埃塞俄比亚队列研究上的结果展示了在真实世界噪声和不规则数据采样下的操作鲁棒性。为了促进重用并支持构建最终有助于公共卫生更好决策的流程,我们公开提供了代码和合成数据集。

英文摘要

Global pandemics, such as the recent COVID-19 crisis, highlight the need for stochastic epidemic models that can capture the randomness inherent in the spread of disease. Such models must be accompanied by methods for estimating parameters in order to generate fast nowcasts and short-term forecasts that can inform public health decisions. This paper presents a comparison of two advanced Bayesian inference methods: 1) pseudo-marginal particle Markov chain Monte Carlo, using an unbiased likelihood estimate obtained by Particle Filter (PF), and 2) Conditional Normalizing Flows (CNF). We investigate their performance on three commonly used compartmental models: A classical Susceptible-Infected-Susceptible (SIS), a Susceptible-Infected-Recovered (SIR) model and a two-variant Susceptible-Exposed-Infected-Recovered (SEIR) model, complemented by an observation model that maps latent trajectories to empirical data. Addressing the challenges of intractable likelihoods for parameter inference in stochastic settings, our analysis highlights how these likelihood-free methods provide accurate and robust inference capabilities. The results of our simulation study further underscore the effectiveness of these approaches in capturing the stochastic dynamics of epidemics, providing prediction capabilities for the control of epidemic outbreaks. Results on an Ethiopian cohort study demonstrate operational robustness under real-world noise and irregular data sampling. To facilitate reuse and to enable building pipelines that ultimately contribute to better informed decision making in public health, we make code and synthetic datasets publicly available.

2603.24603 2026-06-12 q-bio.NC cs.AI 版本更新

Fusion Learning from Dynamic Functional Connectivity: Combining the Amplitude and Phase of fMRI Signals to Identify Brain Disorders

融合动态功能连接:结合fMRI信号的幅度和相位识别脑疾病

Jinlong Hu, Jiatong Huang, Zijian Cai

AI总结 提出多尺度融合学习框架MSFL,结合滑动窗口相关和相位同步两种互补的动态功能连接特征,在自闭症和抑郁症数据集上显著优于现有模型。

详情
AI中文摘要

基于静息态功能磁共振成像(fMRI)的动态功能连接(dFC)已广泛应用于脑科学研究。滑动窗口相关(SWC)方法通过计算脑区对信号幅度时间序列之间的相关系数,是构建dFC的常用方法。在本研究中,我们提出了一种集成方法,结合fMRI信号的幅度和相位信息,以提高脑疾病的检测能力。具体而言,我们引入了一个多尺度融合学习框架MSFL,该框架利用来自SWC和相位同步(PS)的两种互补dFC特征。其中,SWC捕获幅度相关性,而PS测量dFC内的相位相干性。我们使用两个公开数据集(ABIDE I和REST-meta-MDD)评估了MSFL在分类自闭症谱系障碍和重度抑郁症方面的有效性。结果表明,MSFL显著优于现有比较模型。此外,我们使用SHAP框架进行了模型解释分析,表明来自SWC和PS的两种dFC特征均有助于检测脑疾病。

英文摘要

Dynamic functional connectivity (dFC) derived from resting-state functional magnetic resonance imaging (fMRI) has been extensively utilized in brain science research. The sliding window correlation (SWC) method is a widely used approach for constructing dFC by computing correlation coefficients between amplitude time series of signals from pairs of brain regions. In this study, we propose an integrated approach that incorporates both amplitude and phase information of fMRI signals to improve the detection of brain disorders. Specifically, we introduce a multi-scale fusion learning framework, namely MSFL, which leverages two complementary dFC features derived from SWC and phase synchronization (PS). Here, SWC captures amplitude correlations, while PS measures phase coherence within dFC. We evaluated the efficacy of MSFL in classifying autism spectrum disorder and major depressive disorder using two publicly available datasets: ABIDE I and REST-meta-MDD, respectively. The results indicate that MSFL significantly outperforms existing comparative models. Moreover, we performed model explanation analysis using the SHAP framework, which showed that both types of dFC features from SWC and PS contribute to detecting brain disorders.