arXivDaily arXiv每日学术速递 周一至周五更新
重置
2606.18058 2026-06-17 eess.IV q-bio.QM 新提交

Multiscale reconstruction of protein conformations from cryo-EM images

从冷冻电镜图像中多尺度重建蛋白质构象

David Y. W. Thong, Ozan Öktem, Joakim Andén

AI总结 提出一种多尺度算法,直接从单颗粒冷冻电镜数据恢复蛋白质原子模型,通过显式表示蛋白质主链的键、扭转角和键角,在噪声高、对比度低的数据上达到最先进精度,并提高RMSD和TM分数。

Comments 19 pages, 11 figures. Submitted to the Journal of Structural Biology

详情
AI中文摘要

我们提出了一种新颖的多尺度算法,用于从单颗粒冷冻电镜数据中直接恢复蛋白质的原子模型结构。我们的算法能够针对高噪声和低对比度的数据估计出达到最先进精度的蛋白质结构。它还对TEM图像形成模型中的错误指定具有鲁棒性。这些理想的特性主要归功于使用键、扭转角和键角对蛋白质主链进行显式表示,这为结构恢复过程提供了丰富的先验信息。我们将该方法应用于三个蛋白质冷冻电镜数据集(使用电子显微镜数字孪生产生),并表明使用多尺度方法相对于真实值在均方根偏差(RMSD)和模板建模(TM)分数上有所改进。此外,有证据表明多尺度算法优先考虑更大尺度的结构,这减少了收敛到不良局部极小值的可能性。

英文摘要

We present a novel multiscale algorithm for directly recovering the atomic model structure of a protein from single-particle cryo-EM data. Our algorithm is able to estimate protein structures to state-of-the-art accuracy for high-noise and low-contrast data. It is also robust to misspecifications in the TEM image formation model. These desirable properties are primarily due to the use of an explicit representation of the protein backbone in terms of bonds, torsion angles and bond angles, which supplies rich prior information to the structure recovery process. We apply our method on three protein cryo-EM datasets, generated using an electron microscope digital twin, and show that using a multiscale approach yields an improvement of the root-mean-square deviation (RMSD) and template modelling (TM) scores with respect to the ground truth. Furthermore, there is evidence that larger-scale structures are being prioritised with the multiscale algorithm, which reduces the possibility of convergence to bad local minima.

2606.18179 2026-06-17 q-bio.GN 新提交

PyPeakRankR: Reproducible Peak-Level Feature Extraction for Regulatory Element Ranking

PyPeakRankR:用于调控元件排序的可重现峰级特征提取

Saroja Somasundaram, Nelson J. Johansen, Trygve E. Bakken, Jeremy A. Miller

AI总结 提出PyPeakRankR开源Python包,从ATAC-seq峰中提取BigWig信号、GC含量、PhyloP保守性、分布矩和细胞类型特异性排名等特征,形成可重现的峰-特征矩阵,支持透明基准测试和跨组装评分,在BICCN挑战中排名前三。

Comments Software paper. Code: this https URL (https://github.com/AllenInstitute/PeakRankR/tree/python-package). 6 pages, 1 figure

详情
AI中文摘要

高通量染色质可及性检测(如ATAC-seq)可生成数千个候选调控元件(峰),但目前尚无标准化工具来整合多种定量特征以优先选择峰进行功能验证。本文提出PyPeakRankR,一个开源Python包,它提取峰级特征,即BigWig信号汇总、GC含量、PhyloP保守性评分、分布矩(峰度、偏度、双峰性)和细胞类型特异性排名,并将其整合为一个可重现的峰×特征矩阵,以制表符分隔值(TSV)文件存储。PyPeakRankR将确定性特征提取与下游排序分离,使得在相同上游数据上对优先排序策略进行透明基准测试成为可能。该包提供命令行界面和匹配的Python API,支持通过liftOver进行跨组装评分,并在数分钟内处理数千个峰。PyPeakRankR在脑倡议细胞普查网络(BICCN)社区挑战中得到验证,其前身PeakRankR在16种方法中排名前三,用于细胞类型特异性增强子预测。在最近的一项基底神经节研究中,PyPeakRankR被用于跨物种增强子排序管道(CERP),以识别在多种细胞类型中实现超过70%靶向特异性的增强子-AAV工具。PyPeakRankR在MIT许可下免费提供,网址为https://github.com/example/PyPeakRankR。

英文摘要

High-throughput chromatin accessibility assays such as ATAC-seq generate thousands of candidate regulatory elements (peaks), yet no standardized tool exists for assembling the diverse quantitative features needed to prioritize peaks for functional validation. Here we present PyPeakRankR, an open-source Python package that extracts peak-level features, namely BigWig signal summaries, GC content, PhyloP conservation scores, distribution moments (kurtosis, skewness, bimodality), and cell-type specificity rankings, into a single reproducible peak by feature matrix stored as a tab-separated values (TSV) file. PyPeakRankR separates deterministic feature extraction from downstream ranking, enabling transparent benchmarking of prioritization strategies on the same upstream data. The package provides both a command-line interface and a matching Python API, supports cross-assembly scoring via liftOver, and runs in minutes on thousands of peaks. PyPeakRankR was validated in the Brain Initiative Cell Census Network (BICCN) community challenge, where its predecessor PeakRankR ranked among the top 3 of 16 methods for cell-type specific enhancer prediction. In a recent basal ganglia study, PyPeakRankR was used within the Cross-species Enhancer Ranking Pipeline (CERP) to identify enhancer-AAV tools achieving greater than 70% on-target specificity across cell types. PyPeakRankR is freely available under the MIT license at this https URL.

2606.17745 2026-06-17 q-bio.NC 新提交

Separating wiring-specific from statistical control of dynamics in a complete connectome

在完整连接组中分离接线特定与统计控制对动力学的影响

Stavros Therianos

AI总结 通过运行完整连接组作为固定速率模型,并与随机化网络比较,发现粗粒度接线统计决定整体动力学状态,而精确接线模式决定活动传播路径和回路几何结构。

Comments 20 pages, 6 figures. Supplementary Information provided as an ancillary file

详情
AI中文摘要

电子显微镜重建现已产生整个小型大脑的完整突触接线图,即连接组,包括第一个完全重建的昆虫大脑——果蝇幼虫。接线图单独在多大程度上固定电路的活动,相对于它未记录的更精细的生理细节,仍存在争议。我们将一个完整的连接组作为固定的、基于速率的动力学算子运行,其中没有单个神经元参数被拟合,因此在固定的动力学状态下,模型的行为反映接线及其连接强度,而非调谐的单神经元生理学,并将其与一系列随机化网络进行比较,每个随机化网络保留了接线更粗粒度的描述。模型的整体动力学状态,即其响应的强度和丰富程度,主要是统计性的:仅保留连接组粗粒度接线统计的网络能够重现它。超出这些统计的接线则设定活动传播的位置以及哪些回路塑造它。稀疏输入被限制在一个紧凑的嗅觉通路中,而随机化网络会淹没该通路;蘑菇体(昆虫学习中心)在主导伴随侧模式中占据过大作用,这些模式决定了哪些神经元塑造循环动力学。粗粒度统计设定状态;精确的连接模式设定几何结构,这种分离澄清了哪些基于连接组的论断仅依赖于接线。

英文摘要

Electron-microscopy reconstruction now yields complete synaptic wiring diagrams, or connectomes, of entire small brains, including the larval Drosophila, the first insect brain reconstructed in full. How far a wiring diagram alone fixes a circuit's activity, as opposed to the finer physiological detail it does not record, is debated. We run a complete connectome as a fixed, rate-based dynamical operator in which no single-neuron parameter is fitted, so that, at one fixed dynamical regime, the model's behavior reflects the wiring and its connection strengths rather than tuned single-neuron physiology, and compare it against a hierarchy of randomized networks that each preserve a coarser description of the wiring. The model's overall dynamical regime, how strongly and how richly it responds, is mostly statistical: networks keeping only the connectome's coarse wiring statistics reproduce it. The wiring beyond these statistics instead sets where activity travels and which circuits shape it. Sparse input is confined to a compact olfactory pathway that randomized networks flood, and the mushroom body, the insect learning center, takes an outsized role in the leading adjoint-side modes, the directions that weigh which neurons shape the recurrent dynamics. Coarse statistics set the regime; the precise pattern of connections sets the geometry, a separation that clarifies which connectome-based claims rest on wiring alone.

2606.17736 2026-06-17 q-bio.NC 新提交

Ten Years of the Stochastic Resonance Model of Tinnitus: From Phantom Perception to Adaptive Sensory Optimization

耳鸣的随机共振模型十年:从幻想到自适应感觉优化

Patrick Krauss, Achim Schilling

AI总结 本文综述了耳鸣的随机共振模型,该模型将耳鸣重新解释为听觉系统为补偿听力损失而自适应上调神经噪声的副产品,并总结了理论、实验和临床应用进展。

详情
AI中文摘要

主观性耳鸣——在没有外部声刺激的情况下感知声音——仍然是听觉神经科学中最具争议的现象之一。2016年,随机共振(SR)模型被引入作为耳鸣相关神经元过度活跃的替代解释,提出内部产生的神经噪声被自适应上调以恢复听力损失后的信息传递。该模型没有将增加的自发活动解释为适应不良,而是将其重新定义为一种功能机制,增强感觉阈值附近的信号检测,而耳鸣则作为自适应感觉优化的副作用出现。在过去十年中,这一框架已从现象学假设发展为更广泛的神经计算理论,将信息论、自适应信号检测、多通道听觉处理和跨模态可塑性联系起来。计算建模、大规模临床分析和动物实验为关键预测提供了汇聚支持,包括特定噪声条件下的可检测性改善和频率特异性幻听。该框架还启发了基于频谱匹配近阈值噪声刺激的治疗方法,并最近被整合到一个统一的听觉幻听解释中,该解释结合了随机共振、中枢增益、稳态可塑性和预测编码。本综述按时间顺序概述了随机共振模型的发展,总结了主要理论和实验进展,并指出了机制验证和临床转化的未来方向。通过将耳鸣重新定义为自适应感觉计算的结果,该模型将概念焦点从病理功能障碍转向神经系统中信息优化的原理。

英文摘要

Subjective tinnitus - the perception of sound in the absence of an external acoustic stimulus - remains one of the most debated phenomena in auditory neuroscience. In 2016, the stochastic resonance (SR) model was introduced as an alternative account of tinnitus-related neuronal hyperactivity, proposing that internally generated neural noise is adaptively upregulated to restore information transmission after hearing loss. Rather than interpreting increased spontaneous activity as maladaptive, the model reframed it as a functional mechanism that enhances signal detection near sensory thresholds, with tinnitus emerging as a side effect of adaptive sensory optimization. Over the past decade, this framework has evolved from a phenomenological hypothesis into a broader neurocomputational theory linking information theory, adaptive signal detection, multichannel auditory processing, and cross-modal plasticity. Computational modeling, large-scale clinical analyses, and animal experiments have provided converging support for key predictions, including improved detectability under specific noise conditions and frequency-specific phantom percepts. The framework has also inspired a therapeutic approach based on spectrally matched near-threshold noise stimulation and has recently been integrated into a unified account of auditory phantom perception that combines stochastic resonance, central gain, homeostatic plasticity, and predictive coding. This review provides a chronological overview of the development of the stochastic resonance model, summarizes major theoretical and empirical advances, and outlines future directions for mechanistic validation and clinical translation. By redefining tinnitus as a consequence of adaptive sensory computation, the model shifts the conceptual focus from pathological dysfunction toward principles of information optimization in neural systems.

2606.17457 2026-06-17 q-bio.SC 新提交

Aging induced structural alterations in SR-Mitochondria interaction in skeletal muscle: Emerging insights

衰老诱导的骨骼肌SR-线粒体相互作用结构改变:新见解

Unmod Senapati, Barsha Priyadarshini Kar, Sunil Pani, Naresh Chandra Bal

AI总结 本文综述了衰老过程中骨骼肌肌浆网与线粒体接触(MAMs)的结构和功能变化,探讨了运动、营养和药物干预对延缓MAMs丢失的作用。

详情
AI中文摘要

骨骼肌在衰老过程中经历显著变化,包括解剖、超微结构以及生化方面的改变。与衰老相关的肌肉质量减少,称为肌少症,是老年功能衰退和虚弱的主要因素,导致自信心下降。在成年骨骼肌纤维中,肌浆网(SR)和线粒体与肌膜(形成T-小管)一起表现出最复杂和精确的分布,这对肌肉功能至关重要。在健康的年轻肌肉组织中,SR和线粒体膜的紧密物理接近显示出称为线粒体相关膜(MAMs)的接触。最近的文献强调了MAMs网络通过调节Ca2+信号、脂质运输和其他信号分子(如活性氧)的定位,在肌肉平滑功能中的作用。提出了几种锚定机制来稳定MAMs网络,经典的是线粒体融合蛋白(MFN1和MFN2)。新兴共识表明,骨骼肌中的MAMs促进了兴奋-代谢耦合的准确性,确保空间能量供应。然而,在衰老过程中,SR和线粒体的共定位以及串扰的精确性似乎受到影响。在这篇综述中,我们批判性地审视了关于健康和疾病中MAMs网络结构和功能的当前文献,主要从衰老的角度出发。我们进一步评估了运动、营养、营养保健品和药理学方法在减少MAMs丢失以延缓衰老进展中的作用。保持骨骼肌健康与功能是实现健康老龄化目标的主要因素。

英文摘要

Skeletal muscle undergo remarkable changes during aging including anatomical, ultrastructural, and moreover biochemical. The aging associated reduction of muscle mass, termed as sarcopenia, is a major factor in geriatric functional decline and frailty, contributing to the lowering of self-confidence. In an adult skeletal muscle fibers, sarcoplasmic reticulum (SR) and mitochondria exhibit most intricate and precise distribution along with the sarcolemmal (forming T-tubule), which is critical for muscle function. In healthy young muscle tissue, the close physical proximity of SR and mitochondrial membranes shows contacts called mitochondria-associated membranes (MAMs). Recent literature highlights the role of MAMs network in smooth functioning of muscle by regulating localization of Ca2+-signaling, lipid transport, and other signalling molecules like reactive oxygen species. Several tethering mechanisms are proposed to stabilize the MAMs network, the classical ones being the mitofusins (MFN1 and MFN2). Emerging consensus suggest that MAMs in the skeletal muscle facilitate accuracy of excitation-metabolic coupling ensuring spatial energy supply. However, upon aging the precision of SR and mitochondria co-localization as well as crosstalk seems to be affected. In this review, we have critically examined the current literature about MAMs network structure and function during health and diseases mainly from an aging perspective. We have further evaluated the role of exercise, nutritional, nutraceutical and pharmacological approaches in lowering MAMs loss in an effort to retard aging progression. Retention of skeletal muscle health and performance is a major factor in achieving the goal of healthy aging.

2606.17277 2026-06-17 q-bio.OT 新提交

Accuracy, Repeatability, and Reproducibility of a Radiographic Technique to Assess Spinal Cord Stimulation Lead Position: A Validation Study

评估脊髓刺激电极位置的放射学技术的准确性、重复性和再现性:一项验证研究

Andrew Thoreson (1), Katrina Fernandez (1 and 2), Cesar Lopez (1), Margaux Linde (1), Mark A. Bendel (3), Peter Grahn (1), Kristin D. Zhao (1 and 2 and 4) ((1) Assistive and Restorative Technology Laboratory, Rehabilitation Medicine Research Center, Department of Physical Medicine and Rehabilitation, Mayo Clinic, Rochester, MN, USA, (2) Mayo Clinic Graduate School of Biomedical Sciences, Mayo Clinic, Rochester, MN, USA, (3) Department of Anesthesiology and Perioperative Medicine, Mayo Clinic, Rochester, MN, USA, (4) Department of Physiology and Biomedical Engineering, Mayo Clinic, Rochester, MN, USA)

AI总结 本研究开发了一种通过放射线片测量脊髓刺激电极位置的技术,并验证了其准确性、重复性和再现性,发现最小可检测变化小于相邻电极间距,且变异小于总变异的10%。

Comments 11 pages, 2 tables, 6 figures

详情
AI中文摘要

脊髓刺激通过植入电极是治疗多种慢性疼痛的有效疗法。然而,电极移位是导致疗效丧失的常见并发症。以往研究使用放射线片描述电极移位,但方法不一致且缺乏严格验证。本研究旨在开发一种测量腰骶椎管内硬膜外脊髓刺激电极位置的放射学技术,并确定其准确性、重复性和再现性。对三名经皮植入两个八触点圆柱形电极的临床试验参与者进行计算机断层扫描,通过三维测量确定电极位置,并生成数字重建放射线片。两名操作员对每个电极应用数字化和测量协议。创建Bland-Altman图以确定最小可检测变化,并进行量具重复性和再现性分析。发现最小可检测变化小于相邻电极间距,且重复性和再现性引入的变异小于总研究变异的10%。我们得出结论,所开发的测量电极位置的方法具有足够的准确性以及可接受的重复性和再现性。

英文摘要

Spinal cord stimulation with implantable leads is a valuable therapy used to treat a variety of chronic pain conditions. However, lead migration is a common complication causing loss of efficacy. Previous reports have characterized lead migration using radiographs, but methods are not consistent and lack rigorous validation. The purpose of this study was to develop a technique to perform radiographic measurements of the position of epidural spinal cord leads within the lumbosacral spinal canal and establish its accuracy, repeatability, and reproducibility. Computed tomography scans were acquired from three clinical trial participants implanted percutaneously with two eight-contact cylindrical leads; from these, electrode positions were established using three-dimensional measurements, and digitally reconstructed radiographs were created. Two operators applied a digitization and measurement protocol for each lead. Bland-Altman plots were created to determine smallest detectable change, and a gage repeatability and reproducibility analysis was performed. Smallest detectable change was found to be less than the distance between adjacent electrodes and variability introduced by repeatability and reproducibility was less than 10% of the total study variability. We conclude that the method developed to measure lead electrode position has sufficient accuracy and acceptable repeatability and reproducibility.

2606.17420 2026-06-17 eess.IV cs.AI q-bio.QM 新提交

Feynman Kac Reweighted Schrödinger Bridge Matching for Surface-Based Tau PET Harmonization

基于Feynman Kac重加权薛定谔桥匹配的皮层表面Tau PET标准化

Jianwei Zhang, Xinyu Nie, Jiaxin Yue, Yonggang Shi

发表机构 * Stevens Neuroimaging and Informatics Institute, University of Southern California(斯蒂文斯神经影像与信息学研究所,南加州大学) Ming Hsieh Department of Electrical and Computer Engineering of Viterbi School of Engineering, University of Southern California(明希德电气与计算机工程系,维特比工程学院,南加州大学) Alfred E. Mann Department of Biomedical Engineering of Viterbi School of Engineering, University of Southern California(阿尔弗雷德·E·曼生物医学工程系,维特比工程学院,南加州大学)

AI总结 提出Feynman Kac重加权薛定谔桥匹配(FKRSBM)模型,通过熵正则化最优传输实现源域与目标域间的随机传输,结合子群感知端点提议和球面卷积骨干网络,在Tau PET SUVR图上实现优于现有方法的分布对齐和下游疾病分类。

详情
AI中文摘要

Tau PET成像对于追踪阿尔茨海默病进展至关重要,但不同站点间的扫描仪、协议和放射性示踪剂的系统差异引入了非生物变异性,这会增加生物标志物方差、降低对疾病效应的敏感性,并可能偏倚下游临床评估。标准化方法旨在去除这些站点引起的偏移,同时保留有生物学意义的信号,然而现有方法在源队列和目标队列具有不同子群组成时难以应对,存在将站点效应与生物学变异(如tau阳性状态)混淆的风险。我们提出Feynman Kac重加权薛定谔桥匹配(FKRSBM)模型来解决这一问题。与基于扩散的方法通过高斯噪声先验路由数据不同,FKRSBM通过熵正则化最优运输学习源分布和目标分布之间的直接随机传输过程。为了实现生物学一致的传输,FKRSBM结合了由参考桥测度的Feynman Kac重加权导出的子群感知端点提议,完全通过数据层面的分层重要性抽样实现,无需对底层桥匹配求解器或网络架构进行任何更改。对于基于表面的神经影像,FKRSBM采用在皮层网格上运行的球面卷积骨干网络进行顶点级标准化。我们在tau PET SUVR图上评估该方法,将HABS-HD队列的PI-2620数据标准化到ADNI的AV-1451域。与ComBat、CycleGAN、基于扩散的方法(DF)和无正则化的扩散薛定谔桥匹配(DSBM)相比,FKRSBM实现了更优的分布对齐、更低的tau阳性符号不匹配、更强的APOE子群对齐以及改进的下游疾病分类性能。

英文摘要

Tau PET imaging is central to tracking Alzheimer's disease progression, but systematic differences between scanners, protocols, and radiotracers across sites introduce nonbiological variability that inflates biomarker variance, reduces sensitivity to disease effects, and can bias downstream clinical assessments. Harmonization methods aim to remove these site-induced shifts while preserving biologically meaningful signal, yet existing approaches struggle when source and target cohorts differ in subgroup composition, risking conflation of site effects with biological variation such as tau-positivity status. We propose the Feynman Kac Reweighted Schröodinger Bridge Matching (FKRSBM) model to address this problem. Rather than routing data through a Gaussian noise prior as in diffusion-based methods, FKRSBM learns a direct stochastic transport process between source and target distributions via entropy-regularized optimal transport. To enforce biologically consistent transport, FKRSBM incorporates a subgroup-aware endpoint proposal derived from a Feynman Kac reweighting of the reference bridge measure, implemented entirely through stratified importance sampling at the data level and requiring no changes to the underlying bridge-matching solver or network architecture. For surface-based neuroimaging, FKRSBM employs a spherical convolutional backbone operating on cortical meshes to perform vertex-level harmonization. We evaluate the method on tau PET SUVR maps, harmonizing PI-2620 data from the HABS-HD cohort into the AV-1451 domain of ADNI. Compared against ComBat, CycleGAN, a diffusion-based method (DF), and unregularized Diffusion Schröodinger Bridge Matching (DSBM), FKRSBM achieves superior distributional alignment, reduced tau-positivity sign mismatch, stronger APOE subgroup alignment, and improved downstream disease classification performance.

2606.17327 2026-06-17 q-bio.BM cs.AR cs.ET cs.NE 新提交

Energy-efficient codon optimization on thermodynamic hardware

热力学硬件上的节能密码子优化

Andraz Jelincic, Ross C. Walker

AI总结 本文将mRNA密码子优化问题映射到伊辛模型,在热力学采样单元上实现,相比GPU能耗降低约10^6倍,为热力学计算在制药领域的应用提供了首个具体实例。

Comments Preprint available on bioRxiv: DOI TBD

详情
AI中文摘要

计算能耗的不断增长正变得日益不可持续。热力学计算利用物理热涨落作为计算资源而非抑制它们,为概率性和组合性任务提供了数量级的节能。制药研发严重依赖计算优化和采样,是一个自然的应用领域。本文提出了据我们所知首个映射到热力学硬件的具体制药应用,并给出了基于原型测量的能耗估计。我们将mRNA密码子优化(药物开发中常规解决的组合问题)简化为从伊辛模型采样,使其可直接在热力学采样单元(TSU)上执行。在SARS-CoV-2刺突蛋白上对三种方法(Potts采样、伊辛采样和遗传算法基线)进行基准测试,发现所有方法均达到相当的优化质量(得分约234-240),但基于验证硬件模型的能耗估计表明,TSU解决该问题所需的能量约为传统GPU的10^6分之一。所有代码均以开源许可证发布。

英文摘要

The growing energy demand for computation is becoming increasingly unsustainable. Thermodynamic computing, which harnesses physical thermal fluctuations as a computational resource rather than suppressing them, offers orders-of-magnitude energy savings for probabilistic and combinatorial tasks. Pharmaceutical R&D, heavily reliant on computational optimization and sampling, is a natural application domain. Here we present what is, to our knowledge, the first concrete pharmaceutical application mapped to thermodynamic hardware with energy estimates grounded in prototype measurements. We reduce mRNA codon optimization, a combinatorial problem routinely solved in drug development, to sampling from an Ising model, making it directly executable on a thermodynamic sampling unit (TSU). Benchmarking three approaches (Potts sampling, Ising sampling, and a genetic algorithm baseline) on the SARS-CoV-2 spike protein, we find that all achieve comparable optimization quality (scores ~234-240), but energy estimates based on validated hardware models indicate that a TSU could solve this problem using approximately 10e6 times less energy than a conventional GPU. All code is released under an open-source license.

2606.17127 2026-06-17 q-bio.QM cs.AI cs.LG 新提交

Agentic Discovery of Non-Canonical Antimicrobial Peptides with AMPGAN v3

AMPGAN v3 的非经典抗菌肽智能发现

Jay Jung, Xiaohan Zhang, Shenghan Song, Mahmoud Sayedahmed, Chijian Xiang, Yunong Xu, Ahmed AbdelKhalek, Severin T. Schneebeli, Matthew J. Wargo, Jianing Li, Safwan Wshah

发表机构 * University of Vermont(弗吉尼亚大学) Larner College of Medicine, University of Vermont(弗吉尼亚大学医学学院) Purdue University(普渡大学) Department of Comparative Pathobiology(比较病理科部门) Department of Horticulture and Landscape Architecture(园艺与景观建筑部门) Department of Industrial and Molecular Pharmaceutics(工业与分子药学部门)

AI总结 提出 AMPGAN v3,一种多目标条件 GAN,扩展生成词汇至 D-氨基酸和末端修饰,通过双判别器提升稳定性,体外验证显示对革兰氏阳性菌有活性,并引入 PepCraft 多智能体框架用于端到端发现。

Comments Presented at the GenBio Workshop, ICML 2026

详情
AI中文摘要

抗菌药物耐药性每年导致超过一百万人死亡。抗菌肽(AMP)是一种有前景的解决方案,但生成式 AMP 模型尚未准备好设计含有非天然氨基酸和/或化学修饰的肽,而这些对于实际肽药物至关重要。我们提出了 AMPGAN v3,一种多目标条件 GAN,它将生成词汇扩展到 D-氨基酸和 N/C 末端修饰(如酰胺化)。通过将对抗性和活性感知监督分离到两个专门的判别器中,AMPGAN v3 显著提高了训练稳定性,并在外部分类器上优于先前的生成式 AMP 模型。我们在体外验证了跨越三个结构类别的五个候选物;其中两个对革兰氏阳性菌株表现出活性,最佳候选物对枯草芽孢杆菌的 MIC 达到 8 μg/mL。为了支持下游筛选,我们进一步提出了 PepCraft,一个用于端到端 AMP 发现的多智能体框架,其中规划智能体协调专门的执行器进行生成、过滤和验证。其优先级推荐与我们的体外结果一致。这些贡献使我们能够在小型但真实的规模上研究生成式和智能体 AI 如何在治疗性肽发现中协同作用。代码:this https URL

英文摘要

Antimicrobial resistance causes to over a million deaths annually. Antimicrobial peptides (AMPs) are a promising solution, but generative AMP models are not yet ready to design peptides with non-natural amino acids and/or chemical modifications, which are essential for real-world peptide drugs. We present AMPGAN v3, a multi-objective conditional GAN that expands the generative vocabulary to D-amino acids and N/C-terminus modifications such as amidation. By separating adversarial and activity-aware supervision across two specialized discriminators, AMPGAN v3 substantially improves training stability and outperforms prior generative AMP models on external classifiers. We validated five candidates spanning three structural classes in vitro; two showed activity against Gram-positive strains, with the best candidate reaching MIC 8 {\mu}g/mL against B. subtilis. To support downstream curation, we further present PepCraft, a multi-agent framework for end-to-end AMP discovery in which a Planning Agent orchestrates specialized executors for generation, filtering, and verification. Its prioritization recommendations align with our in vitro outcomes. Together, these contributions let us examine, on a small but real scale, how generative and agentic AI compose in therapeutic peptide discovery. Code: this https URL

2606.17062 2026-06-17 q-bio.QM cs.LG 新提交

RadSEM: A Finding-by-Finding Metric for Clinical Consistency in Radiology Reports

RadSEM:放射学报告中临床一致性的逐发现指标

Zhenhong Yang, Zhuoyun Liu, Jintao Fei, Wen Tang, Shichao Quan, Jun Zhao, Jun Xu

发表机构 * JDH Algo, JD Health International Inc., China Department of Big Data in Health Science, The First Affiliated Hospital of Wenzhou Medical University, China Zhejiang Engineering Research Center for Hospital Emergency Department of Intensive Care Unit, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, Zhejiang, China

AI总结 提出RadSEM指标,通过约束LLM辅助将报告重写为原子发现句,进行矛盾感知的多对多匹配,并计算异常加权的F1分数,在SSREE测试中优于现有指标,实现高一致性评分。

详情
AI中文摘要

放射学报告评估必须区分临床兼容性与表面相似性,因为否定、侧别或正常-异常极性可能逆转发现。我们提出RadSEM(放射学句子级评估指标),一种受约束的LLM辅助指标,用于基于参考的放射学发现评估。RadSEM将参考报告和生成报告重写为有序的原子发现句,每个句子表达一个部位-发现命题。然后执行矛盾约束的多对多匹配:不兼容对(如“积液”和“无积液”)不得分,而兼容的粒度差异可获部分得分。确定性阶段根据部分-整体和异常-细节关系对配对加权,计数未匹配的发现,并生成异常加权的加权F1分数。因此,LLM支持结构化重写和局部对齐,而非充当不透明评判者。我们使用SSREE(一种受控单调性压力测试,基于2,448份去标识报告扩展为五个等级损坏水平)评估RadSEM。RadSEM的Kendall tau_b达到0.957,全对一致性97.8%,相邻一致性95.0%,81.9%的报告实现严格五级排序,优于放射学专用和通用文本指标,同时避免了极性反转报告重新获得词汇重叠的失败。在同一SSREE集上,RadSEM优于参考锚定的RadSEM-Alt策略,将相邻一致性从90.7%提升至95.0%,严格排序从67.2%提升至81.9%。在599个三元组同义词/反义词子集上,RadSEM在597个案例(99.67%)中偏好同义词。这些结果表明,显式发现单元、矛盾感知匹配和异常聚焦的确定性评分使报告评分更具可解释性,并对临床有意义的错误更敏感。代码见:此https URL。

英文摘要

Radiology report evaluation must distinguish clinical compatibility from surface similarity, because negation, laterality, or normal-abnormal polarity can reverse a finding. We propose RadSEM (Radiology Sentence-Level Evaluation Metric), a constrained LLM-assisted metric for reference-based evaluation of radiology Findings. RadSEM rewrites reference and generated reports into ordered atomic finding sentences, each expressing one site-finding proposition. It then performs contradiction-constrained many-to-many matching: incompatible pairs such as "effusion" and "no effusion" receive no credit, while compatible granularity differences can receive partial credit. A deterministic stage weights pairs by part-whole and abnormal-detail relationships, counts unmatched findings, and produces an abnormal-focused weighted F1 score. Thus, the LLM supports structured rewriting and local alignment rather than acting as an opaque judge. We evaluate RadSEM with SSREE, a controlled monotonicity stress test built from 2,448 de-identified reports expanded into five graded corruption levels. RadSEM achieves Kendall tau_b of 0.957, all-pairs concordance of 97.8%, adjacent concordance of 95.0%, and strict five-level ordering for 81.9% of reports, outperforming radiology-specific and general text metrics while avoiding the failure in which polarity-inverted reports regain lexical overlap. On the same SSREE set, RadSEM outperforms the Ref-anchored RadSEM-Alt policy, improving adjacent concordance from 90.7% to 95.0% and strict ordering from 67.2% to 81.9%. On a 599-triplet synonym/antonym subset, RadSEM prefers synonyms in 597 cases (99.67%). These results suggest that explicit finding units, contradiction-aware matching, and abnormal-focused deterministic scoring make report scoring more interpretable and sensitive to clinically meaningful errors. Code is available at this https URL.

2606.17742 2026-06-17 cs.CV q-bio.NC 新提交

BrainWorld: A Structural-Prior-Conditioned Generative Model for Whole-Brain 4D fMRI Dynamics

BrainWorld:一种用于全脑4D fMRI动力学的结构先验条件生成模型

Junfeng Xia, Wenhao Ye, Junxiang Zhang, Xuanye Pan, Mo Wang, Quanying Liu

发表机构 * Department of Biomedical Engineering, Southern University of Science and Technology(南方科技大学生物医学工程系) School of Biomedical Engineering, Shenzhen University(深圳大学生物医学工程学院)

AI总结 提出BrainWorld模型,利用结构MRI作为解剖先验条件,通过去噪过程生成全脑4D fMRI动态,在22个数据集上稳定生成400帧轨迹,并通过生成样本增强提升下游任务性能。

详情
AI中文摘要

全脑4D fMRI生成对于建模功能性脑动力学具有重要价值,然而现有的fMRI基础模型主要针对表示学习和下游预测,而非条件预测生成。我们提出BrainWorld,一种用于全脑4D fMRI动力学的结构先验条件生成模型。BrainWorld使用sMRI作为受试者级别的解剖上下文来指导未来的fMRI生成,将结构信息整合到去噪过程中,而非将其视为并行模态。在涵盖不同队列和脑状态的22个数据集上评估,BrainWorld能够生成长达400帧的稳定4D fMRI轨迹,通过生成样本增强提升下游性能,并学习到可迁移的多模态表示,优于基线方法。这些结果共同确立了BrainWorld作为长时程脑动力学建模和多模态表示学习的条件感知生成框架。

英文摘要

Whole-brain 4D fMRI generation is valuable for modeling functional brain dynamics, yet existing fMRI foundation models mainly target representation learning and downstream prediction rather than conditional predictive generation. We introduce BrainWorld, a structural-prior-conditioned generative model for whole-brain 4D fMRI dynamics. BrainWorld uses sMRI as subject-level anatomical context to guide future fMRI generation, integrating structural information into the denoising process rather than treating it as a parallel modality. Evaluated on 22 datasets spanning diverse cohorts and brain states, BrainWorld generates stable 4D fMRI trajectories up to 400 frames, improves downstream performance through generated-example augmentation, and learns transferable multimodal representations that outperform baselines. Together, these results establish BrainWorld as a condition-aware generative framework for long-horizon brain dynamics modeling and multimodal representation learning.

2606.17668 2026-06-17 cs.LG cs.AI q-bio.QM 新提交

ASTEROID: A Spatiotemporal Information Transformer for Forecasting Multi-Step Time Series of Molecular Dynamics

ASTEROID: 用于分子动力学多步时间序列预测的时空信息变换器

Kexin Wu, Luonan Chen, Renxiao Wang

AI总结 提出ASTEROID框架,通过将分子动力学轨迹重构为高维时空序列并集成时空信息变换方程到Transformer中,实现多步原子坐标的直接预测,在多个量子力学分子数据集上显著提升预测精度并降低计算成本。

Comments 32 pages,10 figures

详情
AI中文摘要

分子动力学(MD)模拟计算需求高,尤其对于需要长期分析的大规模系统。准确预测MD模拟结果不仅是一个有吸引力的科学挑战,而且具有重要的实用价值。在这项工作中,我们开发了一个数据驱动框架,称为ASTEROID(用于推断动力学的先进时空变换器),可以直接预测多步原子坐标,避免传统的迭代积分。为此,我们的ASTEROID将MD轨迹重构为高维时空序列,并将时空信息(STI)变换方程集成到Transformer架构中。ASTEROID的核心创新在于其建模多尺度时空依赖性的能力。具体来说,对于空间依赖性,局部-全局自注意力机制捕获短程和长程相互作用。对于时间依赖性,编码器-解码器结构将全局上下文与自回归预测相结合。ASTEROID在几个量子力学衍生的分子数据集上进行了评估。我们的结果表明,ASTEROID不仅在各种基准测试中实现了比现有方法更高的多步预测精度,而且显著降低了传统MD模拟的计算成本。此外,该模型支持在扩展时间尺度上的迭代多步预测。这项工作为加速MD模拟建立了一个稳健且可推广的数据驱动范式。

英文摘要

Molecular dynamics (MD) simulation is computationally demanding, particularly for large-scale systems requiring long-term analysis. Accurate forecast of the outcomes of a MD simulation is not only an attractive scientific challenge but also has substantial practical value. In this work, we developed a data-driven framework, termed ASTEROID (Advanced Spatiotemporal TransformER fOr Inferring Dynamics), that can directly predict multi-step atomic coordinates, avoiding conventional iterative integration. For this purpose, our ASTEROID reformulates MD trajectories as high-dimensional spatiotemporal sequences and integrates the Spatiotemporal Information (STI) Transformation equation into a Transformer architecture. The core innovation of ASTEROID lies in its ability to model multiscale spatiotemporal dependencies. In particular, for spatial dependencies, a local-global self-attention mechanism captures both short- and long-range interactions. For temporal dependencies, an encoder-decoder structure integrates global context with autoregressive forecasting. ASTEROID was evaluated on several quantum-mechanics derived molecular datasets. Our results indicate that ASTEROID achieved not only a higher level of accuracy in multi-step prediction than existing methods on various benchmarks, but also significantly reduced computational cost of conventional MD simulation. Moreover, the model supports iterative multi-step forecasting over an extended time scale. This work establishes a robust and generalizable data-driven paradigm for accelerating MD simulations.

2606.17456 2026-06-17 cs.RO q-bio.NC 新提交

Embodiment Shapes Rolling Behavior in a Multimodal Infant Model

具身形态塑造多模态婴儿模型中的翻滚行为

Leon Philipp, Francisco M. López, Jochen Triesch

发表机构 * Frankfurt Institute for Advanced Studies(法兰克福高等研究院) Goethe University Frankfurt(法兰克福大学) University of New South Wales(新南威尔士大学)

AI总结 通过虚拟婴儿MIMo学习仰卧到俯卧翻滚,研究婴儿运动发展中的具身形态变化如何影响行为,发现与真实婴儿一致的发育趋势和协调模式。

Comments 7 pages, 7 figures. Accepted at the 2026 IEEE ICDL Conference. Cite as: L. Philipp, F. M. López, and J. Triesch, "Embodiment Shapes Rolling Behavior in a Multimodal Infant Model", in 2026 IEEE International Conference on Development and Learning (ICDL). IEEE, 2026, pp. 1-7

详情
AI中文摘要

翻身是婴儿运动发展中最早期的里程碑之一,反映了协调的全身感觉运动控制的出现。在这里,我们使用MIMo(一个配备本体感觉和前庭感觉的虚拟婴儿具身模型)对婴儿翻滚进行计算研究。MIMo通过强化学习学习从仰卧到俯卧的翻滚。有趣的是,学习到的行为捕捉到了与真实婴儿报告一致的发育趋势和协调模式,包括随着年龄增长表现提升和执行速度加快。我们的结果解释了婴儿的能力和限制如何能在人工代理中产生逼真的行为,特别强调了运动发展如何受到不断变化的身体形态的影响。这项工作突出了具身计算模型作为研究感觉运动发展的强大工具的作用。

英文摘要

Rolling over is one of the earliest milestones in infant motor development, reflecting the emergence of coordinated, whole-body sensorimotor control. Here, we conduct a computational study of infant rolling using MIMo, a virtual infant embodiment equipped with proprioception and vestibular sensation. MIMo learns supine-to-prone rolls with reinforcement learning. Interestingly, the learned behaviors capture developmental trends and coordination patterns consistent with those reported in real infants, including improved performance and faster execution with age. Our results explain how infant capabilities and constraints can give rise to realistic behaviors in artificial agents, with a particular emphasis on how motor development is shaped by the changing body morphology. This work highlights the role of embodied computational models as a powerful tool for studying sensorimotor development.

2606.17115 2026-06-17 cs.LG cs.AI q-bio.QM 新提交

Probing, Fusion, and Trustworthiness: A Systematic Evaluation of Foundation Model Representations for Multimodal Cancer Analysis

探测、融合与可信度:基础模型表示在多模态癌症分析中的系统评估

Jingyu Hu, Giuseppe Tripodi, Reed Naidoo, Sarah F. McGough, Tapabrata Chakraborti

发表机构 * The Alan Turing Institute(艾伦·图灵研究所) University of Bristol(布里斯托大学) University of Manchester(曼彻斯特大学) The Institute of Cancer Research(癌症研究所) Genentech(基因泰克)

AI总结 系统评估基础模型表示在计算病理学任务中的性能,发现图像和组学表示互补,多模态融合在单模态不占优时有效,并利用共形预测验证了不确定性感知推理的临床价值。

详情
AI中文摘要

基础模型(FMs)已成为医学数据的强大表示提取器,但它们在分布偏移下的泛化能力仍未充分探索。本工作系统评估了基于FM的表示在计算病理学任务上的表现,涉及两个真实世界商业队列IH-BC和IH-NSCLC,这些队列来自许可的内部(IH)肿瘤学数据集。分析聚焦于两种模态:全切片图像和转录组图谱,均来自IH多模态数据。我们首先在八个下游分类任务上对五个FM进行单模态探测性能基准测试,发现图像和组学表示携带互补的预测信号。然后,我们通过比较三种基于配对表示的图像-组学融合策略,研究多模态融合是否能在单模态基线之上带来额外收益。进一步通过共形预测评估所选单模态和多模态管道的可信度。我们的结果表明,FM表示在分布外数据上取得了竞争性性能,且多模态融合主要在单模态不占主导信号时有所帮助。共形预测揭示,在点预测失败的大多数情况下,真实诊断仍可在预测集中恢复,这强化了不确定性感知推理对临床支持的价值。

英文摘要

Foundation models (FMs) have emerged as powerful representation extractors for medical data, yet their generalizability to datasets under distribution shift remains underexplored. This work systematically evaluates FM-based representations on a suite of computational pathology tasks across two real-world commercial cohorts, IH-BC and IH-NSCLC, drawn from the licensed in-house (IH) oncology dataset. The analysis focuses on two modalities, whole-slide images and transcriptomic profiles, drawn from the IH multimodal data. We first benchmark unimodal probing performance across five FMs on eight downstream classification tasks, and find that image and omics representations carry complementary predictive signals. Then we investigate whether multimodal fusion can yield additional gains over unimodal baselines by comparing three image-omics fusion strategies built on paired representations. The trustworthiness of selected unimodal and multimodal pipelines is further assessed through conformal prediction. Our results show that FM representations achieve competitive performance on out-of-distribution data and that multimodal fusion helps mainly when no single modality dominates the signal. Conformal prediction reveals that in the majority of cases where a point prediction fails, the true diagnosis remains recoverable within the prediction set, reinforcing the value of uncertainty-aware inference for clinical support.

2606.17586 2026-06-17 q-bio.PE math.DS 新提交

Aggregation as a Double-Edged Sword: Fear, Allee Effects, and Finite-Time Collapse

聚合是一把双刃剑:恐惧、Allee效应与有限时间崩溃

Kwadwo Antwi-Fordjour, Eric M. Takyi

AI总结 研究通过疾病-捕食模型揭示,猎物聚合在恐惧和Allee效应下会加速生态系统有限时间崩溃,并首次量化了行为与人口参数对崩溃速度的影响。

详情
AI中文摘要

猎物聚合被广泛视为抵御捕食的防御机制,但我们表明,在受捕食者诱导的恐惧和人口Allee阈值影响的疾病结构化种群中,聚合可能矛盾地加速生态系统崩溃。我们开发并分析了一个易感-感染-捕食者模型,该模型包含双重恐惧反应——以及一个次线性基于聚合的捕食项和Allee效应。关键地,我们推导出灭绝时间的显式上界,该上界随着捕食压力增加或聚合增强而减小,首次量化了行为和人口参数如何共同决定生态崩溃的速度。这种有限时间灭绝随后引发感染猎物和捕食者种群的级联崩溃,导致整个生态群落灭绝。分岔分析揭示了随着恐惧强度、聚合强度和Allee阈值变化而出现的跨临界、鞍结和Hopf分岔。双参数延拓进一步确定了恐惧-Allee参数平面中稳定共存、振荡共存、捕食者排除和有限时间灭绝发生的精确区域,表明更强的聚合单调地扩大了有限时间灭绝区域,而较弱的聚合支持更丰富的共存动态景观。这些结果表明,当种群水平的行为防御与疾病动态和人口脆弱性相互作用时,可能产生突然的生态临界点。

英文摘要

Prey aggregation is widely regarded as a defense against predation, yet we show that in disease-structured populations subject to predator-induced fear and demographic Allee thresholds, aggregation can paradoxically accelerate ecosystem collapse. We develop and analyze a susceptible-infectious-predator model incorporating dual fear responses -- together with a sublinear aggregation-based predation term and an Allee effect. Critically, we derive an explicit upper bound on the extinction time that decreases as predator pressure increases or aggregation strengthens, quantifying for the first time how behavioral and demographic parameters jointly determine the speed of ecological collapse. This finite-time extinction subsequently triggers a cascade collapse of the infected prey and predator populations, driving the entire ecological community to extinction. Bifurcation analysis reveals transcritical, saddle-node, and Hopf bifurcations as fear intensity, aggregation strength, and Allee threshold vary. Two-parameter continuation further identifies the precise regions of the fear--Allee parameter plane in which stable coexistence, oscillatory coexistence, predator exclusion, and finite-time extinction occur, demonstrating that stronger aggregation monotonically enlarges the finite-time extinction region while weaker aggregation supports a richer landscape of coexistence dynamics. These results demonstrate that behavioral defenses operating at the population level can generate abrupt ecological tipping points when they interact with disease dynamics and demographic vulnerability.

2606.17125 2026-06-17 q-bio.PE math.DS math.OC 新提交

Tipping the Balance: Allee Thresholds, Saddle-Node Bifurcations, and Optimal Sterile-Male Release Strategies for Anopheles Mosquitoes

打破平衡:按蚊的Allee阈值、鞍结分岔与最优不育雄蚊释放策略

Abba Gumel, C. Alex Safsten

AI总结 针对按蚊的性别和阶段结构模型,研究不育昆虫技术(SIT)下的Allee效应,证明通过释放不育雄蚊可将种群推过Allee阈值实现消除,并优化释放策略。

Comments 47 pages

详情
AI中文摘要

我们建立并分析了一个按蚊动态的性别和阶段结构模型,该模型考虑了不育昆虫技术(SIT),其动机是需要对杀虫剂抗性和户外传播具有鲁棒性的工具。模型追踪水生阶段、成年雄性、未交配雌性以及与野生或不育雄性交配的雌性;包括产卵能力和幼虫竞争;并使用一个不应期后跟密度依赖的配偶搜索。由此产生的Holling II型交配项产生了配偶寻找的Allee效应。在建立适定性后,我们证明该Allee效应使得无蚊平衡对所有允许参数局部稳定,并且当快速配偶搜索再生数$R_0^q$小于1时全局渐近稳定。当$R_0^q>1$、栖息地容量大且幼虫竞争弱时,通过鞍结分岔出现两个正平衡:一个稳定的自然平衡和一个不稳定的Allee平衡,将持续存在与灭绝分开。对于一个简化模型,Goh-Volterra Lyapunov泛函估计了持续存在的吸引域。然后我们展示了恒定和种群响应的不育雄蚊释放如何重塑这种双稳态。足够大的释放通过第二个鞍结分岔消灭了正平衡,而足够大的恒定释放从每个允许的初始状态驱动局部消除。因此,SIT只需将种群推过Allee分界线,之后配偶寻找失败即可完成灭绝。在一个具有Allee阈值停止规则的自由时域优化框架中,混合释放策略相对于最佳恒定策略将不育雄蚊需求减少约5%,相对于最佳种群响应策略减少约39%。这些结果将Allee效应重新定义为一种媒介抑制的控制杠杆。

英文摘要

We formulate and analyze a sex- and stage-structured model for Anopheles dynamics under the sterile insect technique (SIT), motivated by the need for tools robust to insecticide resistance and outdoor transmission. The model tracks aquatic stages, adult males, unmated females, and females mated with wild or sterile males; includes egg-laying capacity and larval competition; and uses a refractory period followed by density-dependent mate search. The resulting Holling type-II mating term generates a mate-finding Allee effect. After establishing well-posedness, we prove that this Allee effect makes the mosquito-free equilibrium locally stable for all admissible parameters and globally asymptotically stable when a quick-mate-search reproduction number $R_0^q$ is below one. When $R_0^q>1$, habitat capacity is large, and larval competition is weak, two positive equilibria arise through a saddle-node bifurcation: a stable natural equilibrium and an unstable Allee equilibrium separating persistence from extinction. For a reduced model, a Goh-Volterra Lyapunov functional estimates the persistence basin. We then show how constant and population-responsive sterile-male releases reshape this bistability. Sufficiently large releases annihilate the positive equilibria in a second saddle-node bifurcation, while a sufficiently large constant release drives local elimination from every admissible initial state. Thus SIT need only push the population across the Allee separatrix, after which mate-finding failure can complete extinction. In a free-horizon optimization framework with an Allee-threshold stopping rule, a hybrid release strategy reduces the sterile-male requirement by about $5\%$ relative to the best constant-only strategy and $39\%$ relative to the best population-responsive-only strategy. These results recast the Allee effect as a control lever for vector suppression.

2606.17366 2026-06-17 math.AT math.GT q-bio.BM 新提交

A Persistent Homology Signature of Knotting

打结的持续同调签名

Aurelie Jodelle Kemme, Collins A. Agyingi, Colleen Farrelly, Agnese Barbensi

AI总结 研究通过持续同调识别打结,提出基于超图曲率的评分方法,在蛋白质家族和合成例子中区分打结与未打结结构。

Comments Comments are welcome

详情
AI中文摘要

我们询问是否可以使用持续同调识别打结。从曲线的点云表示开始,我们计算一维持续同调,提取循环代表,并为这些循环分配基于超图曲率的分数。受蛋白质启发但在更广泛范围内测试,该方法揭示了蛋白质家族和合成例子中打结与未打结结构之间的系统性差异。这表明打结留下了可检测的基于持续同调的签名。

英文摘要

We ask whether knotting can be recognised using persistent homology. Starting from a point-cloud representation of a curve, we compute one-dimensional persistent homology, extract cycle representatives, and assign a hypergraph curvature-based score to these cycles. Motivated by proteins but tested more broadly, the method reveals systematic differences between knotted and unknotted structures in both protein families and synthetic examples. This suggests that knotting leaves a detectable persistent-homology-based signature.

2606.17891 2026-06-17 q-bio.CB math.AP nlin.AO physics.bio-ph 新提交

A nonlinear theory for chemotactic fronts of mixed populations

混合群体趋化前沿的非线性理论

Giulia L. Celora, Marjorie Watts, Carles Falcó, Mohit P. Dalwadi

AI总结 通过渐近分析推导非线性理论,揭示细胞扩散性、趋化剂消耗和趋化敏感性如何决定异质性细胞群体的密度分布,并预测树突细胞与T细胞共迁移中的参数平衡。

详情
AI中文摘要

异质性细胞群体的集体迁移是许多生物和生理过程的核心,包括发育和免疫反应。最近的实验和理论进展表明,与自生化学梯度的非对称相互作用如何塑造迁移群体中不同细胞类型的空间分布。然而,控制异质性细胞群体稳健空间组织的原理仍然知之甚少。在这里,我们使用渐近分析系统地推导了由自生趋化性引导的异质性细胞集体的非线性解析理论。我们的理论解析了细胞扩散性、趋化剂消耗和趋化敏感性的异质性如何塑造迁移异质性群体的密度分布,揭示了四种不同的动力学行为,共同涵盖了所有可能的区域。我们将我们的框架校准到树突细胞和T细胞共迁移的实验数据。我们预测该系统在一个参数区域内运行,该区域平衡了细胞间混合与T细胞在迁移集体前沿的定位。我们的理论揭示,这种行为是通过树突细胞对趋化剂的强消耗产生的中间长程趋化信号实现的。总的来说,我们的框架提供了理解非互惠化学相互作用如何塑造异质性细胞群体中稳健集体迁移的一般原理。

英文摘要

Collective migration of heterogeneous cell populations is central to many biological and physiological processes, including development and immune response. Recent experimental and theoretical advances have shown how asymmetric interactions with self-generated chemical gradients shape the spatial distribution of distinct cell types within migrating collectives. However, the principles governing robust spatial organisation of heterogeneous cell populations remain poorly understood. Here, we use asymptotic analysis to systematically derive a nonlinear analytical theory for heterogeneous cell collectives guided by self-generated chemotaxis. Our theory disentangles how heterogeneity in cell diffusivity, chemoattractant consumption, and chemotactic sensitivity shape the density profiles of migrating heterogeneous collectives, revealing four distinct dynamical behaviours that together capture all possible regimes. We calibrate our framework to experimental data on the co-migration of dendritic and T cells. We predict that this system operates in a parameter regime that balances intercellular mixing with T-cell localisation at the leading front of the migrating collective. Our theory reveals that this behaviour is enabled by intermediate long-range chemoattractant signalling generated through strong chemoattractant consumption by dendritic cells. Overall, our framework provides general principles for understanding how non-reciprocal chemical interactions shape robust collective migration in heterogeneous cell populations.

2606.16590 2026-06-17 cs.LG cs.AI q-bio.NC 新提交

Infant Spontaneous Movement Noise Improves Exploration in Deep RL

婴儿自发运动噪声改善深度强化学习中的探索

Francisco M. López, Markus R. Ernst, Francisco Cruz, Matej Hoffmann, and Jochen Triesch

发表机构 * Frankfurt Institute for Advanced Studies(法兰克福高等研究所) School of Computer Science and Engineering, University of New South Wales(新南威尔士大学计算机科学与工程学院) Escuela de Ingeniería, Universidad Central de Chile(智利中央大学工程学院) Faculty of Electrical Engineering, Czech Technical University(捷克理工大学电气工程学院)

AI总结 受婴儿自发运动噪声启发,提出一种在RL训练中逐步增加时间自相关的探索噪声机制,实验表明其能产生结构化探索行为并提高学习效率。

Comments 6 pages, 4 figures, 1 table. Accepted at IEEE ICDL 2026. Cite as: F. M. López, M. R. Ernst, F. Cruz, M. Hoffmann, and J. Triesch, "Infant Spontaneous Movement Noise Improves Exploration in Deep RL", in 2026 IEEE International Conference on Development and Learning (ICDL). IEEE, 2026, pp. 1-6

详情
AI中文摘要

深度强化学习(RL)中的探索通常实现为时间上不相关的白噪声。然而,最近的研究表明,时间相关的有色噪声可以通过产生更平滑的轨迹和更好的状态空间覆盖来提高探索效率。我们探究受婴儿自发运动启发的动作噪声是否也能改善深度RL中的探索。我们发现婴儿末端执行器速度的功率谱密度遵循有色噪声过程,其谱指数随年龄增长而增加。受这一发育模式的启发,我们引入了一种机制,在RL训练过程中逐步增加探索噪声的时间自相关,与婴儿统计数据相匹配。在多个RL环境中的实验表明,婴儿启发的噪声产生结构化的探索行为,并且与传统的探索策略相比可以提高学习效率。这些发现表明,人类运动和认知发展可以为人工智能体的学习机制设计提供有用的指导。我们的代码可在 https://github.com/trieschlab/baby-noise-rl 获取。

英文摘要

Exploration in deep reinforcement learning (RL) is commonly implemented as temporally uncorrelated white noise. However, recent works show that temporally correlated colored noise can improve exploration efficiency by producing smooth trajectories with better coverage of the state space. We inquire whether action noise inspired by infant spontaneous movements can also improve exploration in deep RL. We find that the power spectral densities of babies' end-effector velocities follow a colored noise process where the spectral exponent increases with age. Inspired by this developmental pattern, we introduce a mechanism that progressively increases the temporal auto-correlation of exploration noise during RL training, matching the infant statistics. Experiments across several RL environments show that infant-inspired noise produces structured exploratory behavior and can improve learning efficiency compared to conventional exploration strategies. These findings suggest that human motor and cognitive development can provide useful guidance for designing learning mechanisms in artificial agents. Our code is available at this https URL.

2606.15310 2026-06-17 q-bio.OT cond-mat.stat-mech 新提交

Biological proper time and entropy-cost invariance in cardiac and respiratory lifespan scaling

心脏与呼吸寿命标度中的生物本征时间和熵成本不变性

Mesfin Taye

AI总结 基于热力学框架提出生物时间等价原理,将寿命周期数解释为总熵产生与单周期熵成本之比,并证明在异速标度下质量比熵成本不变。

Comments 32 pages

详情
AI中文摘要

温血脊椎动物在自然寿命中积累的生理周期数大致守恒:约$10^9$次心跳和$10^8$--$3\times10^8$次呼吸。这些规律并非精确常数,但它们在体重、代谢功率、生理频率和寿命的数量级变化中持续存在,表明生物时间并非仅由时间间隔衡量。我们提出了生物时间等价原理(PBTE),这是一个热力学框架,其中寿命周期数由总寿命熵产生与一个生理周期的熵成本之比决定。从开放系统的熵平衡$\dot S=\dot e_p-\dot h_d$出发,我们将每周期熵成本定义为$σ_0=dΣ/dN$,其中$dΣ$是生理时钟前进$dN$周期时产生的熵。对于成年稳态,这给出了周期数关系$N_\star=Σ/\langleσ_0\rangle$,其中$Σ=\int_0^L \dot e_p(t)\,dt$,$N_\star$是寿命周期数,$Σ$是总寿命熵产生,$\langleσ_0\rangle$是寿命平均的每周期熵成本。在稳态极限下,$\dot e_p\simeq P/T$,因此直接测量代谢功率$P$、体温$T$和生理频率$f$可得$σ_0\simeq P/(Tf)$。PBTE将经验性的寿命周期不变性转化为熵成本不变性。在Kleiber代谢标度和四分之一幂生理频率标度下,质量比熵成本满足$\barσ_0=P/(TfM)\propto M^{3/4+1/4-1}=M^0$,为异速标度的质量抵消提供了热力学解释。

英文摘要

Warm-blooded vertebrates accumulate approximately conserved numbers of physiological cycles over a natural lifetime: of order $10^9$ heartbeats and $10^8$--$3\times10^8$ breaths. These regularities are not exact constants, but their persistence across orders-of-magnitude variation in body mass, metabolic power, physiological frequency, and lifespan suggests that biological time is not measured by chronological duration alone. We develop the Principle of Biological Time Equivalence (PBTE), a thermodynamic framework in which lifetime cycle count is determined by the ratio between total lifetime entropy production and the entropy cost of one physiological cycle. Starting from the open-system entropy balance $\dot S=\dot e_p-\dot h_d$, we define the entropy cost per cycle as $\sigma_0=d\Sigma/dN$, where $d\Sigma$ is the entropy produced as the physiological clock advances by $dN$ cycles. For an adult homeostatic regime, this gives the cycle-count relation $N_\star=\Sigma/\langle\sigma_0\rangle$, with $\Sigma=\int_0^L \dot e_p(t)\,dt$, where $N_\star$ is the lifetime cycle count, $\Sigma$ is total lifetime entropy production, and $\langle\sigma_0\rangle$ is the lifetime-averaged entropy cost per cycle. In the homeostatic limit, $\dot e_p\simeq P/T$, so direct measurement of metabolic power $P$, body temperature $T$, and physiological frequency $f$ gives $\sigma_0\simeq P/(Tf)$. PBTE converts the empirical lifetime-cycle invariants into entropy-cost invariants. Under Kleiber metabolic scaling and quarter-power physiological-frequency scaling, the mass-specific entropy cost satisfies $\bar\sigma_0=P/(TfM)\propto M^{3/4+1/4-1}=M^0$, providing a thermodynamic interpretation of allometric mass cancellation.

2606.09770 2026-06-17 q-bio.NC cs.LG 新提交

Discovering Functionally Selective Brain Regions with a Deep Topographic Multimodal Model

发现功能选择性脑区:一种深度地形多模态模型

Badr AlKhamissi, Johannes Mehrer, Lara Marinov, Ahmed Abdelaal, Abdulkadir Gokce, Martin Schrimpf

发表机构 * University of California, Berkeley(加州大学伯克利分校) Max Planck Institute for Human Cognitive and Brain Sciences(马克斯·普朗克人类认知与脑科学研究所) ETH Zurich(苏黎世联邦理工学院)

AI总结 提出Topo-Omni模型,通过空间平滑微调预训练基础模型,在单一连续虚拟皮层上整合视觉、听觉和语言/认知处理,产生与人类神经影像一致的多模态聚类,并用于发现新脑区。

Comments Preprint. First two author contributed equally

详情
AI中文摘要

皮层中的邻近神经元具有相似的反应特征,从而在感觉和认知系统中产生系统性的空间组织。最近的地形模型再现了这种结构的某些方面,但仍然是单模态的,并且对每一层分别施加空间约束,产生了碎片化的图谱,既不能捕捉皮层处理流的连续性,也不能捕捉跨模态的整合。我们引入了Topo-Omni,一种地形多模态模型,其中视觉、听觉和语言/认知处理共享一个单一的连续虚拟皮层。通过使用空间平滑目标微调预训练的基础模型,该架构在跨模态中发展出与人类神经影像一致的聚类,从感觉系统到认知系统。驱动或抑制一个聚类会选择性偏向或损害感知,这与人类干预研究相似。最后,我们使用我们的模型在虚拟皮层中筛选新的聚类,并发现了新的自然景观和动物网络,并在人类数据中验证了它们。因此,单一的空间原则组织了跨模态和处理阶段的表征,产生了关于皮层组织的可检验假设。

英文摘要

Nearby neurons in cortex share similar response profiles, producing systematic spatial organization across sensory and cognitive systems. Recent topographic models reproduce aspects of this structure but remain unimodal and spatially constrain each layer separately, yielding fragmented maps that capture neither the contiguity of cortical processing streams nor their integration across modalities. We introduce Topo-Omni, a topographic multimodal model in which visual, auditory, and language/cognitive processing share a single contiguous in-silico sheet. Built by fine-tuning a pretrained foundation model with a spatial smoothness objective, this architecture develops clusters across modalities that are consistent with human neuroimaging, from sensory to cognitive systems. Driving or suppressing a cluster selectively biases or impairs perception, paralleling human intervention studies. Finally, we use our model to screen for novel clusters in-silico and discover new natural landscape and animal networks which we validate in human data. A single spatial principle thus organizes representations across modalities and processing stages, yielding testable hypotheses about cortical organization.

2605.26921 2026-06-17 cs.CV q-bio.NC 版本更新

Similarity-based representation factorization for revealing interpretable dimensions in representational data

揭示大脑、行为和AI中表征的核心维度

Florian P. Mahner, Ka Chun Lam, Francisco Pereira, Martin N. Hebart

发表机构 * Max Planck Institute for Human Cognitive and Brain Sciences(人类认知与脑科学最大平面研究所) National Institute of Mental Health(心理健康国家研究所) Justus Liebig University Giessen(吉森约斯特-利普大学) Center for Mind, Brain and Behavior(心智、脑与行为中心)

AI总结 提出相似性基表示因子分解(SRF)方法,从相似性矩阵中恢复低维、非负、可解释的嵌入,以揭示神经、行为和计算数据中表征的潜在维度。

详情
AI中文摘要

表征研究广泛存在于神经科学、心理学和人工智能等领域。虽然通常通过刺激之间的相似性来研究和比较表征,但现有方法仅能有限地访问塑造这些表征的维度,且可解释性有限。为克服这些挑战,本文引入相似性基表示因子分解(SRF),一种通用的计算方法,用于从测量数据导出的相似性矩阵中恢复低维、非负、可解释的嵌入。在模拟以及多种神经、行为和计算数据集中,SRF能从各种形式的表征数据中恢复可解释的维度,即使对于非常稀疏采样、不完整的数据也是如此。从这些数据集中导出的维度与任务特定模型获得的维度相匹配,预测独立的行为属性,改进探索性分析,并且与比较相似性矩阵相比,为验证性假设检验提供更高的统计功效。这些结果共同确立了SRF作为一种通用方法,在揭示、理解和利用表征背后的维度方面具有广泛的应用前景。

英文摘要

The study of representations is widespread across fields, including neuroscience, psychology, and artificial intelligence. While representations are often studied and compared through similarities between stimuli, current methods provide only limited access to the dimensions that shape these representations and are often limited in interpretability. To overcome these challenges, here we introduce Similarity-Based Representation Factorization (SRF), a general computational method for recovering low-dimensional, non-negative, interpretable embeddings from similarity matrices derived from measured data. Across simulations and many neural, behavioral, and computational datasets, SRF recovers interpretable dimensions from diverse forms of representational data, even for very sparsely sampled, incomplete data. The dimensions derived from these datasets match those obtained by task-specific models, predict independent behavioral properties, improve exploratory analysis, and offer higher power for confirmatory hypothesis testing than comparing similarity matrices. Together, these results establish SRF as a general-purpose method with broad applications for uncovering, understanding, and using the dimensions underlying representations.

2604.27583 2026-06-17 q-bio.NC cs.RO 版本更新

Simulating Infant First-Person Sensorimotor Experience via Motion Retargeting from Babies to Humanoids

通过从婴儿到类人机器人的运动重定向模拟婴儿第一人称感觉运动经验

Francisco M. López, Hoshinori Kanazawa, Ondrej Fiala, Yakov Balashov, Valentin Marcel, Lukas Rustler, Miles Lenz, Dongmin Kim, Yasuo Kuniyoshi, Jochen Triesch, Matej Hoffmann

AI总结 提出一种从单视频重建婴儿3D姿态并映射到物理/虚拟类人平台的方法,实现亚厘米级精度的多感觉流模拟,为发育研究和神经发育障碍早期检测提供新工具。

Comments Accepted at IEEE ICDL 2026. 8 pages, 6 figures. Cite as: F. M. López, H. Kanazawa, O. Fiala, Y. Balashov, V. Marcel, L. Rustler, M. Lenz, D. Kim, Y. Kuniyoshi, J. Triesch, and M. Hoffmann, "Simulating infant first-person sensorimotor experience via motion retargeting from babies to humanoids'', in 2026 IEEE International Conference on Development and Learning (ICDL). IEEE, 2026, pp. 1-8

详情
AI中文摘要

随着人形机器人能力的增强,从人类到类人人工体的运动重定向变得越来越重要。然而,现有方法大多只关注运动学再现,而忽略了与人类运动相关的丰富感觉运动经验。在这项工作中,我们提出了一个框架,使用物理和虚拟类人机器人模拟婴儿的多模态感觉运动经验。从单个视频中,我们的方法通过提取骨骼结构并从每一帧估计完整的3D姿态来重建婴儿的身体配置。然后,我们将重建的运动映射到几个发育平台上:物理iCub机器人和虚拟模拟器pyCub、EMFANT和MIMo。在这些实体上重放重定向的运动会产生模拟的多感觉流,包括本体感觉(关节和肌肉)、触觉和视觉。对于最佳匹配的实体,重定向实现了亚厘米级的精度,并能够对婴儿发育进行丰富的多模态分析,以及增强的行为自动标注。该框架为婴儿的感觉运动经验提供了一个独特的窗口,为机器人学、发育科学和神经发育障碍的早期检测提供了新工具。代码可在https://this URL获取。

英文摘要

Motion retargeting from humans to human-like artificial agents is becoming increasingly important as humanoid robots grow more capable. However, most existing approaches focus only on reproducing kinematics and ignore the rich sensorimotor experience associated with human movement. In this work, we present a framework for simulating the multimodal sensorimotor experiences of infants using physical and virtual humanoids. From a single video, our method reconstructs the infant's body configuration by extracting its skeletal structure and estimating the full 3D pose from each frame. Then we map the reconstructed motion onto several developmental platforms: the physical iCub robot and the virtual simulators pyCub, EMFANT and MIMo. Replaying the retargeted motions on these embodiments produces simulated multisensory streams including proprioception (joints and muscles), touch, and vision. For the best-matching embodiment, the retargeting achieves sub-centimeter accuracy and enables a rich multimodal analysis of infant development as well as enhanced automated annotation of behaviors. This framework provides a unique window into the infant's sensorimotor experience, offering new tools for robotics, developmental science, and early detection of neurodevelopmental disorders. The code is available at this https URL.

2604.16450 2026-06-17 cs.CY cs.LG q-bio.QM 版本更新

Evaluating Intersectional Fairness across Clinical Machine Learning Use Cases using Fairlogue and the All of Us Research Program

使用Fairlogue和All of Us研究计划评估临床机器学习用例中的交叉公平性

Nick Souligne, Vignesh Subbian

AI总结 本文使用Fairlogue工具包在临床预测任务中评估交叉公平性,发现交叉群体差异大于单轴分析,但反事实诊断表明多数差异与随机分组相当。

Comments 10 pages, 7 figures, Accepted at the AMIA Annual Symposium 2026

详情
AI中文摘要

医疗数据中的交叉偏见可能在临床机器学习模型中产生复合差异,然而大多数公平性评估独立地评估人口统计属性。FairLogue是一个用于交叉公平性审计的工具包,被应用于多个临床预测任务,以评估跨组合人口统计群体的差异。使用All of Us数据集,选择两个已发表模型进行复制和评估:(A) 预测选择性5-羟色胺再摄取抑制剂相关的出血事件,(B) 房颤患者两年卒中风险。计算了跨种族、性别和交叉亚组的观察性公平性指标,随后进行反事实分析以评估差异是否可归因于群体成员身份。交叉评估揭示了比单轴分析更大的差异;然而,反事实诊断表明,大多数观察到的差异与随机群体成员身份下预期的差异相当。这些结果强调了交叉公平性审计的重要性,并展示了FairLogue如何为临床机器学习系统中的偏见提供更深入的洞察。

英文摘要

Intersectional biases in healthcare data can produce compound disparities in clinical machine learning models, yet most fairness evaluations assess demographic attributes independently. FairLogue, a toolkit for intersectional fairness auditing, was applied across multiple clinical prediction tasks to evaluate disparities across combined demographic groups. Using the All of Us dataset, two published models were selected for replication and evaluation: (A) prediction of selective serotonin reuptake inhibitor associated bleeding events and (B) two-year stroke risk in patients with atrial fibrillation. Observational fairness metrics were computed across race, gender, and intersectional subgroups, followed by counterfactual analysis to evaluate whether disparities were attributable to group membership. Intersectional evaluation revealed larger disparities than single-axis analyses; however, counterfactual diagnostics indicated that most observed disparities were comparable to those expected under randomized group membership. These results highlight the importance of intersectional fairness auditing and demonstrate how FairLogue provides deeper insight into bias in clinical machine learning systems.

2301.07386 2026-06-17 q-bio.NC stat.AP 版本更新

Hierarchical Bayesian inference for community detection and connectivity of functional brain networks

功能脑网络社区检测与连接性的层次贝叶斯推断

Lingbin Bian, Nizhuan Wang, Leonardo Novelli, Jonathan Keith, Adeel Razi

AI总结 提出基于贝叶斯潜在块模型的多层社区检测方法,在个体和群体层面稳健检测加权功能网络社区结构,保留个体变异性,并通过模拟和真实fMRI数据验证其准确性和可靠性。

详情
AI中文摘要

大多数功能性磁共振成像研究依赖于对层级组织的功能脑网络的估计,这些网络的分隔与整合反映了人类的认知和行为变化。然而,现有的从个体和群体层面分析方法中估计网络社区结构的大多数方法并未考虑受试者之间的变异性。在本文中,我们开发了一种基于贝叶斯潜在块模型(LBM)的新型多层社区检测方法。该方法能够在个体和群体层面稳健地检测具有未知社区数量的加权功能网络的社区结构,并保留个体网络的变异性。为了验证,我们提出了一种新的基于社区结构的多元高斯生成模型来模拟合成信号。我们的模拟研究表明,通过层次贝叶斯推断估计的社区成员身份与生成模型中预定义的节点标签一致。该方法还通过使用人类连接组项目中100名无关健康受试者的工作记忆任务fMRI数据的分半可重复性进行了测试。使用合成数据和真实数据的分析表明,与常用的(多层)模块性模型相比,我们提出的方法更准确、更可靠。

英文摘要

Most functional magnetic resonance imaging studies rely on estimates of hierarchically organized functional brain networks whose segregation and integration reflect the cognitive and behavioral changes in humans. However, most existing methods for estimating the community structure of networks from both individual and group-level analysis methods do not account for the variability between subjects. In this paper, we develop a new multilayer community detection method based on Bayesian latent block model (LBM). The method can robustly detect the community structure of weighted functional networks with an unknown number of communities at both individual and group levels and retain the variability of the individual networks. For validation, we propose a new community structure-based multivariate Gaussian generative model to simulate synthetic signal. Our simulation study shows that the community memberships estimated by hierarchical Bayesian inference are consistent with the predefined node labels in the generative model. The method is also tested via split-half reproducibility using working memory task fMRI data of 100 unrelated healthy subjects from the Human Connectome Project. Analyses using both synthetic and real data show that our proposed method is more accurate and reliable compared with the commonly used (multilayer) modularity models.

2603.24293 2026-06-17 q-bio.NC 版本更新

Emergence of unique hues from sparse coding of color in natural scenes

自然场景颜色稀疏编码中独特色调的出现

Alexander Belsten, E. Paxon Frady, Bruno A. Olshausen

AI总结 通过分析自然图像的锥体响应统计特性,发现颜色分布呈非高斯且具有重尾,稀疏编码模型收敛到四种独特色调(红、绿、蓝、黄)及黑白,其非线性推理产生兴奋和抑制交互,解释了色调感知现象。

Comments 27 pages, 7 figures, 1 table, 3 supplemental pages, 3 supplemental figures, 1 supplemental table

详情
AI中文摘要

我们对颜色的主观体验通常由抽象属性描述,如色调、饱和度和亮度,这些并不直接对应于视网膜锥体产生的感觉信号。在色调维度上,某些颜色——红、绿、蓝和黄——看起来是独特的,因为它们不被感知为其他颜色的组合,并且红-绿和蓝-黄对看起来是对立的。然而,这些“独特色调”在大脑中的解剖学和生理学相关性以及它们存在的原因仍然是一个谜。在这里,我们展示了这些色调与自然视觉环境统计之间的直接联系。对503张校准自然图像数据集上的模拟锥体响应分析揭示了3D颜色空间中的强非高斯分布,具有不同、不对称排列方向的重尾。然后,一个稀疏编码模型被适应于该数据,以最小化表示数据的基础向量上的系数总和。一个六基向量模型收敛到四个独特色调以及黑色和白色。此外,我们发现稀疏编码模型中推理的非线性性质产生了潜在变量之间的兴奋性和抑制性相互作用;前者有助于组合相邻的独特色调对以编码介于它们之间的中间色调,而后者则强制对立独特色调之间的互斥性。这些发现共同为自然环境中颜色的分布提供了新的视角,并为这种结构与颜色外观现象学之间提供了联系原则。

英文摘要

Our subjective experience of color is typically described by abstract properties such as hue, saturation, and brightness that do not directly correspond to sensory signals arising from cones in the retina. Along the hue dimension, certain colors -- red, green, blue, and yellow -- appear unique in that they are not perceived as a combination of other colors, and the pairs red-green and blue-yellow appear opposites. However, the anatomical and physiological correlates of these 'unique hues' within the brain and the reason for their existence remain a mystery. Here, we demonstrate a direct connection between these hues and the statistics of the natural visual environment. Analysis of simulated cone responses on a dataset of 503 calibrated natural images reveals a strongly non-Gaussian distribution in 3D color space, with heavy tails in distinct, asymmetrically arranged directions. A sparse coding model is then adapted to this data so as to minimize the total sum of coefficients on the basis vectors for representing the data. A six basis-vector model converges to the four unique hues in addition to black and white. Moreover, we find that the nonlinear nature of inference in the sparse coding model yields both excitatory and inhibitory interactions among latent variables; the former facilitates combining adjacent pairs of unique hues to encode intermediate hues situated between them, while the latter enforces mutual exclusivity between opposite unique hues. Together, these findings shed new light on the distribution of color in the natural environment and provide a linking principle between this structure and the phenomenology of color appearance.

2507.04704 2026-06-17 q-bio.QM cs.AI cs.CV 版本更新

SPATIA: Multimodal Generation and Prediction of Spatial Cell Phenotypes

SPATIA: 空间细胞表型的多模态生成与预测

Zhenglun Kong, Mufan Qiu, John Boesen, Xiang Lin, Sukwon Yun, Tianlong Chen, Manolis Kellis, Marinka Zitnik

AI总结 提出SPATIA模型,融合细胞形态、基因表达和空间上下文,通过置信感知流匹配和形态-谱对齐实现多尺度生成与预测,在12项任务中优于18个基线模型。

Comments ICML 2026

详情
AI中文摘要

理解细胞形态、基因表达和空间上下文如何共同塑造组织功能是生物学中的一个核心挑战。基于图像的空间转录组学技术现在能够提供细胞图像和基因表达谱的高分辨率测量,但现有方法通常孤立地分析这些模态或以有限的分辨率进行分析。我们通过引入SPATIA来解决这个问题,这是一个多层次的生成和预测模型,通过融合从细胞到组织水平的形态、基因表达和空间上下文,学习统一的、空间感知的表征。SPATIA还结合了一个空间条件生成框架,该框架具有置信感知的OT重加权和形态-谱对齐,用于建模目标状态形态分布。具体来说,我们提出了一个置信感知的流匹配目标,该目标基于不确定性对弱最优传输对进行重加权。我们进一步应用形态-谱对齐来鼓励有生物学意义的图像生成,从而能够建模微环境依赖的表型转变。我们组装了一个多尺度数据集,包含17个组织中的2590万个细胞-基因对。我们在12项任务上对SPATIA与18个模型进行了基准测试,涵盖表型生成、注释、聚类、基因插补和跨模态预测等类别。SPATIA相比最先进模型取得了改进,生成保真度提高了8%,预测准确率提高了3%。

英文摘要

Understanding how cellular morphology, gene expression, and spatial context jointly shape tissue function is a central challenge in biology. Image-based spatial transcriptomics technologies now provide high-resolution measurements of cell images and gene expression profiles, but existing methods typically analyze these modalities in isolation or at limited resolution. We address the problem by introducing SPATIA, a multi-level generative and predictive model that learns unified, spatially aware representations by fusing morphology, gene expression, and spatial context from the cell to the tissue level. SPATIA also incorporates a spatially conditioned generative framework with confidence-aware OT reweighting and morphology-profile alignment for modeling target-state morphology distributions. Specifically, we propose a confidence-aware flow matching objective that reweights weak optimal-transport pairs based on uncertainty. We further apply morphology-profile alignment to encourage biologically meaningful image generation, enabling the modeling of microenvironment-dependent phenotypic transitions. We assembled a multi-scale dataset consisting of 25.9 million cell-gene pairs across 17 tissues. We benchmark SPATIA against 18 models across 12 tasks, spanning categories such as phenotype generation, annotation, clustering, gene imputation, and cross-modal prediction. SPATIA achieves improved performance over state-of-the-art models, improving generative fidelity by 8% and predictive accuracy by up to 3%.

2602.04901 2026-06-17 q-bio.GN cs.LG 版本更新

Beyond Independent Genes: Learning Module-Inductive Representations for Single-Cell Gene Perturbation Prediction

超越独立基因:学习模块归纳表示用于单细胞基因扰动预测

Jiafa Ruan, Ruijie Quan, Liyang Xu, Zongxin Yang, Yi Yang

AI总结 提出scBIG框架,通过基因关系聚类、基因簇感知编码器和结构感知对齐学习协调的基因程序模块表示,结合条件流匹配实现灵活泛化的扰动预测,在多个单细胞扰动基准上平均提升6.7%。

详情
AI中文摘要

预测遗传扰动引起的转录响应是功能基因组学中的一个核心问题。实际上,扰动响应很少是基因独立的,而是表现为功能相关基因之间协调的、程序级别的转录变化。然而,大多数现有方法由于基于基因的建模范式以及依赖无法捕捉动态程序重组的静态生物学先验知识,未能显式建模这种协调性。为解决这些局限,我们提出scBIG,一种模块归纳的扰动预测框架,显式建模协调的基因程序。scBIG通过基因关系聚类从数据中归纳出连贯的基因程序,通过基因簇感知编码器捕获程序间交互,并使用结构感知对齐目标保持模块协调性。然后利用条件流匹配对这些结构化表示进行建模,以实现灵活且可泛化的扰动预测。在多个单细胞扰动基准上的大量实验表明,scBIG始终优于最先进的方法,特别是在未见和组合扰动设置中,相比最强基线平均提升6.7%。代码可在该https URL获取。

英文摘要

Predicting transcriptional responses to genetic perturbations is a central problem in functional genomics. In practice, perturbation responses are rarely gene-independent but instead manifest as coordinated, program-level transcriptional changes among functionally related genes. However, most existing methods do not explicitly model such coordination, due to gene-wise modeling paradigms and reliance on static biological priors that cannot capture dynamic program reorganization. To address these limitations, we propose scBIG, a module-inductive perturbation prediction framework that explicitly models coordinated gene programs. scBIG induces coherent gene programs from data via Gene-Relation Clustering, captures inter-program interactions through a Gene-Cluster-Aware Encoder, and preserves modular coordination using structure-aware alignment objectives. These structured representations are then modeled using conditional flow matching to enable flexible and generalizable perturbation prediction. Extensive experiments on multiple single-cell perturbation benchmarks show that scBIG consistently outperforms state-of-the-art methods, particularly on unseen and combinatorial perturbation settings, achieving an average improvement of 6.7% over the strongest baselines. The code is available at this https URL.

2503.04507 2026-06-17 q-bio.QM cs.LG 版本更新

The Morse Transform for Discrete Shape Analysis

离散形状分析的Morse变换

Alexander M. Tanaka, Aras T. Asaad, Richard Cooper, Vidit Nanda

AI总结 提出一种基于定向分段线性Morse理论的拓扑变换,通过记录多个高度函数下的临界点来量化嵌入对象的几何形状,生成的特征向量在配体虚拟筛选中取得最优平均AUROC。

Comments 37 pages, 3 main figures, 2 main tables, 12 appendix figures and 4 appendix tables

详情
AI中文摘要

物体的几何形状在调节其与物理世界的相互作用中起着至关重要的作用。然而,为了统计推断或分类任务的目的,用数值描述几何信息仍然困难。在这里,我们引入了一种新的拓扑变换,它利用定向分段线性Morse理论,通过编录多个高度函数下的临界点来量化嵌入对象的几何形状。该Morse变换的输出记录了表征底层形状的临界点的高度和局部拓扑类型(峰、谷或鞍点),保留了比欧拉特征变换更精细的信息,同时自然优先考虑形状的最外层区域。关键的是,该输出可以进一步压缩为丰富而紧凑的特征向量。我们将Morse特征向量作为配体虚拟筛选(LBVS)的描述符进行基准测试,这本质上依赖于分子的形状。在常见的梯度提升树分类流程下,与其他拓扑变换描述符和标准基于形状的LBVS描述符相比,Morse描述符实现了最高的平均AUROC。

英文摘要

The geometry of an object plays a vital role in modulating its interactions with the physical world. It nevertheless remains difficult to describe geometric information numerically for the purposes of statistical inference or classification tasks. Here, we introduce a new topological transform which leverages directional piecewise-linear Morse theory to quantify the geometry of an embedded object by cataloguing critical points across multiple height-functions. The output of this Morse transform records both the heights and the local topological type (peak, trough or saddle) of the critical points that characterise the underlying shape, retaining finer information than the Euler characteristic transform whilst naturally prioritising a shape's outermost regions. Crucially, this output can be further compressed into a rich but compact feature vector. We benchmark the Morse feature vector as a descriptor for ligand-based virtual screening (LBVS), which intrinsically depends on the shape of molecules. Under a common gradient-boosted tree classification pipeline, Morse descriptors achieve the highest mean AUROC when compared to other topological transform descriptors and to standard shape-based LBVS descriptors.

2408.06683 2026-06-17 physics.bio-ph q-bio.CB 版本更新

Scale-dependent physical constraints on active intracellular fluctuations

尺度依赖的主动细胞内涨落的物理约束

Yuika Ueda, Outa Nakashima, Takumi Saito, Shinji Deguchi

AI总结 通过荧光相关光谱与非平衡建模,发现纳米尺度主动涨落由局部肌球蛋白II驱动,而微米尺度受肌动蛋白网络约束,揭示了细胞内力学的层级组织。

详情
AI中文摘要

活细胞表现出非平衡动力学,这些动力学塑造了从纳米尺度分子组装到宏观细胞器组织的跨长度尺度的细胞内过程。虽然已知微米尺度的动力学在低频下受到肌动蛋白网格的约束,但控制纳米尺度主动涨落的物理原理仍然难以捉摸。在这里,我们提出了一个分析框架,将荧光相关光谱与非平衡建模相结合,以描绘细胞内力学的物理尺度。将该框架应用于成纤维细胞,我们证明,与较大的组分相比,纳米尺度的主动涨落在高频下仍然显著,并且主要由局部非肌肉肌球蛋白II活性驱动,从而在细胞内力学中建立了一个独特的功能层级:局部主动力促进纳米尺度分子的快速空间探索,而宏观肌动蛋白约束确保了较大分子复合物和细胞器所需的结构稳定性。为了将这些尺度依赖的行为整合到一个单一的物理框架中,我们建立了一个模型,捕捉了主动涨落跨长度尺度的转变,揭示了细胞质的物理性质由主动驱动力和被动结构约束之间的平衡控制。此外,将该模型应用于细胞衰老,揭示了与细胞骨架刚性化相关的非平衡复杂性的降低。因此,我们的发现弥合了局部分子动力学与宏观约束之间的维度差距,为理解细胞内动力学的层级组织提供了基本的物理基础。

英文摘要

Living cells exhibit nonequilibrium dynamics that shape intracellular processes across length scales, from nanoscale molecular assembly to the organization of macroscopic organelles. While dynamics at micrometer scales are known to be constrained by the actin meshwork at low frequencies, the physical principles governing active fluctuations at the nanoscale remain elusive. Here, we present an analytical framework integrating fluorescence correlation spectroscopy with nonequilibrium modeling to delineate the physical scaling of intracellular mechanics. Applying this framework to fibroblasts, we demonstrate that, in contrast to larger components, nanoscale active fluctuations remain prominent at high frequencies and are predominantly driven by local nonmuscle myosin II activity, establishing a distinct functional hierarchy in intracellular mechanics: local active forces promote rapid spatial exploration for nanoscale molecules, whereas macroscopic actin constraints ensure the structural stability required for larger molecular complexes and organelles. To integrate these scale-dependent behaviors within a single physical framework, we formulated a model that captures the transition of active fluctuations across length scales, revealing that the physical properties of the cytoplasm are governed by the balance between active driving forces and passive structural constraints. Furthermore, applying this model to cellular senescence reveals a reduction in nonequilibrium complexity associated with cytoskeletal rigidification. Thus, our findings bridge the dimensional gap between local molecular kinetics and macroscopic constraints, providing a fundamental physical basis for understanding the hierarchical organization of intracellular dynamics.