arXivDaily arXiv每日学术速递 周一至周五更新

科学与医疗

AI for Science

科学智能、蛋白质、分子、药物、材料、气象、物理和数学 AI。

今日/当前日期收录 7 信号源:cs.LG, q-bio, physics, cond-mat, math, stat.ML
2604.03275 2026-06-18 physics.ao-ph cs.AI cs.LG 版本更新 95%

IPSL-AID: Generative Diffusion Models for Climate Downscaling from Global to Regional Scales

IPSL-AID:用于从全球到区域尺度气候降尺度的生成扩散模型

Kishanthan Kingston, Olivier Boucher, Freddy Bouchet, Pierre Chapel, Rosemary Eade, Jean-Francois Lamarque, Redouane Lguensat, Kazem Ardaneh

发表机构 * Climate Modeling Center(气候建模中心) Sorbonne University(索邦大学) CNRS(法国国家科学研究中心) IPSL Paris(巴黎) France(法国)

专题命中 气象气候 :扩散模型用于气候降尺度,生成高分辨率气象场

AI总结 提出基于去噪扩散概率模型的IPSL-AID工具,利用ERA5再分析数据从粗分辨率输入生成0.25°温度、风和降水场,并建模细尺度特征概率分布以量化不确定性,准确重建统计分布、极端事件和空间结构。

Comments 17 pages, 12 figures, submitted to Climate Informatique 2026, to appear in Environmental Data Science

详情
AI中文摘要

有效的气候变化适应和减缓策略需要高分辨率预测来指导战略决策。传统的全球气候模型通常以150至200公里的分辨率运行,缺乏表示关键区域过程的能力。IPSL-AID是一种基于去噪扩散概率模型的全球到区域降尺度工具,旨在解决这一限制。该工具在ERA5再分析数据上训练,利用粗分辨率输入及其时空上下文生成0.25°分辨率的温度、风和降水场。它还建模细尺度特征的概率分布,以产生用于不确定性量化的合理情景。该模型准确重建了统计分布,包括极端事件、功率谱和空间结构。这项工作突出了生成扩散模型在高效气候降尺度及不确定性量化方面的潜力。

英文摘要

Effective adaptation and mitigation strategies for climate change require high-resolution projections to inform strategic decision-making. Conventional global climate models, which typically operate at resolutions of 150 to 200 kilometers, lack the capacity to represent essential regional processes. IPSL-AID is a global to regional downscaling tool based on a denoising diffusion probabilistic model designed to address this limitation. Trained on ERA5 reanalysis data, it generates 0.25 degree resolution fields for temperature, wind, and precipitation using coarse inputs and their spatiotemporal context. It also models probability distributions of fine-scale features to produce plausible scenarios for uncertainty quantification. The model accurately reconstructs statistical distributions, including extreme events, power spectra, and spatial structures. This work highlights the potential of generative diffusion models for efficient climate downscaling with uncertainty

2509.22020 2026-06-18 cs.LG 版本更新 95%

Task-Adaptive Parameter-Efficient Fine-Tuning for Weather Foundation Models

面向天气基础模型的任务自适应参数高效微调

Shilei Cao, Hehai Lin, Jiashun Cheng, Yang Liu, Guowen Li, Xuehe Wang, Juepeng Zheng, Haoyuan Liang, Meng Jin, Chengwei Qin, Hong Cheng, Haohuan Fu

发表机构 * Sun Yat-sen University(中山大学) The Hong Kong University of Science and Technology (Guangzhou)(香港科技大学(广州)) The Hong Kong University of Science and Technology(香港科技大学) The Chinese University of Hong Kong(香港中文大学) National Supercomputing Center in Shenzhen(深圳国家超算中心) Huawei Technologies Co., Ltd(华为技术有限公司) Tsinghua University(清华大学)

专题命中 气象气候 :针对天气基础模型的任务自适应微调

AI总结 提出WeatherPEFT框架,通过任务自适应动态提示和随机Fisher引导自适应选择,在天气下游任务上以更少参数达到全微调性能。

详情
AI中文摘要

尽管机器学习的最新进展使天气基础模型(WFM)在多种下游任务中具备了强大的泛化能力,但随着模型规模扩大,计算需求不断攀升,实际部署愈发困难。当前为视觉或语言任务设计的参数高效微调(PEFT)方法无法应对天气下游任务的独特挑战,如变量异质性、分辨率多样性和时空覆盖变化,导致在WFM上性能欠佳。为弥补这一差距,我们提出WeatherPEFT,一种新颖的PEFT框架,包含两项协同创新。首先,在前向传播中,任务自适应动态提示(TADP)通过内部和外部模式提取,将编码器中的嵌入权重动态注入预训练骨干网络的输入令牌,实现针对特定下游任务的上下文感知特征重校准。其次,在反向传播中,随机Fisher引导自适应选择(SFAS)不仅利用Fisher信息识别并更新最关键的任务参数,从而保留不变的预训练知识,还引入随机性以稳定选择过程。我们在三个下游任务上验证了WeatherPEFT的有效性和效率,现有PEFT方法与全微调相比存在显著差距,而WeatherPEFT使用更少的可训练参数达到了与全微调相当的性能。本工作代码见此https链接。

英文摘要

While recent advances in machine learning have equipped Weather Foundation Models (WFMs) with substantial generalization capabilities across diverse downstream tasks, the escalating computational requirements associated with their expanding scale increasingly hinder practical deployment. Current Parameter-Efficient Fine-Tuning (PEFT) methods, designed for vision or language tasks, fail to address the unique challenges of weather downstream tasks, such as variable heterogeneity, resolution diversity, and spatiotemporal coverage variations, leading to suboptimal performance when applied to WFMs. To bridge this gap, we introduce WeatherPEFT, a novel PEFT framework for WFMs incorporating two synergistic innovations. First, during the forward pass, Task-Adaptive Dynamic Prompting (TADP) dynamically injects the embedding weights within the encoder to the input tokens of the pre-trained backbone via internal and external pattern extraction, enabling context-aware feature recalibration for specific downstream tasks. Furthermore, during backpropagation, Stochastic Fisher-Guided Adaptive Selection (SFAS) not only leverages Fisher information to identify and update the most task-critical parameters, thereby preserving invariant pre-trained knowledge, but also introduces randomness to stabilize the selection. We demonstrate the effectiveness and efficiency of WeatherPEFT on three downstream tasks, where existing PEFT methods show significant gaps versus Full-Tuning, and WeatherPEFT achieves performance parity with Full-Tuning using fewer trainable parameters. The code of this work is available at https://github.com/ShileiCao/WeatherPEFT.

2406.14399 2026-06-18 cs.LG cs.CV physics.ao-ph stat.ML 版本更新 90%

Benchmarking Physics-Informed Time-Series Models for Operational Global Station Weather Forecasting

面向全球站点业务天气预报的物理信息时间序列模型基准测试

Tao Han, Zhibin Wen, Zhenghao Chen, Dazhao Du, Song Guo, Lei Bai

发表机构 * Department of Computer Science and Engineering, Hong Kong University of Science and Technology, Hong Kong SAR China(香港科技大学计算机科学与工程系) Department of Computer Science and Engineering, Southern University of Science and Technology, Shenzhen, China(南方科技大学计算机科学与工程系) School of Computer and Information Sciences, University of Newcastle, Newcastle, Australia(新castle大学计算机与信息科学学院) Hangzhou Innovation Institute of Beihang University, Hangzhou, China(北京航空航天大学杭州创新研究院) Shanghai Artificial Intelligence Laboratory, Shanghai, China(上海人工智能实验室)

专题命中 气象气候 :物理信息模型用于全球站点天气预报

AI总结 提出大规模观测数据集WEATHER-5K和物理信息模型PhysicsFormer,通过压力-风对齐和能量感知平滑损失增强物理一致性,在多个天气变量和极端事件预测上评估学术模型与业务系统的差距。

Comments Accepted by ICML2026

详情
AI中文摘要

时间序列预测(TSF)模型的发展常受限于缺乏全面的数据集,尤其是在全球站点天气预报(GSWF)中,现有数据集规模小、时间短且空间稀疏。为解决这一问题,我们引入了WEATHER-5K,一个大规模观测天气数据集,能更好地反映真实世界条件,支持改进模型训练和评估。尽管最近的TSF方法在基准测试上表现良好,但在捕捉复杂天气动态和极端事件方面落后于业务数值天气预报系统。我们提出了PhysicsFormer,一种物理信息预测模型,结合动态核心与Transformer残差来预测未来天气状态。通过压力-风对齐和能量感知平滑损失强制物理一致性,确保在捕捉复杂时间模式的同时保持合理的动力学。我们将PhysicsFormer及其他TSF模型与业务系统在多个天气变量、极端事件预测和模型复杂度上进行基准测试,全面评估学术TSF模型与业务预报之间的差距。数据集和基准测试实现可在以下网址获取:this https URL。

英文摘要

The development of Time-Series Forecasting (TSF) models is often constrained by the lack of comprehensive datasets, especially in Global Station Weather Forecasting (GSWF), where existing datasets are small, temporally short, and spatially sparse. To address this, we introduce WEATHER-5K, a large-scale observational weather dataset that better reflects real-world conditions, supporting improved model training and evaluation. While recent TSF methods perform well on benchmarks, they lag behind operational Numerical Weather Prediction systems in capturing complex weather dynamics and extreme events. We propose PhysicsFormer, a physics-informed forecasting model combining a dynamic core with a Transformer residual to predict future weather states. Physical consistency is enforced via pressure-wind alignment and energy-aware smoothness losses, ensuring plausible dynamics while capturing complex temporal patterns. We benchmark PhysicsFormer and other TSF models against operational systems across several weather variables, extreme event prediction, and model complexity, providing a comprehensive assessment of the gap between academic TSF models and operational forecasting. The dataset and benchmark implementation are available at: https://github.com/taohan10200/WEATHER-5K.

2601.17462 2026-06-18 physics.ao-ph physics.soc-ph 版本更新 85%

Atmospheric Methane Removal as a Third Climate Intervention: Termination Risks and Air Pollutant Effects

大气甲烷去除作为第三种气候干预:终止风险与空气污染物效应

Katsumasa Tanaka, Weiwei Xiong, Didier A. Hauglustaine, Daniel J. A. Johansson, Nico Bauer, Philippe Bousquet, Philippe Ciais, Renaud de Richter, Marianne T. Lund, Ragnhild B. Skeie, Eric Zusman

专题命中 气象气候 :研究大气甲烷去除,属于气象气候

AI总结 研究大气甲烷去除(AMR)作为第三种气候干预手段,分析其终止风险与空气污染物效应,发现AMR的避免变暖不可持久,但终止后温度反弹比太阳辐射管理(SRM)缓和,且对对流层臭氧的影响受背景污染物水平调节。

详情
AI中文摘要

大气甲烷去除(AMR)是第三种气候干预类别,与二氧化碳去除(CDR)和太阳辐射管理(SRM)并列。我们表明,与CDR不同,由于甲烷的大气寿命短,AMR避免的变暖不可持久,尽管其终止后的温度反弹比SRM更缓和。AMR对对流层臭氧的影响可进一步受背景污染物水平调节。

英文摘要

Atmospheric Methane Removal (AMR) is a third class of climate intervention, along with Carbon Dioxide Removal (CDR) and Solar Radiation Management (SRM). We show that, unlike CDR, the avoided warming by AMR is not durable due to methane's short atmospheric lifetime, although its temperature rebound upon termination is less abrupt than that of SRM. AMR's impact on tropospheric ozone can be further modulated by background pollutant levels.

2508.10178 2026-06-18 q-bio.QM cs.LG 版本更新 85%

Estimating carbon pools in the European Shelf sea environment: replacing reanalysis by model-informed machine learning?

估算欧洲陆架海环境中的碳库:用模型指导的机器学习替代再分析?

Jozef Skakala

发表机构 * Plymouth Marine Laboratory(普利茅斯海洋实验室) National Centre for Earth Observation(国家地球观测中心)

专题命中 气象气候 :机器学习估算海洋碳库

AI总结 提出用深度集成神经网络学习可观测变量与海洋碳库的关系,以低成本替代昂贵再分析,在西北欧陆架海实现高效碳库预测并提供不确定性。

Comments 37 pages, 9 figures (+ 3 in the appendix), v3 - published version

Journal ref JGR - Machine Learning and Computation 3 (2026)

详情
AI中文摘要

陆架海对经济和碳循环至关重要,但碳库观测往往稀疏或高度不确定。碳再分析(无论是同化叶绿素a等代理变量还是直接同化碳)可提供替代方案,但运行成本高昂。我们提出使用计算成本低的神经网络集成(即深度集成)来学习直接可观测(大气、河流和海洋)变量与海洋碳库之间的关系,该关系来自一个物理-生物地球化学耦合模型。深度集成在西北欧陆架海(NWES)物理-生物地球化学模型自由运行模拟上训练。训练后,使用来自NWES再分析的输入而非自由运行来运行深度集成,证明它能高效预测多个NWES碳库(如碎屑、浮游动物、异养细菌),且与再分析的一致性远优于自由运行,同时提供不确定性信息。我们进一步表明,当深度集成直接由同化到再分析中的观测驱动时,其表现同样良好,但碳库只能预测在观测位置和时间。我们关注结果的可解释性,并展示了深度集成在未来气候假设情景中的潜在应用。我们认为,模型指导的机器学习为昂贵的再分析提供了可行的替代方案,并可在观测缺失和/或高度不确定的地方补充观测。

英文摘要

Shelf seas are important for the economy and the carbon cycle, but shelf sea observations for carbon pools are often sparse, or highly uncertain. An alternative can be provided by carbon reanalyses (whether assimilating proxy variables, such as chlorophyll-$a$, or directly carbon), but these are often expensive to run. We propose to use a computationally cheap ensemble of neural networks (i.e. deep ensemble) to learn the relationship between the directly observable (atmospheric, riverine and ocean) variables and marine carbon pools from a coupled physics-biogeochemistry model. The deep ensemble was trained on a North-West European Shelf (NWES) physical-biogeochemistry model free run simulation. After training, the deep ensemble was run using inputs from the NWES reanalysis instead of the free run, demonstrating that it can efficiently predict several NWES carbon pools (e.g., detritus, zooplankton, heterotrophic bacteria) in much better agreement with the reanalysis than the free run, while also providing uncertainty information. We further show that the deep ensemble performs similarly well when it is driven directly by the observations assimilated into the reanalysis, with the limitation that carbon pools can then be predicted only at the observed locations and times. We focus on explainability of the results and demonstrate potential use of the deep ensembles for future climate what-if scenarios. We suggest that model-informed machine learning presents a viable alternative to expensive reanalyses and could complement observations, wherever they are missing and/or highly uncertain.

2605.13566 2026-06-18 cs.LG 版本更新 80%

Spatiotemporal downscaling and nowcasting of urban land surface temperatures with deep neural networks

基于深度神经网络的城市地表温度时空下垫面精细化与现在预报

Solomiia Kurchaba, Angela Meyer

发表机构 * Department of Geoscience and Remote Sensing(地质科学与遥感系) Delft University of Technology(代尔夫特理工大学) School of Engineering and Computer Science(工程与计算机科学学院) Bern University of Applied Sciences(伯恩应用科学大学)

专题命中 气象气候 :利用深度神经网络实现城市地表温度高时空分辨率估计与预报。

AI总结 本文提出利用深度神经网络结合静止和极轨卫星数据,实现高时空分辨率的城市地表温度场估计与现在预报,提升城市气候与生态研究的精度与时效性。

Comments Paper after publication in IEEE Access

Journal ref IEEE Access, vol. 14, pp. 85134-85151, 2026

详情
AI中文摘要

地表温度(LST)是多种应用的关键变量,如城市气候和生态研究。然而,现有卫星衍生的LST产品提供的是高空间或高时间分辨率,导致两者之间存在根本性权衡。为解决这一权衡,我们结合静止和极轨卫星的观测数据,提供高空间和高时间分辨率(1公里,15分钟间隔)的LST场。我们展示了其在日内LST预报中的应用。为了估计高时空分辨率的LST场,训练了一个U-Net模型,将SEVIRI/MSG(3公里,15分钟分辨率)的LST场映射到Terra/Aqua MODIS(1公里,每天4次过境)的LST场,二者在空间和时间上同步。所提出的模型已在欧洲大都市的LST上进行训练,人口超过100万,且在留出测试集上达到RMSE=1.92°C和接近零偏移MVE=0.01°C。作为第二步,我们提出基于ConvLSTM架构的LST现在预报模型,训练数据为下缩的LST场,预测时间跨度为15至75分钟。该现在预报模型优于持续性和气候滚动中位数基准,对于所考虑的预测时间,RMSE为0.57至1.15°C,偏移范围从-0.1到0.14°C。此外,与独立MODIS过境的额外验证确认了鲁棒性能。我们的高时空分辨率LST预报模型可直接应用于基于卫星的LST监测操作。

英文摘要

Land Surface Temperature (LST) is a key variable for various applications, such as urban climate and ecology studies. Yet, existing satellite-derived LST products provide either high spatial or high temporal resolution, resulting in a fundamental trade-off between the two. To address this trade-off, we combine observations from a geostationary and a polar orbiting satellite and provide LST fields at high spatial and high temporal resolution (1 km at 15-min intervals). We demonstrate their application for intraday forecasting of LSTs. To estimate LST fields at high spatiotemporal resolution, a U-Net model is trained to map LST fields from SEVIRI/MSG (3 km and 15 min resolution) to LST fields from Terra/Aqua MODIS (1 km, 4 overpasses per day) that are collocated in space and time. The presented model has been trained on LSTs across large European cities with a population exceeding 1 million inhabitants, and achieves an RMSE = $1.92$°C and near-zero bias MBE = $0.01$°C on the hold-out test set. As a second step, we present an LST nowcasting model based on ConvLSTM architecture, trained across downscaled LST fields with forecast lead times of 15 to 75 minutes. The nowcasting model outperforms a persistence and a Climatological Rolling Median benchmarks, with RMSEs of $0.57$ to $1.15$°C for the considered lead times and biases ranging from $-0.1$ to $0.14$°C. An additional validation conducted against independent MODIS overpasses confirms robust performance. Our LST forecast model at high spatiotemporal resolution is directly applicable to operational satellite-based LST monitoring.

2508.02400 2026-06-18 q-bio.QM 版本更新 80%

Assimilation of machine learning-predicted nitrate to improve the quality of phytoplankton forecasting in the shelf sea environment

同化机器学习预测的硝酸盐以提高陆架海环境中浮游植物预报的质量

Deep S Banerjee, Jozef Skakala, David Ford

专题命中 气象气候 :同化机器学习预测的硝酸盐改进浮游植物预报,涉及海洋环境。

AI总结 本研究通过同化神经网络预测的表层硝酸盐浓度,显著提升了西北欧陆架海域浮游植物短期(1-5天)动力模型预报的准确性,最高改进达30%。

Comments 23 pages, 7 figures, v2 - published version

Journal ref Q.J.R.Meteorol.Soc. 152 (2026),

详情
AI中文摘要

我们证明,同化神经网络(NN)预测的表层硝酸盐可显著改善西北欧陆架(NWES)海域浮游植物短期(1-5天)动力模型预报。我们表明,在当前英国气象局NWES业务系统中仅同化海洋水色叶绿素-$a$会导致春季水华后表层硝酸盐浓度过高,这是晚春和夏季NWES浮游植物预报中一些已知快速增长偏差的主要原因。同化硝酸盐观测数据可能有助于解决这一问题,但NWES硝酸盐数据通常不足以有效同化。因此,我们使用了一个最近开发并验证的神经网络(NN)模型,该模型从一系列可观测变量预测表层硝酸盐浓度,并将NN预测的硝酸盐同化到英国气象局NWES业务预报系统的研发版本中。由于硝酸盐同化,浮游植物5天预报技能提高了30%。我们表明,尽管通过使用NN模型预测的每周硝酸盐气候学数据可以实现大部分改进,但使用流依赖的硝酸盐数据具有明显优势。我们讨论了这一改进对一系列其他富营养化指标(如溶解无机磷和海底氧)的影响。我们认为,在近实时NWES业务预报系统中,将这种方法升级为完全混合机器学习-数据同化是可行的。

英文摘要

We demonstrate that assimilating Neural Network (NN)-predicted surface nitrate leads to a major improvement in phytoplankton short-range (1-5 day) dynamical model forecasts for the North-West European Shelf (NWES) seas. We show that assimilation of only ocean color chlorophyll-$a$ in the current Met Office NWES operational system can lead to excess surface nitrate concentrations in the post-Spring bloom period and these are a major reason behind some known, fast-growing biases in NWES phytoplankton forecasts during late Spring and Summer. Assimilating observations of nitrate would potentially help address this, but NWES nitrate data are typically not available in sufficient abundance to be effectively assimilated. We have therefore used a recently developed and validated neural network (NN) model predicting surface nitrate concentrations from a range of observable variables and assimilated the NN-predicted nitrate within a research and development version of the Met Office's NWES operational forecasting system. As a result of nitrate assimilation the phytoplankton 5-day forecast skill improves by up to 30%. We show that although much of this improvement can be achieved by using a weekly nitrate climatology predicted by the NN model, there is a clear advantage in using flow-dependent nitrate data. We discuss the impacts of this improvement on a range of additional eutrophication indicators, such as dissolved inorganic phosphorus and sea bottom oxygen. We argue that it should be feasible to upgrade this approach to a fully hybrid machine learning - data assimilation within the near-real time NWES operational forecasting system.