arXivDaily arXiv每日学术速递 周一至周五更新
重置
2606.13247 2026-06-13 cs.AI 新提交

EPIG: Emotion-Based Prompting for Personalised Image Generation

EPIG:基于情感提示的个性化图像生成

Emna Othmen, Mohamed Yassine Landolsi, Lotfi Ben Romdhane

发表机构 * MARS Research Lab LR17ES05, ISITCom, University of Sousse(苏塞大学ISITCom学院MARS研究实验室LR17ES05)

AI总结 提出EPIG方法,利用心理学效价-唤醒模型在提示层面增强情感表达,无需训练即可控制生成图像的唤醒度,在10个多样化提示上平均唤醒误差降低14%-17%。

Comments Submitted to arXiv. 20 pages, 4 figures. Work on emotion-based prompt engineering for text-to-image diffusion models with applications in personalized image generation

详情
AI中文摘要

文本到图像扩散模型在从自然语言提示合成高质量图像方面取得了令人印象深刻的结果。然而,常用的提示策略仍然相对通用,限制了模型准确表达情感意图和细微情感属性的能力。本文提出EPIG,一种在图像生成之前在提示层面增强情感表达性的方法。基于心理学知情的情感表示(效价-唤醒)并利用结构化的、角色感知的提示丰富化,EPIG在不修改或重新训练图像生成主干的情况下丰富提示的情感相关组件。由此产生的情感感知提示引导生成过程朝向更情感连贯的视觉输出,在控制唤醒方面特别有效。EPIG轻量级、无需训练,非常适合资源受限和个性化图像生成场景。在10个多样化提示的基准测试上的实验结果表明,与强基线(包括朴素插入和基于LLM的提示扩展)相比,EPIG将平均唤醒误差分别降低了14%和12%。这些改进具有统计显著性。EPIG还保持了效价对齐和语义一致性,如CLIPScore所测量并由消融研究所支持。在包含人类、儿童或动物等显式主体的提示上效果更为显著,误差降低达到17%,突出了所提出方法的主题敏感行为。

英文摘要

Text-to-image diffusion models have achieved impressive results in synthesizing high-quality images from natural language prompts. However, commonly used prompting strategies remain relatively generic, limiting the model's ability to accurately express emotional intent and nuanced affective attributes. This work proposes EPIG, a method that enhances emotional expressiveness at the prompt level prior to image generation. Grounded in psychologically informed emotion representations (valence-arousal) and leveraging structured, role-aware prompt enrichment, EPIG enriches emotion-related components of prompts without modifying or retraining the image generation backbone. The resulting emotion-aware prompts guide the generative process toward more emotionally coherent visual outputs, with particular effectiveness in controlling arousal. EPIG is lightweight, training-free, and well suited for resource-constrained and personalized image generation scenarios. Experimental results on a benchmark of 10 diverse prompts show that EPIG reduces mean arousal error compared to strong baselines, including naive insertion and LLM-based prompt expansion, with reductions of 14% and 12%, respectively. These improvements are statistically significant. EPIG also preserves valence alignment and semantic consistency, as measured by CLIPScore and supported by ablation studies. The effect is more pronounced on prompts containing explicit subjects such as humans, children, or animals, where the reduction reaches 17%, highlighting the subject-sensitive behavior of the proposed method.

2606.12651 2026-06-13 cs.LG q-bio.QM 新提交

Physics-Aware Auxiliary Losses Improve Out-of-Distribution Generalization of a GNN Synthesizability Filter

物理感知辅助损失提升图神经网络可合成性滤波器的分布外泛化能力

Riya Bisht, Dhruv Agarwal

发表机构 * University of California, Berkeley(加州大学伯克利分校)

AI总结 通过在GNN上添加基于Bertz指数的拓扑复杂度回归和MMFF94力场应变能软惩罚作为辅助损失,在分布外数据上小幅但显著提升了可合成性滤波器的AUC(最高+0.0066)。

详情
AI中文摘要

机器学习药物发现流程越来越依赖生成模型,这些模型提出的分子远离用于训练下游可合成性滤波器的数据。现有滤波器(SAScore、SCScore、RAscore、DeepSA)纯粹基于统计,在分布外(OOD)场景下性能下降。我们探究廉价的闭式物理先验,作为图神经网络(GNN)的辅助监督,是否能改善OOD泛化。我们在GINE骨干网络上添加两个辅助损失:基于Bertz指数的拓扑复杂度回归,以及基于MMFF94力场能量的应变能软惩罚。在由SAScore阈值标注的65,177个分子语料库(HIV、Tox21、COCONUT)上,我们复现了强分布内基线,然后在单源OOD划分(在类药HIV+Tox21上训练,在COCONUT天然产物上测试)上评估4路消融实验(基线/+复杂度/+应变/+两者),重复5个种子并采用配对bootstrap置信区间。所有三个物理感知变体相比基线(平均OOD AUC 0.9774)均带来微小但统计显著的OOD提升:+复杂度Delta = +0.0060(95% CI [+0.0023, +0.0102]),+应变Delta = +0.0032([+0.0008, +0.0052]),+两者Delta = +0.0066([+0.0038, +0.0093]);每个区间均不包含零,且组合效果最佳。各变体在分布内表现无差异,因此效果仅在OOD评估下可见。我们明确指出效果是适度的,并报告一个警示性方法学发现:该实验的单种子版本产生了定性不同(非单调)的故事,未能在多种子评估中复现。

英文摘要

Machine-learning drug-discovery pipelines increasingly rely on generative models that propose molecules far from the data used to train downstream synthesizability filters. Existing filters (SAScore, SCScore, RAscore, DeepSA) are purely statistical and degrade in exactly this out-of-distribution (OOD) regime. We ask whether cheap, closed-form physical priors, used as auxiliary supervision on a graph neural network (GNN), improve OOD generalization. We add two auxiliary losses to a GINE backbone: a topological complexity regression supervised by the Bertz index, and a strain-energy soft penalty supervised by MMFF94 force-field energy. On a 65,177-molecule corpus (HIV, Tox21, COCONUT) labeled by SAScore thresholds we reproduce a strong in-distribution baseline, then evaluate a 4-way ablation (baseline / +complexity / +strain / +both) on a single-source OOD split (train on drug-like HIV+Tox21, test on COCONUT natural products), repeated over 5 seeds with paired bootstrap confidence intervals. All three physics-aware variants give a small but statistically significant OOD improvement over the baseline (mean OOD AUC 0.9774): +complexity Delta = +0.0060 (95% CI [+0.0023, +0.0102]), +strain Delta = +0.0032 ([+0.0008, +0.0052]), +both Delta = +0.0066 ([+0.0038, +0.0093]); every interval excludes zero, and the combination is best. The variants are indistinguishable in-distribution, so the effect is visible only under OOD evaluation. We are explicit that the effects are modest, and we report a cautionary methodological finding: a single-seed version of this experiment produced a qualitatively different (non-monotone) story that did not survive multi-seed evaluation.

2412.13012 2026-06-13 cs.LG cond-mat.mtrl-sci cond-mat.str-el

Deep Learning Based Superconductivity: Prediction and Experimental Tests

Daniel Kaplan, Adam Zhang, Joanna Blawat, Rongying Jin, Robert J. Cava, Viktor Oudovenko, Gabriel Kotliar, Anirvan M. Sengupta, Weiwei Xie

发表机构 * Department of Physics and Astronomy(物理与天文学系) Rutgers University(罗格斯大学) Department of Chemistry(化学系) Michigan State University(密歇根州立大学) University of South Carolina(南卡罗来纳大学) Princeton University(普林斯顿大学) Center for Computational Quantum Physics(计算量子物理中心) Flatiron Institute(Flatiron研究所) Center for Computational Mathematics(计算数学中心)

Comments 14 pages + 2 appendices + references. EPJ submission

详情
Journal ref
Eur. Phys. J. Plus (2025) 140:58
英文摘要

The discovery of novel superconducting materials is a longstanding challenge in materials science, with a wealth of potential for applications in energy, transportation, and computing. Recent advances in artificial intelligence (AI) have enabled expediting the search for new materials by efficiently utilizing vast materials databases. In this study, we developed an approach based on deep learning (DL) to predict new superconducting materials. We have synthesized a compound derived from our DL network and confirmed its superconducting properties in agreement with our prediction. Our approach is also compared to previous work based on random forests (RFs). In particular, RFs require knowledge of the chemical properties of the compound, while our neural net inputs depend solely on the chemical composition. With the help of hints from our network, we discover a new ternary compound $\textrm{Mo}_{20} \textrm{Re}_{6} \textrm{Si}_{4}$, which becomes superconducting below 5.4 K. We further discuss the existing limitations and challenges associated with using AI to predict and, along with potential future research directions.