EdgeCrafter: Compact ViTs for Edge Dense Prediction via Task-Specialized Distillation
Comments Code is available at: https://intellindust-ai-lab.github.io/projects/EdgeCrafter/
LoGSAM: Parameter-Efficient Cross-Modal Grounding for MRI Segmentation
Comments 10 pages, 3 figures
MM-OVSeg:Multimodal Optical-SAR Fusion for Open-Vocabulary Segmentation in Remote Sensing
Comments CVPR2026
AIDABench: AI Data Analytics Benchmark
Comments 22 pages (including appendix), 9 figures, 4 tables. Code: https://github.com/MichaelYang-lyx/AIDABench. Dataset: https://huggingface.co/datasets/MichaelYang-lyx/AIDA
UE5-Forest: A Photorealistic Synthetic Stereo Dataset for UAV Forestry Depth Estimation
Massive Redundancy in Gradient Transport Enables Sparse Online Learning
Comments 26 pages, 5 figures, 14 tables
AgentTrace: Causal Graph Tracing for Root Cause Analysis in Deployed Multi-Agent Systems
Comments 11 pages, 1 figure, 19 tables. Published at ICLR 2026 Workshop on Agents in the Wild. Camera-ready version with revised layout and framework overview figure
The Pulse of Motion: Measuring Physical Frame Rate from Visual Dynamics
Local Precise Refinement: A Dual-Gated Mixture-of-Experts for Enhancing Foundation Model Generalization against Spectral Shifts
HIFICL: High-Fidelity In-Context Learning for Multimodal Tasks
Comments Accepted to CVPR 2026. Code available at https://github.com/bbbandari/HiFICL
CLASP: Defending Hybrid Large Language Models Against Hidden State Poisoning Attacks
Comments 22 pages, 6 figures
See, Symbolize, Act: Grounding VLMs with Spatial Representations for Better Gameplay
Comments 11 pages, 13 figures. Accepted to LMReasoning Workshop at AAAI 2026
Making Training-Free Diffusion Segmentors Scale with the Generative Power
Comments Accepted to CVPR 2026
Leveraging Arbitrary Data Sources for AI-Generated Image Detection Without Sacrificing Generalization
Comments Accepted to CVPR Findings 2026
OpenFS: Multi-Hand-Capable Fingerspelling Recognition with Implicit Signing-Hand Detection and Frame-Wise Letter-Conditioned Synthesis
Comments Accepted to CVPR 2026, camera-ready version
Olbedo: An Albedo and Shading Aerial Dataset for Large-Scale Outdoor Environments
Comments CVPR 2026
Skullptor: High Fidelity 3D Head Reconstruction in Seconds with Multi-View Normal Prediction
Comments For our project page, see https://ubisoft-laforge.github.io/character/skullptor/
cc-Shapley: Measuring Multivariate Feature Importance Needs Causal Context
PedaCo-Gen: Scaffolding Pedagogical Agency in Human-AI Collaborative Video Authoring
ORION: ORthonormal Text Encoding for Universal VLM AdaptatION
Journal ref Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026
DUET-VLM: Dual stage Unified Efficient Token reduction for VLM Training and Inference
Comments 15 Pages, 8 figures, 15 tables, CVPR 2026; Code: AGI/DUET-VLM" target="_blank" rel="noopener">https://github.com/AMD-AGI/DUET-VLM
IRIS-SLAM: Unified Geo-Instance Representations for Robust Semantic Localization and Mapping
Advancing AI Trustworthiness Through Patient Simulation: Risk Assessment of Conversational Agents for Antidepressant Selection
PISCO: Precise Video Instance Insertion with Sparse Control
TernaryLM: Memory-Efficient Language Modeling via Native 1.5-Bit Quantization with Adaptive Layer-wise Scaling
PokeFusion Attention: A Lightweight Cross-Attention Mechanism for Style-Conditioned Image Generation
Comments 12 pages, 5 figures. Revised version with improved method description and corrected references
Revisiting Diffusion Model Predictions Through Dimensionality
Comments 19 pages, 5 figures
NRR-Phi: Text-to-State Mapping for Ambiguity Preservation in LLM Inference
Comments 25 pages, 5 figures, 7 tables. Replacement synced to repository snapshot v39. Series hub link: https://github.com/kei-saito-research/nrr-series-hub
ReflexSplit: Single Image Reflection Separation via Layer Fusion-Separation
Comments CVPR 2026 Camera Ready; Project page: https://wuw2135.github.io/ReflexSplit-ProjectPage/