计算机视觉与模式识别

最近提交的作者和标题

查看今天的新的变化

总共 569 条目 : 1-50 51-100 101-150 151-200 201-250 251-300 301-350 351-400 ... 551-569

显示最多 50 每页条目：较少 | 更多 | 所有

[201] arXiv:2509.13083 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：使用KL散度聚焦低光图像增强中的频率信息

标题： Using KL-Divergence to Focus Frequency Information in Low-Light Image Enhancement

Yan Xingyang, Huang Xiaohong, Zhang Zhao, You Tian, Xu Ziheng

主题：计算机视觉与模式识别 (cs.CV)
[202] arXiv:2509.13070 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： TFANet：用于鲁棒指代图像分割的三阶段图像-文本特征对齐网络

标题： TFANet: Three-Stage Image-Text Feature Alignment Network for Robust Referring Image Segmentation

Qianqi Lu, Yuxiang Xie, Jing Zhang, Shiwei Zou, Yan Chen, Xidao Luan

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[203] arXiv:2509.13067 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： HERO：重新思考高分辨率大型视觉语言模型中视觉标记的早期丢弃

标题： HERO: Rethinking Visual Token Early Dropping in High-Resolution Large Vision-Language Models

Xu Li, Yuxuan Liang, Xiaolei Chen, Yi Zheng, Haotian Chen, Bin Li, Xiangyang Xue

主题：计算机视觉与模式识别 (cs.CV)
[204] arXiv:2509.13031 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：感知先于推理：视觉语言模型中视觉推理的两阶段强化学习

标题： Perception Before Reasoning: Two-Stage Reinforcement Learning for Visual Reasoning in Vision-Language Models

Yan Chen, Long Li, Teng Xi, Long Zeng, Jingdong Wang

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[205] arXiv:2509.13013 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： Dream3DAvatar：从单张图像进行文本控制的3D角色重建

标题： Dream3DAvatar: Text-Controlled 3D Avatar Reconstruction from a Single Image

Gaofeng Liu, Hengsen Li, Ruoyu Gao, Xuetong Li, Zhiyuan Ma, Tao Fang

主题：计算机视觉与模式识别 (cs.CV)
[206] arXiv:2509.12997 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：使用低功耗神经形态虚拟警戒线的无人机检测

标题： Drone Detection Using a Low-Power Neuromorphic Virtual Tripwire

Anton Eldeborg Lundin, Rasmus Winzell, Hanna Hamrell, David Gustafsson, Hannes Ovrén

期刊参考： ECCV 2024 工作坊。ECCV 2024。计算机科学讲义，第15646卷。Springer，查姆。

主题：计算机视觉与模式识别 (cs.CV)
[207] arXiv:2509.12995 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：带着枪去与刀搏斗：现代VFM基线在真实场景AI图像检测中超越专业检测器

标题： Brought a Gun to a Knife Fight: Modern VFM Baselines Outgun Specialized Detectors on In-the-Wild AI Image Detection

Yue Zhou, Xinan He, Kaiqing Lin, Bing Fan, Feng Ding, Jinhua Zeng, Bin Li

主题：计算机视觉与模式识别 (cs.CV)
[208] arXiv:2509.12990 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：双阶段重加权MoE用于长尾自我中心错误检测

标题： Dual-Stage Reweighted MoE for Long-Tailed Egocentric Mistake Detection

Boyu Han, Qianqian Xu, Shilong Bao, Zhiyong Yang, Sicong Li, Qingming Huang

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI) ; 机器学习 (cs.LG)
[209] arXiv:2509.12989 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：全景：具身人工智能时代全向视觉的兴起

标题： PANORAMA: The Rise of Omnidirectional Vision in the Embodied AI Era

Xu Zheng, Chenfei Liao, Ziqiao Weng, Kaiyu Lei, Zihao Dongfang, Haocong He, Yuanhuiyi Lyu, Lutao Jiang, Lu Qi, Li Chen, Danda Pani Paudel, Kailun Yang, Linfeng Zhang, Luc Van Gool, Xuming Hu

评论：本文提出了在具身人工智能背景下新兴的全向视觉领域的初步概述

主题：计算机视觉与模式识别 (cs.CV)
[210] arXiv:2509.12980 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：提高隐式神经表示的准确性与效率：使SIREN成为赢家

标题： Improving Accuracy and Efficiency of Implicit Neural Representations: Making SIREN a WINNER

Hemanth Chandravamsi, Dhanush V. Shenoy, Steven H. Frankel

主题：计算机视觉与模式识别 (cs.CV) ; 机器学习 (cs.LG)
[211] arXiv:2509.12976 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： SHREC 2025：包括静电势的蛋白质表面形状检索

标题： SHREC 2025: Protein surface shape retrieval including electrostatic potential

Taher Yacoub, Camille Depenveiller, Atsushi Tatsuma, Tin Barisin, Eugen Rusakov, Udo Gobel, Yuxu Peng, Shiqiang Deng, Yuki Kagaya, Joon Hong Park, Daisuke Kihara, Marco Guerra, Giorgio Palmieri, Andrea Ranieri, Ulderico Fugacci, Silvia Biasotti, Ruiwen He, Halim Benhabiles, Adnane Cabani, Karim Hammoudi, Haotian Li, Hao Huang, Chunyan Li, Alireza Tehrani, Fanwang Meng, Farnaz Heidar-Zadeh, Tuan-Anh Yang, Matthieu Montes

评论：发表于《计算机与图形》, Elsevier。59页，12图

期刊参考：计算机与图形学第132卷，2025年11月，文章104394

主题：计算机视觉与模式识别 (cs.CV) ; 生物大分子 (q-bio.BM)
[212] arXiv:2509.12965 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： ICDAR 2025 古代手写文档的少样本文本行分割竞赛（FEST）

标题： ICDAR 2025 Competition on FEw-Shot Text line segmentation of ancient handwritten documents (FEST)

Silvia Zottin, Axel De Nardin, Giuseppe Branca, Claudio Piciarelli, Gian Luca Foresti

评论：被ICDAR 2025接收

期刊参考：文档分析与识别，ICDAR 2025。ICDAR 2025。计算机科学讲座笔记，第16027卷。Springer，查姆。

主题：计算机视觉与模式识别 (cs.CV)
[213] arXiv:2509.12963 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： MMMS：多模态多表面交互分割

标题： MMMS: Multi-Modal Multi-Surface Interactive Segmentation

Robin Schön, Julian Lorenz, Katja Ludwig, Daniel Kienzle, Rainer Lienhart

评论： 19页，11图，10页

主题：计算机视觉与模式识别 (cs.CV) ; 机器学习 (cs.LG)
[214] arXiv:2509.12959 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：时间步混拌用于从外观域到事件域的高效脉冲知识迁移

标题： Time-step Mixup for Efficient Spiking Knowledge Transfer from Appearance to Event Domain

Yuqi Xie, Shuhan Ye, Chong Wang, Jiazhen Xu, Le Shen, Yuanbin Qian, Jiangbo Qian

主题：计算机视觉与模式识别 (cs.CV)
[215] arXiv:2509.12938 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：超越平均值：使用高斯点云和嵌入集合的开放词汇3D场景理解

标题： Beyond Averages: Open-Vocabulary 3D Scene Understanding with Gaussian Splatting and Bag of Embeddings

Abdalla Arafa, Didier Stricker

主题：计算机视觉与模式识别 (cs.CV)
[216] arXiv:2509.12931 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： 4DRadar-GS：基于4D雷达的自监督动态驾驶场景重建

标题： 4DRadar-GS: Self-Supervised Dynamic Driving Scene Reconstruction with 4D Radar

Xiao Tang, Guirong Zhuo, Cong Wang, Boyuan Zheng, Minqing Huang, Lianqing Zheng, Long Chen, Shouyi Lu

主题：计算机视觉与模式识别 (cs.CV)
[217] arXiv:2509.12924 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： MATTER：用于配准误差回归的多尺度注意力

标题： MATTER: Multiscale Attention for Registration Error Regression

Shipeng Liu, Ziliang Xiong, Khac-Hoang Ngo, Per-Erik Forssén

主题：计算机视觉与模式识别 (cs.CV)
[218] arXiv:2509.12918 (交叉列表自 cs.CV) [中文pdf, pdf, 其他]: 标题：一种用于YOLOv8的新型压缩框架：通过结构化剪枝和通道知识蒸馏实现在边缘设备上的实时航空目标检测

标题： A Novel Compression Framework for YOLOv8: Achieving Real-Time Aerial Object Detection on Edge Devices via Structured Pruning and Channel-Wise Distillation

Melika Sabaghian, Mohammad Ali Keyvanrad, Seyyedeh Mahila Moghadami

评论： 28页，11图

主题：计算机视觉与模式识别 (cs.CV)
[219] arXiv:2509.12913 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： T-SiamTPN：用于鲁棒高效无人机跟踪的时序孪生变换金字塔网络

标题： T-SiamTPN: Temporal Siamese Transformer Pyramid Networks for Robust and Efficient UAV Tracking

Hojat Ardi (1), Amir Jahanshahi (1), Ali Diba (2) ((1) Department of Electrical Engineering, Amirkabir University of Technology (AUT), Tehran, Iran (2) Qatar Computing Research Institute, Hamad Bin Khalifa University, Doha, Qatar)

主题：计算机视觉与模式识别 (cs.CV)
[220] arXiv:2509.12905 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： AREPAS：基于重建的语义块评分在细粒度解剖中的异常检测

标题： AREPAS: Anomaly Detection in Fine-Grained Anatomy with Reconstruction-Based Semantic Patch-Scoring

Branko Mitic, Philipp Seeböck, Helmut Prosch, Georg Langs

主题：计算机视觉与模式识别 (cs.CV)
[221] arXiv:2509.12901 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： MSGFusion：多模态场景图引导的红外与可见光图像融合

标题： MSGFusion: Multimodal Scene Graph-Guided Infrared and Visible Image Fusion

Guihui Li, Bowei Dong, Kaizhi Dong, Jiayi Li, Haiyong Zheng

主题：计算机视觉与模式识别 (cs.CV)
[222] arXiv:2509.12897 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：跨层视觉平滑：通过在大型视觉-语言模型中持续关注关键对象来增强视觉理解

标题： Cross-Layer Vision Smoothing: Enhancing Visual Understanding via Sustained Focus on Key Objects in Large Vision-Language Models

Jianfei Zhao, Feng Zhang, Xin Sun, Lingxing Kong, Zhixing Tan, Chong Feng

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[223] arXiv:2509.12894 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： DialNav：与远程引导的多轮对话导航

标题： DialNav: Multi-turn Dialog Navigation with a Remote Guide

Leekyeung Han, Hyunji Min, Gyeom Hwangbo, Jonghyun Choi, Paul Hongsuck Seo

评论： 18页，8张图，ICCV 2025

主题：计算机视觉与模式识别 (cs.CV)
[224] arXiv:2509.12893 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： MEJO：通过任务间和任务内联合优化的MLLM参与的外科三元组识别

标题： MEJO: MLLM-Engaged Surgical Triplet Recognition via Inter- and Intra-Task Joint Optimization

Yiyi Zhang, Yuchen Yuan, Ying Zheng, Jialun Pei, Jinpeng Li, Zheng Li, Pheng-Ann Heng

主题：计算机视觉与模式识别 (cs.CV)
[225] arXiv:2509.12888 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：龙格-库塔近似与解耦注意力用于校正流反转和语义编辑

标题： Runge-Kutta Approximation and Decoupled Attention for Rectified Flow Inversion and Semantic Editing

Weiming Chen, Zhihan Zhu, Yijia Wang, Zhihai He

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[226] arXiv:2509.12883 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：乐高编辑：一种具有模型级积木和MLLM构建器的通用图像编辑框架

标题： Lego-Edit: A General Image Editing Framework with Model-Level Bricks and MLLM Builder

Qifei Jia, Yu Liu, Yajie Chai, Xintong Yao, Qiming Lu, Yasen Zhang, Runyu Shi, Ying Huang, Guoquan Zhang

主题：计算机视觉与模式识别 (cs.CV)
[227] arXiv:2509.12878 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：少量到大量：通过扩散学习者进行点云少量样本语义分割的原型扩展网络

标题： Few to Big: Prototype Expansion Network via Diffusion Learner for Point Cloud Few-shot Semantic Segmentation

Qianguang Zhao, Dongli Wang, Yan Zhou, Jianxun Li, Richard Irampa

主题：计算机视觉与模式识别 (cs.CV)
[228] arXiv:2509.12871 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：累积共识评分：部署中对象检测器的无标签和模型无关评估

标题： Cumulative Consensus Score: Label-Free and Model-Agnostic Evaluation of Object Detectors in Deployment

Avinaash Manoharan, Xiangyu Yin, Domenik Helm, Chih-Hong Cheng

主题：计算机视觉与模式识别 (cs.CV)
[229] arXiv:2509.12866 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：利用大型语言模型为犬类骨骼肌肉诊断有效生成视觉数据

标题： Leveraging Large Language Models to Effectively Generate Visual Data for Canine Musculoskeletal Diagnoses

Martin Thißen, Thi Ngoc Diep Tran, Barbara Esteve Ratsch, Ben Joel Schönbein, Ute Trapp, Beate Egner, Romana Piat, Elke Hergenröther

期刊参考：计算机科学研究笔记 3501(1) (2025) 27-38

主题：计算机视觉与模式识别 (cs.CV)
[230] arXiv:2509.12836 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：探索度量融合用于NeRFs的评估

标题： Exploring Metric Fusion for Evaluation of NeRFs

Shreyas Shivakumara, Gabriel Eilertsen, Karljohan Lundin Palmerius

评论：被第17届国际多媒体体验质量会议（QoMEX 25）接受

主题：计算机视觉与模式识别 (cs.CV)
[231] arXiv:2509.12818 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：放射学基础模型的数据缩放定律

标题： Data Scaling Laws for Radiology Foundation Models

Maximilian Ilse, Harshita Sharma, Anton Schwaighofer, Sam Bond-Taylor, Fernando Pérez-García, Olesya Melnichenko, Anne-Marie G. Sykes, Kelly K. Horst, Ashish Khandelwal, Maxwell Reynolds, Maria T. Wetscherek, Noel C. F. Codella, Javier Alvarez-Valle, Korfiatis Panagiotis, Valentina Salvatelli

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[232] arXiv:2509.12817 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： SAGA：用于高效且表达性强的线性注意力的选择性自适应门控

标题： SAGA: Selective Adaptive Gating for Efficient and Expressive Linear Attention

Yuan Cao, Dong Wang

主题：计算机视觉与模式识别 (cs.CV)
[233] arXiv:2509.12815 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： Hunyuan3D Studio：面向游戏的3D资源生成端到端AI流程

标题： Hunyuan3D Studio: End-to-End AI Pipeline for Game-Ready 3D Asset Generation

Biwen Lei, Yang Li, Xinhai Liu, Shuhui Yang, Lixin Xu, Jingwei Huang, Ruining Tang, Haohan Weng, Jian Liu, Jing Xu, Zhen Zhou, Yiling Zhu, Jiankai Xing, Jiachen Xu, Changfeng Ma, Xinhao Yan, Yunhan Yang, Chunshi Wang, Duoteng Xu, Xueqi Ma, Yuguang Chen, Jing Li, Mingxin Yang, Sheng Zhang, Yifei Feng, Xin Huang, Di Luo, Zebin He, Puhua Jiang, Changrong Hu, Zihan Qin, Shiwei Miao, Haolin Liu, Yunfei Zhao, Zeqiang Lai, Qingxiang Lin, Zibo Zhao, Kunhong Li, Xianghui Yang, Huiwen Shi, Xin Yang, Yuxuan Wang, Zebin Yao, Yihang Lian, Sicong Liu, Xintong Han, Wangchen Qin, Caisheng Ouyang, Jianyin Liu, Tianwen Yuan, Shuai Jiang, Hong Duan, Yanqi Niu, Wencong Lin, Yifu Sun, Shirui Huang, Lin Niu, Gu Gong, Guojian Xiao, Bojian Zheng, Xiang Yuan, Qi Chen, Jie Xiao, Dongyang Zheng, Xiaofeng Yang, Kai Liu, Jianchen Zhu, Lifu Wang, Qinglin Lu, Jie Liu, Liang Dong, Fan Jiang, Ruibin Chen, Lei Wang, Chao Zhang, Jiaxin Lin, Hao Zhang, Zheng Ye, Peng He, Runzhou Wu, Yinhe Wu, Jiayao Du, Jupeng Chen, Xinyue Mao, Dongyuan Guo, Yixuan Tang, Yulin Tsai, Yonghao Tan, Jiaao Yu, Junlin Yu, Keren Zhang, Yifan Li, Peng Chen, Tian Liu, Di Wang, Yuhong Liu, Linus, Jie Jiang, Zhuo Chen, Chunchao Guo

评论：技术报告

主题：计算机视觉与模式识别 (cs.CV)
[234] arXiv:2509.12791 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：超像素任意：一种准确且规则的超像素分割的通用基于对象的框架

标题： Superpixel Anything: A general object-based framework for accurate yet regular superpixel segmentation

Julien Walther, Rémi Giraud, Michaël Clément

主题：计算机视觉与模式识别 (cs.CV)
[235] arXiv:2509.12787 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：双螺旋扩散用于跨域异常图像生成

标题： Double Helix Diffusion for Cross-Domain Anomaly Image Generation

Linchun Wu, Qin Zou, Xianbiao Qi, Bo Du, Zhongyuan Wang, Qingquan Li

主题：计算机视觉与模式识别 (cs.CV)
[236] arXiv:2509.12784 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：使用上下文表示建模多变量关系以有效检测人-物体交互

标题： Modeling the Multivariate Relationship with Contextualized Representations for Effective Human-Object Interaction Detection

Zhehao Li, Yucheng Qian, Chong Wang, Yinghao Lu, Zhihao Yang, Jiafei Wu

主题：计算机视觉与模式识别 (cs.CV)
[237] arXiv:2509.12777 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： CECT-Mamba：一种分层对比增强感知模型，用于多相CECT胰腺肿瘤亚型分类

标题： CECT-Mamba: a Hierarchical Contrast-enhanced-aware Model for Pancreatic Tumor Subtyping from Multi-phase CECT

Zhifang Gong, Shuo Gao, Ben Zhao, Yingjing Xu, Yijun Yang, Shenghong Ju, Guangquan Zhou

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[238] arXiv:2509.12768 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： BATR-FST：少样本变换器的双级自适应标记精炼

标题： BATR-FST: Bi-Level Adaptive Token Refinement for Few-Shot Transformers

Mohammed Al-Habib, Zuping Zhang, Abdulrahman Noman

评论：本文已被接受发表于IEEE国际神经网络联合会议（IJCNN），2025年意大利罗马

主题：计算机视觉与模式识别 (cs.CV) ; 机器学习 (cs.LG)
[239] arXiv:2509.12763 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： DyGLNet：用于医学图像分割的混合全局-局部特征融合与动态上采样

标题： DyGLNet: Hybrid Global-Local Feature Fusion with Dynamic Upsampling for Medical Image Segmentation

Yican Zhao, Ce Wang, You Hao, Lei Li, Tianli Liao

评论： 18页，正在审稿中

主题：计算机视觉与模式识别 (cs.CV)
[240] arXiv:2509.12759 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： A-TDOM：通过实时3DGS的主动TDOM

标题： A-TDOM: Active TDOM via On-the-Fly 3DGS

Yiwei Xu, Xiang Wang, Yifei Yu, Wentian Gan, Luca Morelli, Giulio Perda, Xiongwu Xiao, Zongqian Zhan, Xin Wang, Fabio Remondino

评论：这是一篇即将发表的期刊论文的简短白皮书

主题：计算机视觉与模式识别 (cs.CV)
[241] arXiv:2509.12757 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：递归跨视图物体几何定位

标题： Recurrent Cross-View Object Geo-Localization

Xiaohan Zhang, Si-Yuan Cao, Xiaokai Bai, Yiming Li, Zhangkai Shen, Zhe Wu, Xiaoxi Hu, Hui-liang Shen

主题：计算机视觉与模式识别 (cs.CV)
[242] arXiv:2509.12750 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：什么构成了高质量的生成图像？研究人类和多模态大语言模型的图像偏好对齐

标题： What Makes a Good Generated Image? Investigating Human and Multimodal LLM Image Preference Alignment

Rishab Parthasarathy, Jasmine Collins, Cory Stephenson

评论： 7页，9图，3表；附录16页，9图，6表

主题：计算机视觉与模式识别 (cs.CV)
[243] arXiv:2509.12746 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：基于尺度空间理论的理想感受野，对“主键滤波器假说”中的8个滤波器在深度可分离深度网络中的建模与分析

标题： Modelling and analysis of the 8 filters from the "master key filters hypothesis" for depthwise-separable deep networks in relation to idealized receptive fields based on scale-space theory

Tony Lindeberg, Zahra Babaiee, Peyman M. Kiasari

评论： 24页，11图，17表

主题：计算机视觉与模式识别 (cs.CV)
[244] arXiv:2509.12742 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：高保真目标重构的有效高斯管理

标题： Effective Gaussian Management for High-fidelity Object Reconstruction

Jiateng Liu, Hao Gao, Jiu-Cheng Xie, Chi-Man Pun, Jian Xiong, Haolun Li, Feng Xu

主题：计算机视觉与模式识别 (cs.CV)
[245] arXiv:2509.12724 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：防御到攻击：绕过弱防御在视觉-语言模型中实现更强的越狱

标题： Defense-to-Attack: Bypassing Weak Defenses Enables Stronger Jailbreaks in Vision-Language Models

Yunhan Zhao, Xiang Zheng, Xingjun Ma

评论：此作品已提交给IEEE以可能发表

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[246] arXiv:2509.12721 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： SPGen：球面投影作为单图像3D形状生成的一致且灵活的表示方法

标题： SPGen: Spherical Projection as Consistent and Flexible Representation for Single Image 3D Shape Generation

Jingdong Zhang, Weikai Chen, Yuan Liu, Jionghao Wang, Zhengming Yu, Zhuowen Shen, Bo Yang, Wenping Wang, Xin Li

主题：计算机视觉与模式识别 (cs.CV)
[247] arXiv:2509.12718 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： EvoEmpirBench：具有智能体-验证的动态空间推理

标题： EvoEmpirBench: Dynamic Spatial Reasoning with Agent-ExpVer

Pukun Zhao, Longxiang Wang, Miaowei Wang, Chen Chen, Fanqing Zhou, Haojian Huang

评论：正在工作，29页，3图，7表

主题：计算机视觉与模式识别 (cs.CV)
[248] arXiv:2509.12715 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： AsyMoE：利用模态不对称性增强大型视觉-语言模型中的专家专业化

标题： AsyMoE: Leveraging Modal Asymmetry for Enhanced Expert Specialization in Large Vision-Language Models

Heng Zhang, Haichuan Hu, Yaomin Shen, Weihao Yu, Yilei Yuan, Haochen You, Guo Cheng, Zijian Zhang, Lubin Gan, Huihui Wei, Hao Zhang, Jin Huang

主题：计算机视觉与模式识别 (cs.CV) ; 机器人技术 (cs.RO)
[249] arXiv:2509.12711 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：通过想象学习：组合零样本学习的去偏特征增强

标题： Learning by Imagining: Debiased Feature Augmentation for Compositional Zero-Shot Learning

Haozhe Zhang, Chenchen Jing, Mingyu Liu, Qingsheng Wang, Hao Chen

主题：计算机视觉与模式识别 (cs.CV)
[250] arXiv:2509.12710 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： RIS-FUSION：从指代图像分割的角度重新思考文本驱动的红外与可见光图像融合

标题： RIS-FUSION: Rethinking Text-Driven Infrared and Visible Image Fusion from the Perspective of Referring Image Segmentation

Siju Ma, Changsiyu Gong, Xiaofeng Fan, Yong Ma, Chengjie Jiang

评论： 5页，2图

主题：计算机视觉与模式识别 (cs.CV)

总共 569 条目 : 1-50 51-100 101-150 151-200 201-250 251-300 301-350 351-400 ... 551-569

显示最多 50 每页条目：较少 | 更多 | 所有

计算机视觉与模式识别

最近提交的作者和标题

2025年09月17日， 星期三 (继续， 展示 132 之 50 条目 )

2025年09月17日，星期三 (继续，展示 132 之 50 条目 )