计算机视觉与模式识别

最近提交的作者和标题

查看今天的新的变化

总共 552 条目 : 1-50 51-100 101-150 151-200 ... 551-552

显示最多 50 每页条目：较少 | 更多 | 所有

[1] arXiv:2601.04956 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： TEA：时间自适应卫星图像语义分割

标题： TEA: Temporal Adaptive Satellite Image Semantic Segmentation

Zeren Jiang, Chuanxia Zheng, Iro Laina, Diane Larlus, Andrea Vedaldi

评论：正在审核中。代码将可在此 https URL 查看

主题：计算机视觉与模式识别 (cs.CV)
[2] arXiv:2601.04946 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：原型偏差揭示了多模态评估指标中的盲点

标题： Prototypicality Bias Reveals Blindspots in Multimodal Evaluation Metrics

Daniele Lizzio Bosco, Shuteng Wang, Giuseppe Serra, Vladislav Golyanik

评论：第一个版本

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[3] arXiv:2601.04860 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： DivAS：通过深度加权体素聚合的NeRF交互式3D分割

标题： DivAS: Interactive 3D Segmentation of NeRFs via Depth-Weighted Voxel Aggregation

Yuan-Kang Lee, Kuan-Lin Chen, Chia-Che Chang, Yu-Lun Liu

主题：计算机视觉与模式识别 (cs.CV)
[4] arXiv:2601.04800 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：用于从石刻、金属板和纸张文档中选择和增强古马拉地语铭文图像的综合框架

标题： Integrated Framework for Selecting and Enhancing Ancient Marathi Inscription Images from Stone, Metal Plate, and Paper Documents

Gangwei Xu, Haotong Lin, Hongcheng Luo, Haiyang Sun, Bing Wang, Guang Chen, Sida Peng, Hangjun Ye, Xin Yang

评论： 9页，5图

主题：计算机视觉与模式识别 (cs.CV)
[5] arXiv:2601.04776 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：基于物理模型的极化驱动单目形状分割

标题： Segmentation-Driven Monocular Shape from Polarization based on Physical Model

Henghui Ding, Chang Liu, Shuting He, Xudong Jiang, Yu-Gang Jiang

评论： 11页，10幅图，提交至IEEE图像处理汇刊

主题：计算机视觉与模式识别 (cs.CV)
[6] arXiv:2601.04752 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：基于骨架的对抗扰动在大型视觉语言模型数学文本识别中的应用

标题： Skeletonization-Based Adversarial Perturbations on Large Vision Language Model's Mathematical Text Recognition

Boyang Wang, Haoran Zhang, Shujie Zhang, Jinkun Hao, Mingda Jia, Qi Lv, Yucheng Mao, Zhaoyang Lyu, Jia Zeng, Xudong Xu, Jiangmiao Pang

评论：被ITC-CSCC 2025接受

期刊参考：第25届ITC-CSCC会议论文集

主题：计算机视觉与模式识别 (cs.CV)
[7] arXiv:2601.05149 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：多尺度局部推测解码用于图像生成

标题： Multi-Scale Local Speculative Decoding for Image Generation

Xiao Fu, Shitao Tang, Min Shi, Xian Liu, Jinwei Gu, Ming-Yu Liu, Dahua Lin, Chen-Hsuan Lin

评论：项目页面位于 https://qualcomm-ai-research.github.io/mulo-sd-webpage

主题：计算机视觉与模式识别 (cs.CV)
[8] arXiv:2601.05116 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：从射线到投影：前馈视图合成的更好输入

标题： From Rays to Projections: Better Inputs for Feed-Forward View Synthesis

Rustin Soraki, Homanga Bharadhwaj, Ali Farhadi, Roozbeh Mottaghi

评论：项目页面：https://wuzirui.github.io/pvsm-web

主题：计算机视觉与模式识别 (cs.CV)
[9] arXiv:2601.05244 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： GREx：广义指称表达式分割、理解与生成

标题： GREx: Generalized Referring Expression Segmentation, Comprehension, and Generation

Danilo Danese, Angela Lombardi, Matteo Attimonelli, Giuseppe Fasano, Tommaso Di Noia

评论： IJCV，项目页面：https://henghuiding.com/GREx/

主题：计算机视觉与模式识别 (cs.CV)
[10] arXiv:2601.05239 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：光场视频生成

标题： Plenoptic Video Generation

Zichen Wang, Ang Cao, Liam J. Wang, Jeong Joon Park

评论：项目页面：https://research.nvidia.com/labs/dir/plenopticdreamer/

主题：计算机视觉与模式识别 (cs.CV)
[11] arXiv:2601.05143 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：一种轻量级且可解释的视觉-语言框架用于作物疾病视觉问答

标题： A Lightweight and Explainable Vision-Language Framework for Crop Disease Visual Question Answering

William Rudman, Michal Golovanevsky, Dana Arad, Yonatan Belinkov, Ritambhara Singh, Carsten Eickhoff, Kyle Mahowald

评论：预印本，稿件正在评审中

主题：计算机视觉与模式识别 (cs.CV) ; 计算与语言 (cs.CL)
[12] arXiv:2601.04792 (交叉列表自 cs.CV) [中文pdf, pdf, 其他]: 标题：金字塔瓦：关于使预训练视频模型为高效推理而金字塔化的研究

标题： PyramidalWan: On Making Pretrained Video Model Pyramidal for Efficient Inference

Zuhair Ahmed Khan Taha, Mohammed Mudassir Uddin, Shahnawaz Alam

主题：计算机视觉与模式识别 (cs.CV)
[13] arXiv:2601.05191 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：降低AI研究成本：任务感知压缩如何使大型语言模型代理变得经济实惠

标题： Cutting AI Research Costs: How Task-Aware Compression Makes Large Language Model Agents Affordable

Shuming Liu, Mingchen Zhuge, Changsheng Zhao, Jun Chen, Lemeng Wu, Zechun Liu, Chenchen Zhu, Zhipeng Cai, Chong Zhou, Haozhe Liu, Ernie Chang, Saksham Suri, Hongyu Xu, Qi Qian, Wei Wen, Balakrishnan Varadarajan, Zhuang Liu, Hu Xu, Florian Bordes, Raghuraman Krishnamoorthi, Bernard Ghanem, Vikas Chandra, Yunyang Xiong

主题：计算机视觉与模式识别 (cs.CV) ; 机器学习 (cs.LG)
[14] arXiv:2601.04968 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： SparseLaneSTP：利用稀疏变换器的时空先验进行三维车道检测

标题： SparseLaneSTP: Leveraging Spatio-Temporal Priors with Sparse Transformers for 3D Lane Detection

Haoyu Zhao, Akide Liu, Zeyu Zhang, Weijie Wang, Feng Chen, Ruihan Zhu, Gholamreza Haffari, Bohan Zhuang

评论：发表于 IEEE/CVF 国际计算机视觉会议（ICCV）2025

主题：计算机视觉与模式识别 (cs.CV)
[15] arXiv:2601.04798 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：用于长时间无人机跟踪的检测器增强型SAMURAI

标题： Detector-Augmented SAMURAI for Long-Duration Drone Tracking

Shuliang Liu, Songbo Yang, Dong Fang, Sihang Jia, Yuqi Tang, Lingfeng Su, Ruoshui Peng, Yibo Yan, Xin Zou, Xuming Hu

评论：被接受至WACV 2026“现实世界监控：应用与挑战”研讨会

主题：计算机视觉与模式识别 (cs.CV)
[16] arXiv:2601.04779 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：离焦像差理论证实了大多数成像设备中的高斯模型

标题： Defocus Aberration Theory Confirms Gaussian Model in Most Imaging Devices

Elia Peruzzo, Guillaume Sautière, Amirhossein Habibian

评论： 13页，9图，11个.jpg文件

主题：计算机视觉与模式识别 (cs.CV)
[17] arXiv:2601.05201 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：提示引起的幻觉机制在视觉-语言模型中

标题： Mechanisms of Prompt-Induced Hallucination in Vision-Language Models

Maximilian Alber, Timo Milbich, Alexandra Carpen-Amarie, Stephan Tietz, Jonas Dippel, Lukas Muttenthaler, Beatriz Perez Cancer, Alessandro Benetti, Panos Korfiatis, Elias Eulig, Jérôme Lüscher, Jiasen Wu, Sayed Abid Hashimi, Gabriel Dernbach, Simon Schallenberg, Neelay Shah, Moritz Krügener, Aniruddh Jammoria, Jake Matras, Patrick Duffy, Matt Redlon, Philipp Jurmeister, David Horst, Lukas Ruff, Klaus-Robert Müller, Frederick Klauschen, Andrew Norgan

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI) ; 计算与语言 (cs.CL)
[18] arXiv:2601.05172 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： CoV：空间推理的视角链提示

标题： CoV: Chain-of-View Prompting for Spatial Reasoning

Md. Zahid Hossain, Most. Sharmin Sultana Samu, Md. Rakibul Islam, Md. Siam Ansary

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[19] arXiv:2601.05212 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： FlowLet：使用小波流匹配的条件3D脑部MRI合成

标题： FlowLet: Conditional 3D Brain MRI Synthesis using Wavelet Flow Matching

Sixiao Zheng, Minghao Yin, Wenbo Hu, Xiaoyu Li, Ying Shan, Yanwei Fu

主题：计算机视觉与模式识别 (cs.CV)
[20] arXiv:2601.05208 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： MoE3D：用于三维重建的专家混合模块

标题： MoE3D: A Mixture-of-Experts Module for 3D Reconstruction

Ignacio de Rodrigo, Alvaro J. Lopez-Lopez, Jaime Boal

主题：计算机视觉与模式识别 (cs.CV)
[21] arXiv:2601.05175 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： VideoAuto-R1：通过一次思考，两次回答的视频自动推理

标题： VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice

Runze He, Yiji Cheng, Tiankai Hang, Zhimin Li, Yu Xu, Zijin Yin, Shiyi Zhang, Wenxun Dai, Penghui Du, Ao Ma, Chunyu Wang, Qinglin Lu, Jizhong Han, Jiao Dai

评论：项目页面：https://ivul-kaust.github.io/projects/videoauto-r1/

主题：计算机视觉与模式识别 (cs.CV)
[22] arXiv:2601.05138 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： VerseCrafter：具有4D几何控制的动态真实视频世界模型

标题： VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control

Zirui Wu, Zeren Jiang, Martin R. Oswald, Jie Song

评论：项目页面：https://sixiaozheng.github.io/VerseCrafter_page/

主题：计算机视觉与模式识别 (cs.CV)
[23] arXiv:2601.05125 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： VERSE：视觉嵌入缩减与空间探索基于聚类的见解，用于视觉丰富文档理解的训练数据增强

标题： VERSE: Visual Embedding Reduction and Space Exploration. Clustering-Guided Insights for Training Data Enhancement in Visually-Rich Document Understanding

Filippo Ghilotti, Samuel Brucker, Nahku Saidy, Matteo Matteucci, Mario Bijelic, Felix Heide

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[24] arXiv:2601.04785 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： SRU-Pix2Pix：一种基于少样本学习的医学图像翻译融合驱动生成网络

标题： SRU-Pix2Pix: A Fusion-Driven Generator Network for Medical Image Translation with Few-Shot Learning

Ellington Kirby, Alexandre Boulch, Yihong Xu, Yuan Yin, Gilles Puy, Éloi Zablocki, Andrei Bursuc, Spyros Gidaris, Renaud Marlet, Florent Bartoccioni, Anh-Quan Cao, Nermin Samet, Tuan-Hung VU, Matthieu Cord

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[25] arXiv:2601.04777 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： GeM-VG：面向多图像视觉定位的通用方法与多模态大语言模型

标题： GeM-VG: Towards Generalized Multi-image Visual Grounding with Multimodal Large Language Models

Suyash Mishra, Qiang Li, Srikanth Patil, Anubhav Girdhar

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[26] arXiv:2601.04754 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： ProFuse：用于开放词汇3D高斯泼溅的高效跨视图上下文融合

标题： ProFuse: Efficient Cross-View Context Fusion for Open-Vocabulary 3D Gaussian Splatting

Ruochen Chen, Thuy Tran, Shaifali Parashar

评论： 10页，5图

主题：计算机视觉与模式识别 (cs.CV)
[27] arXiv:2601.04734 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： AIVD：用于准确和高效工业视觉检测的自适应边缘云协作

标题： AIVD: Adaptive Edge-Cloud Collaboration for Accurate and Efficient Industrial Visual Detection

Jens Bayer, Stefan Becker, David Münch, Michael Arens, Jürgen Beyerer

主题：计算机视觉与模式识别 (cs.CV)
[28] arXiv:2601.05148 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： Atlas 2 -- 用于临床部署的基础模型

标题： Atlas 2 -- Foundation models for clinical deployment

Minseong Kweon, Jinsun Park

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI) ; 机器学习 (cs.LG)
[29] arXiv:2601.04727 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：在五个异构图像数据集上训练自定义的CNN

标题： Training a Custom CNN on Five Heterogeneous Image Datasets

Maximilian Pittner, Joel Janai, Mario Faigle, Alexandru Paul Condurache

主题：计算机视觉与模式识别 (cs.CV) ; 神经与进化计算 (cs.NE)
[30] arXiv:2601.05251 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： Mesh4D：从单目视频中进行4D网格重建和跟踪

标题： Mesh4D: 4D Mesh Reconstruction and Tracking from Monocular Video

Juyuan Kang, Hao Zhu, Yan Zhu, Wei Zhang, Jianing Chen, Tianxiang Xiao, Yike Ma, Hao Jiang, Feng Dai

评论： 15页，8张图，项目页面：https://mesh-4d.github.io/

主题：计算机视觉与模式识别 (cs.CV)
[31] arXiv:2601.04899 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：旋转鲁棒的卷积模型树回归

标题： Rotation-Robust Regression with Convolutional Model Trees

Subhadeep Roy, Gagan Bhatia, Steffen Eger

主题：计算机视觉与模式识别 (cs.CV) ; 机器学习 (cs.LG)
[32] arXiv:2601.04791 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：测量一致的朗之万校正器：潜在扩散逆求解器的解决方案

标题： Measurement-Consistent Langevin Corrector: A Remedy for Latent Diffusion Inverse Solvers

Hongyi Li, William Ward Armstrong, Jun Xu

评论：待审核

主题：计算机视觉与模式识别 (cs.CV) ; 机器学习 (cs.LG)
[33] arXiv:2601.04778 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： CounterVid：减轻视频-语言模型中动作和时间幻觉的反事实视频生成

标题： CounterVid: Counterfactual Video Generation for Mitigating Action and Temporal Hallucinations in Video-Language Models

Suyash Mishra, Qiang Li, Srikanth Patil, Satyanarayan Pati, Baddu Narendra

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI) ; 计算与语言 (cs.CL) ; 多媒体 (cs.MM)
[34] arXiv:2601.04715 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：论检测人像伪造的整体方法

标题： On the Holistic Approach for Detecting Human Image Forgery

Ayush Pande

评论： 6幅图，5张表

主题：计算机视觉与模式识别 (cs.CV)
[35] arXiv:2601.05159 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：视觉-语言内省：通过可解释的双因果引导减轻多模态大语言模型的过度自信幻觉

标题： Vision-Language Introspection: Mitigating Overconfident Hallucinations in MLLMs via Interpretable Bi-Causal Steering

Alessandra Scotto di Freca, Tiziana D Alessandro, Francesco Fontanella, Filippo Sarria, Claudio De Stefano

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[36] arXiv:2601.05124 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：重新对齐：结构化推理引导的上下文图像生成与编辑对齐

标题： Re-Align: Structured Reasoning-guided Alignment for In-Context Image Generation and Editing

Oriol Rabasseda, Zenjie Li, Kamal Nasrollahi, Sergio Escalera

评论： 13页，9图，项目页面：https://github.com/hrz2000/realign

主题：计算机视觉与模式识别 (cs.CV)
[37] arXiv:2601.05105 (交叉列表自 cs.CV) [中文pdf, pdf, 其他]: 标题： UniLiPs：基于几何基础的动态场景分解的统一激光雷达伪标签方法

标题： UniLiPs: Unified LiDAR Pseudo-Labeling with Geometry-Grounded Dynamic Scene Decomposition

Bapu D. Chendage, Rajivkumar S. Mente

主题：计算机视觉与模式识别 (cs.CV)
[38] arXiv:2601.05035 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：基于块的表示和学习用于高效的变形建模

标题： Patch-based Representation and Learning for Efficient Deformation Modeling

Tamara R. Lenhard, Andreas Weinmann, Hichem Snoussi, Tobias Koch

主题：计算机视觉与模式识别 (cs.CV)
[39] arXiv:2601.05249 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： RL-AWB：低光夜间场景中的自动白平衡校正深度强化学习

标题： RL-AWB: Deep Reinforcement Learning for Auto White Balance Correction in Low-Light Night-time Scenes

Denis Korzhenkov, Adil Karjauv, Animesh Karnewar, Mohsen Ghafoorian, Amirhossein Habibian

评论：项目页面：https://ntuneillee.github.io/research/rl-awb/

主题：计算机视觉与模式识别 (cs.CV)
[40] arXiv:2601.05241 (交叉列表自 cs.CV) [中文pdf, pdf, 其他]: 标题： RoboVIP：具有视觉身份提示的多视角视频生成增强机器人操作

标题： RoboVIP: Multi-View Video Generation with Visual Identity Prompting Augments Robot Manipulation

Lee Hyoseok, Sohwi Lim, Eunju Cha, Tae-Hyun Oh

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI) ; 机器人技术 (cs.RO)
[41] arXiv:2601.05059 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：从理解到参与：通过视觉语言模型（VLMs）个性化的药学视频片段

标题： From Understanding to Engagement: Personalized pharmacy Video Clips via Vision Language Models (VLMs)

Xihe Qiu, Yang Dai, Xiaoyu Tan, Sijia Li, Fenghao Sun, Lu Gan, Liang Liu

评论：为顶级会议在视觉语言模型领域做出了原创研究；目前正在进行同行评审

主题：计算机视觉与模式识别 (cs.CV) ; 机器学习 (cs.LG)
[42] arXiv:2601.04891 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：在工业通用人工智能平台上的药物长视频推理的视觉语言模型扩展

标题： Scaling Vision Language Models for Pharmaceutical Long Form Video Reasoning on Industrial GenAI Platform

Akbar Saadat

评论：提交至顶级会议的产业赛道；目前处于同行评审中

主题：计算机视觉与模式识别 (cs.CV) ; 机器学习 (cs.LG)
[43] arXiv:2601.04984 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：海洋飞溅：具有三目视图一致性的对象感知高斯飞溅用于水下场景重建

标题： OceanSplat: Object-aware Gaussian Splatting with Trinocular View Consistency for Underwater Scene Reconstruction

Tobia Poppi, Burak Uzkent, Amanmeet Garg, Lucas Porto, Garin Kessler, Yezhou Yang, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara, Florian Schiffers

评论：被AAAI 2026接收。项目页面：https://oceansplat.github.io

主题：计算机视觉与模式识别 (cs.CV)
[44] arXiv:2601.04834 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：使用 YOLO 的字符检测用于多本中世纪书籍的作者识别

标题： Character Detection using YOLO for Writer Identification in multiple Medieval books

Shurong Zheng, Yousong Zhu, Hongyin Zhao, Fan Yang, Yufei Zhan, Ming Tang, Jinqiao Wang

评论： 7页，2图，1表。被IEEE-CH 2025接收

主题：计算机视觉与模式识别 (cs.CV)
[45] arXiv:2601.04824 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： SOVABench：多模态大语言模型的车辆监控动作检索基准

标题： SOVABench: A Vehicle Surveillance Action Retrieval Benchmark for Multimodal Large Language Models

Jinyu Zhang, Xu Ma, Weili Chen, Gonzalo R. Arce

评论：此作品已被接受在《现实世界监控：应用与挑战》第六届（WACV研讨会）上发表。

主题：计算机视觉与模式识别 (cs.CV)
[46] arXiv:2601.05083 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：寄存器上的驾驶

标题： Driving on Registers

Yen-Jen Chiou, Wei-Tse Cheng, Yuan-Fu Yang

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI) ; 机器人技术 (cs.RO)
[47] arXiv:2601.04991 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：针对实时目标检测器的高阶对抗补丁

标题： Higher-Order Adversarial Patches for Real-Time Object Detectors

Masatomo Yoshida, Haruto Namura, Nicola Adami, Masahiro Okuda

评论：正在审稿（ICPR2026）

主题：计算机视觉与模式识别 (cs.CV)
[48] arXiv:2601.05250 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： QNeRF：模拟基于门的量子计算机上的神经辐射场

标题： QNeRF: Neural Radiance Fields on a Simulated Gate-Based Quantum Computer

Yunqing Hu, Zheming Yang, Chang Zhao, Qi Guo, Meng Gao, Pengcheng Li, Wen Ji

评论： 30页，15图，11表；项目页面：https://4dqv.mpi-inf.mpg.de/QNeRF/

主题：计算机视觉与模式识别 (cs.CV)
[49] arXiv:2601.05246 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：像素精确的视觉几何估计

标题： Pixel-Perfect Visual Geometry Estimation

Anika Tabassum, Tasnuva Mahazabin Tuba, Nafisa Naznin

评论：代码：https://github.com/gangweix/pixel-perfect-depth

主题：计算机视觉与模式识别 (cs.CV)
[50] arXiv:2601.05237 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： ObjectForesight：从人类视频中预测未来3D物体轨迹

标题： ObjectForesight: Predicting Future 3D Object Trajectories from Human Videos

Xiao Guo, Jie Zhu, Anil Jain, Xiaoming Liu

评论：预印本。项目网站：objectforesight.github.io

主题：计算机视觉与模式识别 (cs.CV)

总共 552 条目 : 1-50 51-100 101-150 151-200 ... 551-552

显示最多 50 每页条目：较少 | 更多 | 所有

计算机视觉与模式识别

最近提交的作者和标题

2026年01月09日， 星期五 (展示 首先 97 之 50 条目 )

2026年01月09日，星期五 (展示首先 97 之 50 条目 )