计算机视觉与模式识别

最近提交的作者和标题

查看今天的新的变化

总共 743 条目 : 1-50 51-100 101-150 151-200 201-250 251-300 ... 701-743

显示最多 50 每页条目：较少 | 更多 | 所有

[101] arXiv:2508.03064 (交叉列表自 cs.CV) [中文pdf, pdf, 其他]: 标题： CORE-ReID：通过领域自适应中集成融合的综合优化与精炼的人体再识别

标题： CORE-ReID: Comprehensive Optimization and Refinement through Ensemble fusion in Domain Adaptation for person re-identification

Trinh Quoc Nguyen, Oky Dicky Ardiansyah Prima, Katsuyoshi Hotta

期刊参考：软件 2024，3（2），227-249

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[102] arXiv:2508.03060 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： CHARM：跨任意模态的协作调和用于模态无关的语义分割

标题： CHARM: Collaborative Harmonization across Arbitrary Modalities for Modality-agnostic Semantic Segmentation

Lekang Wen, Jing Xiao, Liang Liao, Jiajun Chen, Mi Wang

主题：计算机视觉与模式识别 (cs.CV)
[103] arXiv:2508.03055 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：不确定引导的遮挡感知人脸修容

标题： Uncertainty-Guided Face Matting for Occlusion-Aware Face Transformation

Hyebin Cho, Jaehyup Lee

评论：被ACM MM 2025接收。9页，8图，6表

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[104] arXiv:2508.03050 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：多人交互式说话数据集

标题： Multi-human Interactive Talking Dataset

Zeyu Zhu, Weijia Wu, Mike Zheng Shou

评论： 9页，4图，4表

主题：计算机视觉与模式识别 (cs.CV)
[105] arXiv:2508.03039 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： VideoForest：面向人物的跨视频问答层次推理

标题： VideoForest: Person-Anchored Hierarchical Reasoning for Cross-Video Question Answering

Yiran Meng, Junhong Ye, Wei Zhou, Guanghui Yue, Xudong Mao, Ruomei Wang, Baoquan Zhao

主题：计算机视觉与模式识别 (cs.CV) ; 多媒体 (cs.MM)
[106] arXiv:2508.03034 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： MoCA：通过交叉注意力混合保持身份的文本到视频生成

标题： MoCA: Identity-Preserving Text-to-Video Generation via Mixture of Cross Attention

Qi Xie (1), Yongjia Ma (2), Donglin Di (2), Xuehao Gao (3), Xun Yang (1) ((1) University of Science and Technology of China, (2) Li Auto, (3) Northwestern Polytechnical University)

主题：计算机视觉与模式识别 (cs.CV)
[107] arXiv:2508.03017 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： SA-3DGS：一种自适应压缩方法用于3D高斯点云

标题： SA-3DGS: A Self-Adaptive Compression Method for 3D Gaussian Splatting

Liheng Zhang, Weihao Yu, Zubo Lu, Haozhi Gu, Jin Huang

评论： 9页，7图。正在AAAI 2026审稿中

主题：计算机视觉与模式识别 (cs.CV)
[108] arXiv:2508.03009 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：通过场景定位的帧分组增强长视频问答

标题： Enhancing Long Video Question Answering with Scene-Localized Frame Grouping

Xuyi Yang, Wenhao Zhang, Hongbo Jin, Lin Liu, Hongbo Xu, Yongwei Nie, Fei Yu, Fei Ma

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[109] arXiv:2508.03007 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：通过VFM的多粒度特征校准用于领域泛化语义分割

标题： Multi-Granularity Feature Calibration via VFM for Domain Generalized Semantic Segmentation

Xinhui Li, Xiaojie Guo

主题：计算机视觉与模式识别 (cs.CV)
[110] arXiv:2508.03006 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：在生成之前看到它：基于扩散的文本到图像模型的实时NSFW检测

标题： Seeing It Before It Happens: In-Generation NSFW Detection for Diffusion-Based Text-to-Image Models

Fan Yang, Yihao Huang, Jiayi Zhu, Ling Shi, Geguang Pu, Jin Song Dong, Kailong Wang

评论： 8页

主题：计算机视觉与模式识别 (cs.CV)
[111] arXiv:2508.02987 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：针对大型目标检测变压器的对抗性注意力扰动

标题： Adversarial Attention Perturbations for Large Object Detection Transformers

Zachary Yahn, Selim Furkan Tekin, Fatih Ilhan, Sihao Hu, Tiansheng Huang, Yichang Xu, Margaret Loper, Ling Liu

评论： ICCV 2025

主题：计算机视觉与模式识别 (cs.CV)
[112] arXiv:2508.02981 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： MoExDA：基于边缘的动作识别领域自适应

标题： MoExDA: Domain Adaptation for Edge-based Action Recognition

Takuya Sugimoto, Ning Ding, Toru Tamaki

评论： 7页

期刊参考：第19届机器视觉应用国际会议（MVA2025）

主题：计算机视觉与模式识别 (cs.CV)
[113] arXiv:2508.02978 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：为多领域学习分离共享和领域特定的LoRAs

标题： Separating Shared and Domain-Specific LoRAs for Multi-Domain Learning

Yusaku Takama, Ning Ding, Tatsuya Yokota, Toru Tamaki

评论： 9页

期刊参考： CVPR2025 领域泛化研讨会：演进、突破与未来展望（DGEBF）

主题：计算机视觉与模式识别 (cs.CV)
[114] arXiv:2508.02973 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：带有自适应负采样的扩散模型，无需外部资源

标题： Diffusion Models with Adaptive Negative Sampling Without External Resources

Alakh Desai, Nuno Vasconcelos

主题：计算机视觉与模式识别 (cs.CV)
[115] arXiv:2508.02967 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：面向尺度等变的稳健图像去噪

标题： Towards Robust Image Denoising with Scale Equivariance

Dawei Zhang, Xiaojie Guo

主题：计算机视觉与模式识别 (cs.CV)
[116] arXiv:2508.02944 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： X-Actor：从音频中进行情感和表现的远距离肖像表演

标题： X-Actor: Emotional and Expressive Long-Range Portrait Acting from Audio

Chenxu Zhang, Zenan Li, Hongyi Xu, You Xie, Xiaochen Zhao, Tianpei Gu, Guoxian Song, Xin Chen, Chao Liang, Jianwen Jiang, Linjie Luo

评论：项目页面位于 https://byteaigc.github.io/X-Actor/

主题：计算机视觉与模式识别 (cs.CV)
[117] arXiv:2508.02927 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：红外目标检测与超小卷积网络：ImageNet预训练是否仍然有用？

标题： Infrared Object Detection with Ultra Small ConvNets: Is ImageNet Pretraining Still Useful?

Srikanth Muralidharan, Heitor R. Medeiros, Masih Aminbeidokhti, Eric Granger, Marco Pedersoli

主题：计算机视觉与模式识别 (cs.CV)
[118] arXiv:2508.02923 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：扩散先验景观在盲解卷积中如何影响后验

标题： How Diffusion Prior Landscapes Shape the Posterior in Blind Deconvolution

Minh-Hai Nguyen, Edouard Pauwels, Pierre Weiss

主题：计算机视觉与模式识别 (cs.CV)
[119] arXiv:2508.02917 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：使用大型视觉-语言模型遵循路线指示：低级和全景动作空间的比较

标题： Following Route Instructions using Large Vision-Language Models: A Comparison between Low-level and Panoramic Action Spaces

Vebjørn Haug Kåsene, Pierre Lison

评论：本文已被接受至ICNSLP 2025

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI) ; 计算与语言 (cs.CL) ; 机器人技术 (cs.RO)
[120] arXiv:2508.02905 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：它会是什么样的声音？面向室内场景的材料控制多模态声学轮廓生成

标题： How Would It Sound? Material-Controlled Multimodal Acoustic Profile Generation for Indoor Scenes

Mahnoor Fatima Saad, Ziad Al-Halah

评论：被ICCV 2025接收。项目页面：https://mahnoor-fatima-saad.github.io/m-capa.html

主题：计算机视觉与模式识别 (cs.CV) ; 声音 (cs.SD) ; 音频与语音处理 (eess.AS)
[121] arXiv:2508.02903 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： RDDPM：用于无监督异常分割的鲁棒去噪扩散概率模型

标题： RDDPM: Robust Denoising Diffusion Probabilistic Model for Unsupervised Anomaly Segmentation

Mehrdad Moradi, Kamran Paynabar

评论： 10页，5张图。已被接受至ICCV 2025工业视觉检测研讨会（VISION）

主题：计算机视觉与模式识别 (cs.CV)
[122] arXiv:2508.02890 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： VisuCraft：通过结构化信息提取增强大型视觉-语言模型以进行复杂视觉引导的创意内容生成

标题： VisuCraft: Enhancing Large Vision-Language Models for Complex Visual-Guided Creative Content Generation via Structured Information Extraction

Rongxin Jiang, Robert Long, Chenghao Gu, Mingrui Yan

主题：计算机视觉与模式识别 (cs.CV) ; 计算与语言 (cs.CL)
[123] arXiv:2508.02871 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：深度神经变换器和卷积神经网络在现代遥感数据集上的评估与分析

标题： Evaluation and Analysis of Deep Neural Transformers and Convolutional Neural Networks on Modern Remote Sensing Datasets

J. Alex Hurt, Trevor M. Bajkowski, Grant J. Scott, Curt H. Davis

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI) ; 机器学习 (cs.LG)
[124] arXiv:2508.02858 (交叉列表自 cs.CV) [中文pdf, pdf, 其他]: 标题： MIDAR：使用轻量级即插即用模型模仿LiDAR检测的交通应用

标题： MIDAR: Mimicking LiDAR Detection for Traffic Applications with a Lightweight Plug-and-Play Model

Tianheng Zhu, Yiheng Feng

评论： 18页，9图

主题：计算机视觉与模式识别 (cs.CV)
[125] arXiv:2508.02844 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： RefineSeg：医学图像分割的双粗到细学习

标题： RefineSeg: Dual Coarse-to-Fine Learning for Medical Image Segmentation

Anghong Du, Nay Aung, Theodoros N. Arvanitis, Stefan K. Piechnik, Joao A C Lima, Steffen E. Petersen, Le Zhang

主题：计算机视觉与模式识别 (cs.CV)
[126] arXiv:2508.02831 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： GENIE：用于神经辐射场交互编辑的高斯编码

标题： GENIE: Gaussian Encoding for Neural Radiance Fields Interactive Editing

Mikołaj Zieliński, Krzysztof Byrski, Tomasz Szczepanik, Przemysław Spurek

主题：计算机视觉与模式识别 (cs.CV)
[127] arXiv:2508.02829 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：阐明特征归一化在IJEPA中的作用

标题： Elucidating the Role of Feature Normalization in IJEPA

Adam Colton

主题：计算机视觉与模式识别 (cs.CV)
[128] arXiv:2508.02807 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： DreamVVT：通过分阶段扩散变压器框架在野外掌握现实视频虚拟试穿

标题： DreamVVT: Mastering Realistic Video Virtual Try-On in the Wild via a Stage-Wise Diffusion Transformer Framework

Tongchun Zuo, Zaiyu Huang, Shuliang Ning, Ente Lin, Chao Liang, Zerong Zheng, Jianwen Jiang, Yuan Zhang, Mingyuan Gao, Xin Dong

评论： 18页，12图

主题：计算机视觉与模式识别 (cs.CV)
[129] arXiv:2508.02806 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： PyCAT4：一种基于分层视觉变换器的3D人体姿态估计框架

标题： PyCAT4: A Hierarchical Vision Transformer-based Framework for 3D Human Pose Estimation

Zongyou Yang, Jonathan Loo

评论： 10页，20图

主题：计算机视觉与模式识别 (cs.CV) ; 机器学习 (cs.LG)
[130] arXiv:2508.03654 (交叉列表自 cs.CL) [中文pdf, pdf, html, 其他]: 标题：大型多模态视觉-语言模型能否理解多模态讽刺？

标题： Can Large Vision-Language Models Understand Multimodal Sarcasm?

Xinyu Wang, Yue Zhang, Liqiang Jing

评论：被CIKM 2025接受

主题：计算与语言 (cs.CL) ; 计算机视觉与模式识别 (cs.CV)
[131] arXiv:2508.03645 (交叉列表自 cs.RO) [中文pdf, pdf, html, 其他]: 标题： DiWA：基于世界模型的扩散策略适应

标题： DiWA: Diffusion Policy Adaptation with World Models

Akshay L Chandra, Iman Nematollahi, Chenguang Huang, Tim Welschehold, Wolfram Burgard, Abhinav Valada

评论：被2025年机器人学习会议（CoRL）接受

主题：机器人技术 (cs.RO) ; 计算机视觉与模式识别 (cs.CV) ; 机器学习 (cs.LG)
[132] arXiv:2508.03644 (交叉列表自 cs.CL) [中文pdf, pdf, html, 其他]: 标题：我们是否在正确地评估文档检索增强生成？

标题： Are We on the Right Way for Assessing Document Retrieval-Augmented Generation?

Wenxuan Shen, Mingjia Wang, Yaochen Wang, Dongping Chen, Junjie Yang, Yao Wan, Weiwei Lin

评论：已提交。项目网站：https://double-bench.github.io/

主题：计算与语言 (cs.CL) ; 计算机视觉与模式识别 (cs.CV) ; 信息检索 (cs.IR)
[133] arXiv:2508.03594 (交叉列表自 eess.IV) [中文pdf, pdf, html, 其他]: 标题：基于规范条件扩散模型的脑图像恢复的上下文感知疾病偏差检测

标题： CADD: Context aware disease deviations via restoration of brain images using normative conditional diffusion models

Ana Lawry Aguila, Ayodeji Ijishakin, Juan Eugenio Iglesias, Tomomi Takenaga, Yukihiro Nomura, Takeharu Yoshikawa, Osamu Abe, Shouhei Hanaoka

主题：图像与视频处理 (eess.IV) ; 计算机视觉与模式识别 (cs.CV)
[134] arXiv:2508.03461 (交叉列表自 eess.IV) [中文pdf, pdf, html, 其他]: 标题：评估术前MRI对根治性前列腺切除术后勃起功能障碍的预测价值

标题： Evaluating the Predictive Value of Preoperative MRI for Erectile Dysfunction Following Radical Prostatectomy

Gideon N. L. Rouwendaal, Daniël Boeke, Inge L. Cox, Henk G. van der Poel, Margriet C. van Dijk-de Haan, Regina G. H. Beets-Tan, Thierry N. Boellaard, Wilson Silva

评论： 13页，5图，2表。已被MICCAI 2025年PRIME-MICCAI研讨会接收（PRedictive Intelligence in MEdicine）。这是提交的稿件，附有GitHub仓库链接、资金致谢以及作者姓名和单位。未进行进一步的投稿后改进或更正。最终版本尚未发表。

主题：图像与视频处理 (eess.IV) ; 计算机视觉与模式识别 (cs.CV)
[135] arXiv:2508.03457 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： READ：实时高效的异步扩散用于音频驱动的说话头生成

标题： READ: Real-time and Efficient Asynchronous Diffusion for Audio-driven Talking Head Generation

Haotian Wang, Yuzhe Weng, Jun Du, Haoran Xu, Xiaoyan Wu, Shan He, Bing Yin, Cong Liu, Jianqing Gao, Qingfeng Liu

评论： 9页

主题：计算机视觉与模式识别 (cs.CV) ; 声音 (cs.SD)
[136] arXiv:2508.03357 (交叉列表自 eess.IV) [中文pdf, pdf, html, 其他]: 标题： GL-LCM：用于胸部X光图像快速高分辨率骨抑制的全局-局部潜在一致性模型

标题： GL-LCM: Global-Local Latent Consistency Models for Fast High-Resolution Bone Suppression in Chest X-Ray Images

Yifei Sun, Zhanghao Chen, Hao Zheng, Yuqing Lu, Lixin Duan, Fenglei Fan, Ahmed Elazab, Xiang Wan, Changmiao Wang, Ruiquan Ge

评论： 11页，3张图，已被MICCAI 2025接收

主题：图像与视频处理 (eess.IV) ; 计算机视觉与模式识别 (cs.CV)
[137] arXiv:2508.03339 (交叉列表自 cs.RO) [中文pdf, pdf, html, 其他]: 标题： UniFucGrasp：受人手启发的统一功能抓取标注策略和数据集，用于多种灵巧手

标题： UniFucGrasp: Human-Hand-Inspired Unified Functional Grasp Annotation Strategy and Dataset for Diverse Dexterous Hands

Haoran Lin, Wenrui Chen, Xianchi Chen, Fan Yang, Qiang Diao, Wenxin Xie, Sijie Wu, Kailun Yang, Maojun Li, Yaonan Wang

评论：项目页面位于 https://haochen611.github.io/UFG

主题：机器人技术 (cs.RO) ; 计算机视觉与模式识别 (cs.CV) ; 图像与视频处理 (eess.IV)
[138] arXiv:2508.03291 (交叉列表自 astro-ph.IM) [中文pdf, pdf, html, 其他]: 标题：基于深度学习的星系图像转换模型的研究

标题： Investigation on deep learning-based galaxy image translation models

Hengxin Ruan, Qiufan Lin, Shupei Chen, Yang Wang, Wei Zhang

评论：已被A&A接收；18+6页；12+6图

主题：天体物理学的仪器与方法 (astro-ph.IM) ; 星系的天体物理学 (astro-ph.GA) ; 计算机视觉与模式识别 (cs.CV)
[139] arXiv:2508.03221 (交叉列表自 cs.CR) [中文pdf, pdf, html, 其他]: 标题： BadBlocks：针对文本到图像扩散模型的低成本且隐蔽的后门攻击

标题： BadBlocks: Low-Cost and Stealthy Backdoor Attacks Tailored for Text-to-Image Diffusion Models

Yu Pan, Jiahao Chen, Lin Wang, Bingrong Dai, Yi Du

主题：密码学与安全 (cs.CR) ; 计算机视觉与模式识别 (cs.CV)
[140] arXiv:2508.03091 (交叉列表自 cs.AI) [中文pdf, pdf, html, 其他]: 标题： T2UE：从文本描述生成不可学习的示例

标题： T2UE: Generating Unlearnable Examples from Text Descriptions

Xingjun Ma, Hanxun Huang, Tianwei Song, Ye Sun, Yifeng Gao, Yu-Gang Jiang

评论：将出现在ACM MM 2025上

主题：人工智能 (cs.AI) ; 密码学与安全 (cs.CR) ; 计算机视觉与模式识别 (cs.CV)
[141] arXiv:2508.03073 (交叉列表自 eess.IV) [中文pdf, pdf, html, 其他]: 标题： Nexus-INR：多样知识引导的任意尺度多模态医学图像超分辨率

标题： Nexus-INR: Diverse Knowledge-guided Arbitrary-Scale Multimodal Medical Image Super-Resolution

Bo Zhang, JianFei Huo, Zheng Zhang, Wufan Wang, Hui Gao, Xiangyang Gong, Wendong Wang

主题：图像与视频处理 (eess.IV) ; 计算机视觉与模式识别 (cs.CV)
[142] arXiv:2508.03057 (交叉列表自 eess.IV) [中文pdf, pdf, html, 其他]: 标题：医学点云形状学习综述：配准、重建和变化

标题： A Survey of Medical Point Cloud Shape Learning: Registration, Reconstruction and Variation

Tongxu Zhang, Zhiming Liang, Bei Wang

主题：图像与视频处理 (eess.IV) ; 计算机视觉与模式识别 (cs.CV)
[143] arXiv:2508.03008 (交叉列表自 eess.IV) [中文pdf, pdf, html, 其他]: 标题：临床FMamba：基于Mamba的多模态神经影像融合在临床评估中的应用

标题： ClinicalFMamba: Advancing Clinical Assessment using Mamba-based Multimodal Neuroimaging Fusion

Meng Zhou, Farzad Khalvati

评论：已被MICCAI MLMI 2025研讨会接受

主题：图像与视频处理 (eess.IV) ; 人工智能 (cs.AI) ; 计算机视觉与模式识别 (cs.CV)
[144] arXiv:2508.02995 (交叉列表自 cs.NE) [中文pdf, pdf, html, 其他]: 标题： VCNet：为稳健的人工视觉重现高级视觉皮层原理

标题： VCNet: Recreating High-Level Visual Cortex Principles for Robust Artificial Vision

Brennen A. Hill, Zhang Xinyu, Timothy Putra Prasetio

主题：神经与进化计算 (cs.NE) ; 人工智能 (cs.AI) ; 计算机视觉与模式识别 (cs.CV) ; 机器学习 (cs.LG)
[145] arXiv:2508.02957 (交叉列表自 eess.IV) [中文pdf, pdf, html, 其他]: 标题： AMD-Mamba：一种表型感知的多模态框架用于稳健的AMD预后

标题： AMD-Mamba: A Phenotype-Aware Multi-Modal Framework for Robust AMD Prognosis

Puzhen Wu, Mingquan Lin, Qingyu Chen, Emily Y. Chew, Zhiyong Lu, Yifan Peng, Hexin Dong

评论：被MICCAI 2025 MIML研讨会接受

主题：图像与视频处理 (eess.IV) ; 计算机视觉与模式识别 (cs.CV)
[146] arXiv:2508.02889 (交叉列表自 eess.IV) [中文pdf, pdf, html, 其他]: 标题： REFLECT：用于高效脑异常校正传输的修正流

标题： REFLECT: Rectified Flows for Efficient Brain Anomaly Correction Transport

Farzad Beizaee, Sina Hajimiri, Ismail Ben Ayed, Gregory Lodygensky, Christian Desrosiers, Jose Dolz

评论：被医学图像计算与计算机辅助干预协会（MICCAI 2025）接受

主题：图像与视频处理 (eess.IV) ; 计算机视觉与模式识别 (cs.CV)
[147] arXiv:2508.02880 (交叉列表自 eess.IV) [中文pdf, pdf, html, 其他]: 标题：三维反事实脑部MRI生成的评估

标题： Evaluation of 3D Counterfactual Brain MRI Generation

Pengwei Sun, Wei Peng, Lun Yu Li, Yixin Wang, Kilian M. Pohl

主题：图像与视频处理 (eess.IV) ; 计算机视觉与模式识别 (cs.CV)
[148] arXiv:2508.02765 (交叉列表自 cs.CY) [中文pdf, pdf, html, 其他]: 标题：信任的架构：结构化数据时代AI增强房地产估值的框架

标题： The Architecture of Trust: A Framework for AI-Augmented Real Estate Valuation in the Era of Structured Data

Petteri Teikari, Mike Jarrell, Maryam Azh, Harri Pesola

评论： 46页，6图

主题：计算机与社会 (cs.CY) ; 人工智能 (cs.AI) ; 计算机视觉与模式识别 (cs.CV)

[149] arXiv:2508.02671 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：原始数据的重要性：通过视觉-语言模型内部增强改进提示调优

标题： Raw Data Matters: Enhancing Prompt Tuning by Internal Augmentation on Vision-Language Models

Haoyang Li, Liang Wang, Chao Wang, Siyu Zhou, Jing Jiang, Yan Peng, Guodong Long

评论： 16页，6图，15表

主题：计算机视觉与模式识别 (cs.CV)
[150] arXiv:2508.02669 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： MedVLThinker：多模态医学推理的简单基线

标题： MedVLThinker: Simple Baselines for Multimodal Medical Reasoning

Xiaoke Huang, Juncheng Wu, Hui Liu, Xianfeng Tang, Yuyin Zhou

评论：项目页面和代码：https://ucsc-vlaa.github.io/MedVLThinker/

主题：计算机视觉与模式识别 (cs.CV)

总共 743 条目 : 1-50 51-100 101-150 151-200 201-250 251-300 ... 701-743

显示最多 50 每页条目：较少 | 更多 | 所有

计算机视觉与模式识别

最近提交的作者和标题

2025年08月06日，星期三 (继续，展示最后 148 之 48 条目 )

2025年08月05日，星期二 (展示首先 278 之 2 条目 )

计算机视觉与模式识别

最近提交的作者和标题

2025年08月06日， 星期三 (继续， 展示 最后 148 之 48 条目 )

2025年08月05日， 星期二 (展示 首先 278 之 2 条目 )

2025年08月06日，星期三 (继续，展示最后 148 之 48 条目 )

2025年08月05日，星期二 (展示首先 278 之 2 条目 )