arXiv cs.CV论文6 小时前DFIR-DETR: Frequency-Domain Iterative Refinement and Dynamic Feature Aggregation for Small Object Detection阅读
arXiv cs.CV论文6 小时前NeuralBoneReg: An Instance-Specific Label-Free Point Cloud-Based Method for Multi-Modal Bone Surface Registration阅读
arXiv cs.CV论文6 小时前Edge Assisted Multi-Camera Vehicle Tracking Framework for Real-Time and Scalable Deployment阅读
arXiv cs.CV论文6 小时前Investigating Robot Control Policy Learning for Autonomous X-ray-guided Spine Procedures阅读
arXiv cs.CV论文6 小时前A solution to generalized learning from small training sets found in infant repeated visual experiences of individual objects阅读
arXiv cs.CV论文6 小时前Dynamic Weight-based Temporal Aggregation for Low-light Video Enhancement Under Extreme Noise阅读
arXiv cs.CV论文6 小时前A drone-based framework for coral habitat mapping via weakly supervised segmentation阅读
arXiv cs.CV论文6 小时前UniEmo: Unifying Emotional Understanding and Generation with Learnable Expert Queries阅读
arXiv cs.CV论文6 小时前Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models阅读
arXiv cs.CV论文6 小时前PixelPonder: Dynamic Patch Adaptation for Enhanced Multi-Conditional Text-to-Image Generation阅读
arXiv cs.CV论文6 小时前RT-NeRV: Rethinking Hybrid Neural Representations for Video via Residual Tokenization阅读