An Open-Source Two-Stage Computer Vision Pipeline for Fine-Grained Vehicle Classification using Vision Transformers
基于视觉Transformer的开源两阶段细粒度车辆分类流水线
发表机构 * Department of Electrical and Computer Engineering, University of California, Los Angeles, CA, USA(1 电气工程与计算机科学系,美国加州大学洛杉矶分校)
AI总结 提出一个结合RT-DETR检测器和微调ViT-Base/16的两阶段流水线,用于六类车身分类,并引入置信度弃权机制,在分布内和分布外数据集上分别达到0.94和0.89的准确率。
Comments 24 pages, 10 figures, venue TBD