Global Instance Tracking: Locating Target More Like Humans

Hu, Shiyu; Zhao, Xin; Huang, Lianghua; Huang, Kaiqi

doi:10.1109/TPAMI.2022.3153312

Computer Science > Computer Vision and Pattern Recognition

arXiv:2202.13073 (cs)

[Submitted on 26 Feb 2022]

Title:Global Instance Tracking: Locating Target More Like Humans

Authors:Shiyu Hu, Xin Zhao, Lianghua Huang, Kaiqi Huang

View PDF

Abstract:Target tracking, the essential ability of the human visual system, has been simulated by computer vision tasks. However, existing trackers perform well in austere experimental environments but fail in challenges like occlusion and fast motion. The massive gap indicates that researches only measure tracking performance rather than intelligence. How to scientifically judge the intelligence level of trackers? Distinct from decision-making problems, lacking three requirements (a challenging task, a fair environment, and a scientific evaluation procedure) makes it strenuous to answer the question. In this article, we first propose the global instance tracking (GIT) task, which is supposed to search an arbitrary user-specified instance in a video without any assumptions about camera or motion consistency, to model the human visual tracking ability. Whereafter, we construct a high-quality and large-scale benchmark VideoCube to create a challenging environment. Finally, we design a scientific evaluation procedure using human capabilities as the baseline to judge tracking intelligence. Additionally, we provide an online platform with toolkit and an updated leaderboard. Although the experimental results indicate a definite gap between trackers and humans, we expect to take a step forward to generate authentic human-like trackers. The database, toolkit, evaluation server, and baseline results are available at this http URL.

Comments:	This paper is published in IEEE TPAMI (refer to DOI). Please cite the published IEEE TPAMI
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2202.13073 [cs.CV]
	(or arXiv:2202.13073v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2202.13073
Related DOI:	https://doi.org/10.1109/TPAMI.2022.3153312

Submission history

From: Xin Zhao [view email]
[v1] Sat, 26 Feb 2022 06:16:34 UTC (27,508 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Global Instance Tracking: Locating Target More Like Humans

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Global Instance Tracking: Locating Target More Like Humans

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators