Object detection

COCO

dataset capable to detect objects of 80 common classes

Object detection is a computer technology related to

video surveillance

.

Uses

It is widely used in

face recognition, video object co-segmentation. It is also used in tracking objects

, for example tracking a ball during a football match, tracking movement of a cricket bat, or tracking a person in a video.

Often, the test images are sampled from a different data distribution, making the object detection task significantly more difficult.[5] To address the challenges caused by the domain gap between training and test data, many unsupervised domain adaptation approaches have been proposed.^[5]^[6]^[7]^[8]^[9] A simple and straightforward solution of reducing the domain gap is to apply an image-to-image translation approach, such as cycle-GAN.^[10] Among other uses, cross-domain object detection is applied in autonomous driving, where models can be trained on a vast amount of video game scenes, since the labels can be generated without manual labor.

Concept

Every object class has its own special features that help in classifying the class – for example all circles are round. Object class detection uses these special features. For example, when looking for circles, objects that are at a particular distance from a point (i.e. the center) are sought. Similarly, when looking for squares, objects that are perpendicular at corners and have equal side lengths are needed. A similar approach is used for face identification where eyes, nose, and lips can be found and features like skin color and distance between eyes can be found.

Methods

false positive

result for sea urchin.
In reality, textures and outlines would not be represented by single nodes, but rather by associated weight patterns of multiple nodes.

Methods for object detection generally fall into either neural network-based or non-neural approaches. For non-neural approaches, it becomes necessary to first define features using one of the methods below, then using a technique such as support vector machine (SVM) to do the classification. On the other hand, neural techniques are able to do end-to-end object detection without specifically defining features, and are typically based on convolutional neural networks (CNN).

Non-neural approaches:
- Viola–Jones object detection framework based on Haar features
- Scale-invariant feature transform (SIFT)
- Histogram of oriented gradients (HOG) features^[12]
Neural network approaches:
- Region Proposals (R-CNN,^[13] Fast R-CNN,^[14] Faster R-CNN,^[15] cascade R-CNN.^[16])
- Single Shot MultiBox Detector (SSD) ^[17]
- Single-Shot Refinement Neural Network for Object Detection (RefineDet) ^[18]
- Retina-Net ^[19]^[16]
- Deformable convolutional networks ^[20]^[21]

References

^ Dasiopoulou, Stamatia, et al. "Knowledge-assisted semantic video object detection." IEEE Transactions on Circuits and Systems for Video Technology 15.10 (2005): 1210–1224.
ISBN 978-1-4398-3087-1
.

S2CID 233194604
.

^ Wu, Jianxin, et al. "A scalable approach to activity recognition based on object use." 2007 IEEE 11th international conference on computer vision. IEEE, 2007.

^
arXiv:2105.13502 [cs.CV
].

arXiv:1904.02361 [cs.LG
].

S2CID 208138033
.

S2CID 253251380
.

arXiv:2208.14662 [cs.CV
].

arXiv:1703.10593 [cs.CV
].

ISBN 1492671207.{{cite book}}: CS1 maint: multiple names: authors list (link
)

^ Dalal, Navneet (2005). "Histograms of oriented gradients for human detection" (PDF). Computer Vision and Pattern Recognition. 1.

S2CID 215827080
.

Bibcode:2015arXiv150408083G
.

arXiv:1506.01497
.

^
arXiv:1904.02701v1 [cs.CV
].

S2CID 2141740
.

Bibcode:2017arXiv171106897Z
.

S2CID 47252984
.

arXiv:1811.11168 [cs.CV
].

arXiv:1703.06211 [cs.CV
].

"Object Class Detection". Vision.eecs.ucf.edu. Archived from the original on 2013-07-14. Retrieved 2013-10-09.

"ETHZ – Computer Vision Lab: Publications". Vision.ee.ethz.ch. Archived from the original on 2013-06-03. Retrieved 2013-10-09.

External links

Multiple object class detection

Spatio-temporal action localization

Online Object Detection Demo

Video object detection and co-segmentation

Retrieved from "https://en.wikipedia.org/w/index.php?title=Object_detection&oldid=1186971351"

[1] Dasiopoulou, Stamatia, et al. "Knowledge-assisted semantic video object detection." IEEE Transactions on Circuits and Systems for Video Technology 15.10 (2005): 1210–1224.

[GuanHe2012-2] ISBN 978-1-4398-3087-1
.

[3] S2CID 233194604
.

[4] Wu, Jianxin, et al. "A scalable approach to activity recognition based on object use." 2007 IEEE 11th international conference on computer vision. IEEE, 2007.

[:0-5] 
arXiv:2105.13502 [cs.CV
].

[6] rXiv:1904.02361 [cs.LG
].

[7] S2CID 208138033
.

[8] S2CID 253251380
.

[9] rXiv:2208.14662 [cs.CV
].

[10] rXiv:1703.10593 [cs.CV
].

[11] ISBN 1492671207.{{cite book}}: CS1 maint: multiple names: authors list (link
)

[12] Dalal, Navneet (2005). "Histograms of oriented gradients for human detection" (PDF). Computer Vision and Pattern Recognition. 1.

[13] S2CID 215827080
.

[14] Bibcode:2015arXiv150408083G
.

[15] rXiv:1506.01497
.

[Pang_Chen_Shi_Feng_2019-16] 
arXiv:1904.02701v1 [cs.CV
].

[17] S2CID 2141740
.

[18] Bibcode:2017arXiv171106897Z
.

[19] S2CID 47252984
.

[20] rXiv:1811.11168 [cs.CV
].

[21] rXiv:1703.06211 [cs.CV
].

[5]

[6]

[7]

[8]

[9]

[10]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

Uses

Concept

Methods

See also

References

External links