Multiscale spectral residue for faster image object detection
| Main Author: | |
|---|---|
| Publication Date: | 2013 |
| Format: | Master thesis |
| Language: | por |
| Source: | Repositório Institucional da UFBA |
| Download full: | http://www.repositorio.ufba.br/ri/handle/ri/13203 |
Summary: | Accuracy in image object detection has been usually achieved at the expense of much computational load. Therefore a trade-o between detection performance and fast execution commonly represents the ultimate goal of an object detector in real life applications. Most images are composed of non-trivial amounts of background nformation, such as sky, ground and water. In this sense, using an object detector against a recurring background pattern can require a signi cant amount of the total processing time. To alleviate this problem, search space reduction methods can help focusing the detection procedure on more distinctive image regions. Among the several approaches for search space reduction, we explored saliency information to organize regions based on their probability of containing objects. Saliency detectors are capable of pinpointing regions which generate stronger visual stimuli based solely on information extracted from the image. The fact that saliency methods do not require prior training is an important bene t, which allows application of these techniques in a broad range of machine vision domains. We propose a novel method toward the goal of faster object detectors. The proposed method was grounded on a multi-scale spectral residue (MSR) analysis using saliency detection. For better search space reduction, our method enables ne control of search scale, more robustness to variations on saliency intensity along an object length and also a direct way to control the balance between search space reduction and false negatives caused by region selection. Compared to a regular sliding window search over the images, in our experiments, MSR was able to reduce by 75% (in average) the number of windows to be evaluated by an object detector while improving or at least maintaining detector ROC performance. The proposed method was thoroughly evaluated over a subset of LabelMe dataset (person images), improving detection performance in most cases. This evaluation was done comparing object detection performance against di erent object detectors, with and without MSR. Additionally, we also provide evaluation of how di erent object classes interact with MSR, which was done using Pascal VOC 2007 dataset. Finally, tests made showed that window selection performance of MSR has a good scalability with regard to image size. From the obtained data, our conclusion is that MSR can provide substantial bene ts to existing sliding window detectors. |
| id |
UFBA-2_ec202d5e4fb38bf80a94eb64bb1f31ba |
|---|---|
| oai_identifier_str |
oai:repositorio.ufba.br:ri/13203 |
| network_acronym_str |
UFBA-2 |
| network_name_str |
Repositório Institucional da UFBA |
| repository_id_str |
1932 |
| spelling |
Silva Filho, José Grimaldo daSilva Filho, José Grimaldo daOliveira, Luciano Rebouças de2013-10-11T19:14:00Z2013-10-11T19:14:00Z2013-10-11http://www.repositorio.ufba.br/ri/handle/ri/13203Accuracy in image object detection has been usually achieved at the expense of much computational load. Therefore a trade-o between detection performance and fast execution commonly represents the ultimate goal of an object detector in real life applications. Most images are composed of non-trivial amounts of background nformation, such as sky, ground and water. In this sense, using an object detector against a recurring background pattern can require a signi cant amount of the total processing time. To alleviate this problem, search space reduction methods can help focusing the detection procedure on more distinctive image regions. Among the several approaches for search space reduction, we explored saliency information to organize regions based on their probability of containing objects. Saliency detectors are capable of pinpointing regions which generate stronger visual stimuli based solely on information extracted from the image. The fact that saliency methods do not require prior training is an important bene t, which allows application of these techniques in a broad range of machine vision domains. We propose a novel method toward the goal of faster object detectors. The proposed method was grounded on a multi-scale spectral residue (MSR) analysis using saliency detection. For better search space reduction, our method enables ne control of search scale, more robustness to variations on saliency intensity along an object length and also a direct way to control the balance between search space reduction and false negatives caused by region selection. Compared to a regular sliding window search over the images, in our experiments, MSR was able to reduce by 75% (in average) the number of windows to be evaluated by an object detector while improving or at least maintaining detector ROC performance. The proposed method was thoroughly evaluated over a subset of LabelMe dataset (person images), improving detection performance in most cases. This evaluation was done comparing object detection performance against di erent object detectors, with and without MSR. Additionally, we also provide evaluation of how di erent object classes interact with MSR, which was done using Pascal VOC 2007 dataset. Finally, tests made showed that window selection performance of MSR has a good scalability with regard to image size. From the obtained data, our conclusion is that MSR can provide substantial bene ts to existing sliding window detectors.Submitted by LIVIA FREITAS (livia.freitas@ufba.br) on 2013-10-11T12:05:23Z No. of bitstreams: 1 dissertacao_mestrado_jose-grimaldo.pdf: 19406681 bytes, checksum: d108855fa0fb0d44ee5d1cb59579a04c (MD5)Approved for entry into archive by LIVIA FREITAS(livia.freitas@ufba.br) on 2013-10-11T19:14:00Z (GMT) No. of bitstreams: 1 dissertacao_mestrado_jose-grimaldo.pdf: 19406681 bytes, checksum: d108855fa0fb0d44ee5d1cb59579a04c (MD5)Made available in DSpace on 2013-10-11T19:14:00Z (GMT). No. of bitstreams: 1 dissertacao_mestrado_jose-grimaldo.pdf: 19406681 bytes, checksum: d108855fa0fb0d44ee5d1cb59579a04c (MD5)SalvadorVisão por computadorProcessamento de imagensMultiscale spectral residue for faster image object detectioninfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisporreponame:Repositório Institucional da UFBAinstname:Universidade Federal da Bahia (UFBA)instacron:UFBAinfo:eu-repo/semantics/openAccessLICENSElicense.txtlicense.txttext/plain1365https://repositorio.ufba.br/bitstream/ri/13203/2/license.txt5371a150bdc863f78dcf39281543bd86MD52ORIGINALdissertacao_mestrado_jose-grimaldo.pdfdissertacao_mestrado_jose-grimaldo.pdfapplication/pdf19406681https://repositorio.ufba.br/bitstream/ri/13203/1/dissertacao_mestrado_jose-grimaldo.pdfd108855fa0fb0d44ee5d1cb59579a04cMD51TEXTdissertacao_mestrado_jose-grimaldo.pdf.txtdissertacao_mestrado_jose-grimaldo.pdf.txtExtracted texttext/plain208826https://repositorio.ufba.br/bitstream/ri/13203/3/dissertacao_mestrado_jose-grimaldo.pdf.txte0d792610b0ba093db14db46d161ebaaMD53ri/132032022-07-05 14:03:55.354oai:repositorio.ufba.br:ri/13203VGVybW8gZGUgTGljZW7vv71hLCBu77+9byBleGNsdXNpdm8sIHBhcmEgbyBkZXDvv71zaXRvIG5vIFJlcG9zaXTvv71yaW8gSW5zdGl0dWNpb25hbCBkYSBVRkJBLg0KDQogUGVsbyBwcm9jZXNzbyBkZSBzdWJtaXNz77+9byBkZSBkb2N1bWVudG9zLCBvIGF1dG9yIG91IHNldSByZXByZXNlbnRhbnRlIGxlZ2FsLCBhbyBhY2VpdGFyIA0KZXNzZSB0ZXJtbyBkZSBsaWNlbu+/vWEsIGNvbmNlZGUgYW8gUmVwb3NpdO+/vXJpbyBJbnN0aXR1Y2lvbmFsIGRhIFVuaXZlcnNpZGFkZSBGZWRlcmFsIGRhIEJhaGlhIA0KbyBkaXJlaXRvIGRlIG1hbnRlciB1bWEgY++/vXBpYSBlbSBzZXUgcmVwb3NpdO+/vXJpbyBjb20gYSBmaW5hbGlkYWRlLCBwcmltZWlyYSwgZGUgcHJlc2VydmHvv73vv71vLiANCkVzc2VzIHRlcm1vcywgbu+/vW8gZXhjbHVzaXZvcywgbWFudO+/vW0gb3MgZGlyZWl0b3MgZGUgYXV0b3IvY29weXJpZ2h0LCBtYXMgZW50ZW5kZSBvIGRvY3VtZW50byANCmNvbW8gcGFydGUgZG8gYWNlcnZvIGludGVsZWN0dWFsIGRlc3NhIFVuaXZlcnNpZGFkZS4NCg0KIFBhcmEgb3MgZG9jdW1lbnRvcyBwdWJsaWNhZG9zIGNvbSByZXBhc3NlIGRlIGRpcmVpdG9zIGRlIGRpc3RyaWJ1ae+/ve+/vW8sIGVzc2UgdGVybW8gZGUgbGljZW7vv71hIA0KZW50ZW5kZSBxdWU6DQoNCiBNYW50ZW5kbyBvcyBkaXJlaXRvcyBhdXRvcmFpcywgcmVwYXNzYWRvcyBhIHRlcmNlaXJvcywgZW0gY2FzbyBkZSBwdWJsaWNh77+977+9ZXMsIG8gcmVwb3NpdO+/vXJpbw0KcG9kZSByZXN0cmluZ2lyIG8gYWNlc3NvIGFvIHRleHRvIGludGVncmFsLCBtYXMgbGliZXJhIGFzIGluZm9ybWHvv73vv71lcyBzb2JyZSBvIGRvY3VtZW50bw0KKE1ldGFkYWRvcyBlc2NyaXRpdm9zKS4NCg0KIERlc3RhIGZvcm1hLCBhdGVuZGVuZG8gYW9zIGFuc2Vpb3MgZGVzc2EgdW5pdmVyc2lkYWRlIGVtIG1hbnRlciBzdWEgcHJvZHXvv73vv71vIGNpZW5077+9ZmljYSBjb20gDQphcyByZXN0cmnvv73vv71lcyBpbXBvc3RhcyBwZWxvcyBlZGl0b3JlcyBkZSBwZXJp77+9ZGljb3MuDQoNCiBQYXJhIGFzIHB1YmxpY2Hvv73vv71lcyBzZW0gaW5pY2lhdGl2YXMgcXVlIHNlZ3VlbSBhIHBvbO+/vXRpY2EgZGUgQWNlc3NvIEFiZXJ0bywgb3MgZGVw77+9c2l0b3MgDQpjb21wdWxz77+9cmlvcyBuZXNzZSByZXBvc2l077+9cmlvIG1hbnTvv71tIG9zIGRpcmVpdG9zIGF1dG9yYWlzLCBtYXMgbWFudO+/vW0gYWNlc3NvIGlycmVzdHJpdG8gDQphbyBtZXRhZGFkb3MgZSB0ZXh0byBjb21wbGV0by4gQXNzaW0sIGEgYWNlaXRh77+977+9byBkZXNzZSB0ZXJtbyBu77+9byBuZWNlc3NpdGEgZGUgY29uc2VudGltZW50bw0KIHBvciBwYXJ0ZSBkZSBhdXRvcmVzL2RldGVudG9yZXMgZG9zIGRpcmVpdG9zLCBwb3IgZXN0YXJlbSBlbSBpbmljaWF0aXZhcyBkZSBhY2Vzc28gYWJlcnRvLg==Repositório InstitucionalPUBhttps://repositorio.ufba.br/oai/requestrepositorio@ufba.bropendoar:19322022-07-05T17:03:55Repositório Institucional da UFBA - Universidade Federal da Bahia (UFBA)false |
| dc.title.pt_BR.fl_str_mv |
Multiscale spectral residue for faster image object detection |
| title |
Multiscale spectral residue for faster image object detection |
| spellingShingle |
Multiscale spectral residue for faster image object detection Silva Filho, José Grimaldo da Visão por computador Processamento de imagens |
| title_short |
Multiscale spectral residue for faster image object detection |
| title_full |
Multiscale spectral residue for faster image object detection |
| title_fullStr |
Multiscale spectral residue for faster image object detection |
| title_full_unstemmed |
Multiscale spectral residue for faster image object detection |
| title_sort |
Multiscale spectral residue for faster image object detection |
| author |
Silva Filho, José Grimaldo da |
| author_facet |
Silva Filho, José Grimaldo da |
| author_role |
author |
| dc.contributor.author.fl_str_mv |
Silva Filho, José Grimaldo da Silva Filho, José Grimaldo da |
| dc.contributor.advisor1.fl_str_mv |
Oliveira, Luciano Rebouças de |
| contributor_str_mv |
Oliveira, Luciano Rebouças de |
| dc.subject.por.fl_str_mv |
Visão por computador Processamento de imagens |
| topic |
Visão por computador Processamento de imagens |
| description |
Accuracy in image object detection has been usually achieved at the expense of much computational load. Therefore a trade-o between detection performance and fast execution commonly represents the ultimate goal of an object detector in real life applications. Most images are composed of non-trivial amounts of background nformation, such as sky, ground and water. In this sense, using an object detector against a recurring background pattern can require a signi cant amount of the total processing time. To alleviate this problem, search space reduction methods can help focusing the detection procedure on more distinctive image regions. Among the several approaches for search space reduction, we explored saliency information to organize regions based on their probability of containing objects. Saliency detectors are capable of pinpointing regions which generate stronger visual stimuli based solely on information extracted from the image. The fact that saliency methods do not require prior training is an important bene t, which allows application of these techniques in a broad range of machine vision domains. We propose a novel method toward the goal of faster object detectors. The proposed method was grounded on a multi-scale spectral residue (MSR) analysis using saliency detection. For better search space reduction, our method enables ne control of search scale, more robustness to variations on saliency intensity along an object length and also a direct way to control the balance between search space reduction and false negatives caused by region selection. Compared to a regular sliding window search over the images, in our experiments, MSR was able to reduce by 75% (in average) the number of windows to be evaluated by an object detector while improving or at least maintaining detector ROC performance. The proposed method was thoroughly evaluated over a subset of LabelMe dataset (person images), improving detection performance in most cases. This evaluation was done comparing object detection performance against di erent object detectors, with and without MSR. Additionally, we also provide evaluation of how di erent object classes interact with MSR, which was done using Pascal VOC 2007 dataset. Finally, tests made showed that window selection performance of MSR has a good scalability with regard to image size. From the obtained data, our conclusion is that MSR can provide substantial bene ts to existing sliding window detectors. |
| publishDate |
2013 |
| dc.date.accessioned.fl_str_mv |
2013-10-11T19:14:00Z |
| dc.date.available.fl_str_mv |
2013-10-11T19:14:00Z |
| dc.date.issued.fl_str_mv |
2013-10-11 |
| dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
| dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
| format |
masterThesis |
| status_str |
publishedVersion |
| dc.identifier.uri.fl_str_mv |
http://www.repositorio.ufba.br/ri/handle/ri/13203 |
| url |
http://www.repositorio.ufba.br/ri/handle/ri/13203 |
| dc.language.iso.fl_str_mv |
por |
| language |
por |
| dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
| eu_rights_str_mv |
openAccess |
| dc.source.none.fl_str_mv |
reponame:Repositório Institucional da UFBA instname:Universidade Federal da Bahia (UFBA) instacron:UFBA |
| instname_str |
Universidade Federal da Bahia (UFBA) |
| instacron_str |
UFBA |
| institution |
UFBA |
| reponame_str |
Repositório Institucional da UFBA |
| collection |
Repositório Institucional da UFBA |
| bitstream.url.fl_str_mv |
https://repositorio.ufba.br/bitstream/ri/13203/2/license.txt https://repositorio.ufba.br/bitstream/ri/13203/1/dissertacao_mestrado_jose-grimaldo.pdf https://repositorio.ufba.br/bitstream/ri/13203/3/dissertacao_mestrado_jose-grimaldo.pdf.txt |
| bitstream.checksum.fl_str_mv |
5371a150bdc863f78dcf39281543bd86 d108855fa0fb0d44ee5d1cb59579a04c e0d792610b0ba093db14db46d161ebaa |
| bitstream.checksumAlgorithm.fl_str_mv |
MD5 MD5 MD5 |
| repository.name.fl_str_mv |
Repositório Institucional da UFBA - Universidade Federal da Bahia (UFBA) |
| repository.mail.fl_str_mv |
repositorio@ufba.br |
| _version_ |
1847339133897277440 |