Ідентифікація людей за фото та відео засобами Computer Vision

Гаврилов, Микола Вікторович; Havrylov, Mykola

Empreu aquest identificador per citar o enllaçar aquest ítem: http://elartu.tntu.edu.ua/handle/lib/39509

Registre complet de metadades

Camp DC	Valor	Lengua/Idioma
dc.contributor.advisor	Александер Марек, Богуслав Антонович	-
dc.contributor.advisor	Aleksander Marek, Bohuslav	-
dc.contributor.author	Гаврилов, Микола Вікторович	-
dc.contributor.author	Havrylov, Mykola	-
dc.date.accessioned	2022-12-23T10:37:04Z	-
dc.date.available	2022-12-23T10:37:04Z	-
dc.date.issued	2022-12-21	-
dc.identifier.citation	Гаврилов М. В. Ідентифікація людей за фото та відео засобами Computer Vision : кваліфікаційна робота магістра за спеціальністю „125 — кібербезпека“ / М. В. Гаврилов. — Тернопіль : ТНТУ, 2022. — 96 с.	uk_UA
dc.identifier.uri	http://elartu.tntu.edu.ua/handle/lib/39509	-
dc.description	Ідентифікація людей за фото та відео засобами Computer Vision // Кваліфікаційна робота освітнього рівня «Магістр» // Гаврилов Микола Вікторович // Тернопільський національний технічний університет імені Івана Пулюя, факультет комп’ютерно-інформаційних систем і програмної інженерії, кафедра кібербезпеки, група СБм-61 // Тернопіль, 2022	uk_UA
dc.description.abstract	У кваліфікаційній роботі розроблено інструмент для знаходження конкретної людини на фото та відео з використанням інструментів Data Science. Інформаційну систему, яка дає змогу обробляти фото чи відео, створювати свою базу даних, тренувати модель для конкретної людини і подальшого використання моделі для ідентифікації людини на різних камерах та локаціях. У першому розділі було проведено загальний огляд проблем виявлення людини на фото та відео та виконано аналіз наявних методик для повторної ідентифікації людини. В другому розділі було проведено дослідження наявних моделей для виявлення людини на фото та відео, проведена їх оцінка. Також, другий розділ містить більш детальний аналіз підходів для повторної ідентифікаціх людини. У третьому розділі програмно реалізовано мовою програмування Python інформаційної системи, з використанням наявних рішень та архітектур для виявлення та ідентифікації людини, таких репозиторіїв як YOLOv7 та Torchreid.	uk_UA
dc.description.abstract	In this qualification paper an information system that gives you an opportunity to find a specific person in a photo and video using Data Science tools was developed. The information system allows to process a photo or video, create your own database, train a model for a specific person and then use the model to identify a person on different cameras and locations. The first section includes a general overview of the problems of person detection in photos/videos and an analysis of existing methods for person re-identification. In the second section, existing models for person detection in photos/videos and model’s evaluation were researched. Also, the second section contains more detailed analysis of approaches for re-identification of a person. In the third chapter, the information system was implemented using Python programming language, using existing solutions and architectures for person detection and identification, such repositories as YOLOv7 and Torchreid.	uk_UA
dc.description.tableofcontents	ВСТУП ....10 РОЗДІЛ 1. ЗАГАЛЬНИЙ ОГЛЯД OD ТА RE-ID В СФЕРІ DS....12 1.1 Виявлення об’єктів у відкритому світі....12 1.2 Повторне виявлення об’єктів....16 РОЗДІЛ 2. ОБГРУНТУВАННЯ ВИБОРУ МОДЕЛІ ТА ПДІХОДІВ....22 2.1 Порівняльний аналіз архітектур моделей для Object Detection....22 2.1.1 SSD....23 2.1.2 Faster R-CNN ....24 2.1.3 YOLOv3 ....27 2.1.4 Порівняння швидкодії моделей YOLO ....28 2.1.5 Обгрунтування вибору моделі від YOLO....34 2.2 Порівняння підходів для ReID особи....35 2.2.1 ReID особи....36 2.2.2 Підсумок досліджених підходів....41 2.2.3 Обгрунтування вибору підходу....43 РОЗДІЛ 3. ПРАКТИЧНА РЕАЛІЗАЦІЯ....45 3.1 Підготовка середовища....45 3.2 Особливості встановлення Torchreid....47 3.3 Налаштування YOLOv7....50 3.4 Створення набору даних для тренування та оцінювання....52 3.5 Тренування Torchreid....54 3.5 Оцінка моделей....55 РОЗДІЛ 4. ОХОРОНА ПРАЦІ ТА БЕЗПЕКА В НАДЗВИЧАЙНИХ СИТУАЦІЯХ....58 4.1 Охорона праці....58 4.1.1 Обов’язки роботодавця....59 4.1.2 Обов’язки працівників....60 4.2 Вплив виробничого середовища на здоров'я та працездатність користувачів комп'ютерів....61 ВИСНОВОК....64 ПЕРЕЛІК використаних джерел....65 Додаток А – Лістинг файлу config.yaml....71 Додаток Б – Лістинг файлу generate_detections.py....73 Додаток В – Лістинг файлу yolov7_detector.py....76 Додаток Г – Лістинг файлу process_separated_cluster.py....84 Додаток Д – Лістинг файлу train_dataset.py....86 Додаток Е – Лістинг файлу train.py....88 Додаток Є – Лістинг файлу evaluate.py....90 Додаток Ж – Лістинг файлу utils_reid.py....94 Додаток З – Тези наукової конференції....95	uk_UA
dc.format.extent	96	-
dc.language.iso	uk	uk_UA
dc.publisher	ТНТУ	uk_UA
dc.subject	125	uk_UA
dc.subject	кібербезпека	uk_UA
dc.subject	наука про дані	uk_UA
dc.subject	наука про дані	uk_UA
dc.subject	модель	uk_UA
dc.subject	виявлення людини	uk_UA
dc.subject	ідентифікація людини	uk_UA
dc.subject	повторна ідентифікація людини	uk_UA
dc.subject	data science	uk_UA
dc.subject	architecture	uk_UA
dc.subject	model	uk_UA
dc.subject	person detection	uk_UA
dc.subject	person identification	uk_UA
dc.subject	person re-id	uk_UA
dc.title	Ідентифікація людей за фото та відео засобами Computer Vision	uk_UA
dc.title.alternative	Identification of people by photo and video using Computer Vision tools	uk_UA
dc.type	Master Thesis	uk_UA
dc.rights.holder	© Гаврилов Микола Вікторович, 2022	uk_UA
dc.contributor.committeeMember	Нікітюк, Вячеслав Вячеславович	-
dc.contributor.committeeMember	Nikitiuk, Vyachesval	-
dc.coverage.placename	Тернопіль	uk_UA
dc.relation.references	1. Ross Girshick. Fast r-cnn. In Proceedings of the IEEE international conference on computer vision, pages 1440–1448, 2015. 1, 2	uk_UA
dc.relation.references	2. John A Meacham. Wisdom and the context of knowledge: Knowing that one doesn’t know. On the development of developmental psychology, 8:111–134, 1983. 1	uk_UA
dc.relation.references	3. Susan Engel. Children’s need to know: Curiosity in schools. Harvard educational review, 81(4):625–645, 2011. 1	uk_UA
dc.relation.references	4. Walter J Scheirer, Anderson de Rezende Rocha, Archana Sapkota, and Terrance E Boult. Toward open set recognition. IEEE transactions on pattern analysis and machine intelligence, 35(7):1757–1772, 2012. 1	uk_UA
dc.relation.references	5. Abhijit Bendale and Terrance Boult. Towards open world recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 1893–1902, 2015. 1, 2	uk_UA
dc.relation.references	6. Dimity Miller, Lachlan Nicholson, Feras Dayoub, and Niko Sunderhauf. Dropout sampling for robust object detection in ¨ open-set conditions. In 2018 IEEE International Conference on Robotics and Automation (ICRA), pages 1–7. IEEE, 2018. 2, 6, 8	uk_UA
dc.relation.references	7. Walter J Scheirer, Anderson de Rezende Rocha, Archana Sapkota, and Terrance E Boult. Toward open set recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 7(35):1757–1772, 2013. 2	uk_UA
dc.relation.references	8. Abhijit Bendale and Terrance E Boult. Towards open set deep networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 1563–1572, 2016. 2	uk_UA
dc.relation.references	9. Zongyuan Ge, Sergey Demyanov, Zetao Chen, and Rahil Garnavi. Generative openmax for multi-class open set classification. In British Machine Vision Conference 2017. British Machine Vision Association and Society for Pattern Recognition, 2017. 2	uk_UA
dc.relation.references	10. Ziwei Liu, Zhongqi Miao, Xiaohang Zhan, Jiayun Wang, Boqing Gong, and Stella X Yu. Large-scale long-tailed recognition in an open world. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 2537–2546, 2019. 2	uk_UA
dc.relation.references	11. Federico Pernici, Federico Bartoli, Matteo Bruni, and Alberto Del Bimbo. Memory based online learning of deep representations from video streams. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 2324–2334, 2018. 2	uk_UA
dc.relation.references	12. Hu Xu, Bing Liu, Lei Shu, and P Yu. Open-world learning and application to product classification. In The World Wide Web Conference, pages 3413–3419, 2019. 2	uk_UA
dc.relation.references	13. Akshay Dhamija, Manuel Gunther, Jonathan Ventura, and Terrance Boult. The overlooked elephant of object detection: Open set. In The IEEE Winter Conference on Applications of Computer Vision, pages 1021–1030, 2020. 2, 3, 6	uk_UA
dc.relation.references	14. David Hall, Feras Dayoub, John Skinner, Haoyang Zhang, Dimity Miller, Peter Corke, Gustavo Carneiro, Anelia Angelova, and Niko Sunderhauf. Probabilistic object detection: Definition and evaluation. In The IEEE Winter Conference on Applications of Computer Vision, pages 1031–1040, 2020. 2	uk_UA
dc.relation.references	15. Yarin Gal and Zoubin Ghahramani. Dropout as a bayesian approximation: Representing model uncertainty in deep learning. In international conference on machine learning, pages 1050–1059, 2016. 2	uk_UA
dc.relation.references	16. Yifan Sun, Liang Zheng, Yi Yang, Qi Tian, and Shengjin Wang. Beyond part models: Person retrieval with refined part pooling (and a strong convolutional baseline). In ECCV, pages 480–496, 2018. 1, 2	uk_UA
dc.relation.references	17. Guanshuo Wang, Yufeng Yuan, Xiong Chen, Jiwei Li, and Xi Zhou. Learning discriminative features with multiple granularities for person re-identification. In ACMMM, pages 274–282, 2018. 1, 2, 3, 8	uk_UA
dc.relation.references	18. Hao Luo, Youzhi Gu, Xingyu Liao, Shenqi Lai, and Wei Jiang. Bag of tricks and a strong baseline for deep person re-identification. In CVPRW, pages 0–0, 2019. 1, 2, 3	uk_UA
dc.relation.references	19. Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, and Xiaohua et al. Zhai. An image is worth 16x16 words: Transformers for image recognition at scale. ICLR, 2021. 2, 3	uk_UA
dc.relation.references	20. Hugo Touvron, Matthieu Cord, Matthijs Douze, Francisco Massa, Alexandre Sablayrolles, and Herve J ´ egou. Training ´ data-efficient image transformers & distillation through attention. arXiv preprint arXiv:2012.12877, 2020. 2, 3	uk_UA
dc.relation.references	21. Hao Luo, Wei Jiang, Xuan Zhang, Xing Fan, Jingjing Qian, and Chi Zhang. Alignedreid++: Dynamically matching local information for person re-identification. Pattern Recognition, 94:53–61, 2019. 2	uk_UA
dc.relation.references	22. Dechao Meng, Liang Li, Xuejing Liu, Yadong Li, Shijie Yang, Zheng-Jun Zha, Xingyu Gao, Shuhui Wang, and Qingming Huang. Parsing-based view-aware embedding network for vehicle re-identification. In CVPR, pages 7103– 7112, 2020. 2, 3, 8	uk_UA
dc.relation.references	23. Mang Ye, Jianbing Shen, Gaojie Lin, Tao Xiang, Ling Shao, and Steven CH Hoi. Deep learning for person reidentification: A survey and outlook. IEEE TPAMI, 2021. 2	uk_UA
dc.relation.references	24. Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Deep residual learning for image recognition. In CVPR, pages 770–778, 2016. 2	uk_UA
dc.relation.references	25. Zhedong Zheng, Liang Zheng, and Yi Yang. A discriminatively learned cnn embedding for person reidentification. ACM TOMM, 14(1):13, 2018. 2	uk_UA
dc.relation.references	26. Hao Liu, Jiashi Feng, Meibin Qi, Jianguo Jiang, and Shuicheng Yan. End-to-end comparative attention networks for person re-identification. IEEE TIP, 26(7):3492–3506, 2017. 2	uk_UA
dc.relation.references	27. Jingjing Qian, Wei Jiang, Hao Luo, and Hongyan Yu. Stripebased and attribute-aware network: A two-branch deep model for vehicle re-identification. Measurement Science and Technology, 31(9):095401, 2020. 3, 8	uk_UA
dc.relation.references	28. Jiaxu Miao, Yu Wu, Ping Liu, Yuhang Ding, and Yi Yang. Pose-guided feature alignment for occluded person re-identification. In ICCV, pages 542–551, 2019. 2, 3, 5, 8	uk_UA
dc.relation.references	29. Zhihui Zhu, Xinyang Jiang, Feng Zheng, Xiaowei Guo, Feiyue Huang, Xing Sun, and Weishi Zheng. Aware loss with angular regularization for person re-identification. In AAAI, volume 34, pages 13114–13121, 2020. 3	uk_UA
dc.relation.references	30. Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Łukasz Kaiser, and Illia Polosukhin. Attention is all you need. In NeurIPS, pages 6000–6010, 2017. 3	uk_UA
dc.relation.references	31. Salman Khan, Muzammal Naseer, Munawar Hayat, Syed Waqas Zamir, Fahad Shahbaz Khan, and Mubarak Shah. Transformers in vision: A survey. arXiv preprint arXiv:2101.01169, 2021. 3	uk_UA
dc.relation.references	32. Hanting Chen, Yunhe Wang, Tianyu Guo, Chang Xu, Yiping Deng, Zhenhua Liu, Siwei Ma, Chunjing Xu, Chao Xu, and Wen Gao. Pre-trained image processing transformer. CVPR, 2021. 3	uk_UA
dc.relation.references	33. Ren S, He K, Girshick R, Sun J. Faster r-cnn: Towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell. 2016;39(6):1137–49.	uk_UA
dc.relation.references	34. Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu CY, Berg AC. Ssd: single shot multibox detector. In: European conference on computer vision. Cham: Springer; 2016, p. 21–37.	uk_UA
dc.relation.references	35. Alganci U, Soydas M, Sertel E. Comparative research on deep learning approaches for airplane detection from very high-resolution satellite images. Remote Sensing. 2020;12(3):458.	uk_UA
dc.relation.references	36. Reza Z. N. (2019). Real-time automated weld quality analysis from ultrasonic B-scan using deep learning (Doctoral dissertation, University of Windsor (Canada)).	uk_UA
dc.relation.references	37. Shen X, Wu Y. A unified approach to salient object detection via low rank matrix recovery. In: 2012 IEEE conference on computer vision and pattern recognition. IEEE; 2012, p. 853–60.	uk_UA
dc.relation.references	38. Madden C., Cheng E. D., Piccardi M., 2007. Tracking people across disjoint camera views by an illumination tolerant appearance representation. Journal of Machine Vision and Applications 18(3), pp. 233–247.	uk_UA
dc.relation.references	39. Colombo A., Orwell J., and Velastin S., 2008. Colour constan-cy techniques for re-recognition of pedestrians from mul-tiple surveillance cameras. In Workshop on Multi-camera and Multi-modal Sensor Fusion Algorithms and Applications, 2008, Marseille, France	uk_UA
dc.relation.references	40. Truong Congl D.-N., Khoudour L., Achard C., Meurie C., Lezoray O., 2010. People re-identification by spectral classification of silhouettes. Signal Processing, vol. 90, no. 8, pp 2362-2374.	uk_UA
dc.relation.references	41. Javed O., Shafique K., Shah M., 2005. Appearance modeling for tracking in multiple non-overlapping cameras. In IEEE Conf. on Computer Vision and Pattern Recognition, vol. 2, pp. 26-33.	uk_UA
dc.relation.references	42. Jeong K., Jaynes C., 2008. Object matching in disjoint cameras using a colour transfer approach. Journal of Machine Vision and Applications, 19(5), pp. 88–96	uk_UA
dc.relation.references	43. Bäuml M., Bernardin K., Fischer M.,. Ekenel H. K, 2010. Multi-pose face recognition for person retrieval in camera networks. In IEEE 7 th Int. Conf. on Advanced Video and Signal Based Surveillance, pp. 441-447.	uk_UA
dc.relation.references	44. Dantcheva A.,. Dugelay J. L, 2011. Frontal-to-side face reidentification based on hair, skin and clothes patches. In IEEE Int. Conf. on Advanced Video and SignalBased Surveillance, pp. 309-313	uk_UA
dc.relation.references	45. Roy A., Sural S., Mukherjee J., 2012. A hierarchical method combining gait and phase of motion with spatiotemporal model for person re-identification. In Pattern Recognition Letters, 33(14), pp. 1891-1901.	uk_UA
dc.relation.references	46. Cai Q. and Aggarwal J.K., 1999. Tracking Human Motion Using a Distributed-Camera System. In IEEE Trans. On PAMI., 21(12), pp. 1241-1247	uk_UA
dc.relation.references	47. Mittal A., Davis, L. 2003. M2 tracker: a multi-view approach to segmenting and tracking people in a cluttered scene. Int. J. Comput. Vis. 51(3), pp.189–203	uk_UA
dc.relation.references	48. Satta R., Fumera G., Roli F., 2012. Fast person reidentification based on dissimilarity representations. In Pattern Recognition Letters, 33(14), pp. 1838-1848.	uk_UA
dc.relation.references	49. Visual Studio Code on Windows [Електронний ресурс]. – 2022. – Режим доступу до ресурсу: https://code.visualstudio.com/docs/setup/windows.	uk_UA
dc.relation.references	50. How to Install WSL 2 on Windows 10 (Updated) [Електронний ресурс]. – 2022. – Режим доступу до ресурсу: https://www.omgubuntu.co.uk/how-to-install-wsl2-on-windows-10	uk_UA
dc.relation.references	51. Simple Python Version Management: pyenv [Електронний ресурс] – Режим доступу до ресурсу: https://github.com/pyenv/pyenv.	uk_UA
dc.relation.references	52. Torchreid [Електронний ресурс] – Режим доступу до ресурсу: https://github.com/KaiyangZhou/deep-person-reid.	uk_UA
dc.relation.references	53. Official YOLOv7 [Електронний ресурс] – Режим доступу до ресурсу: https://github.com/WongKinYiu/yolov7.	uk_UA
dc.contributor.affiliation	Тернопільський національний технічний університет імені Івана Пулюя	uk_UA
dc.coverage.country	UA	uk_UA
Apareix a les col·leccions:	125 — кібербезпека, Кібербезпека та захист інформації

Arxius per aquest ítem:

Arxiu	Descripció	Mida	Format
Dyplom_Havrylov_M_V_2022.pdf	Кваліфікаційна робота магістра	2,89 MB	Adobe PDF	Veure/Obrir
avtorska_dov_Havrylov_M_V_2022.pdf	Авторська довідка	663,88 kB	Adobe PDF	Veure/Obrir

Mostrar el registre simplificat de l'ítem

Els ítems de DSpace es troben protegits per copyright, amb tots els drets reservats, sempre i quan no s’indiqui el contrari.

Eines d'Administrador