Surveillance video processing requires high efficiency, given its large datasets, demands significant resources for timely and effective analysis. This study aims to enhance surveillance systems by developing an automated method for extracting key events from outdoor surveillance videos. The proposed model comprises four phases: preprocessing and feature extraction, training and testing, and validation. Before utilizing a convolution neural networks approach to extract features from videos, the videos are pre-processed. Events classification uses gated recurrent units. In validation, motions and objects are extraction then feature extraction. Results show satisfactory performance, achieving 79% accuracy in events classification, highlighting the effectiveness of the methodology in identifying significant outdoor events.
J. Almeida, N. J. Leite, and R. D. S. Torres, “VISON: VIdeo Summarization for ONline applications,” Pattern Recognit Lett., vol. 33, no. 4, pp. 397–409, 2012.
Q. Xu, X. Li, J. Wang, R. Scopigno, Y. Liu, Z. Yang, and M. Sbert, “Browsing and exploration of video sequences: A new scheme for key frame extraction and 3D visualization using entropy based Jensen divergence,” Inf. Sci., vol. 278, pp. 736–756, 2014.
S. Mei, G. Guan, Z. Wang, S. Wan, M. He, and D. D. Feng, “Video summarization via minimum sparse reconstruction,” Pattern Recognit, vol. 48, no. 2, pp. 522–533, 2015.
M. A. Wahsh and Z. M. Hussain, “Irregular boundaries stereo images dataset creating using depth estimation model,” Pollack Period., vol. 19, no. 1, pp. 143–150, 2024.
G. B. Martins, J. P. Papa, and J. Almeida, “Temporal-and spatial-driven video summarization using optimum-path forest,” in 29th SIBGRAPI Conference on Graphics, Patterns and Images, Sao Paulo, Brazil, October 4–7, 2016, pp. 335–339.
L. dos S. Belo, C. A. Caetano, Jr, Z. K. G. do Patrocínio, and S. J. F. Guimarães, “Summarizing video sequence using a graph-based hierarchical approach,” Neurocomputing, vol. 173, pp. 1001–1016, 2016.
O. Elharrouss, N. Almaadeed, S. Al-Maadeed, A. Bouridane, and A. Beghdadi, “A combined multiple action recognition and summarization for surveillance video sequences,” Appl. Intelligence, vol. 51, no. 2, pp. 690–712, 2021.
S. Brassai and L. Bakó, “Visual trajectory control of a mobile robot using FPGA implemented neural network,” Pollack Period., vol. 4, no. 3, pp. 129–142, 2009.
P. Kaur and R. Kumar, “Analysis of video summarization techniques,” Int. J. Res. Appl. Sci. Eng. Technol., vol. 6, no. 1, pp. 1157–1162, 2018.
A. A. A. Karim and R. A. Sameer, “Static and dynamic video summarization,” Iraqi J. Sci., vol. 60, no. 7, pp. 1627–1638, 2019.
T. Chavan, V. Patil, P. Rokade, and S. Dholay, “Superintendence video summarization,” in International Conference on Emerging Trends in Information Technology and Engineering, Vellore, India, February 24–25, 2020, 2020, pp. 1–7.
V. Tiwari and C. Bhatnagar, “A survey of recent work on video summarization: approaches and techniques,” Multimed Tools Appl., vol. 80, no. 18, pp. 27187–27221, 2021.
M. U. Sreeja and B. C. Kovoor, “Towards genre-specific frameworks for video summarisation: A survey,” J. Vis. Commun. Image Represent, vol. 62, pp. 340–358, 2019.
S. M. Tirupathamma, “Key frame based video summarization using frame difference,” Int. J. Innovative Computer Sci. Eng., vol. 4, no. 3, pp. 160–165, 2017.
O. Elharrouss, N. Al-Maadeed, and S. Al-Maadeed, “Video summarization based on motion detection for surveillance systems,” in 15th International Wireless Communications and Mobile Computing Conference, Tangier, Morocco, June 24–28, 2019, pp. 366–371.
T. M. Moses and K. Balachandran, “A deterministic key-frame indexing and selection for surveillance video summarization,” in International Conference on Data Science and Communication, Bangalore, India, March 1–2, 2019, pp. 1–5.
R. Hannane, A. Elboushaki, and K. Afdel, “MSKVS: Adaptive mean shift-based keyframe extraction for video summarization and a new objective verification approach,” J. Vis. Commun. Image Represent, vol. 55, pp. 179–200, 2018.
H. Ullah and A. Munir, “Human activity recognition using cascaded dual attention CNN and Bi-Directional GRU framework,” J. Imaging, vol. 9, no. 7, 2023, Art no. 130.
S. Oh, A. Hoogs, A. Perera, N. Cuntoor, C. C. Chen, J. T. Lee, S. Mukherjee, J. K. Aggarwal, H. Lee, L. Davis, E. Swears, X. Wang, Q. Ji, K. Reddy, M. Shah, C. Vondrick, H. Pirsiavash, D. Ramanan, J. Yuen, A. Torralba, B. Song, A. Fong, A. Roy-Chowdhury, and M. Desai, “A large-scale benchmark dataset for event recognition in surveillance video,” in CVPR 2011, Colorado Springs, CO, USA, June 20–25, 2011, pp. 3153–3160.
P. Nair, U. Kumar, and S. Nandan, “COVID-19 social distance surveillance using deep learning,” Int. Conf. Computer Vis. Image Process. Commun. Computer Inf. Sci., vol. 1568, Springer, 2022, pp. 288–298.
C. Micheloni, E. Salvador, F. Bigaran, and G. L. Foresti, “An integrated surveillance system for outdoor security,” in IEEE Conference on Advanced Video and Signal Based Surveillance, Como, Italy, September 15–16, 2005, pp. 480–485.
Y. Xie, L. Ning, M. Wang, and C. Li, “Image enhancement based on histogram equalization,” J. Phys. Conf. Ser., vol. 1314, 2019, Art no. 12161.
Y. Zhu, N. M. Nayak, and A. K. Roy-Chowdhury, “Context-aware modeling and recognition of activities in video,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, USA, June 23–28, 2013, pp. 2491–2498.
X. Wang and Q. Ji, “A hierarchical context model for event recognition in surveillance video,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, June 23–28, 2014, pp. 2561–2568.
S. H. Hendi, H. B. Taher, and K. Q. Hussein, “Automated video events detection and classification using CNN-GRU model,” Wasit J. Computer Mathematics Sci., vol. 2, no. 4, pp. 77–86, 2023.