Performance Study of Deep-Learning Based Surveillance Systems

  • Mau-Luen Tham, Jia Lim Foo, Yoong Choon Chang, Ezra Morris, Nordin Ramli


Recent advances in deep learning have transformed video surveillance systems into authentic visual intelligence ecosystems capable of detecting and recognizing objects. Existing literature generally analyses the performance of surveillance in terms of detection accuracy and inference time, not the overall process. Moving towards Internet of things (IoT) trend, real-time monitoring has become a new paradigm for automation safety systems. This motivates us to investigate the holistic performance of deep-learning based surveillance systems in an embedded platform. Specifically, this paper presents a real-time surveillance system which encodes video clips only when human is detected, and subsequently pushes notifications for video retrieval. The implementation runs on Raspberry Pi Zero W (RPZW) with a pre-trained network of single shot detector (SSD) algorithm and a Mobile Net architecture. We further accelerate the hardware performance by utilizing Movidius Neural Compute Stick (NCS). Results show that RPZW with NCS can process live video stream at a rate of 3frames per second (fps).Experiment evaluations also reveal that how different video codecs impact the video quality and notification time. These findings offer insights into the quality of footage in terms of potential crime evidence and emergency response time for an IoT surveillance system.

How to Cite
Ezra Morris, Nordin Ramli, M.-L. T. J. L. F. Y. C. C. (2020). Performance Study of Deep-Learning Based Surveillance Systems. International Journal of Advanced Science and Technology, 29(1), 206 - 212. Retrieved from