The implementation process of the top-left corner

Source publication

The proposed network architecture for vehicle detection

Qualitative comparison results of these detectors on the test video: a...

The “ground-truth” heatmaps of the vehicles in training processing....

The architecture of the multi-target corner pooling module

A multi-target corner pooling-based neural network for vehicle detection

Article

Full-text available

Sep 2020

Convolutional neural network has shown strong capability to improve performance in vehicle detection, which is one of the main research topics of intelligent transportation system. Aiming to detect the blocked vehicles efficiently in actual traffic scenes, we propose a novel convolutional neural network based on multi-target corner pooling layers....

A Generalizable Deepfake Detector based on Neural Conditional Distribution Modelling

Conference Paper

Full-text available

Sep 2020

Photo- and video-realistic generation techniques have become a reality following the advent of deep neural networks. Consequently, there are immense concerns regarding the difficulty in differentiating what content is real from what is synthetic. An example of video-realistic generation techniques is the infamous Deepfakes, which exploit the main m...

Faster R-CNN based on frame difference and spatiotemporal context for vehicle detection

Article

Full-text available

Jun 2024

Vehicle detection is a very important part in intelligent transportation system. In order to improve the detection speed without sacrificing the accuracy, this paper propose an improved Faster R-CNN algorithm based on frame difference and spatiotemporal context to realize real-time detection of vehicles. We improve the training and testing speed of Faster R-CNN by improving the RPN module. Different from the original Faster R-CNN’s anchor based strategy, the inter frame difference in this paper is mainly used to extract the region of interest of the target. And we introduce spatiotemporal context information to assist our detection. Among them, the spatial context is formed by adding the association information outside the target area to enhance the expression of target information and improve the accuracy of target detection, while the anchor filtering of the original Faster R-CNN can be carried out by integrating the temporal context information, so as to improve the detection efficiency. This improved RPN has strong pertinence to the detection of moving vehicles. This strategy not only makes our branch network parallelly process with the original Faster R-CNN, but also avoids the extra time consumption caused by the addition of algorithm. More importantly, it can be simply added to the existing Faster R-CNN based application system without algorithm adjustment or network retraining. Experimental results show that the proposed method has high detection efficiency and low sensitivity to background changes.

TRCA-Net: stronger U structured network for human image segmentation

Article

Full-text available

Jan 2023
NEURAL COMPUT APPL

Human image segmentation has been a practical and active research topic due to its wide range of potential application. There are some previous studies on manual, semi-automatic and automatic segmentation methods to investigate the semantic segmentation of human parts fully for real-world human analysis scenarios, but further research is still needed. This paper presents a novel semantic segmentation network, named TRCA-Net, for human image segmentation tasks. Having the TransUNet as the backbone, TRCA-Net incorporates Res2Net and Coordinate Attention to improve the performance. Res2Net blocks and Transformer can obtain better feature maps by encoding the input images. The Coordinate Attention in the decoder aggregates and upsamples the encoded feature maps, and connects to the high-resolution CNN feature maps for gaining accurate segmentation. The TRCA-Net can enhance finer details by recovering local spatial information. We compare the TRCA-Net with state-of-the-art (SOAT) semantic segmentation networks: the original U-Net, DeepLabv3+, and TransUNet. The experiment results have demonstrated that our proposed TRCA-Net outperforms these networks.

Enhanced Vehicle Re-Identification for Smart City Applications Using Zone Specific Surveillance

Article

Full-text available

Jan 2023

Vehicle re-identification is an important feature of an intelligent transportation system as part of a smart city application. Vehicle re-identification aims at matching vehicles from images acquired by surveillance cameras at different locations. During rush hours, vehicles are densely occupied across regions such as entry/exit of gated campuses, railways, airports, educational institutions, etc. Due to this uneven flow of traffic, there is a possibility of violation of traffic rules by the vehicles that lead to a security breach. In such scenarios to speed up the re-identification process, it is justified to look into a specific group of surveillance cameras to detect and re-identify vehicles on day to day basis in near real-time. However, the existing vehicle re-identification datasets do not contain zone specific information and therefore can not be used to evaluate the performance of re-identification algorithms in different zones. In the proposed work for re-identification, a framework is developed that performs vehicle re-identification across a group of cameras that monitors varying traffic movements over an area. These areas defined as “strategic zones” comprise a subset of non-overlapping cameras that are installed to monitor non-uniform vehicle movements. The re-identification framework is evaluated on a novel dataset developed to understand the performance of vehicle re-identification across strategic zones. The dataset consists of videos of vehicles captured through 20 CCTV surveillance cameras that are grouped into four different zones. Various experiments are conducted to study the performance of re-identification across four zones using a deep neural network with triplet loss, L2 regularization, and re-ranking. The experiments conducted with an image dimension of 224 × 224 have demonstrated an overall mAP of 77.22%. Also, for each of the four zones a mAP of 82.16%, 69.1%, 66.5%, and 75.76% is achieved. The experimental results demonstrate huge variations in the accuracy of vehicle re-identification method across different zones. Therefore, the study assess the possible measures that can be taken to improve the performance in individual zones for an accurate vehicle re-identification in intelligent transport system.

DVPPIR: privacy-preserving image retrieval based on DCNN and VHE

Article

Full-text available

Sep 2022
NEURAL COMPUT APPL

With 5G and Internet technologies developing rapidly, outsourcing images to cloud servers has attracted growing attention. In existing technologies, images are often outsourced to cloud servers to reduce storage and computing burdens. However, outsourcing images to cloud servers without any processing may reveal the users’ privacy, because the images may contain sensitive information about users, such as faces and locations, especially in electronic investigation. To overcome the security problems in image retrieval, we propose a privacy-preserving image retrieval scheme based on deep convolutional neural network (DCNN) and vector homomorphic encryption (VHE). We adopt DCNN and hash algorithms to extract image feature vectors, which improves retrieval accuracy. By combining VHE and K-means outsourcing clustering algorithms, the cloud server can build encrypted index trees, which speeds up the search and reduces the computational cost. In addition, a lightweight access control technique is used to allow image owners to set access policies for datasets flexibly. We prove the security of the proposed scheme and show the effectiveness of the scheme through experiments. Our scheme is suitable for application in electronic image investigation systems (EIIs) to optimize the storage and search of police data.

Corner based statistical modelling in vehicle detection under various condition for traffic surveillance

Article

Full-text available

Aug 2022
MULTIMED TOOLS APPL

The paper proposes a robust approach to detect and track the vehicle under various climatic conditions and in the presence of camera shake, shadows, sudden illumination change. Corners have significant features to detect and track the vehicle. Corner points from the vehicular region are segmented from non - vehicular regions based on the statistical background corner point model. The foreground corner points that belong to the vehicular region are grouped using Euclidean distance as they are closely associated with each other. The flickering effects caused by the corner detection algorithm are handled by tracking these corner points. The detection accuracy of the algorithm is 94.32%.

Using the deep neural networks for normal and abnormal situation recognition in the automatic access monitoring and control system of vehicles

Article

Full-text available

Apr 2021
NEURAL COMPUT APPL

A new mathematical model of the intelligent access monitoring and control system based on the cybernetic approach is proposed for solving the problems of vehicle access to the territory of an organization. The distinctive feature of the mathematical model is the ability to take into account and recognize normal and abnormal situations at the protected object and develop control actions. To localize vehicles and recognize their license plates, the composition of traditional methods of image processing and two-pass classification performed by the developed modified architecture of convolutional neural network MobileNet is offered. The composition allows to define additional identification features of the object. The proposed adaptations allow to recognize the situation in real time with low computational costs and high accuracy. The natural experiment has shown that the integration of the modern hardware means and algorithms of object detection and recognition, even in the rough conditions of street closed-circuit television monitoring, provides not less than 96% accuracy, and the processing time of one frame is not more than 0.094 s based on the Nvidia GeForce 1080Ti graphic processor. High recognition accuracy without loss of speed in the real-time mode is achieved by integrating the modern hardware means and the algorithms of object detection and recognition. The program module in Python using the Tensorflow and Keras library is developed for carrying out the access control functions.

CSE-YOLOv5: A Lightweight Attention Guided YOLOv5 Network based on EIoU Loss

Conference Paper

Dec 2023

Real-Time Ship Detection Algorithm Based on Improved YOLOv8 Network

Conference Paper

Dec 2023

Multi-target vehicle detection based on corner pooling with attention mechanism

Article

Full-text available

Oct 2023
APPL INTELL

Multi-target detection based on corner pooling provides a distinctive framework without anchor boxes, which has achieved wide application in the area of intelligent transportation system. To effectively detect small vehicles in the distant view, we propose an improved detection network termed corner pooling with attention mechanism (CPAM). A newly devised network called Hourglass with Coordinate Attention(Hourglass-CA) is proposed as an alternative to the Hourglass-104 backbone network. This one incorporates a multi-level attention mechanism to optimize the efficiency of feature extraction. Additionally, a novel multi-level attention loss(MLA loss) is presented, which dynamically adjusts the offsets during the feature extraction process. The experimental results demonstrate that our proposed CPAM achieves lightweight detection, reducing the parameters from 201M to 117M with an FPS from 4.2 to 16.1. Moreover, the AP can reach 51.6%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\%$$\end{document}, surpassing several existing detectors. Graphical abstract

Multi-Feature Fusion Based Structural Deep Neural Network for Predicting Answer Time on Stack Overflow

Article

May 2023
J COMPUT SCI TECHNOL

Stack Overflow provides a platform for developers to seek suitable solutions by asking questions and receiving answers on various topics. However, many questions are usually not answered quickly enough. Since the questioners are eager to know the specific time interval at which a question can be answered, it becomes an important task for Stack Overflow to feedback the answer time to the question. To address this issue, we propose a model for predicting the answer time of questions, named Predicting Answer Time (i.e., PAT model), which consists of two parts: a feature acquisition and fusion model, and a deep neural network model. The framework uses a variety of features mined from questions in Stack Overflow, including the question description, question title, question tags, the creation time of the question, and other temporal features. These features are fused and fed into the deep neural network to predict the answer time of the question. As a case study, post data from Stack Overflow are used to assess the model. We use traditional regression algorithms as the baselines, such as Linear Regression, K-Nearest Neighbors Regression, Support Vector Regression, Multilayer Perceptron Regression, and Random Forest Regression. Experimental results show that the PAT model can predict the answer time of questions more accurately than traditional regression algorithms, and shorten the error of the predicted answer time by nearly 10 hours.

The implementation process of the top-left corner

Similar publications

Citations