Block diagram of YOLOv3-tiny architecture.

Source publication

Figure 1. Processing speed comparison by the number of kernels on...

Figure 2. Block diagram of YOLOv3-tiny architecture.

Figure 3. Scalable convolutional blocks.

Figure 4. Block diagram of SF-YOLO (medium) architecture.

Figure 5. Expansion of SF-YOLO by replacing the scalable convolutional...

Design of a Scalable and Fast YOLO for Edge-Computing Devices

Article

Full-text available

Nov 2020

With the increase in research cases of the application of a convolutional neural network (CNN)-based object detection technology, studies on the light-weight CNN models that can be performed in real time on the edge-computing devices are also increasing. This paper proposed scalable convolutional blocks that can be easily designed CNN networks of Y...

Context 1

... first starts by analyzing the state-of-the-art YOLO detector. Figure 2 shows the architecture of YOLOv3-tiny that is the light-weight version of YOLOv3. At the point of feature extraction, YOLOv3-tiny uses the five pooling layers to obtain the final feature map, and through this, the input image of W × H dimensions is converted into the final feature map of (W/32) × (H/32) dimensions. ...

View in full-text

Figure 1. Schematic overview of the National Program of Elderly Care.

Figure 2. Official kick-off meeting of the NPEC, hand-over of check.

Figure 3. Image of older person representing a participant in a client...

Figure 4. The narrative cycle (Source: Development of central...

Figure 5. Network analysis tool (Source: Development of central...

On Staging Work: How Research Funding Bodies Create Adaptive Coherence in Times of Projectification

Article

Full-text available

Apr 2021

While recent science and technology studies literature focuses on “projectification” and its felt tensions for researchers, a surprising scarcity of empirical work addresses experiences at the “other end,” such as funding bodies often held “responsible” for tensions encountered by researchers. Actors in funding bodies experience similar tensions, h...

Figure 1. ROC curve of the GMB Grid model.

Description of the data set with its respective characteristics

Description of quality metrics of models with training data.

GBM Grid classification model confusion matrix.

Development of a Classification Model for Predicting Student Payment Behavior Using Artificial Intelligence and Data Science Techniques

Article

Full-text available

Jun 2023

Artificial intelligence today has become a valuable tool for decision-making, where universities have to adapt and optimize their processes, improving the quality of their services. In this context, the economic income from collections is vital for sustainability. There are several problems that can contribute to student delinquency, such as econom...

Digitalization between environmental activism and counter-activism: The case of satellite data on deforestation in the Brazilian Amazon

Article

Full-text available

Apr 2022

This paper analyzes the uses of digital satellite data on deforestation in the Amazon region, drawing on poststructuralist studies of scientific knowledge practices and Science and Technology Studies (STS). Focusing on changes under the government of President Jair Bolsonaro, we argue that populist right-wing rhetoric, policies, and practices towar...

THE NEW WORLD ORDER OF TECHNOLOGY AND INNOVATION: AN ANALYSIS OF HOW THE INVESTMENT ON INNOVATION BASED IN ARTIFICIAL INTELLIGENCE CAN MODIFY TODAY'S INDUSTRIAL PROTECTION MEANS

Article

Full-text available

Mar 2021

There is no acceptable definition for artificial intelligence (AI), one of which is that it is a computer system capable of solving complex problems (World Economic Forum 2018). The WIPO AI report and the highest order fields AI was the starting point for the construction of this exploratory, statistical study with surveys of WIPO, UNESCO, World Ba...

Who cares? How care practices uphold the decentralised energy order

Article

Full-text available

Jul 2022

This paper represents the decentralised energy order as a matter of care: so as to make visible the unequal burden of care and to encourage active caring. It extends an emerging overlap that exists in studies of repair and maintenance of material objects from science and technology studies (STS) and an increasing interest in the creation and mainte...

YOLOv1 to v8: Unveiling Each Variant — A Comprehensive Review of YOLO

Article

Full-text available

Jan 2024

Muhammad Hussain

This paper implements a systematic methodological approach to review the evolution of YOLO variants. Each variant is dissected by examining its internal architectural composition, providing a thorough understanding of its structural components. Subsequently, the review highlights key architectural innovations introduced in each variant, shedding light on the incremental refinements. The review includes benchmarked performance metrics, offering a quantitative measure of each variant’s capabilities. The paper further presents the performance of YOLO variants across a diverse range of domains, manifesting their real-world impact. This structured approach ensures a comprehensive examination of YOLOs journey, methodically communicating its internal advancements and benchmarked performance before delving into domain applications. It is envisioned, the incorporation of concepts such as federated learning can introduce a collaborative training paradigm, where YOLO models benefit from training across multiple edge devices, enhancing privacy, adaptability, and generalisation.

Software Aging in a Real-Time Object Detection System on an Edge Server

Conference Paper

Jun 2023

A Dragon Fruit Picking Detection Method Based on YOLOv7 and PSP-Ellipse

Article

Full-text available

Apr 2023
SENSORS-BASEL

Dragon fruit is one of the most popular fruits in China and Southeast Asia. It, however, is mainly picked manually, imposing high labor intensity on farmers. The hard branches and complex postures of dragon fruit make it difficult to achieve automated picking. For picking dragon fruits with diverse postures, this paper proposes a new dragon fruit detection method, not only to identify and locate the dragon fruit, but also to detect the endpoints that are at the head and root of the dragon fruit, which can provide more visual information for the dragon fruit picking robot. First, YOLOv7 is used to locate and classify the dragon fruit. Then, we propose a PSP-Ellipse method to further detect the endpoints of the dragon fruit, including dragon fruit segmentation via PSPNet, endpoints positioning via an ellipse fitting algorithm and endpoints classification via ResNet. To test the proposed method, some experiments are conducted. In dragon fruit detection, the precision, recall and average precision of YOLOv7 are 0.844, 0.924 and 0.932, respectively. YOLOv7 also performs better compared with some other models. In dragon fruit segmentation, the segmentation performance of PSPNet on dragon fruit is better than some other commonly used semantic segmentation models, with the segmentation precision, recall and mean intersection over union being 0.959, 0.943 and 0.906, respectively. In endpoints detection, the distance error and angle error of endpoints positioning based on ellipse fitting are 39.8 pixels and 4.3°, and the classification accuracy of endpoints based on ResNet is 0.92. The proposed PSP-Ellipse method makes a great improvement compared with two kinds of keypoint regression method based on ResNet and UNet. Orchard picking experiments verified that the method proposed in this paper is effective. The detection method proposed in this paper not only promotes the progress of the automatic picking of dragon fruit, but it also provides a reference for other fruit detection.

A real-time fire and flame detection method for electric vehicle charging station based on machine vision

Article

Full-text available

Mar 2023

In the charging process of electric vehicle (EV), high voltage and high current charging methods are widely used to reduce charging time, resulting in severe battery heating and an increased risk of fire. To improve fire detection efficiency, this paper proposes a real-time fire and smoke detection method for EV charging station based on Machine Vision. The algorithm introduces the Kmeans + + algorithm in the GhostNet-YOLOv4 model to rescreen anchor boxes for fire smoke targets to optimize the classification quality for the complex and variable features of targets; and introduces the coordinate attention (CA) module after the lightweight backbone network GhostNet to improve the classification quality. In this paper, we use EV charging station monitoring video as a model detection input source to achieve real-time detection of multiple pairs of sites. The experimental results demonstrate that the improved algorithm has a model parameter number of 11.436 M, a mAP value of 87.70%, and a video detection FPS value of 75, which has a good continuous target tracking capability and satisfies the demand for real-time monitoring and is crucial for the safe operation of EV charging station and the emergency extinguishing of fire.

An Embedded Framework for Fully Autonomous Object Manipulation in Robotic-Empowered Assisted Living

Article

Full-text available

Dec 2022
SENSORS-BASEL

Most of the humanoid social robots currently diffused are designed only for verbal and animated interactions with users, and despite being equipped with two upper arms for interactive animation, they lack object manipulation capabilities. In this paper, we propose the MONOCULAR (eMbeddable autONomous ObjeCt manipULAtion Routines) framework, which implements a set of routines to add manipulation functionalities to social robots by exploiting the functional data fusion of two RGB cameras and a 3D depth sensor placed in the head frame. The framework is designed to: (i) localize specific objects to be manipulated via RGB cameras; (ii) define the characteristics of the shelf on which they are placed; and (iii) autonomously adapt approach and manipulation routines to avoid collisions and maximize grabbing accuracy. To localize the item on the shelf, MONOCULAR exploits an embeddable version of the You Only Look Once (YOLO) object detector. The RGB camera outcomes are also used to estimate the height of the shelf using an edge-detecting algorithm. Based on the item's position and the estimated shelf height, MONOCULAR is designed to select between two possible routines that dynamically optimize the approach and object manipulation parameters according to the real-time analysis of RGB and 3D sensor frames. These two routines are optimized for a central or lateral approach to objects on a shelf. The MONOCULAR procedures are designed to be fully automatic, intrinsically protecting sensitive users' data and stored home or hospital maps. MONOCULAR was optimized for Pepper by SoftBank Robotics. To characterize the proposed system, a case study in which Pepper is used as a drug delivery operator is proposed. The case study is divided into: (i) pharmaceutical package search; (ii) object approach and manipulation; and (iii) delivery operations. Experimental data showed that object manipulation routines for laterally placed objects achieves a best grabbing success rate of 96%, while the routine for centrally placed objects can reach 97% for a wide range of different shelf heights. Finally, a proof of concept is proposed here to demonstrate the applicability of the MONOCULAR framework in a real-life scenario.

Lightweight Deep Learning Model for Weed Detection for IoT Devices

Conference Paper

Full-text available

Jun 2022

A Real Time 1280x720 Object Detection Chip With 585MB/s Memory Traffic

Preprint

Full-text available

May 2022

Memory bandwidth has become the real-time bottleneck of current deep learning accelerators (DLA), particularly for high definition (HD) object detection. Under resource constraints, this paper proposes a low memory traffic DLA chip with joint hardware and software optimization. To maximize hardware utilization under memory bandwidth, we morph and fuse the object detection model into a group fusion-ready model to reduce intermediate data access. This reduces the YOLOv2's feature memory traffic from 2.9 GB/s to 0.15 GB/s. To support group fusion, our previous DLA based hardware employes a unified buffer with write-masking for simple layer-by-layer processing in a fusion group. When compared to our previous DLA with the same PE numbers, the chip implemented in a TSMC 40nm process supports 1280x720@30FPS object detection and consumes 7.9X less external DRAM access energy, from 2607 mJ to 327.6 mJ.

Benchmark Analysis of YOLO Performance on Edge Intelligence Devices

Article

Full-text available

Apr 2022

In the 5G intelligent edge scenario, more and more accelerator-based single-board computers (SBCs) with low power consumption and high performance are being used as edge devices to run the inferencing part of the artificial intelligence (AI) model to deploy intelligent applications. In this paper, we investigate the inference workflow and performance of the You Only Look Once (YOLO) network, which is the most popular object detection model, in three different accelerator-based SBCs, which are NVIDIA Jetson Nano, NVIDIA Jetson Xavier NX and Raspberry Pi 4B (RPi) with Intel Neural Compute Stick2 (NCS2). Different video contents with different input resize windows are detected and benchmarked by using four different versions of the YOLO model across the above three SBCs. By comparing the inference performance of the three SBCs, the performance of RPi + NCS2 is more friendly to lightweight models. For example, the FPS of detected videos from RPi + NCS2 running YOLOv3-tiny is 7.6 times higher than that of YOLOv3. However, in terms of detection accuracy, we found that in the process of realizing edge intelligence, how to better adapt a AI model to run on RPi + NCS2 is much more complex than the process of Jetson devices. The analysis results indicate that Jetson Nano is a trade-off SBCs in terms of performance and cost; it achieves up to 15 FPSs of detected videos when running YOLOv4-tiny, and this result can be further increased by using TensorRT.

Adopting the YOLOv4 Architecture for Low-Latency Multispectral Pedestrian Detection in Autonomous Driving

Article

Full-text available

Jan 2022
SENSORS-BASEL

Detecting pedestrians in autonomous driving is a safety-critical task, and the decision to avoid a a person has to be made with minimal latency. Multispectral approaches that combine RGB and thermal images are researched extensively, as they make it possible to gain robustness under varying illumination and weather conditions. State-of-the-art solutions employing deep neural networks offer high accuracy of pedestrian detection. However, the literature is short of works that evaluate multispectral pedestrian detection with respect to its feasibility in obstacle avoidance scenarios, taking into account the motion of the vehicle. Therefore, we investigated the real-time neural network detector architecture You Only Look Once, the latest version (YOLOv4), and demonstrate that this detector can be adapted to multispectral pedestrian detection. It can achieve accuracy on par with the state-of-the-art while being highly computationally efficient, thereby supporting low-latency decision making. The results achieved on the KAIST dataset were evaluated from the perspective of automotive applications, where low latency and a low number of false negatives are critical parameters. The middle fusion approach to YOLOv4 in its Tiny variant achieved the best accuracy to computational efficiency trade-off among the evaluated architectures.

A package auto-counting model based on tailored YOLO and DeepSort techniques

Article

Full-text available

Jan 2022

In the industrial area, the deployment of deep learning models in object detection and tracking are normally too large, also, it requires appropriate trade-offs between speed and accuracy. In this paper, we present a compressed object identification model called Tailored-YOLO (T-YOLO), and builds a lighter deep neural network construction based on the T-YOLO and DeepSort. The model greatly reduces the number of parameters by tailoring the two layers of Conv and BottleneckCSP. We verify the construction by realizing the package counting during the input-output warehouse process. The theoretical analysis and experimental results show that the mean average precision (mAP) is 99.50%, the recognition accuracy of the model is 95.88%, the counting accuracy is 99.80%, and the recall is 99.15%. Compared with the YOLOv5 combined DeepSort model, the proposed optimization method ensures the accuracy of packages recognition and counting and reduces the model parameters by 11MB.

Block diagram of YOLOv3-tiny architecture.

Context in source publication

Similar publications

Citations