Bit-plane representation of a DCT residue matrix.

Source publication

Figure 2: Bit-plane representation of a DCT residue matrix.

Figure 3: A part of a frame where the card is selected as the ROI. The...

Figure 6: A macroblock m i, j divided into four 8 ¢ 8 subblocks is...

Figure 7: A part of a frame of the video sequence "hall monitor." The...

Lightweight Object Tracking in Compressed Video Streams Demonstrated in Region-of-Interest Coding

Article

Full-text available

Jan 2007

Video scalability is a recent video coding technology that allows content providers to offer multiple quality versions from a single encoded video file in order to target different kinds of end-user devices and networks. One form of scalability utilizes the region-of-interest concept, that is, the possibility to mark objects or zones within the vid...

Context 1

... difference with more traditional video encod- ing schemes is a novel approach to encode the residual val- ues. The FGS bit-plane DCT residue value encoding occurs by zig-zag scanning the values and by placing them in their binary form in a matrix (Figure 2). Note that the sign of a value is stored separately, so only the absolute value is used for the binary representation. ...

View in full-text

Specialization of Run-time Configuration Space at Compile-time: An Exploratory Study

Preprint

Full-text available

Oct 2022

Numerous software systems are highly configurable through run-time options, such as command-line parameters. Users can tune some of the options to meet various functional and non-functional requirements such as footprint, security, or execution time. However, some options are never set for a given system instance, and their values remain the same w...

Motion vector predictor selection for the enhancement layer in the H.264/AVC extension-spatial SVC

Conference Paper

Full-text available

Jun 2009

Scalable Video Coding (SVC) has been approved as the extension of the H.264/AVC video coding standard recently. In current spatial scalability scheme, the technique to examine both inter modes with residual prediction and without residual prediction for enhancement layers can achieve the highest possible coding efficiency, but it's typically one of...

A Rate Control Scheme of the Even Low Bit-rate Video Encoder

Article

Full-text available

Jan 2009

Rate control plays an important role in transmitting low-delay and high-quality images over the channel of very low bandwidth. The rate control algorithm in MPEG-4 or H.26X only defined the rate control model of P-frame, and did not introduce the rate control model of I-frame as it supposed that only the first frame is an I-frame, the others are al...

Traffic Analysis and Modeling of Real World Video Encoders

Article

Full-text available

Jan 1999

Any performance evaluation of broadband networks requires statistical analysis and modeling of the actual network traffic. Since multimedia services and especially MPEG coded video streams are expected to be a major traffic component over these networks, modeling of such services and accurate estimation of the network resources are crucial for the...

On the Performance of Compressive Video Streaming for Wireless Multimedia Sensor Networks

Conference Paper

Full-text available

Jun 2010

This paper investigates the potential of the compressed sensing (CS) paradigm for video streaming in Wireless Multimedia Sensor Networks. The objective is to study performance limits and outline key design principles that will be the basis for cross-layer protocol stacks for efficient transport of compressive video streams. Hence, this paper invest...

Temporal Motion Vector Filter for Fast Object Detection on Compressed Video

Article

Full-text available

May 2014

A novel Temporal Motion Vector Filter (TF) is presented and evaluated for real-time object detection on com- pressed videos in MPEG-2, MPEG-4 or H.264/AVC formats. The filter significantly reduces the noisy motion vectors that do not represent a real object movement . The filter analyses the temporal coherence of block motion vectors to determine if they are likely to represent true motion in the recorded scene. Experiments are performed using the CLEAR metrics for object detection and public available datasets from CAVIAR, PETS and CLEAR. These experiments demonstrate that the TF outperforms the Vector Median Filter, by providing better object detection accuracy with reduced computational complexity. The good results obtained by the TF make it suitable as a first step towards implementing systems that aim to detect and track objects from compressed video by using motion vectors. The TF could also be used to improve other techniques based on motion vectors such as Global Motion Estimation (GME) and Motion-Compensated Frame Interpolation (MCFI).

An Interactive Video Streaming Architecture Featuring Bitrate Adaptation

Article

Apr 2012

This paper describes an interactive and adaptive streaming architecture that exploits temporal concatenation of H.264/AVC video bit-streams to dynamically adapt to both user commands and network conditions. The architecture has been designed to improve the viewing experience when accessing video content through individual and potentially bandwidth constrained connections. On the one hand, the user commands typically gives the client the opportunity to select interactively a preferred version among the multiple video clips that are made available to render the scene, e.g. using different view angles, or zoomed-in and slow-motion factors. On the other hand, the adaptation to the network bandwidth ensures effective management of the client buffer, which appears to be fundamental to reduce the client-server interaction latency, while maximizing video quality and preventing buffer underflow. In addition to user interaction and network adaptation, the deployment of fully autonomous infrastructures for interactive content distribution also requires the development of automatic versioning methods. Hence, the paper also surveys a number of approaches proposed for this purpose in surveillance and sport event contexts. Both objective metrics and subjective experiments are exploited to assess our system. Index Terms— interactive streaming, clip versioning, RoI extraction, bitrate adaption, H.264/AVC.

Low-Complexity Tracking-Aware H.264 Video Compression for Transportation Surveillance

Article

Full-text available

Nov 2011
IEEE T CIRC SYST VID

In centralized transportation surveillance systems, video is captured and compressed at low processing power remote nodes and transmitted to a central location for processing. Such compression can reduce the accuracy of centrally run automated object tracking algorithms. In typical systems, the majority of communications bandwidth is spent on encoding temporal pixel variations such as acquisition noise or local changes to lighting. We propose a tracking-aware, H.264-compliant compression algorithm that removes temporal components of low tracking interest and optimizes the quantization of frequency coefficients, particularly those that most influence trackers, significantly reducing bitrate while maintaining comparable tracking accuracy. We utilize tracking accuracy as our compression criterion in lieu of mean squared error metrics. Our proposed system is designed with low processing power and memory requirements in mind, and as such can be deployed on remote nodes. Using H.264/AVC video coding and a commonly used state-of-the-art tracker we show that our algorithm allows for over 90% bitrate savings while maintaining comparable tracking accuracy.

Content-aware H.264 encoding for traffic video tracking applications

Conference Paper

Apr 2010
Acoust Speech Signal Process

Compressed domain indexing of scalable H.264/SVC streams

Article

Jul 2009
SIGNAL PROCESS-IMAGE

We present methods to efficiently analyze scalable, compressed H.264/scalable video coding (SVC) video streams. Relying solely on information present in the compressed stream, we estimate the global camera motion, perform motion segmentation and use a simple matching process to track moving objects over time. Object energy images are constructed in order to help resolve the problem of object correspondence during the occlusions of multiple objects. To save computing time, we analyze lower spatial layers of the stream and add higher layer information only if necessary. We draw 2-D object trajectories in the view plane of the camera and use the temporal evolution of the objects’ properties to estimate the relative distance to the camera, resulting in a pseudo 3-D representation of the trajectories. Finally, the suitability of the motion parameters to perform video retrieval/copy detection tasks is demonstrated. We therefore form two simple descriptors that are invariant to a series of transformations.

An Approach to Trajectory Estimation of Moving Objects in the H.264 Compressed Domain

Conference Paper

Full-text available

Jan 2009

This paper presents a simple and fast method for unsupervised trajectory estimation of multiple moving objects within a video scene. It is entirely based on the motion vectors that are present in compressed H.264/AVC or SVC video streams. We extract these motion vectors, perform robust frame-wise global motion estimation and use these estimates to form outlier masks. Motion segmentation on the spatio-temporally filtered outlier masks is performed to detect moving regions in the scene, which are analyzed over time in order to identify similar objects in adjacent frames. The construction of so-called Object History Images (OHIs) is proposed to stabilize the trajectories, which are finally interpolated with X-splines. The system enables real-time analysis with standard hardware.

Automated Video Adaptation Based on Time-Varying Context Parameters

Thesis

Jan 2006

R De Sutter

Application-Aware Approach to Compression and Transmission of H.264 Encoded Video for Automated and Centralized Transportation Surveillance

Article

Dec 2013
IEEE T INTELL TRANSP

In this paper, we present a transportation video coding and wireless transmission system specifically tailored to automated vehicle tracking applications. By taking into account the video characteristics and the lossy nature of the wireless channels, we propose video preprocessing and error control approaches to enhance tracking performance while conserving bandwidth resources and computational power at the transmitter. Compared with current state-of-the-art H.264-based implementations, our system is shown to yield over 80% bitrate savings for comparable tracking accuracy.

A Spatiotemporal Motion-Vector Filter for Object Tracking on Compressed Video

Article

Aug 2010

In this paper, a novel filter for real-time object trackingfrom compressed domain is presented and evaluated. Thefilter significantly reduces the noisy motion vectors, that donot represent a real object movement, from Mpeg familycompressed videos. The filter analyses the spatial (neighborhood)and temporal coherence of block motion vectorsto determine if they are likely to represent true motion fromthe recorded scene. Qualitative and quantitative experimentsare performed displaying that the proposed spatiotemporalfilter (STF) outperforms the currently widelyused vector median filter. The results obtained with the spatiotemporalfilter make it suitable as a first step of any systemthat aims to detect and track objects from compressedvideo using its motion vectors.

Compressed domain aided analysis of traffic surveillance videos

Conference Paper

Oct 2009

We present a novel system to perform efficient, compressed domain aided video analysis in the context of traffic surveillance applications. After camera installation, the system initializes by performing two short and fully automatic learning stages to gather information about the background and the principal moving directions in the scene. This knowledge is later used to assist the detection and tracking of vehicles. We combine processing in the pixel domain on decoded I-frames with motion based information from the H.264/SVC compressed domain in order to obtain a hybrid solution that delivers robust results at low computational complexity. Pan-tilt-zoom cameras are supported by the system, since global motion estimation is performed using the motion vectors that are present in the compressed stream.

Bit-plane representation of a DCT residue matrix.

Context in source publication

Similar publications

Citations