Home
Harbin Institute of Technology
School of Computer Science and Technology
Xianming Liu

Xianming Liu
Harbin Institute of Technology | HIT · School of Computer Science and Technology

About

177

Publications

19,893

Reads

3,288

Citations

Publications

Incrementally Adapting Pretrained Model Using Network Prior for Multi-Focus Image Fusion

Article

Jun 2024

Multi-focus image fusion can fuse the clear parts of two or more source images captured at the same scene with different focal lengths into an all-in-focus image. On the one hand, previous supervised learning-based multi-focus image fusion methods relying on synthetic datasets have a clear distribution shift with real scenarios. On the other hand,...

OpticalDR: A Deep Optical Imaging Model for Privacy-Protective Depression Recognition

Conference Paper

Full-text available

Jun 2024

Depression Recognition (DR) poses a considerable challenge , especially in the context of the growing concerns surrounding privacy. Traditional automatic diagnosis of DR technology necessitates the use of facial images, undoubtedly expose the patient identity features and poses privacy risks. In order to mitigate the potential risks associated with...

Spatial Annealing Smoothing for Efficient Few-shot Neural Rendering

Preprint

Jun 2024

Neural Radiance Fields (NeRF) with hybrid representations have shown impressive capabilities in reconstructing scenes for view synthesis, delivering high efficiency. Nonetheless, their performance significantly drops with sparse view inputs, due to the issue of overfitting. While various regularization strategies have been devised to address these...

Mix-DDPM: Enhancing Diffusion Models through Fitting Mixture Noise with Global Stochastic Offset

Article

Jun 2024

Denoising diffusion probabilistic models (DDPM) have shown impressive performance in various domains as a class of deep generative models. In this paper, we introduce the Mixture noise-based DDPM (Mix-DDPM), which considers the Markov diffusion posterior as a Gaussian mixture model. Specifically, Mix-DDPM randomly selects a Gaussian component and t...

The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition

Preprint

Full-text available

May 2024

In the realm of autonomous driving, robust perception under out-of-distribution conditions is paramount for the safe deployment of vehicles. Challenges such as adverse weather, sensor malfunctions, and environmental unpredictability can severely impact the performance of autonomous systems. The 2024 RoboDrive Challenge was crafted to propel the dev...

DINO-SD for Robust Multi-View Supervised Depth Estimation

Preprint

Full-text available

May 2024

This technical report summarizes the champion solution for the RoboDepth Challenge, which is held in the ICRA 2024 RoboDrive Workshop. DINO-SD is a multi-view supervised depth estimation model. Our model primarily focuses on addressing robustness issues in corrupted environments of autonomous driving. We use pretrained DINOv2 as the backbone, M-DPT...

DINO-SD: Champion Solution for ICRA 2024 RoboDepth Challenge

Preprint

Full-text available

May 2024

Surround-view depth estimation is a crucial task aims to acquire the depth maps of the surrounding views. It has many applications in real world scenarios such as autonomous driving, AR/VR and 3D reconstruction, etc. However, given that most of the data in the autonomous driving dataset is collected in daytime scenarios, this leads to poor depth mo...

Illumination-Aware Low-Light Image Enhancement with Transformer and Auto-Knee Curve

Article

May 2024

Images captured under low-light conditions suffer from several combined degradation factors, including low brightness, low contrast, noise, and color bias. Many learning-based techniques attempt to learn the low-to-clear mapping between low-light and normal-light images. However, they often fall short when applied to low-light images taken in wide-...

Deep Lossy Plus Residual Coding for Lossless and Near-Lossless Image Compression

Article

May 2024

Lossless and near-lossless image compression is of paramount importance to professional users in many technical fields, such as medicine, remote sensing, precision engineering and scientific research. But despite rapidly growing research interests in learning-based image compression, no published method offers both lossless and near-lossless modes....

Low-Light Face Super-resolution via Illumination, Structure, and Texture Associated Representation

Article

Mar 2024

Human face captured at night or in dimly lit environments has become a common practice, accompanied by complex low-light and low-resolution degradations. However, the existing face super-resolution (FSR) technologies and derived cascaded schemes are inadequate to recover credible textures. In this paper, we propose a novel approach that decomposes...

FMRNet: Image Deraining via Frequency Mutual Revision

Article

Mar 2024

The wavelet transform has emerged as a powerful tool in deciphering structural information within images. And now, the latest research suggests that combining the prowess of wavelet transform with neural networks can lead to unparalleled image deraining results. By harnessing the strengths of both the spatial domain and frequency space, this innova...

Learning from History: Task-agnostic Model Contrastive Learning for Image Restoration

Article

Mar 2024

Contrastive learning has emerged as a prevailing paradigm for high-level vision tasks, which, by introducing properly negative samples, has also been exploited for low-level vision tasks to achieve a compact optimization space to account for their ill-posed nature. However, existing methods rely on manually predefined and task-oriented negatives, w...

Image Deblurring by Exploring In-Depth Properties of Transformer

Article

Feb 2024

Image deblurring continues to achieve impressive performance with the development of generative models. Nonetheless, there still remains a displeasing problem if one wants to improve perceptual quality and quantitative scores of recovered image at the same time. In this study, drawing inspiration from the research of transformer properties, we intr...

Overhead-free Noise-tolerant Federated Learning: A New Baseline

Article

Jan 2024

Federated learning (FL) is a promising decentralized machine learning approach that enables multiple distributed clients to train a model jointly while keeping their data private. However, in real-world scenarios, the supervised training data stored in local clients inevitably suffer from imperfect annotations, resulting in subjective, inconsistent...

Structure Prior-Aware Dynamic Network for Face Super-Resolution

Article

Jan 2024

The recent emergence of deep learning neural networks has propelled advancements in the field of face super-resolution. While these deep learning-based methods have shown significant performance improvements, they depend overwhelmingly on fixed, spatially shared kernels within standard convolutional layers. This leads to a neglect of the diverse fa...

HoloFormer: Contrastive Regularization Based Transformer for Holographic Image Reconstruction

Article

Jan 2024

Deep learning has emerged as a prominent technique in the field of holographic imaging, owing to its rapidity and high performance. Prevailing deep neural networks employed for holographic image reconstruction predominantly rely on convolutional neural networks (CNNs). While CNNs have yielded impressive results, their intrinsic limitations, charact...

GroupedMixer: An Entropy Model with Group-wise Token-Mixers for Learned Image Compression

Article

Jan 2024

Transformer-based entropy models have gained prominence in recent years due to their superior ability to capture long-range dependencies in probability distribution estimation compared to convolution-based methods. However, previous transformer-based entropy models suffer from sluggish coding process due to pixel-wise autoregression or duplicated c...

Learning a 3D-CNN and Transformer prior for hyperspectral image super-resolution

Article

Dec 2023

Reciprocal transformer for hyperspectral and multispectral image fusion

Article

Nov 2023

Super-Resolving Face Image by Facial Parsing Information

Article

Oct 2023

Face super-resolution is a technology that transforms a low-resolution face image into the corresponding high-resolution one. In this paper, we build a novel parsing map guided face super-resolution network which extracts the face prior (i.e., parsing map) directly from low-resolution face image for the following utilization. To exploit the extract...

FMRNet: Image Deraining via Frequency Mutual Revision

Preprint

Sep 2023

DepthFormer: Exploiting Long-range Correlation and Local Information for Accurate Monocular Depth Estimation

Article

Full-text available

Sep 2023

This paper aims to address the problem of supervised monocular depth estimation. We start with a meticulous pilot study to demonstrate that the long-range correlation is essential for accurate depth estimation. Moreover, the Transformer and convolution are good at long-range and close-range depth estimation, respectively. Therefore, we propose to a...

Fully $1\times1$ Convolutional Network for Lightweight Image Super-Resolution

Preprint

Jul 2023

Deep models have achieved significant process on single image super-resolution (SISR) tasks, in particular large models with large kernel ($3\times3$ or more). However, the heavy computational footprint of such models prevents their deployment in real-time, resource-constrained environments. Conversely, $1\times1$ convolutions bring substantial com...

Unsupervised Deep Exemplar Colorization via Pyramid Dual Non-Local Attention

Article

Jul 2023

Exemplar-based colorization is a challenging task, which attempts to add colors to the target grayscale image with the aid of a reference color image, so as to keep the target semantic content while with the reference color style. In order to achieve visually plausible chromatic results, it is important to sufficiently exploit the global color styl...

A Practical Contrastive Learning Framework for Single-Image Super-Resolution

Article

Jul 2023

Contrastive learning has achieved remarkable success on various high-level tasks, but there are fewer contrastive learning-based methods proposed for low-level tasks. It is challenging to adopt vanilla contrastive learning technologies proposed for high-level visual tasks to low-level image restoration problems straightly. Because the acquired high...

Learning Lossless Compression for High Bit-Depth Medical Imaging

Conference Paper

Jul 2023

Self-Supervised Arbitrary-Scale Implicit Point Clouds Upsampling

Article

Jun 2023

Point clouds upsampling (PCU), which aims to generate dense and uniform point clouds from the captured sparse input of 3D sensor such as LiDAR, is a practical yet challenging task. It has potential applications in many real-world scenarios, such as autonomous driving, robotics, AR/VR, etc. Deep neural network based methods achieve remarkable succes...

Thermal Image Super-Resolution Challenge Results - PBVS 2023

Conference Paper

Jun 2023

Spatial-Frequency Mutual Learning for Face Super-Resolution

Conference Paper

Jun 2023

Backdoor Attacks Against Incremental Learners: An Empirical Evaluation Study

Preprint

Full-text available

May 2023

Large amounts of incremental learning algorithms have been proposed to alleviate the catastrophic forgetting issue arises while dealing with sequential data on a time series. However, the adversarial robustness of incremental learners has not been widely verified, leaving potential security risks. Specifically, for poisoning-based backdoor attacks,...

Comparison of 2D (a) and 3D convolution (b) on multi-band data.

Simplified flowchart of several models: (a) full 2D CNNs, (b) full 3D...

Some visual results and error maps of MUN (the upper line) and F3DUN...

Rethinking 3D-CNN in Hyperspectral Image Super-Resolution

Article

Full-text available

May 2023

Recently, CNN-based methods for hyperspectral image super-resolution (HSISR) have achieved outstanding performance. Due to the multi-band property of hyperspectral images, 3D convolutions are natural candidates for extracting spatial–spectral correlations. However, pure 3D CNN models are rare to see, since they are generally considered to be too co...

Super-Resolving Face Image by Facial Parsing Information

Preprint

Apr 2023

Deep Attentional Guided Image Filtering

Article

Mar 2023

Guided filter is a fundamental tool in computer vision and computer graphics, which aims to transfer structure information from the guide image to the target image. Most existing methods construct filter kernels from the guidance itself without considering the mutual dependency between the guidance and the target. However, since there typically exi...

Fig. 1: Illustration the effectiveness of VGG and ViT features. The...

Fig. 2: Similarity visualization of different ViT features on the image...

Fig. 3: The workflow illustration of proposed perceptual losses.

Image Deblurring by Exploring In-depth Properties of Transformer

Preprint

Full-text available

Mar 2023

Figure 3. The architecture of the proposed TCSR.

Figure 9. LAM [12] comparison. (a) The ground truth of the reference...

Incorporating Transformer Designs into Convolutions for Lightweight Image Super-Resolution

Preprint

Full-text available

Mar 2023

In recent years, the use of large convolutional kernels has become popular in designing convolutional neural networks due to their ability to capture long-range dependencies and provide large receptive fields. However, the increase in kernel size also leads to a quadratic growth in the number of parameters, resulting in heavy computation and memory...

Guided Depth Map Super-resolution: A Survey

Preprint

Feb 2023

Guided depth map super-resolution (GDSR), which aims to reconstruct a high-resolution (HR) depth map from a low-resolution (LR) observation with the help of a paired HR color image, is a longstanding and fundamental problem, it has attracted considerable attention from computer vision and image processing communities. A myriad of novel and effectiv...

Efficient Single-Image Depth Estimation on Mobile Devices, Mobile AI & AIM 2022 Challenge: Report

Chapter

Feb 2023

Various depth estimation models are now widely used on many mobile and IoT devices for image segmentation, bokeh effect rendering, object tracking and many other mobile tasks. Thus, it is very crucial to have efficient and accurate depth estimation models that can run fast on low-power mobile chipsets. In this Mobile AI challenge, the target was to...

Guided Depth Map Super-resolution: A Survey

Article

Feb 2023

LiteDepth: Digging into Fast and Accurate Depth Estimation on Mobile Devices

Chapter

Feb 2023

Monocular depth estimation is an essential task in the computer vision community. While tremendous successful methods have obtained excellent results, most of them are computationally expensive and not applicable for real-time on-device inference. In this paper, we aim to address more practical applications of monocular depth estimation, where the...

Asymmetric Loss Functions for Noise-Tolerant Learning: Theory and Applications

Article

Feb 2023

Supervised deep learning has achieved tremendous success in many computer vision tasks, which however is prone to overfit noisy labels. To mitigate the undesirable influence of noisy labels, robust loss functions offer a feasible approach to achieve noise-tolerant learning. In this work, we systematically study the problem of noise-tolerant learnin...

ReSmooth: Detecting and Utilizing OOD Samples When Training With Data Augmentation

Article

Nov 2022

Data augmentation (DA) is a widely used technique for enhancing the training of deep neural networks. Recent DA techniques which achieve state-of-the-art performance always meet the need for diversity in augmented training samples. However, an augmentation strategy that has a high diversity usually introduces out-of-distribution (OOD) augmented sam...

Efficient Single-Image Depth Estimation on Mobile Devices, Mobile AI & AIM 2022 Challenge: Report

Preprint

Nov 2022

Unsupervised Domain Adaptation for Monocular 3D Object Detection via Self-training

Chapter

Nov 2022

Monocular 3D object detection (Mono3D) has achieved unprecedented success with the advent of deep learning techniques and emerging large-scale autonomous driving datasets. However, drastic performance degradation remains an unwell-studied challenge for practical cross-domain deployment as the lack of labels on the target domain. In this paper, we f...

Fusion from Decomposition: A Self-Supervised Decomposition Approach for Image Fusion

Chapter

Nov 2022

Image fusion is famous as an alternative solution to generate one high-quality image from multiple images in addition to image restoration from a single degraded image. The essence of image fusion is to integrate complementary information or best parts from source images. The current fusion methods usually need a large number of paired samples or s...

Propagating Facial Prior Knowledge for Multi-Task Learning in Face Super-Resolution

Article

Nov 2022

Existing face hallucination methods always achieve improved performance through regularizing the model with facial prior. Most of them always estimate facial prior information first and then leverage it to help the prediction of the target high-resolution face image. However, the accuracy of prior estimation is difficult to guarantee, especially fo...

ZMFF: Zero-shot multi-focus image fusion

Article

Nov 2022

Multi-focus image fusion (MFF) is an effective way to eliminate the out-of-focus blur generated in the imaging process. The difficulties in distinguishing different blur levels and the lack of real supervised data make multi-focus image fusion remain a challenging task after decades of research. According to deep image prior (DIP) (Ulyanov et al.,...

Hybrid Conditional Deep Inverse Tone Mapping

Conference Paper

Oct 2022

Multi-Camera Collaborative Depth Prediction via Consistent Structure Estimation

Conference Paper

Oct 2022

ChebyLighter: Optimal Curve Estimation for Low-light Image Enhancement

Conference Paper

Oct 2022

Multi-Camera Collaborative Depth Prediction via Consistent Structure Estimation

Preprint

Oct 2022

Depth map estimation from images is an important task in robotic systems. Existing methods can be categorized into two groups including multi-view stereo and monocular depth estimation. The former requires cameras to have large overlapping areas and sufficient baseline between cameras, while the latter that processes each image independently can ha...

Deep Lossy Plus Residual Coding for Lossless and Near-lossless Image Compression

Preprint

Sep 2022

Fig. 1. Illustration of our proposed network architecture that follows...

Fig. 4. Illustration of valid mask calculation for gradience loss (x...

Fig. 6. Illustration of our multi-scale distillation strategy.

Ranking results in the MAI&AIM2022 Monocular Depth Estimation Chal-...

Inference time of our network (AI Benchmark).

LiteDepth: Digging into Fast and Accurate Depth Estimation on Mobile Devices

Preprint

Full-text available

Sep 2022

Self-Supervised Monocular Depth Estimation via Discrete Strategy and Uncertainty

Article

Jul 2022

Dear Editor, This letter is concerned with self-supervised monocular depth estimation. To estimate uncertainty simultaneously, we propose a simple yet effective strategy to learn the uncertainty for self-supervised monocular depth estimation with the discrete strategy that explicitly associates the prediction and the uncertainty to train the networ...

Towards End-to-End Image Compression and Analysis with Transformers

Article

Jun 2022

We propose an end-to-end image compression and analysis model with Transformers, targeting to the cloud-based image classification application. Instead of placing an existing Transformer-based image classification model directly after an image codec, we aim to redesign the Vision Transformer (ViT) model to perform image classification from the comp...

Local Surface Descriptor for Geometry and Feature Preserved Mesh Denoising

Article

Jun 2022

3D meshes are widely employed to represent geometry structure of 3D shapes. Due to limitation of scanning sensor precision and other issues, meshes are inevitably affected by noise, which hampers the subsequent applications. Convolultional neural networks (CNNs) achieve great success in image processing tasks, including 2D image denoising, and have...

SimIPU: Simple 2D Image and 3D Point Cloud Unsupervised Pre-training for Spatial-Aware Visual Representations

Article

Jun 2022

Pre-training has become a standard paradigm in many computer vision tasks. However, most of the methods are generally designed on the RGB image domain. Due to the discrepancy between the two-dimensional image plane and the three-dimensional space, such pre-trained models fail to perceive spatial information and serve as sub-optimal solutions for 3D...

Learning Towards the Largest Margins

Preprint

Jun 2022

One of the main challenges for feature representation in deep learning-based classification is the design of appropriate loss functions that exhibit strong discriminative power. The classical softmax loss does not explicitly encourage discriminative learning of features. A popular direction of research is to incorporate margins in well-established...

Prototype-Anchored Learning for Learning with Imperfect Annotations

Preprint

Jun 2022

The success of deep neural networks greatly relies on the availability of large amounts of high-quality annotated data, which however are difficult or expensive to obtain. The resulting labels may be class imbalanced, noisy or human biased. It is challenging to learn unbiased classification models from imperfectly annotated datasets, on which we us...

NTIRE 2022 Image Inpainting Challenge: Report

Conference Paper

Jun 2022

GLaMa: Joint Spatial and Frequency Loss for General Image Inpainting

Conference Paper

Jun 2022

Thermal Image Super-Resolution Challenge Results - PBVS 2022

Conference Paper

Full-text available

Jun 2022

This paper presents results from the third Thermal Image Super-Resolution (TISR) challenge organized in the Perception Beyond the Visible Spectrum (PBVS) 2022 workshop. The challenge uses the same thermal image dataset as the first two challenges, with 951 training images and 50 validation images at each resolution. A set of 20 images was kept asid...

From Less to More: Spectral Splitting and Aggregation Network for Hyperspectral Face Super-Resolution

Conference Paper

Jun 2022

Shadows can be Dangerous: Stealthy and Effective Physical-world Adversarial Attack by Natural Phenomenon

Conference Paper

Jun 2022

Self-Supervised Arbitrary-Scale Point Clouds Upsampling via Implicit Neural Representation

Conference Paper

Jun 2022

Fully Unsupervised Person Re-Identification via Selective Contrastive Learning

Article

May 2022

Person re-identification (ReID) aims at searching the same identity person among images captured by various cameras. Existing fully supervised person ReID methods usually suffer from poor generalization capability caused by domain gaps. Unsupervised person ReID has attracted a lot of attention recently, because it works without intensive manual ann...

ReSmooth: Detecting and Utilizing OOD Samples when Training with Data Augmentation

Preprint

May 2022

BaMBNet: A Blur-Aware Multi-Branch Network for Dual-Pixel Defocus Deblurring

Article

May 2022

Reducing the defocus blur that arises from the finite aperture size and short exposure time is an essential problem in computational photography. It is very challenging because the blur kernel is spatially varying and difficult to estimate by traditional methods. Due to its great breakthrough in low-level tasks, convolutional neural networks (CNNs)...

Multi-Task Interaction Learning for Spatiospectral Image Super-Resolution

Article

Mar 2022

High spatial resolution and high spectral resolution images (HR-HSIs) are widely applied in geosciences, medical diagnosis, and beyond. However, how to get images with both high spatial resolution and high spectral resolution is still a problem to be solved. In this paper, we present a deep spatial-spectral feature interaction network (SSFIN) for r...

Exploiting the Potential of Datasets: A Data-Centric Approach for Model Robustness

Preprint

Full-text available

Mar 2022

Robustness of deep neural networks (DNNs) to malicious perturbations is a hot topic in trustworthy AI. Existing techniques obtain robust models given fixed datasets, either by modifying model structures, or by optimizing the process of inference or training. While significant improvements have been made, the possibility of constructing a high-quali...

Figure 6. The test result of our scheduled attack on 8 German speed...

Performances of the proposed Shadow Attack with dif- ferent number of...

Shadows can be Dangerous: Stealthy and Effective Physical-world Adversarial Attack by Natural Phenomenon

Preprint

Full-text available

Mar 2022

Estimating the risk level of adversarial examples is essential for safely deploying machine learning models in the real world. One popular approach for physical-world attacks is to adopt the "sticker-pasting" strategy, which however suffers from some limitations, including difficulties in access to the target or printing by valid colors. A new type...

Rectified Meta-learning from Noisy Labels for Robust Image-based Plant Disease Classification

Article

Jan 2022

Plant diseases serve as one of main threats to food security and crop production. It is thus valuable to exploit recent advances of artificial intelligence to assist plant disease diagnosis. One popular approach is to transform this problem as a leaf image classification task, which can be then addressed by the powerful convolutional neural network...

Learning a 3D-CNN and Transformer Prior for Hyperspectral Image Super-Resolution

Article

Jan 2022

Deep Unfolding Network for Spatiospectral Image Super-Resolution

Article

Dec 2021

In this paper, we explore the spatiospectral image super-resolution (SSSR) task, i.e., joint spatial and spectral super-resolution, which aims to generate a high spatial resolution hyperspectral image (HR-HSI) from a low spatial resolution multispectral image (LR-MSI). To tackle such a severely ill-posed problem, one straightforward but inefficient...

Towards End-to-End Image Compression and Analysis with Transformers

Preprint

Dec 2021

Fig. 2. The network architecture of the proposed deep attentional...

Fig. 3. The network architecture of the proposed attentional kernel...

Fig. 4. Qualitative comparison for recovered depth maps (8×). (a)...

Fig. 5. Visual comparison of 8× saliency map super-resolution on the...

Deep Attentional Guided Image Filtering

Preprint

Full-text available

Dec 2021

Guided filter is a fundamental tool in computer vision and computer graphics which aims to transfer structure information from guidance image to target image. Most existing methods construct filter kernels from the guidance itself without considering the mutual dependency between the guidance and the target. However, since there typically exist sig...

High-Resolution Depth Maps Imaging via Attention-Based Hierarchical Multi-Modal Fusion

Article

Dec 2021

Depth map records distance between the viewpoint and objects in the scene, which plays a critical role in many real-world applications. However, depth map captured by consumer-grade RGB-D cameras suffers from low spatial resolution. Guided depth map super-resolution (DSR) is a popular approach to address this problem, which attempts to restore a hi...

Deep Learning-based Face Super-resolution: A Survey

Article

Nov 2021

Face super-resolution (FSR), also known as face hallucination, which is aimed at enhancing the resolution of low-resolution (LR) face images to generate high-resolution face images, is a domain-specific image super-resolution problem. Recently, FSR has received considerable attention and witnessed dazzling advances with the development of deep lear...

Target-guided Adaptive Base Class Reweighting for Few-Shot Learning

Conference Paper

Oct 2021

Learning with Noisy Labels via Sparse Regularization

Conference Paper

Oct 2021

Deep Learning-based Face Super-resolution: A Survey

Preprint

Full-text available

Sep 2021

Face super-resolution (FSR), also known as face hallucination, which is aimed at enhancing the resolution of low-resolution (LR) face images to generate high-resolution (HR) face images, is a domain-specific image super-resolution problem. Recently, FSR has received considerable attention and witnessed dazzling advances with the development of deep...

Figure 1. Visualization of learned representations on MNIST with 0.8...

Figure 5. Visualization of learned representations on MNIST with...

Validation accuracy on imbalanced CIFAR-10/-100.

Learning with Noisy Labels via Sparse Regularization

Preprint

Full-text available

Jul 2021

Learning with noisy labels is an important and challenging task for training accurate deep neural networks. Some commonly-used loss functions, such as Cross Entropy (CE), suffer from severe overfitting to noisy labels. Robust loss functions that satisfy the symmetric condition were tailored to remedy this problem, which however encounter the underf...

NormalNet: Learning-Based Mesh Normal Denoising via Local Partition Normalization

Article

Jul 2021

Mesh denoising is a critical technology in geometry processing that aims to recover high-fidelity 3D mesh models of objects from noise-corrupted versions. In this work, we propose a learning-based mesh normal denoising scheme, called NormalNet , which employs deep networks to find the correlation between the volumetric representation and denoised...

Heatmap-Aware Pyramid Face Hallucination

Conference Paper

Jul 2021

Zero-Shot Multi-Focus Image Fusion

Conference Paper

Jul 2021

Figure 5. Visualization for GCE (top) and AGCE (bottom) on MNIST with...

Figure 7. Visualization for GCE (top) and AGCE (bottom) on MNIST with...

Asymmetric Loss Functions for Learning with Noisy Labels

Preprint

Full-text available

Jun 2021

Robust loss functions are essential for training deep neural networks with better generalization power in the presence of noisy labels. Symmetric loss functions are confirmed to be robust to label noise. However, the symmetric condition is overly restrictive. In this work, we propose a new class of loss functions, namely \textit{asymmetric loss fun...

Fast and Accurate Single-Image Depth Estimation on Mobile Devices, Mobile AI 2021 Challenge: Report

Conference Paper

Jun 2021

Learning Scalable ℓ ∞ -constrained Near-lossless Image Compression via Joint Lossy Image and Residual Compression

Conference Paper

Jun 2021

Physics-based Iterative Projection Complex Neural Network for Phase Retrieval in Lensless Microscopy Imaging

Conference Paper

Jun 2021

BaMBNet: A Blur-aware Multi-branch Network for Defocus Deblurring

Preprint

May 2021

The defocus deblurring raised from the finite aperture size and exposure time is an essential problem in the computational photography. It is very challenging because the blur kernel is spatially varying and difficult to estimate by traditional methods. Due to its great breakthrough in low-level tasks, convolutional neural networks (CNNs) have been...

Fast and Accurate Single-Image Depth Estimation on Mobile Devices, Mobile AI 2021 Challenge: Report

Preprint

Full-text available

May 2021

Depth estimation is an important computer vision problem with many practical applications to mobile devices. While many solutions have been proposed for this task, they are usually very computationally expensive and thus are not applicable for on-device inference. To address this problem, we introduce the first Mobile AI challenge, where the target...

Fig. 1. Comparison of state-of-the-art methods for 4× DSR on Middlebury...

Fig. 2. The network architecture of our proposed attention-based...

Fig. 3. Multi-modal attention based fusion (MMAF), where is pixel-wise...

Fig. 4. Hierarchical feature collaboration with four BHFCs, where ⊕...

Fig. 5. Visual comparison of 16× upsampling results on Art, Moebius and...

High-resolution Depth Maps Imaging via Attention-based Hierarchical Multi-modal Fusion

Preprint

Full-text available

Apr 2021

Learning Scalable $\ell_\infty$-constrained Near-lossless Image Compression via Joint Lossy Image and Residual Compression

Preprint

Mar 2021

We propose a novel joint lossy image and residual compression framework for learning $\ell_\infty$-constrained near-lossless image compression. Specifically, we obtain a lossy reconstruction of the raw image through lossy image compression and uniformly quantize the corresponding residual to satisfy a given tight $\ell_\infty$ error bound. Suppose...

Multilayer Spectral-Spatial Graphs for Label Noisy Robust Hyperspectral Image Classification

Article

Oct 2020

In hyperspectral image (HSI) analysis, label information is a scarce resource and it is unavoidably affected by human and nonhuman factors, resulting in a large amount of label noise. Although most of the recent supervised HSI classification methods have achieved good classification results, their performance drastically decreases when the training...

Unsupervised Constrative Person Re-identification

Preprint

Oct 2020

Person re-identification (ReID) aims at searching the same identity person among images captured by various cameras. Unsupervised person ReID attracts a lot of attention recently, due to it works without intensive manual annotation and thus shows great potential of adapting to new conditions. Representation learning plays a critical role in unsuper...

Single Image Deraining via Scale-space Invariant Attention Neural Network

Conference Paper

Oct 2020

Semi-Supervised Graph Convolutional Hashing Network For Large-Scale Cross-Modal Retrieval

Conference Paper

Full-text available

Oct 2020

Single Image Deraining via Scale-space Invariant Attention Neural Network

Preprint

Jun 2020

Image enhancement from degradation of rainy artifacts plays a critical role in outdoor visual computing systems. In this paper, we tackle the notion of scale that deals with visual changes in appearance of rain steaks with respect to the camera. Specifically, we revisit multi-scale representation by scale-space theory, and propose to represent the...

Learning Spectral-Spatial Prior for Super-Resolution of Hyperspectral Imagery

Article

May 2020

Recently, single gray/RGB image super-resolution reconstruction task has been extensively studied and made significant progress by leveraging the advanced machine learning techniques based on deep convolutional neural networks (DCNNs). However, there has been limited technical development focusing on single hyperspectral image super-resolution due...