Home
Nanjing University of Posts and Telecommunications
Zheng Liu

Zheng Liu
Nanjing University of Posts and Telecommunications · School of Computer Science

Doctor of Philosophy

About

Publications

19,017

Reads

348

Citations

Publications

Multiplex network of user interactions in Twitter, where nodes are...

Overview of LIAMNE. For each node, two types of embeddings are finally...

Performance of GATNE and LIAMNE with different imbalance ratios on IMDB...

Ablation study of LIAMNE*, LIAMNE- and LIAMNE

Layer imbalance-aware multiplex network embedding

Article

Full-text available

Mar 2024

Multiplex network embedding is an effective technique to jointly learn the low-dimensional representations of nodes across network layers. However, the number of edges among layers may vary significantly. This data imbalance will lead to performance degradation especially on the sparse layer due to learning bias and the adverse effects of irrelevan...

M$$^3$$FGM: A Node Masking and Multi-granularity Message Passing-Based Federated Graph Model for Spatial-Temporal Data Prediction

Chapter

Nov 2023

Researchers are solving the challenges of spatial-temporal prediction by combining Federated Learning (FL) and graph models with respect to the constrain of privacy and security. In order to make better use of the power of graph model, some researchs also combine split learning (SL). However, there are still several issues left unattended: 1) Clien...

Collaborative bi-aggregation for directed graph embedding

Article

May 2023

Directed graph is able to model asymmetric relationships between nodes and research on directed graph embedding is of great significance in downstream graph analysis and inference. Learning source and target embeddings of nodes separately to preserve edge asymmetry has become the dominant approach, but also poses challenge for learning representati...

Collaborative Bi-Aggregation for Directed Graph Embedding

Article

Jan 2023

PM $^{2}$ VE: Power Metering Model for Virtualization Environments in Cloud Data Centers

Article

Jul 2023

Virtualization technologies provide solutions for cloud computing. Virtual resource scheduling is a crucial task in data centers, and the power consumption of virtual resources is a critical foundation of virtualization scheduling. Containers are the smallest unit of virtual resource scheduling and migration. Although many practical models for esti...

Layer Imbalance Aware Multiplex Network Embedding

Preprint

Dec 2022

Collaborative Bi-Aggregation for Directed Graph Embedding

Preprint

Nov 2022

Directed graphs model asymmetric relationships between nodes and research on directed graph embedding is of great significance in downstream graph analysis and inference. Learning source and target embedding of nodes separately to preserve edge asymmetry has become the dominant approach, but also poses challenge for learning representations of low...

M3FGM:a node masking and multi-granularity message passing-based federated graph model for spatial-temporal data prediction

Preprint

Oct 2022

Researchers are solving the challenges of spatial-temporal prediction by combining Federated Learning (FL) and graph models with respect to the constrain of privacy and security. However, there are still several issues left unattended: 1) Clients might not be able to access the server during inference phase; 2) The graph of clients designed manuall...

Cost-Sensitive Tensor-Based Dual-stage Attention LSTM with Feature Selection for Data Center Server Power Forecasting

Article

Oct 2022

The power forecasting has a guiding effect on power-aware scheduling strategies to reduce unnecessary power consumption in data centers. Many metrics related to power consumption can be collected in physical servers, such as the status of CPU, memory, and other components. However, most existing methods empirically exploit a small number of metrics...

Figure 7. Complex token transfer pattern: sequential first, radial later

THE PARAMETERS AND THE RESULTS OF DBSCAN

Fighting Sybils in Airdrops

Preprint

Full-text available

Sep 2022

Airdrop is a crucial concept in tokenomics. Startups of decentralized applications (DApps) reward early supporters by airdropping newly issued tokens up to a certain amount as a free giveaway. This naturally induces greedy hackers, called Sybils, to create multiple accounts for more shares. Most airdrops have prerequisites for qualification, in whi...

One-to-many comparative summarization for patents

Article

Mar 2022

Patents bring technology companies commercial values in modern business operations. However, companies have to bear the high cost of handling patent applications or infringement cases. A common yet expensive task among these jobs is to analyze relevant patent literature. Lengthy and technically complicated patents require a large number of human ef...

Data characteristics aware prediction model for power consumption of data center servers

Article

Feb 2022

Due to the rapid increase in the number and scale of data centers, the information and communication technology (ICT) equipment in data centers consumes an enormous amount of power. A power prediction model is therefore essential for decision‐making optimization and power management of ICT equipment. However, it is difficult to predict the power co...

Heterogeneous graph convolutional network with local influence

Article

Nov 2021

Graph convolutional networks (GCNs) have recently drawn extensive attention due to their superior learning performance on graph data. Through graph convolution, topological structure and node attributes can be simultaneously aggregated in a local neighborhood. In heterogeneous information networks (HINs), the diversity of node and edge types poses...

Towards better time series prediction with model-independent, low-dispersion clusters of contextual subsequence embeddings

Article

Oct 2021

Many applications in financial investment management, environmental pollution reduction, energy resource scheduling, and consumer sales promotion need to forecast future values in multivariate tie series. Despite extensive research efforts laid on the task for decades from the perspective of prediction models, this paper explores boost the predicti...

Loss Attenuation for Time Series Prediction Respecting Categories of Values

Chapter

Aug 2021

Forecasting future values is a core task in many applications dealing with multivariate time series data. In pollution monitoring, for example, forecasting future PM$_{\boldsymbol{2.5}}$ values in air is very common, which is a crucial indicator of the air quality index (AQI). These values in time series are sometimes affiliated with category inf...

An Interpretability Method of Workflow Power Consumption Based on Feature Contribution

Conference Paper

Mar 2021

What nodes vote to? Graph classification without readout phase

Conference Paper

Jan 2021

DPAST-RNN: A Dual-Phase Attention-Based Recurrent Neural Network Using Spatiotemporal LSTMs for Time Series Prediction

Chapter

Nov 2020

For time series forecasting, the weight distribution among multivariables and the long-short-term time dependence are always very important and challenging. Traditional machine forecasting can’t automatically select the effective features of multivariable input and can’t capture the time dependence of sequences. The key to solve this problem is to...

Estimating Power Consumption of Containers and Virtual Machines in Data Centers

Conference Paper

Sep 2020

Cross Message Passing Graph Neural Network

Conference Paper

Jul 2020

Collaboratively Modeling and Embedding of Latent Topics for Short Texts

Article

Full-text available

May 2020

Deriving a successful document representation is the critical challenge in many downstream tasks in NLP, especially when documents are very short. It is challenging to handle the sparsity and the noise problems confronting short texts. Some approaches employ latent topic models, based on global word co-occurrence, to obtain topic distribution as th...

A Non-Intrusive, Traffic-Aware Prediction Framework for Power Consumption in Data Center Operations

Article

Full-text available

Feb 2020

Modern cloud computing relies heavily on data centers, which usually host tens of thousands of servers. Predicting the power consumption accurately in data center operations is crucial for energy optimization. In this paper, we formulate the power consumption prediction at both the fine-grained and coarse-grained level. We carefully discuss the des...

PPGCN: A Message Selection Based Approach for Graph Classification

Chapter

Dec 2019

Recently, the Graph Convolutional Networks (GCNs) have achieved state-of-the-art performance in many graph data related tasks. However, traditional GCNs may generate redundant information in the message passing phase. In order to solve this problem, we propose a novel graph convolution named Push-and-Pull Convolution (PPC), which follows the messag...

Behavior Flow Graph Construction from System Logs for Anomaly Analysis

Chapter

Full-text available

Dec 2019

Anomaly analysis plays a significant role in building a secure and reliable system. Raw system logs contain important system information, such as execution paths and execution time. People often use system logs for fault diagnosis and root cause localization. However, due to the complexity of raw system logs, these tasks can be arduous and ineffect...

Multi-Granularity Power Prediction for Data Center Operations via Long Short-Term Memory Network

Conference Paper

Full-text available

Dec 2019

PCP-2LSTM: Two Stacked LSTM-Based Prediction Model for Power Consumption in Data Centers

Conference Paper

Full-text available

Sep 2019

ADPR: An Attention-based Deep Learning Point-of-Interest Recommendation Framework

Conference Paper

Full-text available

Jul 2019

SC-NER: A Sequence-to-Sequence Model with Sentence Classification for Named Entity Recognition

Chapter

Full-text available

Mar 2019

Named Entity Recognition (NER) is a basic task in Natural Language Processing (NLP). Recently, the sequence-to-sequence (seq2seq) model has been widely used in NLP task. Different from the general NLP task, 60% sentences in the NER task do not contain entities. Traditional seq2seq method cannot address this issue effectively. To solve the aforement...

Event Extraction from Streaming System Logs: ICISA 2018

Chapter

Jan 2019

Log data is typically the only available data source recording system health information. Event extraction converts unstructured log messages into structured event signatures. Existing methods, whether batch or streaming methods, require true event signatures to guide parameter selection. This paper presents a streaming event extraction method that...

SMAS: An Investor-Oriented Social Media Analysis System for Movies

Conference Paper

Dec 2018

Multiple Relational Topic Modeling for Noisy Short Texts

Article

Nov 2018

Understanding contents in social networks by inferring high-quality latent topics from short texts is a significant task in social analysis, which is challenging because social network contents are usually extremely short, noisy and full of informal vocabularies. Due to the lack of sufficient word co-occurrence instances, well-known topic modeling...

Neural network architecture of the skip-gram model

PatSearch: an integrated framework for patentability retrieval

Article

Full-text available

Oct 2018

Patent retrieval primarily focuses on searching relevant legal documents with respect to a given query. Depending on the purposes of specific retrieval tasks, processes of patent retrieval may differ significantly. Given a patent application, it is challenging to determine its patentability, i.e., to decide whether a similar invention has been publ...

Keyphrase Extraction Based on Optimized Random Walks on Multiple Word Relations: Second International Joint Conference, APWeb-WAIM 2018, Macau, China, July 23–25, 2018, Proceedings, Part II

Chapter

Jul 2018

Extracting keyphrases from documents helps to reduce the document information and further assist in information retrieval. In this paper, we construct a multi-relational graph by considering heterogeneous latent word relations (the co-occurrence and the semantic) in a document. Then we optimize the random walks on the multi-relational graph to dete...

Topic Modeling for Noisy Short Texts with Multiple Relations

Conference Paper

Full-text available

Jul 2018

Extracting Keyphrases Using Heterogeneous Word Relations

Conference Paper

Full-text available

Sep 2017

Extracting keyphrases from documents for providing a quick and insightful summarization is an interesting and important task, on which lots of research efforts have been laid. Most of the existing methods could be categorized as co-occurrence based, statistic-based, or semantics-based. The co-occurrence based methods do not take various word relati...

STAR: A System for Ticket Analysis and Resolution

Conference Paper

Full-text available

Aug 2017

In large scale and complex IT service environments, a problematic incident is logged as a ticket and contains the ticket summary (sys-tem status and problem description). The system administrators log the step-wise resolution description when such tickets are resolved. The repeating service events are most likely resolved by inferring similar histo...

FLAP: An End-to-End Event Log Analysis Platform for System Management

Conference Paper

Full-text available

Aug 2017

Many systems, such as distributed operating systems, complex networks, and high throughput web-based applications, are continuously generating large volume of event logs. These logs contain useful information to help system administrators to understand the system running status and to pinpoint the system failures. Generally, due to the scale and co...

FIU-Miner (a fast, integrated, and user-friendly system for data mining) and its applications

Article

Full-text available

Aug 2017

The advent of Big Data era drives data analysts from different domains to use data mining techniques for data analysis. However, performing data analysis in a specific domain is not trivial; it often requires complex task configuration, onerous integration of algorithms, and efficient execution in distributed environments. Few efforts have been pai...

Data-Driven Techniques in Computing System Management

Article

Jul 2017

Modern forms of computing systems are becoming progressively more complex, with an increasing number of heterogeneous hardware and software components. As a result, it is quite challenging to manage these complex systems and meet the requirements in manageability, dependability, and performance that are demanded by enterprise customers. This survey...

Identifying Relevant Subgraphs in Large Networks

Conference Paper

Sep 2016

Structural relationships between objects are used to model as graphs in many applications. In this paper, we study the problem of identifying relevant subgraphs in large networks. Relevant subgraphs in large networks contain network elements which are maintained by network administrators. We formalize the problem and propose a framework consisting...

Frequent Subgraph Summarization with Error Control

Conference Paper

Jun 2013

Frequent subgraph mining has been an important research problem in the literature. However, the huge number of discovered frequent subgraphs becomes the bottleneck for exploring and understanding the generated patterns. In this paper, we propose to summarize frequent subgraphs with an independence probabilistic model, with the goal to restore the f...

Detecting Priming News Events

Article

Jan 2012

We study a problem of detecting priming events based on a time series index and an evolving document stream. We define a priming event as an event which triggers abnormal movements of the time series index, i.e., the Iraq war with respect to the president approval index of President Bush. Existing solutions either focus on organizing coherent keywo...

Approximate Homogeneous Graph Summarization

Article

Jan 2012

Graph patterns are able to represent the complex structural relations among objects in many applications in various domains. The objective of graph summarization is to obtain a concise representation of a single large graph, which is interpretable and suitable for analysis. A good summary can reveal the hidden relationships between nodes in a graph...

On Summarizing Graph Homogeneously

Conference Paper

Apr 2011

Graph summarization is to obtain a concise representation of a large graph, which is suitable for visualization and analysis. The main idea is to construct a super-graph by grouping similar nodes together. In this paper, we propose a new information-preserving approach for graph summarization, which consists of two parts: a super-graph and a list o...

Fires on the Web: Towards Efficient Exploring Historical Web Graphs

Conference Paper

Apr 2010

Discovery of evolving regions in large graphs is an important issue because it is the basis of many applications such as spam websites detection in the Web, community lifecycle exploration in social networks, and so forth. In this paper, we aim to study a new problem, which explores the evolution process between two historic snapshots of an evolvin...

Discovering Burst Areas in Fast Evolving Graphs

Conference Paper

Apr 2010

Evolving graphs are used to model the relationship variations between objects in many application domains such as social networks, sensor networks, and telecommunication. In this paper, we study a new problem of discovering burst areas that exhibit dramatic changes during some periods in evolving graphs. We focus on finding the top-k results in a s...

Spotting Significant Changing Subgraphs in Evolving Graphs

Conference Paper

Dec 2008

Graphs are popularly used to model structural relation- ships between objects. In many application domains such as social networks, sensor networks and telecommunication, graphs evolve over time. In this paper, we study a new prob- lem of discovering the subgraphs that exhibit significant changes in evolving graphs. This problem is challenging sinc...

Integrating Multiple Data Sources for Stock Prediction

Conference Paper

Aug 2008

In many real world applications, decisions are usually made by collecting and judging information from multiple different data sources. Let us take the stock market as an example. We never make our decision based on just one single piece of advice, but always rely on a collection of information, such as the stock price movements, exchange volumes,...

Mining Multiple Time Series Co-movements

Conference Paper

Apr 2008

In this paper, we propose a new model, called co-movement model, for constructing financial portfolios by analyzing and mining the co-movement patterns among multiple time series. Unlike the existing approaches where the portfolios’ expected risks are computed based on the co-variances among the assets in the portfolios, we model their risks by con...

Detection of Shape Anomalies: A Probabilistic Approach Using Hidden Markov Models

Conference Paper

Apr 2008

We study the problem of detecting the shape anomalies in this paper. Our shape anomaly detection algorithm is performed on the one-dimensional representation (time series) of shapes, whose similarity is modeled by a generalized segmental hidden Markov model (HMM) under a scaling, translation and rotation invariant manner. Experimental results show...

Conference Paper

Oct 2005

Driven by many real applications, in this paper we study the problem of similarity search with implicit object features; that is, the features of each object are not pre-computed/evaluated. As the ex- isting similarity search techniques are not applicable, a novel and ecient algorithm is developed in this paper to approach the problem. The R-tree b...

Locating Motifs in Time-Series Data

Conference Paper

Jan 2005

Finding motifs in time-series is proposed to make clustering of time-series subsequences meaningful, because most existing algorithms of clustering time-series subsequences are reported meaningless in recent studies. The existing motif finding algorithms emphasize the efficiency at the expense of quality, in terms of the number of time-series subse...