ChapterPDF Available

Answer Passage Ranking Enhancement Using Shallow Linguistic Features

September 2021

September 2021

DOI:10.1007/978-3-030-85529-1_23

In book: Modeling Decisions for Artificial Intelligence (pp.286-298)

Authors:

Bahadorreza Ofoghi

Deakin University - Australia

Armita Zarnegar

Swinburne University of Technology

Question Answering (QA) systems play an important role in decision support systems. Deep neural network-based passage rankers have recently been developed to more effectively rank likely answer-containing passages for QA purposes. These rankers utilize distributed word or sentence embeddings. Such distributed representations mostly carry semantic relatedness of text units in which explicit linguistic features are under-represented. In this paper, we take novel approaches to combine linguistic features (such as different part-of-speech measures) with distributed sentence representations of questions and passages. The QUASAR-T fact-seeking questions and short text passages were used in our experiments to show that while ensembling of deep relevance measures based on pure sentence embedding with linguistic features using several machine learning techniques fails to improve upon the passage ranking performance of our baseline neural network ranker, the concatenation of the same features within the network structure significantly improves the overall performance of passage ranking for QA.

Content uploaded by Armita Zarnegar

Content may be subject to copyright.

A preview of the PDF is not available

Data Envelopment Analysis of linguistic features and passage relevance for open-domain Question Answering

Article

Mar 2022
KNOWL-BASED SYST

Question Answering (QA) systems play an important role in today’s human–computer interaction systems. QA performance can be significantly improved using effective answer passage retrieval and ranking techniques. Our focus in this paper is on both non machine learning-based and deep learning-based passage retrieval and ranking systems for QA to leverage linguistic features within the text of questions and passages and improve passage ranking effectiveness. We propose a decoupled linguistic and linear programming-based approach for passage ranking using the Data Envelopment Analysis (DEA) technique to improve over well-established answer passage retrieval techniques. Our method scores passages using information retrieval and deep learning relevance metrics, represents retrieved passages using their relevance scores and several linguistic features, and finally makes use of DEA to re-rank the retrieved list of passages. The high effectiveness and significance of our proposed passage ranking method is demonstrated based on several experiments that we have conducted on a standard benchmark data set.

Semantic Triple-Assisted Learning for Question Answering Passage Re-ranking

Conference Paper

Aug 2023

Passage re-ranking in question answering (QA) systems is a method to reorder a set of retrieved passages, related to a given question so that answer-containing passages are ranked higher than non-answer-containing passages. With recent advances in language models, passage ranking has become more effective due to improved natural language understanding of the relationship between questions and answer passages. With neural network models, question-passage pairs are used to train a cross-encoder that predicts the semantic relevance score of the pairs and is subsequently used to rank retrieved passages. This paper reports on the use of open information extraction (OpenIE) triples in the form \({<subject, verb, object>}\) for questions and passages to enhance answer passage ranking in neural network models. Coverage and overlap scores of question-passage triples are studied and a novel loss function is developed using the proposed triple-based features to better learn a cross-encoder model to rerank passages. Experiments on three benchmark datasets are compared to the baseline BERT and ERNIE models using the proposed loss function demonstrating improved passage re-ranking performance.KeywordsPassage re-rankingInformation extractionPassage retrievalLinked open data

Linguistic characterization of answer passages for fact-seeking question answering

Conference Paper

Apr 2022

Bahadorreza Ofoghi

A Decision-making Problem as an Applications of Intuitionistic Fuzzy Set

Article

Full-text available

Dec 2019

The fuzzy sets and Intuitionistic fuzzy sets are very useful concepts to elaborate the vagueness in real world problems. The objective of our study is to apply fuzzy set theory and Intuitionistic fuzzy set theory in decision making process. In this paper, we identify in which society a person has to purchase a house in order to fulfil his requirement to maximum extent. In our study we use intuitionistic fuzzy sets to find a relation between the societies and the parameters. And then we find a relation between a person and the parameters. We calculate Normalized Euclidean distance between two Intuitionistic fuzzy sets to make a decision of purchasing house in a society.

Passage Re-ranking with BERT

Conference Paper

Full-text available

Jan 2019

An RFML Ecosystem: Considerations for the Application of Deep Learning to Spectrum Situational Awareness

Article

Full-text available

Sep 2021

While deep learning (DL) technologies are now pervasive in state-of-the-art Computer Vision (CV) and Natural Language Processing (NLP) applications, only in recent years have these technologies started to sufficiently mature in applications related to wireless communications, a field loosely termed Radio Frequency Machine Learning (RFML). In particular, recent research has shown DL to be an enabling technology for Cognitive Radio (CR) applications as well as a useful tool for supplementing expertly defined algorithms for spectrum awareness applications such as signal detection, estimation, and classification. A major driver for the usage of RFML is that little, to no, a priori knowledge of the intended spectral environment is required, given that there is an abundance of representative raw Radio Frequency (RF) data to facilitate training and evaluation. However, in addition to this fundamental need for sufficient data, there are other key considerations, such as trust, security, and hardware requirements, that must be taken into account before deploying RFML systems in real-world wireless communication applications that largely go unaddressed in the current literature. This paper examines the prior works related to these major research considerations, with focus on the dependencies between them and factors unique to the RFML space.

Deep Convolutional Embedding for Digitized Painting Clustering

Conference Paper

Full-text available

Jan 2021

Approximations of fuzzy soft sets by fuzzy soft relations with image processing application

Article

Full-text available

May 2021
SOFT COMPUT

Fuzzy soft sets represent a generalization of fuzzy sets with considerable application potential in decision-making problems, optimization, and computer science. In the paper, we use the fact that the fuzzy soft sets powerset theory is defined by the monad and, using this theory, we introduce the concept of fuzzy soft relations defined by the monad. Using that general method, we can also introduce the fuzzy soft approximation of fuzzy soft sets defined by fuzzy soft relations. We show that fuzzy soft sets and fuzzy soft approximations can be naturally used in selective color segmentation problem, where reliable and fully automated methods do not yet exist.

A Maximizing Model of Spherical Bezdek-Type Fuzzy Multi-Medoids Clustering

Article

Nov 2015

Yuchi Kanzawa

This paper proposes three modifications for the maximizing model of spherical Bezdek-type fuzzy c -means clustering (msbFCM). First, we use multi-medoids instead of centroids (msbFMMdd), which is similar to modifying fuzzy c -means to fuzzy multi-medoids. Second, we kernelize msbFMMdd (K-msbFMMdd). msbFMMdd can only be applied to objects in the first quadrant of the unit hypersphere, whereas its kernelized form can be applied to a wider class of objects. The third modification is a spectral clustering approach to K-msbFMMdd using a certain assumption. This approach improves the local convergence problem in the original algorithm. Numerical examples demonstrate that the proposed methods can produce good results for clusters with nonlinear borders when an adequate parameter value is selected.

MEDICI: A Simple to Use Synthetic Social Network Data Generator

Conference Paper

Sep 2021

The motivation of the work in this paper is due to the need in research and applied fields for synthetic social network data due to (i) difficulties to obtain real data and (ii) data privacy issues of the real data. The issues to address are first to obtain a graph with a social network type structure, label it with communities. The main focus is the generation of realistic data, its assignment to and propagation within the graph. The main aim in this work is to implement an easy to use standalone end-user application which addresses the aforementioned issues. The methods used are the R-MAT and Louvain algorithms, with some modifications, for graph generation and community labeling respectively, and the development of a Java based system for the data generation using an original seed assignment algorithm followed by a second algorithm for weighted and probabilistic data propagation to neighbors and other nodes. The results show that a close fit can be achieved between the initial user specification and the generated data, and that the algorithms have potential for scale up. The system is made publicly available in a Github Java project.

Kernel-Based k-Representatives Algorithm for Fuzzy Clustering of Categorical Data

Conference Paper

Jul 2021

On the Texture Bias for Few-Shot CNN Segmentation

Conference Paper

Jan 2021

Label-Free Segmentation of COVID-19 Lesions in Lung CT

Article

Mar 2021

Scarcity of annotated images hampers the building of automated solution for reliable COVID-19 diagnosis and evaluation from CT. To alleviate the burden of data annotation, we herein present a label-free approach for segmenting COVID-19 lesions in CT via voxel-level anomaly modeling that mines out the relevant knowledge from normal CT lung scans. Our modeling is inspired by the observation that the parts of tracheae and vessels, which lay in the high-intensity range where lesions belong to, exhibit strong patterns. To facilitate the learning of such patterns at a voxel level, we synthesize ‘lesions’ using a set of simple operations and insert the synthesized ‘lesions’ into normal CT lung scans to form training pairs, from which we learn a normalcy-recognizing network (NormNet) that recognizes normal tissues and separate them from possible COVID-19 lesions. Our experiments on three different public datasets validate the effectiveness of NormNet, which conspicuously outperforms a variety of unsupervised anomaly detection (UAD) methods.

Answer Passage Ranking Enhancement Using Shallow Linguistic Features

Abstract

Recommended publications

Answer Passage Ranking Enhancement Using Shallow Linguistic Features

Automatic Clustering of CT Scans of COVID-19 Patients Based on Deep Learning

Automatic Detection of COVID-19 Using Chest X-Ray Images and Modified ResNet18-Based Convolution Neu...

A Review on Effectiveness of Artificial Intelligence Techniques in the Detection of COVID-19