Illustration of the new natural language processing method for...

Easy Data Augmentation in Sentiment Analysis of Cyberbullying

Conference Paper

Full-text available

Nov 2023

Text Preprocessing Approaches in CNN for Disaster Reports Dataset

Conference Paper

Full-text available

Feb 2023

Automated Identification of Disaster News for Crisis Management using Machine Learning

Article

Jan 2023

A lot of news sources picked up on Typhoon Rai (also known locally as Typhoon Odette), along with fake news outlets. The study honed in on the issue, to create a model that can identify between legitimate and illegitimate news articles. With this in mind, we chose the following machine learning algorithms in our development: Logistic Regression, Random Forest and Multinomial Naive Bayes. Bag of Words, TF-IDF and Lemmatization were implemented in the Model. Gathering 160 datasets from legitimate and illegitimate sources, the machine learning was trained and tested. By combining all the machine learning techniques, the Combined BOW model was able to reach an accuracy of 91.07%, precision of 88.33%, recall of 94.64%, and F1 score of 91.38% and Combined TF-IDF model was able to reach an accuracy of 91.18%, precision of 86.89%, recall of 94.64%, and F1 score of 90.60%.

Automated Identification of Disaster News For Crisis Management Using Machine Learning

Preprint

Jan 2023

A lot of news sources picked up on Typhoon Rai (also known locally as Typhoon Odette), along with fake news outlets. The study honed in on the issue, to create a model that can identify between legitimate and illegitimate news articles. With this in mind, we chose the following machine learning algorithms in our development: Logistic Regression, Random Forest and Multinomial Naive Bayes. Bag of Words, TF-IDF and Lemmatization were implemented in the Model. Gathering 160 datasets from legitimate and illegitimate sources, the machine learning was trained and tested. By combining all the machine learning techniques, the Combined BOW model was able to reach an accuracy of 91.07%, precision of 88.33%, recall of 94.64%, and F1 score of 91.38% and Combined TF-IDF model was able to reach an accuracy of 91.18%, precision of 86.89%, recall of 94.64%, and F1 score of 90.60%.

Social Commonsense for Explanation and Cultural Bias Discovery

Conference Paper

Jan 2023

A Chatbot System to Support Mine Safety Procedures during Natural Disasters

Article

Full-text available

Jan 2021

This study developed a chatbot to improve the efficiency of government activation of mine safety procedures during natural disasters. Taiwan has a comprehensive governmental system dedicated to responding to frequent natural disasters, and the Bureau of Mines has instituted clear procedures to ensure the delivery of disaster alarms and damage reports. However, the labor- and time-consumption procedures are inefficient. In this study, we propose a system framework for disaster-related information retrieval and immediate notifications to support the execution of mine safety procedures. The framework utilizes instant messaging (IM) applications as the user interface to look up information and send messages to announce the occurrence of disaster events. We evaluated the efficiency of the procedures before and after adopting the system and achieved a time-cost reduction of 55.8 min among three types of disaster events. The study has proven the feasibility of adopting novel techniques for decision-making and assures the improvement of the efficiency and effectiveness of the procedure activation.

Modeling of Lifeline Infrastructure Restoration Using Empirical Quantitative Data

Preprint

Aug 2020

Disaster recovery is widely regarded as the least understood phase of the disaster cycle. In particular, the literature around lifeline infrastructure restoration modeling frequently mentions the lack of empirical quantitative data available. Despite limitations, there is a growing body of research on modeling lifeline infrastructure restoration, often developed using empirical quantitative data. This study reviews this body of literature and identifies the data collection and usage patterns present across modeling approaches to inform future efforts using empirical quantitative data. We classify the modeling approaches into simulation, optimization, and statistical modeling. The number of publications in this domain has increased over time with the most rapid growth of statistical modeling. Electricity infrastructure restoration is most frequently modeled, followed by the restoration of multiple infrastructures, water infrastructure, and transportation infrastructure. Interdependency between multiple infrastructures is increasingly considered in recent literature. Researchers gather the data from a variety of sources, including collaborations with utility companies, national databases, and post-event damage and restoration reports. This study provides discussion and recommendations around data usage practices within the lifeline restoration modeling field. Following the recommendations would facilitate the development of a community of practice around restoration modeling and provide greater opportunities for future data sharing.

Semantic Matching Against a Corpus: New Applications and Methods

Preprint

Aug 2018

We consider the case of a domain expert who wishes to explore the extent to which a particular idea is expressed in a text collection. We propose the task of semantically matching the idea, expressed as a natural language proposition, against a corpus. We create two preliminary tasks derived from existing datasets, and then introduce a more realistic one on disaster recovery designed for emergency managers, whom we engaged in a user study. On the latter, we find that a new model built from natural language entailment data produces higher-quality matches than simple word-vector averaging, both on expert-crafted queries and on ones produced by the subjects themselves. This work provides a proof-of-concept for such applications of semantic matching and illustrates key challenges.

Review of Empirical Quantitative Data Use in Lifeline Infrastructure Restoration Modeling

Article

Nov 2021
Nat Hazards Rev

Illustration of the new natural language processing method for computing the frequency that user-specified propositions are expressed in a corpus that describes a disaster recovery process. The frequency of these occurrences can be plotted across time.

Citations