Joyce Y. Chai's research works | Michigan State University, MI (MSU) and other places

What is this page?

This page lists the scientific contributions of an author, who either does not have a ResearchGate profile, or has not yet added these contributions to their profile.

It was automatically created by ResearchGate to create a record of this author's body of work. We create such pages to advance our goal of creating and maintaining the most comprehensive scientific repository possible. In doing so, we process publicly available (personal) data relating to the author as a member of the scientific community.

If you're a ResearchGate member, you can follow this page to keep up with this author's work.

If you are this author, and you don't want us to display this page anymore, please let us know.

Figure 1: Given the concepts of sliced and apple in the training phase,...

Zero-Shot Compositional Concept Learning

Preprint

Full-text available

Jul 2021

In this paper, we study the problem of recognizing compositional attribute-object concepts within the zero-shot learning (ZSL) framework. We propose an episode-based cross-attention (EpiCA) network which combines merits of cross-attention mechanism and episode-based training strategy to recognize novel compositional concepts. Firstly, EpiCA bases o...

Recent Advances in Natural Language Inference: A Survey of Benchmarks, Resources, and Approaches

Preprint

Full-text available

Nov 2019

In the NLP community, recent years have seen a surge of research activities that address machines' ability to perform deep language understanding which goes beyond what is explicitly stated in text, rather relying on reasoning and knowledge of the world. Many benchmark tasks and datasets have been created to support the development and evaluation o...

X-ToM: Explaining with Theory-of-Mind for Gaining Justified Human Trust

Preprint

Full-text available

Sep 2019

We present a new explainable AI (XAI) framework aimed at increasing justified human trust and reliance in the AI machine through explanations. We pose explanation as an iterative communication process, i.e. dialog, between the machine and human user. More concretely, the machine generates sequence of explanations in a dialog which takes into accoun...

Natural Language Interaction with Explainable AI Models

Preprint

Mar 2019

This paper presents an explainable AI (XAI) system that provides explanations for its predictions. The system consists of two key components -- namely, the prediction And-Or graph (AOG) model for recognizing and localizing concepts of interest in input data, and the XAI model for providing explanations to the user about the AOG's predictions. In th...

Language to Action: Towards Interactive Task Learning with Physical Agents

Conference Paper

Full-text available

Jul 2018

Language communication plays an important role in human learning and knowledge acquisition. With the emergence of a new generation of cognitive robots, empowering these robots to learn directly from human partners becomes increasingly important. This paper gives a brief introduction to interactive task learning where humans can teach physical agent...

Interactive Learning of State Representation through Natural Language Instruction and Explanation

Article

Full-text available

Oct 2017

One significant simplification in most previous work on robot learning is the closed-world assumption where the robot is assumed to know ahead of time a complete set of predicates describing the state of the physical world. However, robots are not likely to have a complete model of the world especially when learning a new task. To address this prob...

Interactive Learning of Grounded Verb Semantics towards Human-Robot Communication

Conference Paper

Jan 2017

Collaborative Language Grounding Toward Situated Human-Robot Dialogue

Article

Dec 2016

To enable situated human-robot dialogue, techniques to support grounded language communication are essential. One particular challenge is to ground human language to a robot's internal representation of the physical world. Although copresent in a shared environment, humans and robots have mismatched capabilities in reasoning; perception, and action...

Program robots manufacturing tasks by natural language instructions

Conference Paper

Aug 2016

Robotic systems are traditionally programmed through off-line coding interfaces for manufacturing tasks. These programming methods are usually time-consuming and cost a lot of human efforts. They cannot meet the emerging requirements of robotic systems in many areas such as intelligent manufacturing and customized production. To address this issue,...

Task Learning through Visual Demonstration and Situated Dialogue

Conference Paper

Full-text available

Feb 2016

To enable effective collaborations between humans and cog-nitive robots, it is important for robots to continuously acquire task knowledge from human partners. To address this issue, we are currently developing a framework that supports task learning through visual demonstration and natural language dialogue. One core component of this framework is...

Grounded Semantic Role Labeling

Conference Paper

Full-text available

Jan 2016

Incremental Acquisition of Verb Hypothesis Space towards Physical World Interaction

Conference Paper

Full-text available

Jan 2016

Embodied Collaborative Model for Referring Expression Generation in Situated Human-Robot Interaction

Conference Paper

Mar 2015

Previous work has shown that, when the agent and the hu- man have mismatched representations of the shared world, traditional approaches that generate a single long referring expression to refer to an object are inadequate for listen- ers to correctly identify the target object. To mediate the mismatched representations, collaborative models have b...

Exception Handling for Natural Language Control of Robots

Conference Paper

Mar 2015

Enabling natural language control of robots is challenging, since human users are often not familiar with the underlying robotic system, and its capabilities and limitations. Many exceptions may occur when natural language commands are translated into lower-level robot actions. This paper gives a brief introduction to three levels of exceptions and...

Learning to Mediate Perceptual Differences in Situated Human-Robot Dialogue

Conference Paper

Full-text available

Jan 2015

In human-robot dialogue, although a robot and its human partner are co-present in a shared environment, they have significantly mismatched perceptual capabilities (e.g., recognizing objects in the surroundings). When a shared perceptual basis is missing, it becomes difficult for the robot to identify referents in the physical world that are referre...

Teaching Robots New Actions through Natural Language Instructions

Conference Paper

Aug 2014

Robots often have limited knowledge and need to continuously acquire new knowledge and skills in order to collaborate with its human partners. To address this issue, this paper describes an approach which allows human partners to teach a robot (i.e., a robotic arm) new high-level actions through natural language instructions. In particular, built u...

Collaborative Models for Referring Expression Generation in Situated Dialogue

Conference Paper

Jul 2014

In situated dialogue with artificial agents (e.g., robots), although a human and an agent are co-present, the agent's representation and the human's representation of the shared environment are significantly mismatched. Because of this misalignment, our previous work has shown that when the agent applies traditional approaches to generate referring...

Probabilistic Labeling for Efficient Referential Grounding based on Collaborative Discourse

Conference Paper

Full-text available

Jul 2014

Perceptive feedback for natural language control of robotic operations

Conference Paper

May 2014

A new planning and control scheme for natural language control of robotic operations using the perceptive feedback is presented. Different from the traditional open-loop natural language control, the scheme incorporates the highlevel planning and low-level control of the robotic systems and makes the high-level planning become a closed-loop process...

Collaborative effort towards common ground in situated human-robot dialogue

Conference Paper

Full-text available

Mar 2014

In situated human-robot dialogue, although humans and robots are co-present in a shared environment, they have significantly mismatched capabilities in perceiving the shared environment. Their representations of the shared world are misaligned. In order for humans and robots to communicate with each other successfully using language, it is importan...

Back to the Blocks World: Learning New Actions through Situated Human-Robot Dialogue

Conference Paper

Jan 2014

Towards Situated Dialogue: Revisiting Referring Expression Generation

Conference Paper

Full-text available

Oct 2013

Modeling Collaborative Referring for Situated Referential Grounding

Conference Paper

Full-text available

Aug 2013

Introduction to the special section on eye gaze and conversation

Article

Jul 2013

This editorial introduction first explains the origin of this special section. It then outlines how each of the two articles included sheds light on possibilities for conversational dialog systems to use eye gaze as a signal that reflects aspects of participation in the dialog: degree of engagement and turn taking behavior, respectively.

Shared Gaze in Situated Referential Grounding: An Empirical Study

Chapter

Jan 2013

In situated dialogue, although an artificial agent and its human partner are co-present in a shared environment, they have significantly mismatched capabilities in perceiving the environment. When a shared perceptual basis is broken, referential grounding between partners becomes more challenging. Our hypothesis is that in such a situation, non-ver...

Semantic Role Labeling of Implicit Arguments for Nominal Predicates

Article

Dec 2012

Nominal predicates often carry implicit arguments. Recent work on semantic role labeling has focused on identifying arguments within the local context of a predicate; implicit arguments, however, have not been systematically examined. To address this limitation, we have manually annotated a corpus of implicit arguments for ten predicates from NomBa...

Integrating word acquisition and referential grounding towards physical world interaction

Conference Paper

Full-text available

Oct 2012

In language-based interaction between a human and an artificial agent (e.g., robot) in a physical world, because the human and the agent have different knowledge and capabilities in perceiving the shared environment, referential grounding is very difficult. To facilitate such interaction, it is important for the agent to continuously learn and acqu...

Towards mediating shared perceptual basis in situated dialogue

Conference Paper

Full-text available

Jul 2012

To enable effective referential grounding in situated human robot dialogue, we have conducted an empirical study to investigate how conversation partners collaborate and mediate shared basis when they have mismatched visual perceptual capabilities. In particular, we have developed a graph-based representation to capture linguistic discourse and vis...

Autonomous self-assessment of autocorrections: exploring text message dialogues

Conference Paper

Jun 2012

Text input aids such as automatic correction systems play an increasingly important role in facilitating fast text entry and efficient communication between text message users. Although these tools are beneficial when they work correctly, they can cause significant communication problems when they fail. To improve its autocorrection performance, it...

Towards online adaptation and personalization of key-target resizing for mobile devices

Article

Feb 2012

Software (soft) keyboards are becoming increasingly popular on mobile devices. To attempt to improve soft keyboard input accuracy, key-target resizing algorithms that dynamically change the size of each key's target area have been developed. Although methods that employ personalized touch models have been shown to outperform general models, previou...

Introduction to the special issue on eye gaze in intelligent human-machine interaction

Article

Jan 2012

Given the recent advances in eye tracking technology and the availability of nonintrusive and high-performance eye tracking devices, there has never been a better time to explore new opportunities to incorporate eye gaze in intelligent and natural human-machine communication. In this special issue, we present six articles that cover various aspects...

A joint model of implicit arguments for nominal predicates

Conference Paper

Jun 2011

Many prior studies have investigated the recovery of semantic arguments for nominal predicates. The models in many of these studies have assumed that arguments are independent of each other. This assumption simplifies the computational modeling of semantic arguments, but it ignores the joint nature of natural language. This paper presents a prelimi...

Context-Based Word Acquisition for Situated Dialogue in a Virtual World

Article

Mar 2010

To tackle the vocabulary problem in conversational systems, previous work has applied unsupervised learning approaches on co-occurring speech and eye gaze during interaction to automatically acquire new words. Although these approaches have shown promise, several issues related to human language behavior and human-machine conversation have not been...

Integrating domain knowledge with user eye gaze in automated word acquisition for conversational interfaces

Article

Feb 2010

Most conversation systems tend to fail when unexpected words are encountered. To overcome this problem, conversational systems must be able to learn new words automatically during human machine conversation. Motivated by psycholinguistic findings on eye gaze and human language processing, we have developed several techniques to incorporate human ey...

Workshop: eye gaze in intelligent human machine interaction

Conference Paper

Feb 2010

This workshop brought researchers from academia and industry together to share recent advances and discuss research directions and opportunities for next generation of intelligent human machine interaction that incorporate eye gaze.

Beyond NomBank: A Study of Implicit Arguments for Nominal Predicates.

Conference Paper

Full-text available

Jan 2010

Despite its substantial coverage, Nom- Bank does not account for all within- sentence arguments and ignores extra- sentential arguments altogether. These ar- guments, which we call implicit, are im- portant to semantic processing, and their recovery could potentially benefit many NLP applications. We present a study of implicit arguments for a sele...

Fusing Eye Gaze with Speech Recognition Hypotheses to Resolve Exophoric References in Situated Dialogue

Conference Paper

Jan 2010

In situated dialogue humans often utter linguistic expressions that refer to extralinguistic entities in the environment. Correctly resolving these references is critical yet challenging for artificial agents partly due to their limited speech recognition and language understanding capabilities. Motivated by psycholinguistic studies demonstrating a...

Ambiguities in Spatial Language Understanding in Situated Human Robot Dialogue

Article

Full-text available

Jan 2010

In human robot dialogue, identifying intended referents from human partners' spatial language is challenging. This is partly due to automated inference of potentially ambiguous underlying reference system (i.e., frame of reference). To im-prove spatial language understanding, we conducted an em-pirical study to investigate the prevalence of ambigui...

Hand Gestures in Disambiguating Types of You Expressions in Multiparty Meetings.

Conference Paper

Jan 2010

The second person pronoun you serves different functions in English. Each of these different types often corresponds to a different term when translated into another language. Correctly identifying different types of you can be beneficial to machine translation systems. To address this issue, we investigate disambiguation of different types of you...

User Language Behavior, Domain Knowledge, and Conversation Context in Automatic Word Acquisition for Situated Dialogue in a Virtual World

Article

Jan 2010

Towards Conversation Entailment: An Empirical Investigation.

Conference Paper

Jan 2010

While a significant amount of research has been devoted to textual entailment, automated entailment from conversational scripts has re- ceived less attention. To address this limi- tation, this paper investigates the problem of conversation entailment: automated inference of hypotheses from conversation scripts. We examine two levels of semantic re...

Communicative Gestures in Coreference Identification in Multiparty Meetings

Conference Paper

Nov 2009

During multiparty meetings, participants can use non-verbal modalities such as hand gestures to make reference to the shared environment. Therefore, one hypothesis is that incorporating hand gestures can improve coreference identification, a task that automatically identifies what participants refer to with their linguistic expressions. To evaluate...

Between linguistic attention and gaze fixations inmultimodal conversational interfaces

Conference Paper

Full-text available

Nov 2009

In multimodal human machine conversation, successfully interpreting human attention is critical. While attention has been studied extensively in linguistic processing and visual processing, it is not clear how linguistic attention is aligned with visual attention in multimodal conversational interfaces. To address this issue, we conducted a prelimi...

The Role of Implicit Argumentation in Nominal SRL.

Conference Paper

Full-text available

Jan 2009

Nominals frequently surface without overtly expressed arguments. In order to measure the potential benefit of nominal SRL for down- stream processes, such nominals must be ac- counted for. In this paper, we show that a state-of-the-art nominal SRL system with an overall argument F1 of 0.76 suffers a perfor- mance loss of more than 9% when nominals...

What do We Know about Conversation Participants: Experiments on Conversation Entailment.

Conference Paper

Jan 2009

Given the increasing amount of conversa- tion data, techniques to automatically ac- quire information about conversation par- ticipants have become more important. Towards this goal, we investigate the prob- lem of conversation entailment, a task that determines whether a given conversa- tion discourse entails a hypothesis about the participants. T...

The Role of Interactivity in Human-Machine Conversation for Automatic Word Acquisition.

Conference Paper

Jan 2009

Motivated by the psycholinguistic finding that human eye gaze is tightly linked to speech production, previous work has ap- plied naturally occurring eye gaze for au- tomatic vocabulary acquisition. However, unlike in the typical settings for psycholin- guistic studies, eye gaze can serve differ- ent functions in human-machine conver- sation. Some...

Incorporating Temporal and Semantic Information with Eye Gaze for Automatic Word Acquisition in Multimodal Conversational Systems.

Conference Paper

Oct 2008

One major bottleneck in conversational sys- tems is their incapability in interpreting un- expected user language inputs such as out-of- vocabulary words. To overcome this problem, conversational systems must be able to learn new words automatically during human ma- chine conversation. Motivated by psycholin- guistic findings on eye gaze and human...

What's in a Gaze? The role of eye-gaze in reference resolution in multimodal conversational interfaces

Conference Paper

Jan 2008

Multimodal conversational interfaces allow users to carry a dialog with a graphical display using speech to ac- complish a particular task. Motivated by previous psy- cholinguistic findings, we examine how eye-gaze con- tributes to reference resolution in such a setting. Specif- ically, we present an integrated probabilistic framework that combines...

Beyond attention: The role of deictic gesture in intention recognition in multimodal conversational interfaces

Conference Paper

Jan 2008

In a multimodal conversational interface supporting speech and deictic gesture, deictic gestures on the graphical display have been traditionally used to identify user attention, for example, through reference resolution. Since the context of the identified attention can potentially constrain the associ- ated intention, our hypothesis is that deict...

Predicting User Attention using Eye Gaze in Conversational Interfaces

Article

Jan 2008

In a conversational system, determining the user's focus of attention is crucial to the suc-cess of the system. Motivated by previous psycholinguistic findings, we are currently ex-amining how eye gaze contributes to automat-ed identification of user attention during con-versation. In particular, we are developing techniques that can predict an obj...

Michigan State University at the 2007 TREC ciQA evaluation

Article

Full-text available

Jan 2008

rst participation in the ciQA task. Instead of exploring conversation strategies in question answering (3, 4), we decided to focus on simple interaction strategies using relevance feedback. In our view, the ciQA task is not designed to evaluate user initiative interaction strategies. Since NIST assessors act as users, the motivation to take an init...

Discourse processing for context question answering based on linguistic knowledge

Article

Aug 2007

Motivated by the recent effort on scenario-based context question answering (QA), this paper investigates the role of discourse processing and its implication on query expansion for a sequence of questions. Our view is that a question sequence is not random, but rather follows a coherent manner to serve some information goals. Therefore, this seque...

An empirical investigation of user term feedback in text-based targeted image search

Article

Full-text available

Feb 2007

Text queries are natural and intuitive for users to describe their information needs. However, text-based image retrieval faces many challenges. Traditional text retrieval techniques on image descriptions have not been very successful. This is mainly due to the inconsistent textual descrip- tions and the discrepancies between user queries and terms...

Michigan State University at the 2007 TREC ciQA Task.

Conference Paper

Full-text available

Jan 2007

An Exploration of Eye Gaze in Spoken Language Processing for Multimodal Conversational Interfaces.

Conference Paper

Jan 2007

Motivated by psycholinguistic findings, we are currently investigating the role of eye gaze in spoken language understand- ing for multimodal conversational sys- tems. Our assumption is that, during hu- man machine conversation, a user's eye gaze on the graphical display indicates salient entities on which the user's atten- tion is focused. The spe...

Eye Gaze for Attention Prediction in Multimodal Human-Machine Conversation.

Conference Paper

Jan 2007

In a conversational system, determining a user's focus of attention is crucial to the success of the system. Mo- tivated by previous psycholinguistic findings, we are currently examining how eye gaze contributes to au- tomated identification of user attention during human- machine conversation. As part of this effort, we inves- tigate the contribut...

Automated Vocabulary Acquisition and Interpretation in Multimodal Conversational Systems.

Conference Paper

Full-text available

Jan 2007

Motivated by psycholinguistic findings that eye gaze is tightly linked to human lan- guage production, we developed an unsuper- vised approach based on translation models to automatically learn the mappings between words and objects on a graphic display dur- ing human machine conversation. The ex- perimental results indicate that user eye gaze can...

A statistical framework for query translation disambiguation

Article

Full-text available

Dec 2006

Resolving ambiguity in the process of query translation is crucial to cross-language information retrieval (CLIR), given the short length of queries. This problem is even more challenging when only a bilingual dictionary is available, which is the focus of our work described here. In this paper, we will present a statistical framework for dictionar...

Salience modeling based on non-verbal modalities for spoken language understanding

Conference Paper

Nov 2006

Previous studies have shown that, in multimodal conver- sational systems, fusing information from multiple modal- ities together can improve the overall input interpretation through mutual disambiguation. Inspired by these findings, this paper investigates non-verbal modalities, in particular deictic gesture, in spoken language processing. Our assu...

Cognitive Principles in Robust Multimodal Interpretation

Article

Sep 2006

Multimodal conversational interfaces provide a natural means for users to communicate with computer systems through multiple modalities such as speech and gesture. To build effective multimodal interfaces, automated interpretation of user multimodal inputs is important. Inspired by the previous investigation on cognitive status in multimodal human...

Automated performance assessment in interactive QA

Conference Paper

Aug 2006

In interactive question answering (QA), users and systems take turns to ask questions and provide answers. In such an interactive setting, user questions largely depend on the answers provided by the system. One question is whether user follow-up questions can provide feedback for the system to automatically assess its performance (e.g.,assess whet...

Towards intelligent QA interfaces: discourse processing for context questions

Conference Paper

Jan 2006

Question answering (QA) systems take users' natural language questions and retrieve relevant answers from large repositories of free texts. Despite recent progress in QA research, most work on question answering is still focused on isolated questions. In a real- world information seeking scenario, questions are not asked in isolation, but rather in...

Towards conversational QA

Conference Paper

Jan 2006

To enable conversational QA, it is important to examine key issues addressed in conversational systems in the context of question answering. In conversational systems, understanding user intent is critical to the success of interaction. Recent studies have also shown that the capability to automatically identify problematic situations during intera...

Towards Conversational QA: Automatic Identification of Problematic Situations and User Intent

Conference Paper

Jan 2006

To enable conversational QA, it is impor- tant to examine key issues addressed in conversational systems in the context of question answering. In conversational sys- tems, understanding user intent is criti- cal to the success of interaction. Recent studies have also shown that the capabil- ity to automatically identify problematic situations durin...

User term feedback in interactive text-based image retrieval

Conference Paper

Aug 2005

To alleviate the vocabulary problem, this paper investigates the role of user term feedback in interactive text-based image retrieval. Term feedback refers to the feedback from a user on specific terms regarding their relevance to a target image. Previous studies have indicated the effectiveness of term feedback in interactive text retrieval [14]....

Study of cross lingual information retrieval using on-line translation systems

Conference Paper

Aug 2005

Typical cross language retrieval requires special linguistic resources, such as bilingual dictionaries and parallel corpus. In this study, we focus on the cross lingual retrieval problem that only uses online translation systems. We compare two approaches: a translation-based approach that directly translates queries into the language of documents...

A maximum coherence model for dictionary-based cross-language information retrieval

Conference Paper

Full-text available

Aug 2005

One key to cross-language information retrieval is how to efficiently resolve the translation ambiguity of queries given their short length. This problem is even more challenging when only bilingual dictionaries are available, which is the focus of this paper. In the previous research of cross-language information retrieval using bilingual dictiona...

Linguistic theories in efficient multimodal reference resolution: An empirical investigation

Conference Paper

Jan 2005

Multimodal conversational interfaces provide a natural means for users to communicate with computer systems through multiple modalities such as speech, gesture, and gaze. To build effective multimodal interfaces, understanding user multimodal inputs is important. Previous linguistic and cognitive studies indicate that user language behavior does no...

A Salience Driven Approach to Robust Input Interpretation in Multimodal Conversational Systems.

Conference Paper

Jan 2005

To improve the robustness in multimodal input interpretation, this paper presents a new salience driven approach. This approach is based on the observation that, during multimodal conversation, information from deictic gestures (e.g., point or circle) on a graphical display can signal a part of the physical world (i.e., representation of the domain...

Mind: A Context-Based Multimodal Interpretation Framework in Conversational Systems

Chapter

Jan 2005

In a multimodal human-machine conversation, user inputs are often abbreviated or imprecise. Simply fusing multimodal inputs together may not be sufficient to derive a complete understanding of the inputs. Aiming to handle a wide variety of multimodal inputs, we are building a context-based multimodal interpretation framework called MIND (Multimodal...

Learn to weight terms in information retrieval using category information

Conference Paper

Jan 2005

How to assign appropriate weights to terms is one of the critical issues in information retrieval. Many term weighting schemes are unsupervised. They are either based on the empirical observation in information retrieval, or based on generative approaches for language modeling. As a result, the existing term weighting schemes are usually insufficie...

Regularizing translation models for better automatic image annotation

Conference Paper

Nov 2004

The goal of automatic image annotation is to automatically generate annotations for images to describe their content. In the past, statistical machine translation models have been successfully applied to automatic image annotation task [8]. It views the process of annotating images as a process of translating the content from a 'visual language' to...

Effective automatic image annotation via a coherent language model and active learning

Conference Paper

Oct 2004

Image annotations allow users to access a large image database with textual queries. There have been several studies on automatic image annotation utilizing machine learning techniques, which automatically learn statistical models from annotated images and apply them to generate annotations for unseen images. One common problem shared by most previ...

MSU at ImageCLEF: Cross language and interactive image retrieval

Conference Paper

Sep 2004

In this report, we describe our studies with cross language and interactive image retrieval in ImageCLEF 2004. Typical cross language retrieval requires special linguistic resources, such as bilingual dictionaries. In this study, we focus on the issue of how to achieve good retrieval performance given only an online translation system. We compare t...

An automatic weighting scheme for collaborative filtering

Conference Paper

Jul 2004

Collaborative filtering identifies information interest of a particular user based on the information provided by other similar users. The memory-based approaches for collaborative filtering (e.g., Pearson correlation coefficient approach) identify the similarity between two users by comparing their ratings on a set of items. In these approaches, d...

An Automatic Weighting Scheme for

Article

Jun 2004

A probabilistic approach to reference resolution in multimodal user interfaces

Conference Paper

Jan 2004

Multimodal user interfaces allow users to interact with computers through multiple modalities, such as speech, gesture, and gaze. To be effective, multimodal user interfaces must correctly identify all objects which users refer to in their inputs. To systematically resolve different types of references, we have developed a probabilistic approach th...

Optimization in Multimodal Interpretation.

Conference Paper

Jan 2004

In a multimodal conversation, the way users communicate with a system depends on the available interaction channels and the situated context (e. g., conversation focus, visual feedback). These dependencies form a rich set of constraints from various perspectives such as temporal alignments between different modalities, coherence of conversation, an...

Performance evaluation and error analysis for multimodal reference resolution in a conversation system

Article

Jan 2004

Multimodal reference resolution is a process that automatically identifies what users refer to during multimodal human-machine conversation. Given the substantial work on multimodal reference resolution; it is important to evaluate the current state of the art, understand the limitations, and identify directions for future improvement. We conducted...

A probabilistic approach to reference resolution in multimodal user interfaces

Conference Paper

Jan 2004

Discourse structure for context question answering

Article

Jan 2004

In a real-world setting, questions are not asked in isolation, but rather in a cohesive manner that involves a sequence of related questions to meet user's information needs. The capability to interpret and answer questions based on context is important. In this paper, we discuss the role of discourse modeling in context question answering. In part...

Combining semantic and temporal constraints for multimodal integration in conversation systems

Article

Jan 2003

In a multimodal conversation, user referring patterns could be complex, involving multiple referring expressions from speech utterances and multiple gestures. To resolve those references, multimodal integration based on semantic constraints is insufficient. In this paper, we describe a graph-based probabilistic approach that simultaneously combines...

Context-based multimodal input understanding in conversational systems

Conference Paper

Feb 2002

In a multimodal human-machine conversation, user inputs are often abbreviated or imprecise. Sometimes, merely fusing multimodal inputs together cannot derive a complete understanding. To address these inadequacies, we are building a semantics-based multimodal interpretation framework called MIND (Multimodal Interpretation for Natural Dialog). The u...

Awareness of Partner's Eye Gaze in Situated Referential Grounding: An Empirical Study

Article

Full-text available

In situated dialogue, although artificial agents and their hu-man partners are co-present in a shared environment, their representations of the environment are significantly differ-ent. When a shared basis is missing, referential grounding between partners becomes more challenging. Our hypothe-sis is that in such a situation, non-verbal modalities...

Beyond Normalization: Pragmatics of Word Form in Text Messages

Article

Non-standard spellings in text messages often convey extra pragmatic information not found in the standard word form. However, text message normalization sys-tems that transform non-standard text mes-sage spellings to standard form tend to ignore this information. To address this problem, this paper examines the types of extra pragmatic information...

Class-based nominal semantic role labeling: a preliminary investigation

Article

This paper presents a preliminary investi- gation into the use of NomLex classes for NomBank semantic role labeling (SRL). We hypothesize that modeling each class indi- vidually will result in more homogeneous training data and better performance com- pared to a baseline approach that is not class- based. Our current experimental results, which are...