The manipulation system

Source publication

Design of Hand Gestures for Manipulating Objects in Virtual Reality

Conference Paper

Full-text available

May 2017

Virtual reality requires high levels of interaction with the user, a type of human computer interaction. Interactions that match the way humans usually interact with their surroundings should improve training effectiveness. A 3D hand gesture based interface allows users to control the position and orientation of 3D objects by simply moving their ha...

Context 1

... are many devices that provide hand pose data such as Intel RealSense, Leap Motion, Kinect etc. Due to the accuracy of Leap Motion and its compatibility with the Oculus Rift [8], we chose the pose data provided by the Leap Motion for gesture recognition. The system set-up is shown in Fig. 2 and the objects that the user sees are indicated in the box. A Leap Motion hand tracking device was mounted on an Oculus Rift VR headset using a custom mount that oriented the Leap Motion device to point 13° below a line perpendicular to the headset surface. The task involved manipulating a virtual hand to grab a virtual dice from a ...

View in full-text

A Human-Centered Approach to Designing Gestures for Natural User Interfaces

Chapter

Full-text available

Jul 2020

As technology matures, human-computer interfaces have changed to meet the needs of interacting with more complex systems in user-friendly ways. Gesture-based interfaces, a type of natural user interface (NUI), allow users to use their bodies to interact with computers or virtual/augmented reality (VR/AR) and offer a more natural and intuitive user...

VR based gesture elicitation for user—Interfaces with low vision

Article

Full-text available

Mar 2024

p>User interfaces (UI) and menus in virtual reality (VR), which frequently replicate traditional UI for computers and smartphones, are not created factoring for individuals with low eyesight as they demand accurate pointing and good vision to engage effectively. As an alternative method of user interaction with UI, using gestures can be recommended. Comparing gesture-based interaction with the conventional point-and-click technique for changing system settings like volume, brightness, and window manipulation in order to test this hypothesis is employed. Accessibility, spatial awareness, and precision for those with low vision while lowering cognitive load and enhancing immersion for all users can be improved by leveraging gestures. The objective of the research work is to explore the framework of Gesture Elicitation in VR environments for users with low vision. In this research work the usage of gestures as a more effective and immersive means of interacting with menus, which will not only enhance the experience of normal VR users but also drastically reduce the friction experienced by those with visual impairments is proposed. User studies demonstrate a noticeable improvement in the aforementioned areas, with faster work completion times, more immersion, and better user satisfaction.</p

Coming in handy: CeTI-Age — A comprehensive database of kinematic hand movements across the lifespan

Article

Full-text available

Nov 2023

The Tactile Internet aims to advance human-human and human-machine interactions that also utilize hand movements in real, digitized, and remote environments. Attention to elderly generations is necessary to make the Tactile Internet age inclusive. We present the first age-representative kinematic database consisting of various hand gesturing and grasping movements at individualized paces, thus capturing naturalistic movements. We make this comprehensive database of kinematic hand movements across the adult lifespan (CeTI-Age-Kinematic-Hand) publicly available to facilitate a deeper understanding of intra-individual–focusing especially on age-related differences–and inter-individual variability in hand kinematics. The core of the database contains participants’ hand kinematics recorded with wearable resistive bend sensors, individual static 3D hand models, and all instructional videos used during the data acquisition. Sixty-three participants ranging from age 20 to 80 years performed six repetitions of 40 different naturalistic hand movements at individual paces. This unique database with data recorded from an adult lifespan sample can be used to advance machine-learning approaches in hand kinematic modeling and movement prediction for age-inclusive applications.

Grab It, While You Can: A VR Gesture Evaluation of a Co-Designed Traditional Narrative by Indigenous People

Conference Paper

Full-text available

Apr 2023

FPSI-Fingertip pose and state-based natural interaction techniques in virtual environments

Article

Full-text available

Oct 2022
MULTIMED TOOLS APPL

Simple and natural interaction has a vital role in any realistic virtual environment (VE). This research proposes a set of lightweight gesture-based techniques for interaction in VEs with a focus on high accuracy, performance, and usability. The proposed techniques use a single fingertip pose and state for object/task selection, translation, navigation, rotation, and scaling. Four different techniques are proposed for interaction, i.e., MSGE (Menu-based task selection and gesture-based task execution), GSGE (Gesture-based task selection and gesture-based task execution), SGTE (Single gesture for task selection and execution), and TSGE (Time slice-based task selection and gesture-based task execution). Keeping in mind the concept of re-usability, the index-tip spatial position is used for task operation in all techniques. For experimental evaluation of the proposed techniques, a VE is designed in Unity3D, while interaction is carried out using the Leap Motion controller. The experimental study was conducted with forty (40) volunteer participants and two experts (authors). Experimental results show improved accuracy for TSGE (participants 97.22%, and experts 97.22%) as compared to others (participants: SGTE 95.55%, GSGE 94.44%, and MSGE 92.75%, experts: SGTE 94.44%, GSGE 94.44%, and MSGE 91.67%). Similarly, the results show high task performance for TSGE (participants 112.9 seconds, SD 5.3, experts 101.75 seconds, SD 3.3) as compared to others (participants: SGTE 117.2 seconds, SD 5.7, GSGE 121.8 seconds, SD 8.0, and MSGE 126.7 seconds, SD 12.9 and experts: SGTE 107.0, SD 5.7, GSGE 113.25, SD 3.5, and MSGE 122.0 seconds, SD 3.6). In addition, usability analysis shows high usability for the proposed interaction techniques, i.e., TSGE (SUS score 98.5), SGTE (SUS score 95.75), GSGE (SUS score 95.25), MSGE (SUS score 94.75). Furthermore, a comparative study with state-of-the-art interaction techniques showed a high accuracy rate, multiple tasks, and reusability support, use of easy to learn and use and fewer features-based gestures (fingertip gestures), and multiple interaction techniques (four techniques) support for the proposed techniques.

Efficient gesture recognition for the assistance of visually impaired people using multi-head neural networks

Article

Full-text available

Sep 2022
ENG APPL ARTIF INTEL

Existing research for the assistance of visually impaired people mainly focus on solving a single task (such as reading a text or detecting an obstacle), hence forcing the user to switch applications to perform other actions. This paper proposes an interactive system for mobile devices controlled by hand gestures that allow the user to control the device and use several assistance tools by making simple static and dynamic hand gestures (e.g., pointing a finger at an object will show a description of it). The system is based on a multi-head neural network, which initially detects and classifies the gestures, and subsequently, depending on the gesture detected, performs a second stage that carries out the corresponding action. This architecture optimizes the resources required to perform different tasks, it takes advantage of the information obtained from an initial backbone to perform different processes in a second stage. To train and evaluate the system, a dataset with about 40k images was manually compiled and labeled including different types of hand gestures, backgrounds (indoors and outdoors), lighting conditions, etc. This dataset contains synthetic gestures (whose objective is to pre-train the system to improve the results) and real images captured using different mobile phones. The comparison made with nearly 50 state-of-the-art methods shows competitive results as regards the different actions performed by the system, such as the accuracy of classification and localization of gestures, or the generation of descriptions for objects and scenes.

H-GOMS: a model for evaluating a virtual-hand interaction system in virtual environments

Article

Full-text available

Jul 2022

The virtual-hand interaction technique is a common input technique in virtual environments (VEs). The current application of virtual-hand interaction in VEs lacks specialized objective evaluation methods, making it difficult to establish a systematic functional goal orientation. To achieve a quantitative evaluation of the virtual-hand interaction system in VEs, we developed the modified evaluation method of goals, operators, methods, and selection rules (H-GOMS). The evaluation model contains five modules: analysis, decomposition, configuration, acquisition, and evaluation. To build the H-GOMS model, the relevant temporal parameters of operators in VEs were measured, and interactive rules were formulated for the configuration module. In addition to establishing the H-GOMS evaluation model, this paper demonstrates the development of a performance evaluation software (HI2ET) based on the Unity engine. We realized automatic retrieval and identification of the interactive behavior information from the software and applied the H-GOMS model algorithm for real-time visualization of the interactive process. The proposed method, with modeling of interactive tasks based on expert users, enables feasible and generally quantifiable performance evaluation for virtual-hand interaction systems in VEs.

A Comparative Study of Interaction Time and Usability of Using Controllers and Hand Tracking in Virtual Reality Training

Article

Full-text available

Sep 2021

Virtual Reality (VR) technology is frequently applied in simulation, particularly in medical training. VR medical training often requires user input either from controllers or free-hand gestures. Nowadays, hand gestures are commonly tracked via built-in cameras from a VR headset. Like controllers, hand tracking can be used in VR applications to control virtual objects. This research developed VR intubation training as a case study and applied controllers and hand tracking for four interactions—namely collision, grabbing, pressing, and release. The quasi-experimental design assigned 30 medical students in clinical training to investigate the differences between using VR controller and hand tracking in medical interactions. The subjects were divided into two groups, one with VR controllers and the other with VR hand tracking, to study the interaction time and user satisfaction in seven procedures. System Usability Scale (SUS) and User Satisfaction Evaluation Questionnaire (USEQ) were used to measure user usability and satisfaction, respectively. The results showed that the interaction time of each procedure was not different. Similarly, according to SUS and USEQ scores, satisfaction and usability were also not different. Therefore, in VR intubation training, using hand tracking has no difference in results to using controllers. As medical training with free-hand gestures is more natural for real-world situations, hand tracking will play an important role as user input for VR medical training. This allows trainees to recognize and correct their postures intuitively, which is more beneficial for self-learning and practicing.

MM-Hand: 3D-Aware Multi-Modal Guided Hand Generative Network for 3D Hand Pose Synthesis

Preprint

Full-text available

Oct 2020

Estimating the 3D hand pose from a monocular RGB image is important but challenging. A solution is training on large-scale RGB hand images with accurate 3D hand keypoint annotations. However, it is too expensive in practice. Instead, we have developed a learning-based approach to synthesize realistic, diverse, and 3D pose-preserving hand images under the guidance of 3D pose information. We propose a 3D-aware multi-modal guided hand generative network (MM-Hand), together with a novel geometry-based curriculum learning strategy. Our extensive experimental results demonstrate that the 3D-annotated images generated by MM-Hand qualitatively and quantitatively outperform existing options. Moreover, the augmented data can consistently improve the quantitative performance of the state-of-the-art 3D hand pose estimators on two benchmark datasets. The code will be available at https://github.com/ScottHoang/mm-hand.

The effects of target location on musculoskeletal load, task performance, and subjective discomfort during virtual reality interactions

Article

Apr 2020
APPL ERGON

The objective of this study was to evaluate the effect of different target locations on musculoskeletal loading and task performance during virtual reality (VR) interactions. A repeated-measures laboratory study with 20 participants (24.2 � 1.5 years; 10 males) was conducted to compare biomechanical exposures (joint angle, moment, and muscle activity in the neck and shoulder), subjective discomfort, and task performance (speed and accuracy) during two VR tasks (omni-directional pointing and painting tasks) among different vertical target locations (ranged from 15 � above to 30 � below eye height). The results showed that neck flexion/extension angle and moment, shoulder flexion angle and moment, shoulder abduction angle, muscle activities of neck and shoulder muscles, and subjective discomfort in the neck and shoulder significantly varied by target locations (p's < 0.001). The target locations at 15 � above and 30 � below eye height demonstrated greater shoulder flexion (up to 52 �), neck flexion moment (up to 2.7Nm), anterior deltoid muscle activity, and subjective discomfort in the neck and shoulder as compared to the other locations. This result indicates that excessive vertical target locations should be avoided to reduce musculoskeletal discomfort and injury risks during VR interactions. Based on relatively lower biomechanical exposures and trade-off between neck and shoulder postures, vertical target location between eye height and 15 � below eye height could be recommended for VR use.

8. Towards personalized virtual reality touring through cross-object user interfaces

Chapter

Full-text available

Sep 2019

Real-time adaptation is one of the most important problems that currently require a solution in the field of personalized human-computer interaction. For conventional desktop system interactions, user behaviors are acquired to develop models that support context-aware interactions. In virtual reality interactions, however, users operate tools in the physical world but view virtual objects in the virtual world. This dichotomy constrains the use of conventional behavioral models and presents difficulties to personalizing interactions in virtual environments. To address this problem, we propose the cross-object user interfaces (COUIs) for personalized virtual reality touring. COUIs consist of two components: a Deep Learning algorithm-based model using convolutional neural networks (CNNs) to predict the user’s visual attention from the past eye movement patterns to determine which virtual objects are likely to be viewed next, and delivery mechanisms that determine what should when and where be displayed on the user interface. In this chapter, we elaborate on the training and testing of the prediction model and evaluate the delivery mechanisms of COUIs through a cognitive walk-through approach. Furthermore, the implications for using COUIs to personalize interactions in virtual reality (and other environments such as augmented reality and mixed reality) are discussed.

The manipulation system

Context in source publication

Similar publications

Citations