Manuela Veloso's research works | Jpmorgan Chase & Co., NY and other places

An example showing 50 jobs before and after computing a schedule...

An example of a single job with stochastic duration and CPU usage. The...

This is an example of our Capacity Planning and Scheduling Problem...

Our constraint programming approach with a deterministic estimator. All...

COSPiS: Our proposed approach using constraint programming and SAA...

Capacity planning and scheduling for jobs with uncertainty in resource usage and duration

Article

Full-text available

Jun 2024

Organizations around the world schedule jobs (programs) regularly to perform various tasks dictated by their end users. With the major movement toward using a cloud computing infrastructure, our organization follows a hybrid approach with both cloud and on-prem servers. The objective of this work is to perform capacity planning, i.e., estimate reso...

HiddenTables & PyQTax: A Cooperative Game and Dataset For TableQA to Ensure Scale and Data Privacy Across a Myriad of Taxonomies

Preprint

Jun 2024

A myriad of different Large Language Models (LLMs) face a common challenge in contextually analyzing table question-answering tasks. These challenges are engendered from (1) finite context windows for large tables, (2) multi-faceted discrepancies amongst tokenization patterns against cell boundaries, and (3) various limitations stemming from data c...

Progressive Inference: Explaining Decoder-Only Sequence Classification Models Using Intermediate Predictions

Preprint

Jun 2024

This paper proposes Progressive Inference - a framework to compute input attributions to explain the predictions of decoder-only sequence classification models. Our work is based on the insight that the classification head of a decoder-only Transformer model can be used to make intermediate predictions by evaluating them at different points in the...

Counterfactual Metarules for Local and Global Recourse

Preprint

May 2024

We introduce T-CREx, a novel model-agnostic method for local and global counterfactual explanation (CE), which summarises recourse options for both individuals and groups in the form of human-readable rules. It leverages tree-based surrogate models to learn the counterfactual rules, alongside 'metarules' denoting their regions of optimality, provid...

Accelerating Cutting-Plane Algorithms via Reinforcement Learning Surrogates

Article

Mar 2024

Discrete optimization belongs to the set of N P-hard problems, spanning fields such as mixed-integer programming and combinatorial optimization. A current standard approach to solving convex discrete optimization problems is the use of cutting-plane algorithms, which reach optimal solutions by iteratively adding inequalities known as cuts to refine...

FairWASP: Fast and Optimal Fair Wasserstein Pre-processing

Article

Full-text available

Mar 2024

Recent years have seen a surge of machine learning approaches aimed at reducing disparities in model outputs across different subgroups. In many settings, training data may be used in multiple downstream applications by different users, which means it may be most effective to intervene on the training data itself. In this work, we present FairWASP,...

Efficient Event Series Data Modeling via First-Order Constrained Optimization

Conference Paper

Nov 2023

FlowMind: Automatic Workflow Generation with LLMs

Conference Paper

Nov 2023

Multi-Modal Financial Time-Series Retrieval Through Latent Space Projections

Conference Paper

Nov 2023

From Pixels to Predictions: Spectrogram and Vision Transformer for Better Time Series Forecasting

Conference Paper

Nov 2023

Figure 1: Replanning scenario in a navigation domain where the driver...

Figure 2: Navigation example previously shown in Figure 1. Blue cells...

Generating Replanning Goals Through Multi-Objective Optimization in Response to Execution Observation

Chapter

Full-text available

Sep 2023

In some applications, planning-monitoring systems generate plans and monitor their execution by other agents. During execution, agents might deviate from these plans for various reasons. The deviation from the expected behavior will be observed by the planning-monitoring system, which will replan in order to provide the agent a new suggested plan....

Towards multi‐agent reinforcement learning‐driven over‐the‐counter market simulations

Article

Sep 2023

We study a game between liquidity provider (LP) and liquidity taker agents interacting in an over‐the‐counter market, for which the typical example is foreign exchange. We show how a suitable design of parameterized families of reward functions coupled with shared policy learning constitutes an efficient solution to this problem. By playing against...

REFRESH: Responsible and Efficient Feature Reselection guided by SHAP values

Conference Paper

Aug 2023

Explainable Reinforcement Learning: A Survey and Comparative Review

Article

Aug 2023

Explainable reinforcement learning (XRL) is an emerging subfield of explainable machine learning that has attracted considerable attention in recent years. The goal of XRL is to elucidate the decision-making process of reinforcement learning (RL) agents in sequential decision-making settings. Equipped with this information, practitioners can better...

Exploring the Effectiveness of GPT Models in Test-Taking: A Case Study of the Driver's License Knowledge Test

Preprint

Aug 2023

Large language models such as Open AI's Generative Pre-trained Transformer (GPT) models are proficient at answering questions, but their knowledge is confined to the information present in their training data. This limitation renders them ineffective when confronted with questions about recent developments or non-public documents. Our research prop...

XSkill: Cross Embodiment Skill Discovery

Preprint

Jul 2023

Human demonstration videos are a widely available data source for robot learning and an intuitive user interface for expressing desired behavior. However, directly extracting reusable robot manipulation skills from unstructured human videos is challenging due to the big embodiment difference and unobserved action parameters. To bridge this embodime...

Figure 1. Iterative procedure of Benders decomposition, alternating...

Figure 2. Iterative procedure of Surrogate-MP.

Figure 3. Convergence rates of a baseline BD, and Surrogate-MP with...

Figure 4. Count of instances with faster convergence between...

Figure 5. Convergence rates for different levels of informed surrogate...

Towards Accelerating Benders Decomposition via Reinforcement Learning Surrogate Models

Preprint

Full-text available

Jul 2023

Stochastic optimization (SO) attempts to offer optimal decisions in the presence of uncertainty. Often, the classical formulation of these problems becomes intractable due to (a) the number of scenarios required to capture the uncertainty and (b) the discrete nature of real-world planning problems. To overcome these tractability issues, practitione...

Combining Heuristic Search and Linear Programming to Compute Realistic Financial Plans

Article

Jul 2023

Defining financial goals and formulating actionable plans to achieve them are essential components for ensuring financial health. This task is computationally challenging, given the abundance of factors that can influence one’s financial situation. In this paper, we present the Personal Finance Planner (PFP), which can generate personalized financi...

Differentially Private Synthetic Data Using KD-Trees

Preprint

Jun 2023

Creation of a synthetic dataset that faithfully represents the data distribution and simultaneously preserves privacy is a major research challenge. Many space partitioning based approaches have emerged in recent years for answering statistical queries in a differentially private manner. However, for synthetic data generation problem, recent resear...

Financial Time Series Forecasting using CNN and Transformer

Preprint

Full-text available

Apr 2023

Time series forecasting is important across various domains for decision-making. In particular, financial time series such as stock prices can be hard to predict as it is difficult to model short-term and long-term temporal dependencies between data points. Convolutional Neural Networks (CNN) are good at capturing local patterns for modeling short-...

HiddenTables and PyQTax: A Cooperative Game and Dataset For TableQA to Ensure Scale and Data Privacy Across a Myriad of Taxonomies

Conference Paper

Jan 2023

Fast Learning of Multidimensional Hawkes Processes via Frank-Wolfe

Preprint

Dec 2022

Hawkes processes have recently risen to the forefront of tools when it comes to modeling and generating sequential events data. Multidimensional Hawkes processes model both the self and cross-excitation between different types of events and have been applied successfully in various domain such as finance, epidemiology and personalized recommendatio...

Advising Agent for Service-Providing Live-Chat Operators

Chapter

Dec 2022

Call centers, in which human operators attend clients using textual chat, are very common in modern e-commerce. Training enough skilled operators who are able to provide good service is a challenge. We propose a methodology for the development of an assisting agent that provides online advice to operators while they attend clients. The agent is eas...

Learn to explain yourself, when you can: Equipping Concept Bottleneck Models with the ability to abstain on their concept predictions

Preprint

Nov 2022

The Concept Bottleneck Models (CBMs) of Koh et al. [2020] provide a means to ensure that a neural network based classifier bases its predictions solely on human understandable concepts. The concept labels, or rationales as we refer to them, are learned by the concept labeling component of the CBM. Another component learns to predict the target clas...

Towards learning to explain with concept bottleneck models: mitigating information leakage

Preprint

Nov 2022

Concept bottleneck models perform classification by first predicting which of a list of human provided concepts are true about a datapoint. Then a downstream model uses these predicted concept labels to predict the target label. The predicted concepts act as a rationale for the target prediction. Model trust issues emerge in this paradigm when soft...

Online Learning for Mixture of Multivariate Hawkes Processes

Conference Paper

Oct 2022

Towards Multi-Agent Reinforcement Learning driven Over-The-Counter Market Simulations

Preprint

Oct 2022

We study a game between liquidity provider and liquidity taker agents interacting in an over-the-counter market, for which the typical example is foreign exchange. We show how a suitable design of parameterized families of reward functions coupled with associated shared policy learning constitutes an efficient solution to this problem. Precisely, w...

ASPiRe:Adaptive Skill Priors for Reinforcement Learning

Preprint

Sep 2022

We introduce ASPiRe (Adaptive Skill Prior for RL), a new approach that leverages prior experience to accelerate reinforcement learning. Unlike existing methods that learn a single skill prior from a large and diverse dataset, our framework learns a library of different distinction skill priors (i.e., behavior priors) from a collection of specialize...

Online Learning for Mixture of Multivariate Hawkes Processes

Preprint

Aug 2022

Online learning of Hawkes processes has received increasing attention in the last couple of years especially for modeling a network of actors. However, these works typically either model the rich interaction between the events or the latent cluster of the actors or the network structure between the actors. We propose to model the latent structure o...

Synthetic document generator for annotation-free layout recognition

Article

Aug 2022

Analyzing the layout of a document to identify headers, sections, tables, figures etc. is critical to understanding its content. Deep learning based approaches for detecting the layout structure of document images have been promising. However, these methods require a large number of annotated examples during training, which are both expensive and t...

Differentially Private Learning of Hawkes Processes

Preprint

Jul 2022

Hawkes processes have recently gained increasing attention from the machine learning community for their versatility in modeling event sequence data. While they have a rich history going back decades, some of their properties, such as sample complexity for learning the parameters and releasing differentially private versions, are yet to be thorough...

Structure and Semantics Preserving Document Representations

Conference Paper

Jul 2022

FIGURE 1 | Interaction setup. Figure is only meant for illustrative...

FIGURE 3 | Chronological scenario timeline (to approximate scale) along...

FIGURE 4 | Snapshots from the experimental sessions. (A), (B): JATT...

FIGURE 5 | Distribution of children profiles during interaction with...

“Sequencing Matters”: Investigating Suitable Action Sequences in Robot-Assisted Autism Therapy

Article

Full-text available

Mar 2022

Social robots have been shown to be promising tools for delivering therapeutic tasks for children with Autism Spectrum Disorder (ASD). However, their efficacy is currently limited by a lack of flexibility of the robot’s social behavior to successfully meet therapeutic and interaction goals. Robot-assisted interventions are often based on structured...

Figure 1: XRL taxonomy and its relationship to the RL process.

Figure 2: Example object saliency map (b), natural language explanation...

A Survey of Explainable Reinforcement Learning

Preprint

Full-text available

Feb 2022

Explainable reinforcement learning (XRL) is an emerging subfield of explainable machine learning that has attracted considerable attention in recent years. The goal of XRL is to elucidate the decision-making process of learning agents in sequential decision-making settings. In this survey, we propose a novel taxonomy for organizing the XRL literatu...

Bandit Sampling for Multiplex Networks

Preprint

Feb 2022

Graph neural networks have gained prominence due to their excellent performance in many classification and prediction tasks. In particular, they are used for node classification and link prediction which have a wide range of applications in social networks, biomedical data sets, and financial transaction graphs. Most of the existing work focuses pr...

Structure with Semantics: Exploiting Document Relations for Retrieval

Preprint

Jan 2022

Retrieving relevant documents from a corpus is typically based on the semantic similarity between the document content and query text. The inclusion of structural relationship between documents can benefit the retrieval mechanism by addressing semantic gaps. However, incorporating these relationships requires tractable mechanisms that balance struc...

Towards Robust Representations of Limit Orders Books for Deep Learning Models

Article

Jan 2022

Simulation Intelligence: Towards a New Generation of Scientific Methods

Preprint

Full-text available

Dec 2021

The original "Seven Motifs" set forth a roadmap of essential methods for the field of scientific computing, where a motif is an algorithmic method that captures a pattern of computation and data movement. We present the "Nine Motifs of Simulation Intelligence", a roadmap for the development and integration of the essential algorithms necessary for...

Simulation Intelligence: Towards a New Generation of Scientific Methods

Article

Full-text available

Dec 2021

The original "Seven Motifs" set forth a roadmap of essential methods for the field of scientific computing, where a motif is an algorithmic method that captures a pattern of computation and data movement. We present the "Nine Motifs of Simulation Intelligence", a roadmap for the development and integration of the essential algorithms necessary for...

Figure 2: Cross section of the Bayesian network in plate notation. The...

Figure 3: Layout Recognition Model Architecture. A feature extraction...

Figure 4: Examples of synthetic documents. The generated documents...

Figure 5: Synthetic tabular data used for training cell recognition....

Figure 6: Low-quality noisy documents generated to simulate defects due...

Synthetic Document Generator for Annotation-free Layout Recognition

Preprint

Full-text available

Nov 2021

Analyzing the layout of a document to identify headers, sections, tables, figures etc. is critical to understanding its content. Deep learning based approaches for detecting the layout structure of document images have been promising. However, these methods require a large number of annotated examples during training, which are both expensive and t...

Deep video prediction for time series forecasting

Conference Paper

Nov 2021

Learning to classify and imitate trading agents in continuous double auction markets

Conference Paper

Nov 2021

ABIDES-gym: gym environments for multi-agent discrete event simulation and application to financial markets

Conference Paper

Nov 2021

Visual time series forecasting: an image-driven approach

Conference Paper

Nov 2021

Tradeoffs in streaming binary classification under limited inspection resources

Conference Paper

Nov 2021

Parameterized Explanations for Investor / Company Matching

Preprint

Oct 2021

Matching companies and investors is usually considered a highly specialized decision making process. Building an AI agent that can automate such recommendation process can significantly help reduce costs, and eliminate human biases and errors. However, limited sample size of financial data-sets and the need for not only good recommendations, but al...

ABIDES-Gym: Gym Environments for Multi-Agent Discrete Event Simulation and Application to Financial Markets

Preprint

Full-text available

Oct 2021

Model-free Reinforcement Learning (RL) requires the ability to sample trajectories by taking actions in the original problem environment or a simulated version of it. Breakthroughs in the field of RL have been largely facilitated by the development of dedicated open source simulators with easy to use frameworks such as OpenAI Gym and its Atari envi...

Figure 2: (A) Original LOB data with 10 levels on ask and bid side...

Figure 3: Spatial-temporal Representation in mid-price-centred Moving...

Figure 4: Confusion matrices for corresponding experimental results of...

Towards Robust Representation of Limit Orders Books for Deep Learning Models

Preprint

Full-text available

Oct 2021

The success of machine learning models is highly reliant on the quality and robustness of representations. The lack of attention on the robustness of representations may boost risks when using data-driven machine learning models for trading in the financial markets. In this paper, we focus on representations of the limit order book (LOB) data and d...

How Robust are Limit Order Book Representations under Data Perturbation?

Preprint

Full-text available

Oct 2021

The success of machine learning models in the financial domain is highly reliant on the quality of the data representation. In this paper, we focus on the representation of limit order book data and discuss the opportunities and challenges for learning representations of such data. We also experimentally analyse the issues associated with existing...

Tradeoffs in Streaming Binary Classification under Limited Inspection Resources

Preprint

Oct 2021

Institutions are increasingly relying on machine learning models to identify and alert on abnormal events, such as fraud, cyber attacks and system failures. These alerts often need to be manually investigated by specialists. Given the operational cost of manual inspections, the suspicious events are selected by alerting systems with carefully desig...

Figure 2: An example of the market data reproduced for a simulated...

Figure 3: One-minute log returns distribution

Figure 4: Ten-minute log returns distribution

Figure 5: Neural network model architecture diagram

Learning to Classify and Imitate Trading Agents in Continuous Double Auction Markets

Preprint

Full-text available

Oct 2021

Continuous double auctions such as the limit order book employed by exchanges are widely used in practice to match buyers and sellers of a variety of financial instruments. In this work, we develop an agent-based model for trading in a limit order book and show (1) how opponent modelling techniques can be applied to classify trading agent archetype...

Intelligent Execution through Plan Analysis

Conference Paper

Full-text available

Sep 2021

Search-based Planning with Learned Behaviors for Navigation among Pedestrians

Conference Paper

Sep 2021

Artificial intelligence research in finance: discussion and examples

Article

Sep 2021

Artificial intelligence (AI) is a science and engineering discipline that is highly relevant to financial services, given the significant amount and diversity of data generated (and consumed) as those services are delivered worldwide. Global banks process billions of international payments each day, while equity exchanges handle trillions of orders...

Factored Models for Multiscale Decision-Making in Smart Grid Customers

Article

Sep 2021

Active participation of customers in the management of demand, and renewable energy supply, is a critical goal of the Smart Grid vision. However, this is a com-plex problem with numerous scenarios that are difficult to test in field projects. Rich and scalable simulations are required to develop effective strategies and poli-cies that elicit desira...

Exploiting Symmetry in Human Robot-Assisted Dressing Using Reinforcement Learning

Chapter

Sep 2021

In this work, we address the problem of symmetry transfer in human-robot collaborative tasks, i.e., how certain actions can be extended to their symmetrical by exploiting symmetries in their execution. We contribute an approach capable of considering the symmetry inherent to a given task, such as the human or robot’s lateral symmetry, abstracting t...

Search Reduction through Conservative Abstract-Space Based Heuristic

Article

Sep 2021

The efficiency of heuristic search depends dramatically on the quality of the heuristic function. For an optimal heuristic search, heuristics that estimate cost-to-goal better typically lead to faster searches. For a sub-optimal heuristic search such as weighted A*, the search speed depends more on the correlation between the heuristic and the true...

Adapting a Rapidly-Exploring Random Tree for Automated Planning

Article

Aug 2021

Rapidly-exploring random trees (RRTs) are data structures and search algorithms designed to be used in continuous path planning problems. They are one of the most successful state-of-the-art techniques as they offer a great degree of flexibility and reliability. However, their use in other search domains has not been thoroughly analyzed. In this wo...

A Low-Cost Compliant Gripper Using Cooperative Mini-Delta Robots for Dexterous Manipulation

Conference Paper

Full-text available

Jul 2021

Visual Time Series Forecasting: An Image-driven Approach

Preprint

Jul 2021

In this work, we address time-series forecasting as a computer vision task. We capture input data as an image and train a model to produce the subsequent image. This approach results in predicting distributions as opposed to pointwise values. To assess the robustness and quality of our approach, we examine various datasets and multiple evaluation m...

Optimal Planning Over Long and Infinite Horizons for Achieving Independent Partially-Observable Tasks That Evolve Over Time

Article

Jul 2021

We focus on long-sighted planning for a class of problems with multiple independent tasks that are partially observable and evolve over time. An example problem that falls into this class is a robot waiting multiple tables, referred to as tasks, in a restaurant where customers' satisfaction is partially observable and evolves over time. Our recent...

A Theoretical and Algorithmic Analysis of Configurable MDPs

Article

May 2021

This paper analyzes, from theoretical and algorithmic perspectives, a class of problems recently introduced in the literature of Markov decision processes—configurable Markov decision processes. In this new class of problems we jointly optimize the probability transition function and associated optimal policy, in order to improve the performance of...

Speeding Up Search-Based Motion Planning via Conservative Heuristics

Article

May 2021

Weighted A* search (wA*) is a popular tool for robot motionplanning. Its efficiency however depends on the quality of heuristic function used. In fact, it has been shown that the correlation between the heuristic function and the true costto-goal significantly affects the efficiency of the search, when used with a large weight on the heuristics. Mo...

Fig. 1. A restaurant setting with 3 tables and one robot.

Fig. 2. The robot operates in a restaurant with 3 tables, T1, T2, and...

Waiting Tables as a Robot Planning Problem

Preprint

Full-text available

May 2021

We present how we formalize the waiting tables task in a restaurant as a robot planning problem. This formalization was used to test our recently developed algorithms that allow for optimal planning for achieving multiple independent tasks that are partially observable and evolve over time [1], [2].

Iterative Bounding MDPs: Learning Interpretable Policies via Non-Interpretable Methods

Article

Full-text available

May 2021

Current work in explainable reinforcement learning generally produces policies in the form of a decision tree over the state space. Such policies can be used for formal safety verification, agent behavior prediction, and manual inspection of important features. However, existing approaches fit a decision tree after training or use a custom learning...

Computing Opportunities to Augment Plans for Novel Replanning during Execution

Article

May 2021

Traditionally, planning provides for execution plans as sequences of actions with preconditions and effects. Execution monitoring identifies failure conditions when the preconditions of an action do not match the state. Interestingly, planning proceeds by consuming a given initial state and abandoning reasoning about any facts not true in that stat...

Figure 1: The process of building an information vector as the chat...

Figure 2: A screenshot of the operator interface. The left-hand side is...

Figure 3: TLX questionnaires data (lower is better).

Figure 4: Time performance data (in minutes, decimal notation).

Figure 5: Demographic data of the subjects who played the role of...

Advising Agent for Service-Providing Live-Chat Operators

Preprint

Full-text available

May 2021

Call centers, in which human operators attend clients using textual chat, are very common in modern e-commerce. Training enough skilled operators who are able to provide good service is a challenge. We suggest an algorithm and a method to train and implement an assisting agent that provides on-line advice to operators while they attend clients. The...

Figure 5: Topic size effect on classifier performance.

Comparison of Classification Results -F1 Scores Dataset Transformer No...

Domain-agnostic Document Representation Learning Using Latent Topics and Metadata

Article

Full-text available

Apr 2021

Fine-tuning a pre-trained neural language model with a task specific output layer is the de facto approach of late when dealing with document classification. This technique is inadequate when labeled examples are unavailable at training time and when the metadata artifacts in a document must be exploited. We address these challenges by generating d...

Playing with Food: Learning Food Item Representations Through Interactive Exploration

Chapter

Mar 2021

A key challenge in robotic food manipulation is modeling the material properties of diverse and deformable food items. We propose using a multimodal sensory approach to interact and play with food that facilitates the ability to distinguish these properties across food items. First, we use a robotic arm and an array of sensors, which are synchroniz...

Iterative Bounding MDPs: Learning Interpretable Policies via Non-Interpretable Methods

Preprint

Full-text available

Feb 2021

Current work in explainable reinforcement learning generally produces policies in the form of a decision tree over the state space. Such policies can be used for formal safety verification, agent behavior prediction, and manual inspection of important features. However, existing approaches fit a decision tree after training or use a custom learning...

Theory and Analysis of Optimal Planning over Long and Infinite Horizons for Achieving Independent Partially-Observable Tasks that Evolve over Time

Preprint

Full-text available

Feb 2021

We present the theoretical analysis and proofs of a recently developed algorithm that allows for optimal planning over long and infinite horizons for achieving multiple independent tasks that are partially observable and evolve over time.

Deep Video Prediction for Time Series Forecasting

Preprint

Feb 2021

Time series forecasting is essential for decision making in many domains. In this work, we address the challenge of predicting prices evolution among multiple potentially interacting financial assets. A solution to this problem has obvious importance for governments, banks, and investors. Statistical methods such as Auto Regressive Integrated Movin...

Playing with Food: Learning Food Item Representations through Interactive Exploration

Preprint

Full-text available

Jan 2021

A key challenge in robotic food manipulation is modeling the material properties of diverse and deformable food items. We propose using a multimodal sensory approach to interact and play with food that facilitates the ability to distinguish these properties across food items. First, we use a robotic arm and an array of sensors, which are synchroniz...

ViziTex: Interactive Visual Sense-Making of Text Corpora

Conference Paper

Jan 2021

Visual Forecasting of Time Series with Image-to-Image Regression

Preprint

Nov 2020

Time series forecasting is essential for agents to make decisions in many domains. Existing models rely on classical statistical methods to predict future values based on previously observed numerical information. Yet, practitioners often rely on visualizations such as charts and plots to reason about their predictions. Inspired by the end-users, w...

Simulating and classifying behavior in adversarial environments based on action-state traces: an application to money laundering

Preprint

Full-text available

Nov 2020

Many business applications involve adversarial relationships in which both sides adapt their strategies to optimize their opposing benefits. One of the key characteristics of these applications is the wide range of strategies that an adversary may choose as they adapt their strategy dynamically to sustain benefits and evade authorities. In this pap...

Domain-independent generation and classification of behavior traces

Preprint

Full-text available

Nov 2020

Financial institutions mostly deal with people. Therefore, characterizing different kinds of human behavior can greatly help institutions for improving their relation with customers and with regulatory offices. In many of such interactions, humans have some internal goals, and execute some actions within the financial system that lead them to achie...

Localization and Force-Feedback with Soft Magnetic Stickers for Precise Robot Manipulation

Conference Paper

Oct 2020

Robust Document Representations using Latent Topics and Metadata

Preprint

Oct 2020

Task specific fine-tuning of a pre-trained neural language model using a custom softmax output layer is the de facto approach of late when dealing with document classification problems. This technique is not adequate when labeled examples are not available at training time and when the metadata artifacts in a document must be exploited. We address...

Get real: realism metrics for robust limit order book market simulations

Conference Paper

Oct 2020

Generating synthetic data in finance: opportunities, challenges and pitfalls

Conference Paper

Oct 2020

SURF: improving classifiers in production by learning from busy and noisy end users

Conference Paper

Oct 2020

Recommending missing and suspicious links in multiplex financial networks

Conference Paper

Oct 2020

Simulating and classifying behavior in adversarial environments based on action-state traces: an application to money laundering

Conference Paper

Oct 2020

Trading via image classification

Conference Paper

Oct 2020

Risk-sensitive reinforcement learning: a martingale approach to reward uncertainty

Conference Paper

Oct 2020

SURF: Improving classifiers in production by learning from busy and noisy end users

Preprint

Oct 2020

Supervised learning classifiers inevitably make mistakes in production, perhaps mis-labeling an email, or flagging an otherwise routine transaction as fraudulent. It is vital that the end users of such a system are provided with a means of relabeling data points that they deem to have been mislabeled. The classifier can then be retrained on the rel...

Paying down metadata debt: learning the representation of concepts using topic models

Preprint

Oct 2020

We introduce a data management problem called metadata debt, to identify the mapping between data concepts and their logical representations. We describe how this mapping can be learned using semisupervised topic models based on low-rank matrix factorizations that account for missing and noisy labels, coupled with sparsity penalties to improve loca...

AI pptX: Robust Continuous Learning for Document Generation with AI Insights

Preprint

Full-text available

Oct 2020

Business analysts create billions of slide decks, reports and documents annually. Most of these documents have well-defined structure comprising of similar content generated from data. We present 'AI pptX', a novel AI framework for creating and modifying documents as well as extract insights in the form of natural language sentences from data. AI p...

Calibration of Shared Equilibria in General Sum Partially Observable Markov Games

Preprint

Jun 2020

Training multi-agent systems (MAS) to achieve realistic equilibria gives us a useful tool to understand and model real-world systems. We consider a general sum partially observable Markov game where agents of different types share a single policy network, conditioned on agent-specific information. This paper aims at i) formally understanding equili...

Risk-Sensitive Reinforcement Learning: a Martingale Approach to Reward Uncertainty

Preprint

Jun 2020

We introduce a novel framework to account for sensitivity to rewards uncertainty in sequential decision-making problems. While risk-sensitive formulations for Markov decision processes studied so far focus on the distribution of the cumulative reward as a whole, we aim at learning policies sensitive to the uncertain/stochastic nature of the rewards...

Efficient Robot Planning for Achieving Multiple Independent Partially Observable Tasks That Evolve over Time

Article

Jun 2020

We focus on domains where a robot is required to accomplish a set of tasks that are partially observable and evolve independently of each other according to their dynamics. An example domain is a restaurant setting where a robot waiter should take care of an ongoing stream of tasks, namely serving a number of tables, including delivering food to th...

Guaranteeing Reproducibility in Deep Learning Competitions

Preprint

Full-text available

May 2020

To encourage the development of methods with reproducible and robust training behavior, we propose a challenge paradigm where competitors are evaluated directly on the performance of their learning procedures rather than pre-trained agents. Since competition organizers re-train proposed methods in a controlled setting they can guarantee reproducibi...

Optimal action sequence generation for assistive agents in fixed horizon tasks

Article

Full-text available

Apr 2020

Agents providing assistance to humans are faced with the challenge of automatically adjusting the level of assistance to ensure optimal performance. In this work, we argue that identifying the right level of assistance consists in balancing positive assistance outcomes and some (domain-dependent) measure of cost associated with assistive actions. T...

Some people aren't worth listening to: periodically retraining classifiers with feedback from a team of end users

Preprint

Apr 2020

Document classification is ubiquitous in a business setting, but often the end users of a classifier are engaged in an ongoing feedback-retrain loop with the team that maintain it. We consider this feedback-retrain loop from a multi-agent point of view, considering the end users as autonomous agents that provide feedback on the labelled data provid...

Latent Bayesian Inference for Robust Earnings Estimates

Preprint

Apr 2020

Equity research analysts at financial institutions play a pivotal role in capital markets; they provide an efficient conduit between investors and companies' management and facilitate the efficient flow of information from companies, promoting functional and liquid markets. However, previous research in the academic finance and behavioral economics...

Heuristics for Link Prediction in Multiplex Networks

Preprint

Apr 2020

Link prediction, or the inference of future or missing connections between entities, is a well-studied problem in network analysis. A multitude of heuristics exist for link prediction in ordinary networks with a single type of connection. However, link prediction in multiplex networks, or networks with multiple types of connections, is not a well u...

Improving the On-Vehicle Experience of Passengers Through SC-M*: A Scalable Multi-Passenger Multi-Criteria Mobility Planner

Article

Full-text available

Jan 2020

The rapid growth in urban population poses significant challenges to moving city dwellers in a fast and convenient manner. This paper contributes to solving the challenges from the viewpoint of passengers by improving their on-vehicle experience. Specifically, we focus on the problem: Given an urban public transit network and a number of passengers...

Get Real: Realism Metrics for Robust Limit Order Book Market Simulations

Preprint

Full-text available

Dec 2019

Machine learning (especially reinforcement learning) methods for trading are increasingly reliant on simulation for agent training and testing. Furthermore, simulation is important for validation of hand-coded trading strategies and for testing hypotheses about market structure. A challenge, however, concerns the robustness of policies validated in...

Proofs_appendix (1).pdf

Data

Dec 2019

Manuela Veloso's research while affiliated with Jpmorgan Chase & Co. and other places

What is this page?

Publications (725)

Citations