José Luís Oliveira
University of Aveiro | UA

About

292

Publications

67,110

Reads

2,712

Citations

Publications

Fig. 2 Visual representation of dataset components. a Tissue image (H&E...

Fig. 3 Visual representation of outcomes with IoU threshold set to 0.5....

Fig. 4 Diagram of different dataset configurations. The first...

Fig. 10 Class-balanced accuracy considering background label

Fig. 11 Class-balanced accuracy without considering background label

A Data Augmentation Methodology to Reduce the Class Imbalance in Histopathology Images

Article

Full-text available

Mar 2024

Deep learning techniques have recently yielded remarkable results across various fields. However, the quality of these results depends heavily on the quality and quantity of data used during the training phase. One common issue in multi-class and multi-label classification is class imbalance, where one or several classes make up a substantial porti...

MMIR software architecture, with three core components. A project...

Example of segmented annotations using images. a Image Annotation...

Adaptable multichannel array format for multiple segmented annotation...

Process to convert a npz file or an image annotation to the COCO...

Plugin life cycle. The process begins with system initiation and...

MMIR: an open-source software for the registration of multimodal histological images

Article

Full-text available

Mar 2024

Background Multimodal histology image registration is a process that transforms into a common coordinate system two or more images obtained from different microscopy modalities. The combination of information from various modalities can contribute to a comprehensive understanding of tissue specimens, aiding in more accurate diagnoses, and improved...

Fig. 3. Overview of the registration flow between the four entities to...

A Federated Authentication Schema among Multiple Identity Providers

Article

Full-text available

Mar 2024

Single Sign-On (SSO) methods are the primary solution to authenticate users across multiple web systems. These mechanisms streamline the authentication procedure by avoiding duplicate developments of authentication modules for each application. Besides, these mechanisms also provide convenience to the end-user by keeping the user authenticated when...

MONTRA2: A web platform for profiling distributed databases in the health domain

Article

Jan 2024

DyPrune: Dynamic Pruning Rates for Neural Networks

Chapter

Dec 2023

Neural networks have achieved remarkable success in various applications such as image classification, speech recognition, and natural language processing. However, the growing size of neural networks poses significant challenges in terms of memory usage, computational cost, and deployment on resource-constrained devices. Pruning is a popular techn...

TAG-DTA: Binding-region-guided strategy to predict drug-target affinity using transformers

Article

Oct 2023

Gene expression differences between HFpEF and Control groups. (A) Bar...

Functional classification of the DEGs. (A) Density plot showing the...

microRNA expression differences between HFpEF and Control groups. (A)...

Correlation between each mRNA and corresponding miRNA(s). (A) Venn...

Relative mRNA expression of HAPLN1 and NPPB genes in human cardiac...

Myocardial RNA Sequencing Reveals New Potential Therapeutic Targets in Heart Failure with Preserved Ejection Fraction

Article

Full-text available

Jul 2023

Heart failure with preserved ejection fraction (HFpEF) represents a global health challenge, with limited therapies proven to enhance patient outcomes. This makes the elucidation of disease mechanisms and the identification of novel potential therapeutic targets a priority. Here, we performed RNA sequencing on ventricular myocardial biopsies from p...

FSM-DDTR: End-to-end feedback strategy for multi-objective De Novo drug design using transformers

Article

Jul 2023

The design of compounds that target specific biological functions with relevant selectivity is critical in the context of drug discovery, especially due to the polypharmacological nature of most existing drug molecules. In recent years, in silico-based methods combined with deep learning have shown promising results in the de novo drug design chall...

A 20-Year Journey of Tracing the Development of Web Catalogues for Rare Diseases

Chapter

Jun 2023

Rare diseases are affecting over 350 million individuals on a worldwide scale. However, studying such diseases is challenging due to the lack of individuals compliant with the study protocols. This unavailability of information raises some challenges when defining the best treatments or diagnosing patients in the early stages. Multiple organization...

Optimizing Variant Calling for Human Genome Analysis: A Comprehensive Pipeline Approach

Chapter

Full-text available

Jun 2023

The identification of genetic variations in large cohorts is a critical issue to identify patient cohorts, disease risks, and to develop more effective treatments. To help this analysis, we improved a variant calling pipeline for the human genome using state-of-the-art tools, including GATK (Hard Filter/VQSR) and DeepVariant. The pipeline was teste...

A Multimodal Image Registration System for Histology Images

Conference Paper

Full-text available

Jun 2023

SecureFASTA: Ensuring privacy and trust when sharing genomic data

Conference Paper

Full-text available

Jun 2023

A FAIR Approach to Real-World Health Data Management and Analysis

Conference Paper

Full-text available

Jun 2023

A Reliable and Secure Method for Sharing Genomic Data

Article

Full-text available

May 2023

Genomics has significantly impacted the field of medicine, with advances in DNA sequencing leading to personalized medicine and a deeper understanding of the genomic basis of various diseases. The ability to share genomic data is crucial for advancing this field and developing new approaches to understanding the genome. However, the sensitive natur...

Methodology to identify a gene expression signature by merging microarray datasets

Article

Full-text available

Apr 2023

A vast number of microarray datasets have been produced as a way to identify differentially expressed genes and gene expression signatures. A better understanding of these biological processes can help in the diagnosis and prognosis of diseases, as well as in the therapeutic response to drugs. However, most of the available datasets are composed of...

Venn-diagram showing the number of NAD-binding proteins obtained from...

Classification of the NAD-binding proteins according to the protein...

Venn-diagram representing the protein-protein interactions from STRING,...

Protein–protein interactions of NUDIX proteins from the NAD-binding...

Docking results for NAD ligand on TRPC3 as a target (D6RC49 and...

Study of NAD-interacting proteins highlights the extent of NAD regulatory roles in the cell and its potential as a therapeutic target

Article

Full-text available

Mar 2023

Nicotinamide adenine dinucleotide (NAD) levels are essential for the normal physiology of the cell and are strictly regulated to prevent pathological conditions. NAD functions as a coenzyme in redox reactions, as a substrate of regulatory proteins, and as a mediator of protein-protein interactions. The main objectives of this study were to identify...

Classifying and discovering genomic sequences in metagenomic repositories

Article

Full-text available

Jan 2023

The taxonomic and functional composition of microbial communities from environmental, agricultural, and therapeutic settings is increasingly being studied using metagenomic methodologies in large-scale genomic applications. This has led to exponential growth in the field and has impacted on healthcare, pharmacology and biotechnology. However, with...

FIGURE 1. Vulnerability Assessment model.

FIGURE 2. Traditional and Agile software development life cycle.

FIGURE 3. Distribution of the number of tools by type of application...

Code scanning solutions comparison, the recommended tools for easy...

Dynamic scanning solutions comparison, the recommended tool for easy...

Open Source Solutions for Vulnerability Assessment: A Comparative Analysis

Article

Full-text available

Jan 2023

As software applications continue to become more complex and attractive to cyber-attackers, enhancing resilience against cyber threats becomes essential. Aiming to provide more robust solutions, different approaches were proposed for vulnerability detection in different stages of the application life-cycle. This article explores three main approach...

Fig. 1 Concept of collection events, populations and variables. Adapted...

Concepts involved in the process of addressing a research question in...

Concepts involved to enact interoperability across multiple data...

Towards an Interoperable Ecosystem of Research Cohort and Real-world Data Catalogues Enabling Multi-center Studies

Article

Full-text available

Dec 2022

Objectives: Existing individual-level human data cover large populations on many dimensions such as lifestyle, demography, laboratory measures, clinical parameters, etc. Recent years have seen large investments in data catalogues to FAIRify data descriptions to capitalise on this great promise, i.e. make catalogue contents more Findable, Accessible...

Querying Semantic Catalogues of Biomedical Databases

Article

Full-text available

Dec 2022

Background Secondary use of health data is a valuable source of knowledge that boosts observational studies, leading to important discoveries in the medical and biomedical sciences. The fundamental guiding principle for performing a successful observational study is the research question and the approach in advance of executing a study. However, in...

Figure 2. UI mockup proposal of a treemap visualisation.

Figure 4. UI mockup proposal of a temporal chart visualisation...

Figure 5. UI mockup proposal of a dendrogram visualisation.

Figure 8. UI mockup of a temporal chart visualisation (graph-level view).

Semantic Data Visualisation for Biomedical Database Catalogues

Article

Full-text available

Nov 2022

Biomedical databases often have restricted access policies and governance rules. Thus, an adequate description of their content is essential for researchers who wish to use them for medical research. A strategy for publishing information without disclosing patient-level data is through database fingerprinting and aggregate characterisations. Howeve...

Portuguese Twitter Dataset on COVID-19

Conference Paper

Nov 2022

BIcenter-AD: Harmonising Alzheimer’s Disease cohorts using a common ETL tool

Article

Full-text available

Nov 2022

Background Many scientific studies have sought to obtain a better understanding of specific medical conditions. Concerning Alzheimer’s Disease, there is a lack of reliable diagnostics and this can be related to the availability of only small-scale ongoing biomarker studies and longitudinal cohorts including these subjects. Aiming to generate more s...

The NAD Interactome, Identification of Putative New NAD-Binding Proteins

Chapter

Oct 2022

Nicotinamide adenine dinucleotide (NAD) is an essential metabolite in normal cellular physiology and its deregulation may lead to several pathological conditions. NAD interacts with a vast number of proteins, acting as a coenzyme, as a substrate and regulating the interaction between proteins. The goals of this study were to characterize the protei...

On the Estimation of Depression Through Social Mining

Chapter

Sep 2022

This paper describes the work conducted by the Bioinformatics group of the Institute of Electronics and Engineering Informatics of University of Aveiro through several participations in the CLEF eRisk shared tasks related to the estimation of the level of depression. The eRisk initiative fosters Natural Language Processing research for the automati...

Preserving Privacy when Querying OMOP CDM Databases

Chapter

Full-text available

Aug 2022

Anonymisation is currently one of the biggest challenges when sharing sensitive personal information. Its importance depends largely on the application domain, but when dealing with health information, this becomes a more serious issue. A simpler approach to avoid inadequate disclosure is to ensure that all data that can be associated directly with...

Automatic Classification of Stigmatizing Articles of Mental Illness: The Case of Portuguese Online Newspapers

Chapter

Aug 2022

The stigma related to mental health continues to be present in online newspapers, where mental diseases are often used metaphorically to refer to entities or situations outside the clinical of mental health. This project explores the implementation of Artificial Intelligence and Natural Language Processing techniques for the task of automatically c...

Correction to: Designing optimized drug candidates with Generative Adversarial Network

Article

Full-text available

Aug 2022

Visualising Time-evolving Semantic Biomedical Data

Conference Paper

Jul 2022

Combining heterogeneous patient-level data into tranSMART to support multicentre studies

Conference Paper

Jul 2022

A secure architecture for exploring patient-level databases from distributed institutions

Conference Paper

Jul 2022

The general workflow. This model is composed of an Encoder–Decoder (A...

Data preprocessing of the SMILES string. A Acetylsalicylic Acid using...

The detailed structure of the Encoder (A) and Decoder (B). This model...

General schema of LSTM-based Predictor architecture. This regression...

Comparison of the predicted pIC 50 distributions for the original data...

Designing optimized drug candidates with Generative Adversarial Network

Article

Full-text available

Jun 2022

Drug design is an important area of study for pharmaceutical businesses. However, low efficacy, off-target delivery, time consumption, and high cost are challenges and can create barriers that impact this process. Deep Learning models are emerging as a promising solution to perform de novo drug design, i.e., to generate drug-like molecules tailored...

DTITR: End-to-end drug-target binding affinity prediction with transformers

Article

Full-text available

Jun 2022

The accurate identification of Drug-Target Interactions (DTIs) remains a critical turning point in drug discovery and understanding of the binding process. Despite recent advances in computational solutions to overcome the challenges of in vitro and in vivo experiments, most of the proposed in silico-based methods still focus on binary classificati...

Dictionary-based encoding followed by one-hot encoding applied to the...

CNN-FCNN binding affinity prediction model

CNN-FCNN model predictions against the true values for the Davis kinase...

PSSM Motifs - LGrad-RAM\documentclass[12pt]{minimal}...

Explainable deep drug–target representations for binding affinity prediction

Article

Full-text available

Jun 2022

Background Several computational advances have been achieved in the drug discovery field, promoting the identification of novel drug-target interactions and new leads. However, most of these methodologies have been overlooking the importance of providing explanations to the decision-making process of Deep Learning architectures. In this research s...

Fig. 1. Examples of WSI modalities. (a) Brightfield image, (b)...

Fig. 3. Categorizing of image analysis levels according to...

Fig. 4. Examples of different types of variation in tissue images. On...

Bio-imaging software used in DP comparison.

Software tools and platforms in Digital Pathology: a review for clinicians and computer scientists

Article

Full-text available

Jun 2022

At the end of the twentieth century, new technology was developed that allowed an entire tissue section to be scanned on an objective slide. Originally called virtual microscopy, this technology it is now known as Whole Slide Imaging (WSI). WSI presents new challenges for reading, visualization, storage, and analysis. For this reason, several techn...

Discovery of Biomedical Databases Through Semantic Questioning

Chapter

Full-text available

May 2022

Many clinical studies are greatly dependent on an efficient identification of relevant datasets. This selection can be performed in existing health data catalogues, by searching for available metadata. The search process can be optimised through questioning-answering interfaces, to help researchers explore the available data present. However, when...

Figure 3 General schema of LSTM-based Predictor architecture. SMILES...

Figure 4 Comparison of the predicted pIC50 distributions for the...

Figure 7 Scatter plots from applying the Predictor for the binding...

Figure 8 Distribution of the predicted pIC50 values for the unbiased...

Figure 11 Distribution of the predicted pIC50 values for different...

Designing Optimized Drug Candidates With Generative Adversarial Network

Preprint

Full-text available

Mar 2022

Figure 1. Data analysis. Schematic representation showing the pipeline...

Figure 3. Gene ontology results for 122 potential NAPRT RNA binding...

Figure 4. Network of interactions between 17 genes targeted by more...

Transcription factors and RNA binding proteins with a significant...

NAPRT Expression Regulation Mechanisms: Novel Functions Predicted by a Bioinformatics Approach

Article

Full-text available

Dec 2021

The nicotinate phosphoribosyltransferase (NAPRT) gene has gained relevance in the research of cancer therapeutic strategies due to its main role as a NAD biosynthetic enzyme. NAD metabolism is an attractive target for the development of anti-cancer therapies, given the high energy requirements of proliferating cancer cells and NAD-dependent signali...

Figure 2: BIcenter task editor. ETL Task editor of BIcenter, where it...

Figure 3: Usagi Mapper component. Configuration view for the Usagi...

Figure 4: Data transformations. Illustration of cohort raw data (first...

Harmonising Alzheimer's Disease Cohorts using a Common ETL Tool

Preprint

Full-text available

Dec 2021

Background: Many scientific studies have sought to obtain better understanding of specific medical conditions. Concerning Alzheimer’s Disease, there is a lack of reliable diagnostics and this can be related to the availability of only small-scale ongoing biomarker studies and longitudinal cohorts including these subjects. Aiming to generate more su...

BIcenter: A collaborative Web ETL solution based on a reflective software approach

Article

Dec 2021

The continuous growth of new sources of information has led to an unprecedented increase in the data collected. The dimensionality and heterogeneity of these data requires efficient strategies for searching, accessing and integrating from multiple repositories. The techniques underlying this goal are usually known as Extraction, Transformation and...

A methodology for cohort harmonisation in multicentre clinical research

Article

Oct 2021

Many clinical trials and scientific studies have been conducted aiming for better understanding of specific medical conditions. However, these studies are often based on a small number of participants due to the difficulty in finding people with similar medical characteristics and available to participate in the studies. This is particularly critic...

Fig. 1. The general framework contains 4 DL modules: an unbiased...

Fig. 2. Predictor architecture for the BBB permeability: The ECFP...

Fig. 3. AA2AR QSAR scatter plot and evaluation metrics: MSE, Q 2 and CCC

Comparison of different combinations of descriptor and oversampling...

Comparison of the non-dominated solutions obtained for each...

Optimizing blood–brain barrier permeation through deep reinforcement learning for de novo drug design

Article

Full-text available

Jul 2021

Motivation The process of placing new drugs into the market is time-consuming, expensive and complex. The application of computational methods for designing molecules with bespoke properties can contribute to saving resources throughout this process. However, the fundamental properties to be optimized are often not considered or conflicting with ea...

General architecture for template‐based question answering over...

General architecture for question answering over knowledge bases based...

Systematic review of question answering over knowledge bases

Article

Full-text available

Jun 2021

Abstract Over the years, a growing number of semantic data repositories have been made available on the web. However, this has created new challenges in exploiting these resources efficiently. Querying services require knowledge beyond the typical user’s expertise, which is a critical issue in adopting semantic information solutions. Several propos...

A Two-Stage Workflow to Extract and Harmonize Drug Mentions from Clinical Notes into Observational Databases

Article

Jun 2021

Background: The content of the clinical notes that have been continuously collected along patients' health history has the potential to provide relevant information about treatments and diseases, and to increase the value of structured data available in Electronic Health Records (EHR) databases. EHR databases are currently being used in observatio...

A Comparative Analysis of Data Platforms for Rare Diseases

Conference Paper

Full-text available

Jun 2021

Improvements in lymphocytes detection using deep learning with a preprocessing stage

Conference Paper

Full-text available

Jun 2021

An Architecture to Define Cohorts over Medical Imaging Datasets

Conference Paper

Full-text available

Jun 2021

Easing the Questioning of Semantic Biomedical Data

Conference Paper

Jun 2021

A Recommender System to Help Refining Clinical Research Studies

Chapter

Full-text available

May 2021

The process of refining the research question in a medical study depends greatly on the current background of the investigated subject. The information found in prior works can directly impact several stages of the study, namely the cohort definition stage. Besides previous published methods, researchers could also leverage on other materials, such...

Absolutist words validated by Al-Mosaiwi et al. [30].

Cross-evaluation of social mining for classification of depressed online personas

Article

Full-text available

May 2021

With the continuous increase in the use of social networks, social mining is steadily becoming a powerful component of digital phenotyping. In this paper we explore social mining for the classification of self-diagnosed depressed users of Reddit as social network. We conduct a cross evaluation study based on two public datasets in order to understa...

Leveraging Clinical Notes for Enhancing Decision-Making Systems with Relevant Patient Information

Chapter

Mar 2021

Personalised treatment is usually needed for hospitalised patients afflicted by secondary illnesses that demand daily medication. Even though clinical guidelines were designed to consider those circumstances exist, current decision-support features fail to assimilate detailed relevant patient information. This creates opportunities for the developm...

Figure 1: The system architecture, which follows a client-server model....

Figure 2: Client component architecture. It is represented the client...

Figure 3: Individual patient view. This figure represents: a) the...

Figure 4: The Protocol editor view. This figure represents: a) the...

Figure 5: Part of the RESTFull web service response for a GET request...

A Software Solution for Clinical Protocol Management

Preprint

Full-text available

Mar 2021

Clinical treatments are mostly the result of consecutive success of medical procedures. The patterns in those procedures lead to creation of clinical guidelines which are currently essential to have better health treatments. The use of electronic health record systems (EHR) helps the patient management, but it fails in the treatment guidance due to...

Machine Learning for Depression Screening in Online Communities

Chapter

Jan 2021

Social media writings have been explored over the last years, in the context of mental health, as a potential source of information for extending the so-called digital phenotyping of a person. In this paper we present a computational approach for the classification of depressed social media users. We conducted a cross evaluation study based on two...

Bilingual Emotion Analysis on Social Media throughout the COVID19 Pandemic in Portugal

Conference Paper

Jan 2021

A semi-automatic methodology for analysing distributed and private biobanks

Article

Full-text available

Dec 2020

Privacy issues limit the analysis and cross-exploration of most distributed and private biobanks, often raised by the multiple dimensionality and sensitivity of the data associated with access restrictions and policies. These characteristics prevent collaboration between entities, constituting a barrier to emergent personalized and public health ch...

Lifelog Moment Retrieval Web Application

Conference Paper

Oct 2020

UA.PT Bioinformatics at ImageCLEF 2020: Lifelog Moment Retrieval Web based Tool

Conference Paper

Full-text available

Sep 2020

This paper describes the participation of the Bioinformatics group of the Institute of Electronics and Engineering Informatics of University of Aveiro in the ImageCLEF lifelog task, more specifically in the Lifelog Moment Retrieval (LMRT) sub-task. In our first participation last year we tackled the LMRT challenge with an automatic approach. Follow...

File Forgery Detection Using a Weighted Rule-Based System

Chapter

Sep 2020

The society is becoming increasingly dependent on digital data sources. However, our trust on the sources and its contents is only ensured if we can also rely on robust methods that prevent fraudulent forgery. As digital forensic experts are continually dealing with the detection of forged data, new fraudulent approaches are emerging, making it dif...

SCALEUS-FD architecture and implementation technologies. At the file...

SCALEUS-FD: A FAIR Data Tool for Biomedical Applications

Article

Full-text available

Aug 2020

The Semantic Web and Linked Data concepts and technologies have empowered the scientific community with solutions to take full advantage of the increasingly available distributed and heterogeneous data in distinct silos. Additionally, FAIR Data principles established guidelines for data to be Findable, Accessible, Interoperable, and Reusable, and t...

Venn diagram of common GenBank identifiers. The Venn diagram presents...

MDS plot before and after batch-adjustment. MDS was performed using the...

Gene Ontology significant results. Gene Ontology significant results of...

Protein-protein interactions of up-regulated protein coding genes. a...

Merging microarray studies to identify a common gene expression signature to several structural heart diseases

Article

Full-text available

Jul 2020

Background: Heart disease is the leading cause of death worldwide. Knowing a gene expression signature in heart disease can lead to the development of more efficient diagnosis and treatments that may prevent premature deaths. A large amount of microarray data is available in public repositories and can be used to identify differentially expressed...

Fig. 3. Rearrangements map generated with GTO using the pipeline:...

Fig. 4. Rearrangements map generated with GTO using the pipeline:

GTO: A toolkit to unify pipelines in genomic and proteomic research

Article

Full-text available

Jul 2020

Next-generation sequencing triggered the production of a massive volume of publicly available data and the development of new specialised tools. These tools are dispersed over different frameworks, making the management and analyses of the data a challenging task. Additionally, new targeted tools are needed, given the dynamics and specificities of...

Multi-language Concept Normalisation of Clinical Cohorts

Conference Paper

Jul 2020

A Recommender System to Help Discovering Cohorts in Rare Diseases

Conference Paper

Full-text available

Jul 2020

Social Media Mining for Postpartum Depression Prediction

Article

Jun 2020

This study investigated the feasibility of a postpartum depression predictor based on social media writings. The current broad use of social media networks generates a large amount of digital data, which, when coupled with artificial intelligence methods, have the potential to disclose significant health related insights. In this paper we explore t...

A Recommender System Based on Cohorts' Similarity

Article

Jun 2020

Aiming to better understand the genetic and environmental associations of Alzheimer's disease, many clinical trials and scientific studies have been conducted. However, these studies are often based on a small number of participants. To address this limitation, there is an increasing demand of multi-cohorts studies, which can provide higher statist...

Towards a More Reproducible Biomedical Research Environment: Endorsement and Adoption of the FAIR Principles

Chapter

May 2020

The FAIR guiding Principles for scientific data management and stewardship are a fundamental enabler for digital transformation and transparent research. They were designed with the purpose of improving data quality, by making it Findable, Accessible, Interoperable and Reusable. While these principles have been endorsed by both data owners and regu...

Understanding Depression from Psycholinguistic Patterns in Social Media Texts

Chapter

Apr 2020

The World Health Organization reports that half of all mental illnesses begin by the age of 14. Most of these cases go undetected and untreated. The expanding use of social media has the potential to leverage the early identification of mental health diseases. As data gathered via social media are already digital, they have the ability to power up...

Image Selection based on Low Level Properties for Lifelog Moment Retrieval

Conference Paper

Full-text available

Jan 2020

The increasing number of mobile and wearable devices is dramatically changing the way we collect data about person’s life. These devices allow recording our daily activities and behavior in several forms, e.g., text, images, bio-signals, or video. However, many times, the collected data includes low quality or irrelevant contents, feeding lifeloggi...

GTO: a toolkit to unify pipelines in genomic and proteomic research

Preprint

Full-text available

Jan 2020

Enhancing Decision-making Systems with Relevant Patient Information by Leveraging Clinical Notes

Conference Paper

Jan 2020

Diagram of the OM methodology pipeline. A Reference Set of an organism...

OM methodology ROC curves. ROC curves are obtained by OM application...

OM methodology ROC curves with normalized data. ROC curves are obtained...

Handling Noise in Protein Interaction Networks

Article

Full-text available

Oct 2019

Protein-protein interactions (PPIs) can be conveniently represented as networks, allowing the use of graph theory for their study. Network topology studies may reveal patterns associated with specific organisms. Here, we propose a new methodology to denoise PPI networks and predict missing links solely based on the network topology, the organizatio...

UA.PT Bioinformatics at ImageCLEF 2019: Lifelog Moment Retrieval based on Image Annotation and Natural Language Processing

Conference Paper

Sep 2019

Exploring the Value of Electronic Health Records from Multiple Datasets

Chapter

Aug 2019

During the last decades, most European countries dedicated huge efforts in collecting and maintaining Electronic Health Records (EHR). With the continuous grow of these datasets, it became obvious that its secondary use for research may lead to new insights about diseases and treatments outcomes.

Users workflow - for the study manager (in white boxes), and for the...

TASKA: A modular task management system to support health research studies

Article

Full-text available

Jul 2019

Background: Many healthcare databases have been routinely collected over the past decades, to support clinical practice and administrative services. However, their secondary use for research is often hindered by restricted governance rules. Furthermore, health research studies typically involve many participants with complementary roles and respon...

GenericCDSS - A Generic Clinical Decision Support System

Conference Paper

Jun 2019

Patient data discovery platforms as enablers of biomedical and translational research: A systematic review

Article

May 2019

Background: The global shift from paper health records to electronic ones has led to an impressive growth of biomedical digital data along the past two decades. Exploring and extracting knowledge from these data has the potential to enhance translational research and lead to positive outcomes for the population's health and healthcare. Obective:...

EMIF Catalogue: A collaborative platform for sharing and reusing biomedical data

Article

Mar 2019

Objective: The collaboration and knowledge exchange between researchers are often hindered by the nonexistence of accurate information about which databases may support research studies. Even though a considerable amount of patient health information does exist, it is usually distributed and hidden in many institutions. The goal of this project is...

Handling Noise in Protein Interaction Networks

Preprint

Full-text available

Jan 2019

Protein-protein interactions (PPI) can be conveniently represented as networks, allowing the use of graph theory in their study. Network topology studies may reveal patterns associated to specific organisms. Here we propose a new methodology to denoise PPI networks and predict missing links solely based on the network topology, the Organization Mea...

FAIRness in Biomedical Data Discovery

Conference Paper

Jan 2019

Health monitoring systems through smartphones: a systematic review and users’ expectations (Preprint)

Preprint

Oct 2018

BACKGROUND With the current society’s lifestyle people became more concerned and started seeking for solutions that may help them to monitor their health conditions. Traditional monitoring systems present some limitations and today’s smartphones appear to be a good tool as they are unobtrusive and discrete. Additionally, they can continuously colle...

Figure 1. Flow-chart describing the selection of the studies for the...

Figure 2: Distribution of rejected papers resulting from the full-text...

Figure 3: Number of unique returned papers by year

Figure 4: Study fields of the selected papers

Figure 5: Source of the health-related data in percentage

Passive sensing of health outcomes through smartphones: a systematic review of current solutions and possible limitations (Preprint)

Article

Full-text available

Oct 2018

Background: Technological advancements, together with the decrease in both price and size of a large variety of sensors, has expanded the role and capabilities of regular mobile phones, turning them into powerful yet ubiquitous monitoring systems. At present, smartphones have the potential to continuously collect information about the users, monit...

Smartphone as data collector in health monitoring

Conference Paper

Oct 2018

Sensing health and well-being parameters from citizens and patients has been an increasing concern in our society. However, since the traditional data collection methods rely mostly on dedicated and expensive equipments, in the recent years, the potential of smartphones has been largely investigated because of its unobtrusiveness and embedded senso...

Automated ICD-9-CM medical coding of diabetic patient's clinical reports

Conference Paper

Oct 2018

The assignment of ICD-9-CM codes to patient's clinical reports is a costly and wearing process manually done by medical personnel, estimated to cost about $25 billion per year in the United States. To develop a system that automates this process has been an ambition of researchers but is still an unsolved problem due to the inherent difficulties in...

Simplifying the Digitization of Clinical Protocols for Diabetes Management

Conference Paper

Jun 2018

Services Orchestration and Workflow Management in Distributed Medical Imaging Environments

Conference Paper

Full-text available

Jun 2018

A FAIR Marketplace for Biomedical Data Custodians and Clinical Researchers

Conference Paper

Jun 2018

Fighting Fire with Fire: Computational Prediction of Microbial Targets for Bacteriocins

Chapter

Mar 2018

MONTRA: An agile architecture for data publishing and discovery

Article

Mar 2018

Background and Objective Data catalogues are a common form of capturing and presenting information about a specific kind of entity (e.g. products, services, professionals, datasets, etc.). However, the construction of a web-based catalogue for a particular scenario normally implies the development of a specific and dedicated solution. In this paper...

A Methodology to Perform Semi-automatic Distributed EHR Database Queries

Conference Paper

Jan 2018

The proliferation of electronic health databases has resulted in the existence of a wide collection of diversified clinical digital data. These data are fragmented over dispersed databases in different clinical silos around the world. The exploration of these electronic health records (EHRs) is essential for clinical and pharmaceutical research and...

A Computational Pipeline for Sepsis Patients’ Stratification and Diagnosis

Conference Paper

Jan 2018

A Modular Workflow Management Framework

Conference Paper

Full-text available

Jan 2018

Task management systems are crucial tools in modern organizations, by simplifying the coordination of teams and their work. Those tools were developed mainly for task scheduling, assignment, follow-up and accountability. Then again, scientific workflow systems also appeared to help putting together a set of computational processes through the pipel...

A Methodology for Fine-Grained Access Control in Exposing Biomedical Data

Article

Jan 2018

Biomedical data integration and processing is a very sensitive issue and a main barrier for research, since it normally implies dealing with private clinical information. To overcome this problem, we propose a solution based on multiple levels of data visibility, combined with a fine-grained access control over the shared data. Through our proposal...

Biomedical Informatics - How to Choose the Best Tool for Each Task

Article

Jan 2018

The ever-increasing number of bioinformatics software tools that are publicly available, is leading to greater expectations about its regular use in clinical practice. However, from the end-users' perspective, they face many time the challenge of choosing the right tool for each task, from a panoply of solutions that have been developed over the ye...

COEUS 2.0: An automated platform to integrate and publish biomedical data as nanopublications

Article

Dec 2017

Publishing, analysing or properly accessing the abundant information resulting largely from experimental studies in the biomedical domain are current challenges for the research community. Problems with the extraction of relevant information, redundant data, and lack of associations or provenance are good examples of the main concerns. The innovati...

Figure 1. Semantic-based architecture for scientific information...

Figure 2. Annotation model: sample extraction of the integration and...

Figure 3. Relation model: sample extraction of the integration and...

Figure 4. Validation workflow overview. (1) Dataset is extracted from...

Figure 5. Knowledge base sample annotation model. The annotators...

A semantic-based workflow for biomedical literature annotation

Article

Full-text available

Nov 2017

Computational annotation of textual information has taken on an important role in knowledge extraction from the biomedical literature, since most of the relevant information from scientific findings is still maintained in text format. In this endeavour, annotation tools can assist in the identification of biomedical concepts and their relationships...

A Sequence-Based Mesh Classifier for the Prediction of Protein-Protein Interactions

Article

Nov 2017

The worldwide surge of multiresistant microbial strains has propelled the search for alternative treatment options. The study of Protein-Protein Interactions (PPIs) has been a cornerstone in the clarification of complex physiological and pathogenic processes, thus being a priority for the identification of vital components and mechanisms in pathoge...

Knowledge federation architecture, integrating distributed patient...

Simplified registry publication workflow.

Patient registry model overview. Facioscapulohumeral Muscular Dystrophy...

Linked Registries Web application interface.

Linked Registries: Connecting Rare Diseases Patient Registries through a Semantic Web Layer

Article

Full-text available

Oct 2017

Patient registries are an essential tool to increase current knowledge regarding rare diseases. Understanding these data is a vital step to improve patient treatments and to create the most adequate tools for personalized medicine. However, the growing number of disease-specific patient registries brings also new technical challenges. Usually, thes...

Figure 1. Software development process: including the several stages of...

Figure 2. Example of a strategy for SCM workflow based on Git. It is an...

Figure 3. The deployment of each new release should follow three...

General guidelines for biomedical software development

Article

Full-text available

Jul 2017

Most bioinformatics tools available today were not written by professional software developers, but by people that wanted to solve their own problems, using computational solutions and spending the minimum time and effort possible, since these were just the means to an end. Consequently, a vast number of software applications are currently availabl...

De-identification service life-cycle. This diagram depicts a sequence...

Web application for de-identification of medical imaging studies (top –...

This figure shows one sample from each ultrasound equipment used to...

Overall system performance in the first version (1st approach) and...

A De-Identification Pipeline for Ultrasound Medical Images in DICOM Format

Article

Full-text available

Apr 2017

Clinical data sharing between healthcare institutions, and between practitioners is often hindered by privacy protection requirements. This problem is critical in collaborative scenarios where data sharing is fundamental for establishing a workflow among parties. The anonymization of patient information burned in DICOM images requires elaborate pro...

Intelligent Generator of Big Data Medical Imaging Repositories

Article

Feb 2017

The production of medical imaging data has grown tremendously in the last decades. Nowadays, even small institutions produce a considerable amount of studies. Furthermore, the general trend in new imaging modalities is to produce more data per examination. As a result, the design and implementation of tomorrow's storage and communication systems mu...