Home
Nacéra Bennacer Seghouani

Nacéra Bennacer Seghouani
Laboratoire Interdisciplinaire des Sciences du Numérique

Professor

About

Publications

9,129

Reads

390

Citations

Nacéra Bennacer Seghouani is professor at Computer Science Department of CentraleSupélec and researcher member of LRI (Laboratoire de Recherche en Informatique). Nacéra's research topics are currently mainly focused on information extraction (multilingual text, unreconciled entities, structured data as graphs, ...), learning user's profiles in social networks from heterogenous data and large network analysis.

Skills and Expertise

Semantic Web

Semantic Web Technologies

Information Extraction

Knowledge Representation

Natural Language Processing

Artificial Intelligence

Ontologies

Information Retrieval

Knowledge Acquisition

Conceptual Modeling

Publications

Weakly Supervised Named Entity Recognition for Carbon Storage Using Deep Neural Networks

Chapter

Nov 2022

Applying Transfer-Learning based on pre-trained language models has become popular in Natural Language Processing. In this paper, we present a weakly supervised Named Entity Recognition system that uses a pre-trained BERT model and applies two consecutive fine tuning steps. We aim to reduce the amount of human labour required for annotating data by...

Sampling-based Label Propagation for Balanced Graph Partitioning

Conference Paper

Apr 2022

Figure 1 Personality and other psychological traits scores....

Figure 2 Overall, Temporal and Category-related Features vs Individual...

Figure 4 Individual Traits correlation table. Pearson correlation...

Features summary. Summary of all the 54 features defined at customer level

Inferring psychological traits from spending categories and dynamic consumption patterns

Article

Full-text available

Dec 2021

In recent years there has been a growing interest in analyzing human behavioral data generated by new technologies. One type of digital footprint that is universal across the world, but that has received relatively little attention to date, is spending behavior. In this paper, using the transaction records of 1306 bank customers, we investigated th...

Figure 2: Retrieved nodes for query birlinggap using STiC

Figure 5: Number of different nodes for query 'bank'

Smart Crawling: A New Approach toward Focus Crawling from Twitter

Preprint

Full-text available

Oct 2021

Twitter is a social network that offers a rich and interesting source of information challenging to retrieve and analyze. Twitter data can be accessed using a REST API. The available operations allow retrieving tweets on the basis of a set of keywords but with limitations such as the number of calls per minute and the size of results. Besides, ther...

BGRAP: Balanced GRAph Partitioning Algorithm for Large Graphs

Article

Jun 2021

The definition of effective strategies for graph partitioning is a major challenge in distributed environments since an effective graph partitioning allows to considerably improve the performance of large graph data analytics computations. In this paper, we propose a multi-objective and scalable Balanced GRAph Partitioning (\algo) algorithm, based...

A Graph Partitioning Algorithm for Edge or Vertex Balance

Book

Sep 2020

A Graph Partitioning Algorithm for Edge or Vertex Balance

Chapter

Sep 2020

PROCLAIM: An Unsupervised Approach to Discover Domain-Specific Attribute Matchings from Heterogeneous Sources

Chapter

Aug 2020

Schema matching is a critical problem in many applications where the main goal is to match attributes coming from heterogeneous sources. In this paper, we propose PROCLAIM (PROfile-based Cluster-Labeling for AttrIbute Matching), an automatic, unsupervised clustering-based approach to match attributes of a large number of heterogeneous sources. We d...

PROCLAIM: An Unsupervised Approach to Discover Domain-Specific Attribute Matchings from Heterogeneous Sources

Conference Paper

Full-text available

Apr 2020

Profile Reconciliation Through Dynamic Activities Across Social Networks

Chapter

May 2019

Since today’s online social media serve diverse purposes such as social and professional networking, photo and blog sharing, it is not uncommon for people to have multiple profiles across different social networks. Finding or reconciling these profiles would allow the creation of a holistic view of different facets of a person’s life that can be us...

Plot of the users on the first two components of Big5

Plot of the users against component 2 and 3 of Big5

Variation of precision and recall with the number of tweets

Determining the interests of social media users: two approaches

Article

Full-text available

Jul 2018

Although social media platforms serve diverse purposes, from social and professional networking to photo sharing and blogging, people frequently use them to share the thoughts and opinions and most importantly, their interests (e.g., politics, economy, sports). Understanding the interests of social media users is key to many applications that need...

Modeling Strategies for Storing Data in Distributed Heterogeneous NoSQL Databases: 37th International Conference, ER 2018, Xi'an, China, October 22–25, 2018, Proceedings

Chapter

Sep 2018

In this work, we propose HerM (Heterogeneous Distributed Model), a NoSQL data modeling approach which supports the use of multiple heterogeneous NoSQL systems in a distributed environment. We define the conceptual elements necessary for data modeling, and we identify optimized data distribution patterns. We implemented a flexible framework, where w...

Correlation of parameters, aTrump, bObama, cLa La Land, dThe Voice,...

Relation of parameters of interest and NEs in our datasets, a Favorite...

Average number of itemsets for Obama dataset

Average number of itemsets for Trump dataset

A Frequent Named Entities-Based Approach for Interpreting Reputation in Twitter

Article

Full-text available

Jun 2018

Twitter is a social network that provides a powerful source of data. The analysis of those data offers many challenges among those stands out the opportunity to find reputation of a product, a person or any other entity of interest. Several approaches for sentiment analysis have been proposed in the literature to assess the general opinion expresse...

LIAISON: ReconciLIAtion of individuals profiles across social networks

Chapter

Full-text available

Nov 2017

Social Networking Sites , such as Twitter and LinkedIn, are clear examples of the impact that the Web 2.0 has on people around the world, because they target an aspect of life that is extremely important to anyone: social relationships. The key to building a social network is the ability of finding people that we know in real life, which, in turn,...

FRISK: A Multilingual Approach to Find twitteR InterestS via wiKipedia

Conference Paper

Full-text available

Oct 2017

Several studies have shown that the users of Twitter reveal their interests (i.e., what they like) while they share their opinions, preferences and personal stories.

Interpreting Reputation Through Frequent Named Entities in Twitter

Conference Paper

Oct 2017

Twitter is a social network that provides a powerful source of data. The analysis of those data offers many challenges among those stands out the opportunity to find the reputation of a product, of a person, or of any other entity of interest. Several tools for sentiment analysis have been built in order to calculate the general opinion of an entit...

Eliminating Incorrect Cross-Language Links in Wikipedia

Conference Paper

Full-text available

Oct 2017

Many Wikipedia articles that cover the same topic in different language editions are interconnected via cross-language links that enable the understanding of topics in multiple languages, as well as cross-language information retrieval applications. However, cross-language links are added manually by the users of Wikipedia and, as such, are often i...

Wikipedia-based extraction of key information from resumes

Conference Paper

May 2017

LIAISON: reconciLIAtion of Individuals Profiles Across SOcial Networks

Article

Jan 2017

Toponym disambiguation in online social network profiles

Conference Paper

Full-text available

Nov 2015

In online social networks individuals are given the option to reveal on their online profiles some personal information about themselves including, among others, their home location that, if specified, is typically referred to with a toponym. A toponym disclosed by an individual on her profile, or self-reported toponym (e.g., "London"), is often am...

A Multilingual Approach to Discover Cross-Language Links in WikipediaWeb Information Systems Engineering

Conference Paper

Full-text available

Oct 2015

Wikipedia is a well-known public and collaborative encyclopaedia consisting of millions of articles. Initially in English, the popular website has grown to include versions in over 288 languages. These versions and their articles are interconnected via cross-language links, which not only facilitate navigation and understanding of concepts in multi...

A Multi-Level Framework for Validation of Ontology-Driven and Community-Based Web Services Composition

Article

Jul 2014

This paper proposes an Ontology-driven and Community-based Web Services (OCWS) framework which aims at automating discovery, composition and execution of web services. The purpose is to validate and to execute a user’s request built from the composition of a set of OCWS descriptions and a set of user constraints. The defined framework separates cle...

Matching User Profiles Across Social Networks

Conference Paper

Full-text available

Jun 2014

Social Networking Sites, such as Facebook and LinkedIn, are clear examples of the impact that the Web 2.0 has on people around the world, because they target an aspect of life that is extremely important to anyone: social relationships. The key to building a social network is the ability of finding people that we know in real life, which, in turn,...

Réconciliation des profils dans les réseaux sociaux

Article

Full-text available

Jan 2014

Avec l’arrivée du Web 2.0, on assiste à un foisonnement de services de réseautage social, qui mettent l’utilisateur au centre des préoccupations. Ces services permettent de partager des ressources (YouTube, Flickr, Del.icio.us), d’échanger des informations et de construire des relations personnelles ou pro- fessionnelles (Facebook, LinkedIn) ou enc...

REISA : enrichissement contrôlé de bases de connaissances à partir de documents annotés

Article

Jan 2014

Controlled Knowledge Base Enrichment from Web Documents

Conference Paper

Nov 2012

The Linked Open Data initiative brought more and more RDF data sources to be published on the Web. However, these data sources contain relatively little information compared to the documents available on the surface Web. Many annotation tools have been proposed in the last decade for the automatic construction and enrichment of knowledge bases. But...

REISA : Enrichissement contrôlé de bases de connaissances à partir de documents annotés

Conference Paper

Full-text available

Jun 2012

Résumé : Grâce au Linked Open Data, les sources RDF mises à disposition sur le web sont de plus en plus nombreuses. Cependant, bien que ces sources soient volu-mineuses et évolutives, elles contiennent relativement peu d'information par compa-raison au volume d'informations contenues dans les documents semi-structurés. De nombreux outils ont pour o...

A Multi-Level Framework for Validation of Ontology-Driven and Community-Based Web Services Composition

Article

Mar 2012

Nacéra Bennacer Seghouani

Controlled enrichment of knowledge bases from annotated semi-structured documents

Article

Jan 2012

Conference Paper

Full-text available

Sep 2011

In the past few years, recommender systems and semantic web technologies have become main subjects of interest in the research community. In this paper, we present a domain independent semantic similarity measure that can be used in the recommendation process. This semantic similarity is based on the relations between the individuals of an ontology...

A Collaborative and Semantic-based Approach for Recommender Systems

Conference Paper

Full-text available

Jan 2011

The constant growth of the Internet has made recommender systems very useful to guide users coping with a large amount of data. In this paper, we present a domain independent collaborative and semantic-based recommender system which uses distinct and complementary modules. The approach targets users with various interests and is based on: (i) a col...

Supporting Semantic Search on Heterogeneous Semi-structured Documents

Conference Paper

Full-text available

Jun 2010

This paper presents SHIRI-Querying, an approach for semantic search in semistructured documents. We propose a solution to tackle incompleteness and imprecision of annotations at querying time. This solution relies on two elementary reformulation types that use the notion of aggregation and the documents structure. We present the dynamic algorithm (...

Supporting Information Retrieval in RSS Feeds.

Conference Paper

Apr 2010

Une approche pour la recherche sémantique de l'information dans les documents semi-structurés hétérogènes

Article

Full-text available

Mar 2010

Ce papier présente SHIRI-Querying, une approche pour la recherche sémantique de l'information dans les documents semi-structurés. Nous proposons une solution pour pallier l'incomplétude et l'imprécision des annotations au moment de l'interrogation. Cette solution repose sur deux types de reformulations élémentaires qui exploitent la notion d'agréga...

An approach to semantic information retrieval in heterogeneous semi-structured documents

Article

Jan 2010

Incremental Ontology-Based Extraction and Alignment in Semi-Structured Documents

Conference Paper

Full-text available

Aug 2009

SHIRI 1 is an ontology-based system for integration of semi- structured documents related to a specic domain. The system's purpose is to allow users to access to relevant parts of documents as answers to their queries. SHIRI uses RDF/OWL for representation of resources and SPARQL for their querying. It relies on an automatic, unsupervised and ontol...

Aggregative and Neighboring Approximations to Query Semi-Structured Documents

Conference Paper

Jan 2009

A Framework for the Semantic Composition of Web Services Handling User Constraints

Conference Paper

Full-text available

Sep 2008

In this paper, we present a model of composition for semantic web services based on statechart. The environment is based on a distributed architecture and on semantic and uniform based-ontology and community service descriptions. Thus, the community constitutes a unified, a homogeneous and a consistent access interface for existing heterogeneous e-...

A Framework for the Semantic Composition of Web Services Handling User Constraints

Article

Sep 2008

In this work, we present a framework for the semantic composition of web services based on Statecharts and uniform community service descriptions. Our model is a two step process. In the first step, we derive the execution model of the user's query. The execution model is specified in Statecharts formalism; whereas the user's query is described in...

Contextual and Metadata-based Approach for the Semantic Annotation of Heterogeneous Documents

Article

Full-text available

Jun 2008

In this paper, we present SHIRI-Annot, an automatic ontology- driven and unsupervised approach for the semantic annotation of doc- uments which contain more or less structured parts. The aim of this approach is to build an integration system called SHIRI 3 which allows the user access to documents related to a specific domain. In this sys- tem, the...

Metadata- and Ontology-Based Semantic Web Mining

Chapter

Jan 2008

The increasing volume of data available on the Web makes information retrieval a tedious and difficult task. The vision of the Semantic Web introduces the next generation of the Web by establishing a layer of machine-understandable data, e.g., for software agents, sophisticated search engines and Web services. The success of the Semantic Web crucia...

A Statechart-Based Model for the Semantic Composition of Web Services

Conference Paper

Full-text available

Jul 2007

In this paper, we present a semantic web services composition framework based on a distributed architecture and on semantic and uniform based- ontology service descriptions. Central to our work is the use of the concept of community of services. Community constitutes a unified, homogeneous and consistent access interface to existing and heterogeneo...

Extraction Contextuelle de Concepts Ontologiques pour le Web Sémantique

Conference Paper

Full-text available

Jul 2007

Analyses and Fundamental ideas for a Relation Extraction Approach

Conference Paper

Full-text available

May 2007

Relation extraction is a difficult open research problem with important applications in several fields such as knowledge management, web mining, ontology building, intelligent systems, etc. In our research, we focus on extracting relations among the ontological concepts in order to build a domain ontology. In this paper, firstly, we answer some cru...

Contextual Concept Discovery Algorithm

Conference Paper

Full-text available

May 2007

In this paper, we focus on the ontological concept extraction and evaluation process from HTML documents. In order to improve this process, we propose an unsupervised hierarchical clustering algorithm namely "Contextual Concept Discovery" (CCD) which is an incremental use of the partitioning algorithm Kmeans and is guided by a structural context. O...

Contextualization Vs Decontextualization: Impact on the Experts' concept Evaluation

Article

Mar 2007

Semantic Web Services Composition Framework

Article

Full-text available

Feb 2007

In this paper, we present a semantic web services composition framework based on a distributed architecture and on semantic and uniform based-ontology service descriptions. Central to our framework is the use of the concept of community of services. This concept provides means to describe services with the same language based on the community ontol...

WebDocEnrich : Enrichissement Sémantique Flexible de Documents Semi-Structurés

Conference Paper

Jan 2007

Résumé. Nous présentons une approche automatique d'enrichissement séman- tique de documents HTML qui exploite une description du domaine, plus préci- sément un ensemble de concepts, leurs propriétés, leurs relations, les cardinalités associées et leurs caractéristiques contextuelles pour enrichir sémantiquement le contenu des documents. Le processu...

Context-based Hierarchical Clustering for the Ontology Learning

Conference Paper

Dec 2006

Ontologies provide a common layer which plays a major role in supporting information exchange and sharing. In this paper, we focus on the ontological concept extraction process from HTML documents. In order to improve this process, we propose an unsupervised hierarchical clustering algorithm namely "contextual ontological concept extraction" (COCE)...

A New Extraction Concept Based on Contextual Clustering

Conference Paper

Nov 2006

Ontologies provide a common layer that plays a major role in information exchange and support sharing. Ontologies proliferation relies strongly on the automation of their building, integration and deployment processes. In this paper, we present an integrated framework involving complementary dimensions to drive the (semi) automatic acquisition conc...

Contextual Ontological Concepts Extraction

Conference Paper

Full-text available

Oct 2006

Ontologies provide a common layer which plays a major role in supporting information exchange and sharing. In this paper, we focus on the ontological concept extraction process from HTML documents. We propose an unsupervised hierarchical clustering algorithm namely “Contextual Ontological Concept Extraction” (COCE) which is an incremental use of a...

Extraction de concepts guidée par le contexte

Article

Sep 2006

Formalizing Mappings for OWL Spatiotemporal Ontologies

Conference Paper

Sep 2006

Nacéra Bennacer Seghouani

Ontology mappings provide a common layer which allows distributed applications to share and to exchange semantic information. Providing mechanized ways for mapping ontologies is a challenging issue and main problems to be faced are related to structural and semantic heterogeneity. The complexity of these problems increases in the presence of spatio...

Metadata- and Ontology- Based Semantic Web Mining

Article

Full-text available

Feb 2006

The increasing volume of data available on the Web makes information retrieval a tedious and difficult task. The vision of the Semantic Web introduces the next generation of the Web by establishing a layer of machine-understandable data e.g. for software agents, sophisticated search engines and Web services. The success of the Semantic Web cruciall...

A framework for retrieving conceptual knowledge from Web pages

Conference Paper

Full-text available

Dec 2005

Ontologies provide a common layer which plays a major role in sup- porting information exchange and sharing. Their proliferation relies strongly on the automation of ontology building, integration and deployment processes. In this paper we introduce an integrated framework involving different and com- plementary dimensions to drive the (semi) autom...

Processus d'extraction de connaissances à partir du Web

Article

Oct 2005

Découverte des connaissances à partir des pages Web

Article

Jan 2005

Semantic Mappings in Description Logics for Spatio-Temporal Database Schema Integration

Article

Full-text available

Jan 2005

The interoperability problem arises in heterogeneous systems where difierent data sources coexist and there is a need for meaningful information sharing. One of the most representive realms of diversity of data representation is the spatio-temporal domain. Spatio-temporal data are most often described according to multiple and greatly diverse perce...

Representing and Reasoning for Spatiotemporal Ontology Integration

Conference Paper

Full-text available

Oct 2004

The World-Wide Web hosts many autonomous and heterogeneous information sources. In the near future each source may be described by its own ontology. The distributed nature of ontology development will lead to a large number of local ontologies covering overlapping domains. Ontology integration will then become an essential capability for effective...

Ontology Discovery from Web Pages : Application to Tourism

Article

Full-text available

Sep 2004

This paper deals with the automation of ontology building process from HTML pages. Our methodology is based on the complementary use of two approaches. The first ap-proach is based on Aussenac-Gilles methodology and requires feedback from the user to pro-pose and refine concepts and their relationships. The second one exploits Web pages structure a...

Using OWL to Describe Pedagogical Resources

Conference Paper

Aug 2004

In this paper we present an example of a part of a pedagogical ontology for a grande ecole. OWL is intended to help users to formalize ontologies and to be a support for the semantic Web, for example to enable the interchange of resources and the inference of knowledge while querying these resources.

Formalizing For Querying Learning Objects Using OWL

Conference Paper

Aug 2004

The World Wide Web offers an increasing amount of complex and rich educational Web resources that are available for free in various domains. Unfortunately, it is difficult today to have a Web agent that answers precisely a simple query. Semantic Web aims to make Web resources meaningful to automated agents. Ontologies are proposed to provide a form...

Building a Pedagogical Ontology to Improve Information Retrieval

Article

Jun 2004

Probabilistic validation using worst event driven and importance sampling simulation

Conference Paper

Nov 1994

Probabilistic validation is an approach for the validation of highly dependable and complex systems. It relies on a partial analysis on a system model and tries to prove that the failed event occurrences has a sufficiently low probability. We define a probabilistic validation method using worst event driven and an importance sampling simulation. Th...

Probabilistic Validation of a Remote Procedure Call Protocol.

Conference Paper

Jun 1994

Classical validation approach tries to prove that failed events (events that do not verify an user property) will never occur. Probabilistic validation relies on a partial analysis on a system model and tries to prove that the failed event occurrences, have a sufficiently low probability. An incorrect behavior is a very rare event: it is the conseq...

SemanticMappingsinDescriptionLogicsfor Spatio-TemporalDatabaseSchemaIntegration

Article

Network

James Hendler
Rensselaer Polytechnic Institute
Alexander Maedche
Karlsruhe Institute of Technology
Maurizio Lenzerini
Sapienza University of Rome
Uli Sattler
The University of Manchester
Amit Sheth
University of South Carolina

Basav Roychoudhury
Indian Institute of Management Shillong
Veronica Rossano
Università degli Studi di Bari Aldo Moro
Maryam Hazman
Climate Change Information Center and Expert Systems, Agricultural Research Center (ARC), Egypt
Yudith Cardinale
Simon Bolívar University
Adrien Coulet
University of Lorraine