Services of Table 1 in the Pin × Pout space

Source publication

Top-k dominant web services under multi-criteria matching

Article

Full-text available

Apr 2009

As we move from a Web of data to a Web of services, enhancing the capabilities of the current Web search engines with effective and efficient techniques for Web services retrieval and selection becomes an important issue. Traditionally, the relevance of a Web service advertisement to a service request is determined by com-puting an overall score th...

Context 1

... the example of Table 1. Let S m i R,S = (si.Pin, si.Pout) denote the match vector under criterion fm i for the input and output parameters of service S. Figure 1 draws the degrees of match si as an instance in the Pin × Pout space for all services and criteria. For example, a1 corresponds to the de- grees of match of service A under fm 1 and, hence, has coordinates (0.96, 0.92). ...

View in full-text

Context 2

... (Cont'd). Consider object C with instances c1, c2 and c3 shown in Figure 1. Instance c1 is dominated by a1, a2 and a3, whereas it dominates b1, b3, d1, d2 and d3. ...

View in full-text

Context 3

... the current object U , the algorithm first searches for objects that fully dominate it. For example, in the case of the data set of Figure 1, with a single dominance check between bmax and amin, we can conclude that all instances b1, b2 and b3 are dominated by a1, a2 and a3. According to property 2, only objects with F (vmin) > F (umax) need to be checked. ...

View in full-text

Context 4

... step searches for individual instances v that dominate U . For ex- ample, in Figure 1, a dominance check between dmax (which co- incides with d1) and c1 shows that all instances d1, d2, and d3 are dominated by c1. As before, only instances with F (v) > F (umax) are considered. ...

View in full-text

Context 5

... it is dominated, the score of u is again increased by 1/M , and the threshold is checked. In Figure 1, this is the case with d3 and bmin. ...

View in full-text

Context 6

... in fact, is a known problem faced by the skyline computation approaches as well. As the dimensionality increases, it becomes increasingly more difficult to find instances dominating other instances; hence, many unnecessary dominance Figure 10: Effect of corr under low (left) and high (right) vari- ance var checks are performed. A possible work-around is to group together related service parameters so as to decrease the dimensionality of the match objects. ...

View in full-text

Context 7

... possible work-around is to group together related service parameters so as to decrease the dimensionality of the match objects. For the same reasons, a similar effect is observed in Figure 10. For correlated data sets, where many successful dom- inance checks occur, the computational cost for all methods drops close to zero. ...

View in full-text

Context 8

... an application favors more accurate results, then T KM seems as an excellent solution. If the time factor acts as the driving decision point, then T KDD should be favored, since it provides high quality results (see Table 2) almost instantly (see Figures 9 and 10). ...

View in full-text

Survey and evaluation of Web search engine hit counts as research tools in computational linguistics

Article

Full-text available

Dec 2017

In recent years, many studies on computational linguistics have employed the Web as source for research. Specifically, the distribution of textual data in the Web is used to drive linguistic analyses in tasks such as information extraction, knowledge acquisition or natural language processing. For these purposes, commercial Web search engines are c...

MusicPedia: Retrieving and Merging- Interlinking Music Metadata

Article

Full-text available

Aug 2011

The rapid change of computers from isolated machines to networks and the need of people to exchange information lead us to the World Wide Web (WWW). Nowadays a lot of people are spending lot of hours in WWW searching information for every aspect of life. This increase of information in WWW, increase also the difficulty to find and access the inform...

Effect of document expansion using web documents for spoken documents retrieval

Article

Full-text available

Jan 2010

This paper describes a method for spoken document retrieval using Web document expansion. This technique improves document retrieval performance by expanding the target spoken documents using Web data. In this research, two types of indexes are built. One is made from transcriptions of the spoken documents; the other is made from Web documents that...

Using Wikipedia Knowledge and Query Types in a New Indexing Approach for Web Search Engines

Thesis

Full-text available

Oct 2014

Falah Al-akashi

The Web is comprised of a vast quantity of text. Modern search engines struggle to index it independent of the structure of queries and type of Web data, and commonly use indexing based on Web‘s graph structure to identify high-quality relevant pages. However, despite the apparent widespread use of these algorithms, Web indexing based on human feed...

A Framework for Extracting Information from Semi-Structured Web Data Sources

Research

Full-text available

Jun 2015

Mahmood Shakir Hammoodi

Nowadays, many users use web search engines to find and gather information. User faces an increasing amount of various semi-structured information sources. The issue of correlating, integrating and presenting related information to users becomes important. When a user uses a search engine such as Yahoo and Google to seek a specific information, the...

Distributed probabilistic top-k dominating queries over uncertain databases

Article

Full-text available

Jul 2023
KNOWL INF SYST

In many real-world applications such as business planning and sensor data monitoring, one important, yet challenging, task is to rank objects (e.g., products, documents, or spatial objects) based on their ranking scores and efficiently return those objects with the highest scores. In practice, due to the unreliability of data sources, many real-world objects often contain noises and are thus imprecise and uncertain. In this paper, we study the problem of probabilistic top-k dominating (PTD) query on such large-scale uncertain data in a distributed environment, which retrieves k uncertain objects from distributed uncertain databases (on multiple distributed servers), having the largest ranking scores with high confidences. In order to efficiently tackle the distributed PTD problem, we propose a MapReduce framework for processing distributed PTD queries over distributed uncertain databases. In this MapReduce framework, we design effective pruning strategies to filter out false alarms in the distributed setting, propose cost-model-based index distribution mechanisms over servers, and develop efficient distributed PTD query processing algorithms. Extensive experiments have demonstrated the efficiency and effectiveness of our proposed distributed PTD approaches on both real and synthetic data sets through various experimental settings.

Cloud services security-driven evaluation for multiple tenants

Article

Full-text available

Jun 2021
CLUSTER COMPUT

Cloud Computing has become a reliable solution for outsourcing business data and operation with its cost-effective and resource-efficient services. A key part of the success of the cloud is the multi-tenancy architecture, where a single instance of a service can be shared between a large number of users, also known as tenants. Service selection for multiple tenants presents a real challenge that has not been properly addressed in the literature so far. Most of the existing cloud services selection approaches are designed for a single-user, and hence are inefficient when applied to the case of a large group of users with different, and often, conflicting requirements. In this paper, we propose a multi-tenant cloud services evaluation framework, whereby service selection is carried out per group of tenants that can belong to different service classes, rather than per a single user. We formulate the cloud services selection for multi-tenants as a complex multi-attribute large-group decision-making (CMALGDM) problem. Skyline method is initially applied to reduce the search space by eliminating the dominated services regardless of tenants’ requirements. Tenants are clustered based on their profiles characterized by different personal, service, and environmental features. Each tenant is assigned a weight to reflect its importance in the decision-making. The weight of a tenant is determined locally based on its closeness to the group decision and globally by combining its local weight with its corresponding cluster weight to reflect its total contribution to the overall decision-making. The final ranking of the alternatives is guided by a dynamic consensus process to reach a final solution with the highest level of agreement. The proposed framework supports multiple types of information, including deterministic data, interval numbers, and fuzzy numbers, to realistically represent the heterogeneity and uncertainty of security information.

Learning a Generalized Matrix from Multi-graphs Topologies Towards Microservices Recommendations

Chapter

Jan 2021

This paper presents a methodology that combines latent factor models with graph-based models. The proposed recommendation system identifies a recommended item as a node of a graph. More specifically, the topology of the graph and the paths between the nodes are considered as critical features regarding the associations between them. Furthermore, in the current approach, these structural features are considered as feedback. These structural features are extracted from a pool of several application graphs which are afterwards generalized into a unified matrix of proximities. The main reason for the use of this structural feedback is to generate recommendations and discover unobserved relations using matrix factorization techniques. The approach is tested on a data set that consists of cloud-native microservices graphs.

Data mining service recommendation based on dataset features

Article

Full-text available

Sep 2019

Quality of service (QoS)-based web service selection has been studied in the service computing community for some time. However, characteristics of the input dataset that is going to be processed by the web service are not usually considered in the selection process, even though they might have impact on QoS values of the service, e.g. latency on processing a bigger dataset is higher than that on a smaller dataset, one service takes longer time to process a certain dataset than another service. To address this issue, in this work, we take into consideration the dataset features in the QoS-based service recommendation process and we focus on data mining services because their QoS values could be highly dependent on dataset features. We propose two approaches for data mining service recommendations and compare their performances. In the first approach, we use a meta-learning algorithm to incorporate dataset features in the recommendation process and study the use of different machine learning algorithms (both classification models and regression models) as meta-learners in recommending data mining services for the given dataset. We also investigate the impact of the number of dataset features on the performance of the meta-learners. In the second approach, we propose a novel technique of using factor analysis for web service recommendation. We use decomposition technique to identify latent features of the input dataset and then recommend services by exploiting these latent variables. Our proposed approach of web service recommendation based on latent features was shown to be a more robust model with an accuracy of 85% compared to meta-feature-based recommendation.

Open-data-driven embeddable quality management services for map-based web applications

Article

Full-text available

Apr 2019

Various map-centered web services facilitate citizens’ lives. Web-map applications exist for many years already. Due to simplification and improvement of technologies supporting WebGIS, map-based services become more popular and important nowadays. Data quality assurance for such services is a significant challenge. Since many of such applications intensively use open data, approaches focused on open solutions are required. This work proposes a data-quality concept, which is based on intrinsic and comparable approaches. OpenStreetMap (OSM) allows intrinsic data evaluation. Moreover, it is used as a reference dataset for quality assessment of public-sector-information Open Data layers. Equidistant point (EDP)-based statistics enables to filter out low-quality Open Data features. A data-type model carries out the inventory of OSM data. The comparison of raster web-map tile file sizes and calculation of a simplified data quality indicator make it possible to specify acceptable data quality levels. Embeddable instances of quality assurance web services incorporate data features with acceptable quality. This work provides all required software and data for the deployment of such services under liberal licenses. Concrete instructions allow users to adopt the proposed solutions for their platforms. Some generic use cases illustrate the advantages of the introduced shared web services.

Sliding window top-k dominating query processing over distributed data streams

Article

Full-text available

Dec 2016
DISTRIB PARALLEL DAT

Preference query processing is important for a wide range of applications involving distributed databases, such as network monitoring, web-based systems, and market analysis. In such applications, data objects are generated frequently and massively, which presents an important and challenging problem of continuous query processing over distributed data stream environments. A top-k dominating query, which has been receiving much research attention recently, returns the k data objects that dominate the highest number of data objects in a given dataset, and due to its dominance-based ranking function, we can easily obtain superior data objects. An emerging requirement in distributed stream environments is an efficient technique for continuously monitoring top-k dominating data objects. Despite of this fact, no study has addressed this problem. In this paper, therefore, we address the problem of continuous top-k dominating query processing over distributed data stream environments. We present two algorithms that monitor the exact top-k dominating data and efficiently eliminate unqualified data objects for the result, which reduces both communication and computation costs. In addition to these algorithms, we present an approximate algorithm that further reduces both communication and computation costs. Extensive experiments on both synthetic and real data have demonstrated the efficiency and scalability of our algorithms.

Approches vers des modèles unifiés pour l'intégration de bases de connaissances

Thesis

Full-text available

Sep 2016

Maria Koutraki

Ma thèse a comme but l’intégration automatique de nouveaux services Web dans une base de connaissances. Pour chaque méthode d’un service Web, une vue est calculée de manière automatique. La vue est représentée comme une requête sur la base de connaissances. L’algorithme que nous avons proposé calcule également une fonction de transformation XSLT associée à la méthode qui est capable de transformer les résultats d’appel dans un fragment conforme au schéma de la base de connaissances. La nouveauté de notre approche c’est que l’alignement repose seulement sur l’alignement des instances. Il ne dépend pas des noms des concepts ni des contraintes qui sont définis par le schéma. Ceci le fait particulièrement pertinent pour les services Web qui sont publiés actuellement sur le Web, parce que ces services utilisent le protocole REST. Ce protocole ne permet pas la publication de schémas. En plus, JSON semble s’imposer comme le standard pour la représentation des résultats d’appels de services. À différence du langage XML, JSON n’utilise pas de noeuds nommés. Donc les algorithmes d’alignement traditionnels sont privés de noms de concepts sur lesquels ils se basent.

A Survey on Quantitative Evaluation of Web Service Security

Conference Paper

Full-text available

Jul 2016

Extended Comb Needle Model for Energy Efficient Data Aggregation in Random Wireless Sensor Networks

Article

Full-text available

Jun 2016

Background/Objectives: Energy conservation in Wireless Sensor Network is essential to enhance its life. A sensor node consumes more energy for communication than performing data gathering or data processing. Data aggregation minimizes the data size for communication. Methods/Statistical Analysis: The Comb Needle model is available in literature to perform data aggregation for grid networks (regular deployment). Extended the Basic Comb Needle Model in randomly deployed sensor networks. The simple random network with Comb Needle Model is compared with simple random network without Comb Needle Model. The theoretical analysis and simulation study shows that Extended Comb Needle Model performs better data aggregation. Findings: When we apply the Proposed Model in random network, the communication cost, overhead, and energy consumption are significantly reduced. The simulation results for the proposed Extended Comb Needle Model prove that the energy consumption and overall communication costs are substantially minimized. The simulation comparison is done for simple random network with and without Comb Needle Model in terms of communication cost, energy consumption, delay, packet loss, packet delivery ratio, and throughput. We found that the communication cost is decreased from 82% to 58%. the average energy consumption is decreased from 80% to 40%. Delay is decreased from 76% to 20%. Packet loss in decreased from 67% to 12%. Packet Delivery Ratio is increased from 82 % to 87%. And throughput is increased from 70% to 90%. Application/Improvements: Proposed Model optimizes WSN performance in terms of better packet delivery ratio, improved throughput, minimized energy consumption and reduced delay. Simulation results as well as theoretical analysis affirm the same.

An integrated QoE and QoS based approach for web service selection

Conference Paper

Full-text available

Jan 2016

An exponential increase in the number of web services over the last few years increases the importance of the service selection task for choosing the best among a group of web services with similar functionalities. Most of the web service selection approaches are service provider perspective based on non-functional properties such as performance, reliability etc. (such attributes known as Quality of Service or QoS). But it has been observed that in any decision support issues in selection, users' feedback (known as Quality of Experience or QoE) plays a crucial role. In this paper, an integrated model has been proposed, based on both QoE and QoS, where the best service selection is made based not only on current QoS values of the services but also users' past experience of using them. Further, the case study has been provided and the results have been analyzed by inducing users' ratings as a QoE factor along with QoS parameters. The results show that the proposed approach, augmented by user's feedback, improves the quality of selection.

Services of Table 1 in the Pin × Pout space

Contexts in source publication

Similar publications

Citations