ArticlePDF Available

Rule based Hybrid Weighted Fuzzy Classifier for Tumor Data

Authors:

Abstract and Figures

Examination of gene based information has turned out to be so essential in biomedical industry for assurance of basic ailments. A fuzzy rule based classification is a standout amongst the most mainstream approaches utilized as a part of example arrangement issues. The fuzzy rule based classifier creates an arrangement of fuzzy if-then decides that empower exact non-straight order of information designs. In spite of the fact that there are different techniques to create fluffy if-then guidelines, the advancement of lead producing process is as yet an issue. Here, we introduce a half and half weighted fluffy order framework in which few fluffy if-then principles are chosen by methods for offering weights to preparing designs. Further, we utilize a genetic algorithm (GA) to streamline the classifier for quality articulation investigation
Content may be subject to copyright.
Copyright © 2018 Authors. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted
use, distribution, and reproduction in any medium, provided the original work is properly cited.
International Journal of Engineering & Technology, 7 (4.19) (2018) 104-108
International Journal of Engineering & Technology
Website: www.sciencepubco.com/index.php/IJET
Research paper
Rule based Hybrid Weighted Fuzzy Classifier for Tumor Data
D. Winston Paul1*, S. Balakrishnan2, A.Velusamy3
1, 2, 3 Department of Information Technology,
Sri Krishna College of Engineering and Technology, Coimbatore, Tamilnadu, India
*Corresponding Author Email: 1winstonpauld@skcet.ac.in
Abstract
Examination of gene based information has turned out to be so essential in biomedical industry for assurance of basic ailments. A fuzzy
rule based classification is a standout amongst the most mainstream approaches utilized as a part of example arrangement issues. The
fuzzy rule based classifier creates an arrangement of fuzzy if-then decides that empower exact non-straight order of information designs.
In spite of the fact that there are different techniques to create fluffy if-then guidelines, the advancement of lead producing process is as
yet an issue. Here, we introduce a half and half weighted fluffy order framework in which few fluffy if-then principles are chosen by
methods for offering weights to preparing designs. Further, we utilize a genetic algorithm (GA) to streamline the classifier for quality
articulation investigation
Keywords: Data mining, classificaton, Bioinformatics, Fuzzy sytems ,genetic algorithms, weighted rule.
1. Introduction
In the process of DNA analysis, microarray studies have received
a lot of attention, industrially as well as in research. The gene
expression analysis plays an inevitable role of both identification
and prediction of DNA sequences. Microarray trials can either
screen every quality a few times under fluctuating conditions or
examine the qualities in a solitary situation however in various
sorts of tissue [1], [2].
Here, we center around the arrangement of quality articulations.
“Information mining has pulled in a lot of consideration in the
data business and in the public arena in general as of late, because
of the wide accessibility of gigantic measures of information and
the up and coming requirement for transforming such information
into helpful data and learning [5]. Arrangement is an undertaking
that happens much of the time in regular day to day existence.
Basically it includes separating up objects with the goal that each
is relegated to one of various commonly thorough and select
classifications known as classes [6].
In this paper, we utilize mixture weighted fluffy manage based
characterization framework for dissecting microarray articulation
information. Fluffy frameworks in view of fluffy if-then principles
have been connected to different control issues. One favorable
position of such fluffy framework is their interpretability. As of
late fluffy manage based framework have likewise been connected
to design characterization issues [4]. These are numerous ways to
deal with consequently create fluffy if-then guidelines from
numerical information for design grouping issues, for example,
Michigan, Pittsburgh or Iterative lead learning approaches. In the
proposed work, we enhance the Michigan based fuzzy rule
scheme by adding weights for training patterns. Thus we can
reduce the number of rules generated exponentially there by
reducing number of attributes involved. Further, using genetic
algorithm (GA) [11] we can make “a compact classifier for gene
expression analysis”.
2. Background
2.1. Fuzzy Rule Based Classification
Pattern classification normally is a directed procedure where, in
light of known information, anticipate choice for new information.
The known information is called preparing set and the new
information is called test information. Different techniques have
been proposed for fluffy order. Let us assume that our pattern
classification problem is an n-dimensional problem with C classes
and m given training patterns
Yp= (yp1, yp2, …, ypm), p=1, 2, …, m.
“Without loss of generality, each attribute of the given training
patterns is normalized into a unit interval [0, 1]; That is, the
pattern space is an n-dimensional unit hypercube [0, 1]n. In this
study we use “fuzzy if-then rules of the following type as a base of
our fuzzy rule-based classification systems:
Rule Rj: If y1 is Aj1 and ... and yn is Ajn then
Class Cj with CFj, j=1, 2, …, N
where Rj is the label of the jth fuzzy if-then rule, Aj1, … ,Ajn are
antecedent fuzzy sets on the unit interval [0, 1], Cj is a consequent
class (i.e. one of the given classes), CFj is the grade of certainty of
the fuzzy if-then rule Rj, and N is the total number of fuzzy if-then
rules”. As antecedent fuzzy sets, we use triangular fuzzy sets as
in Fig.1and Fig.2 where we show various partitions of the unit
interval into a number of fuzzy sets [3].
The Fig.1 shows triangular membership function and Fig.2
Shows the triangular membership function based on five fuzzy
set. So, it contains five classes and 25 rules are generated (if the
number of attribute is 2) for each attribute.
International Journal of Engineering & Technology
105
Fig. 1: Triangular membership Function
Fig. 2: Five Fuzzy Set Membership Function
2.2. Fuzzy if-then Rule Generation
Step 1: Calculate βclass h (Rj) for Class h as
Where, µj(yp) = µj1(yp1).…. µjn(ypm).
P=1, 2, …, m. h=1,2,…,C.
Step 2: Find Class ĥ that has the maximum value of βclass h (Rj),
where βclass h (Rj) is the “average of the compatibility grades of
training patterns in Class h with the fuzzy if-then rule Rj and
Nclass h is the number of training patterns which their
corresponding class is Class h and each of the fuzzy rules in the
final classification has a certainty grade, which denotes the
strength of that fuzzy rule”. This number is calculated according
to,
Where,
2.3. Fuzzy Rule Mapping
When a rule set S is given, an input pattern Y=(y1,y2,…,yn) is
classified by a single winner rule Rj in S, which is determined as
follow:
That is the winner rule has the maximum product of
compatibility and the certainty grade CFj. The classification is
rejected if no fuzzy if-then rule is compatible with the input
pattern Yp. The generation of each fuzzy rule is accepted only if
its consequent class is the same as its corresponding random
pattern class. Otherwise, the generated fuzzy rule is rejected.
3. Related Work
There are several approaches for classifying gene expression data.
In [1], the clinical research approach method is used to categorize
different types of cancer tissues and to identify new cancer
classes. The approach [1] used here is based on gene expression
monitoring by DNA microarrays and are applied to human
leukemia’s as test case”. The gene expression databases are pillars
of the research [2] and mining from these databases is a major
task.
In section II, we have explained the generic fuzzy classification
system as expressed in [3], [7]. An “n-dimensional classification
problem is analyzed using a hybrid fuzzy method”. The hybrid
fuzzy classification system is formed by combining the fuzzy
classification scheme and genetic algorithm [10]-[12]. The “fuzzy
classification scheme is used to generate the fuzzy if-then rules,
which increases the number of rules exponentially with the
number of attributes involved”.
In order to reduce the rule generation, the modern genetic
algorithm is used. Some other methods have also been applied
using rule weights and weights for training patterns to reduce the
rules by avoiding unimportant attributes [7], [8]. Applying
weights for training patterns prove to be better than rule weights
[7]. However, optimization is not achieved completely.
The main contribution of this paper is hybridizing the evolutionary
genetic algorithm with weighted training patterns for optimizing
the rule generation. As a result, the searching complexity of rule
mapping process is reduced.
4. Hybrid Weighted Fuzzy Method
In our fuzzy rule generation approach, the weight is given for each
training pattern. The initial population is generated from the
weighted training patterns and is subjected to fitness evaluation.
To achieve more compactness for classifier, the current population
is further subjected to genetic operations.
4.1. Weights for Training Patterns
The motivation behind weight is to give need for all preparation
designs. The heaviness of preparing examples can be seen as a
critical factor of the examples. There are different techniques
accessible for weight task, for example, class-based weighting
strategy, cover based weighting technique and arbitrary weighting
technique and so on. Here we underline on class based weighting
technique.
The class-based weighting technique is to make an inclination
toward the grouping of examples from a specific class. For
instance, if the predisposition is toward the arrangement of Class1
designs, characterization frameworks are relied upon to accurately
order Class 1 designs regardless of whether the quantity of
misclassification/dismissal is substantial for different classes.
In this “weighting method, a weight for the pattern Yp is
determined by following equation”:
4.2. Rules from Weighted Training Patterns
Let us assume that we have m training patterns Yp= (yp1,yp2, …,
ypm), p=1,2, …,m. Then the rule procedure is,
[Step 1]: Calculate βclass h (Rj) for Class h as
h=1,2, …. C
106
where, µj(yp) = µj1(yp1).…. µjn(ypm).
[Step 2]: Find Class ĥ that has the maximum value of βclass h
(Rj),
The remaining procedures are same as explained earlier.
4.3. Architecture
Fig. 3: Hybrid weighted fuzzy classifier
The Fig.3 shows the overall structure of the hybrid weighted fuzzy
if-then rule based classification system. The known classified data
is taken as input for training process. The weight is multiplied
with each training pattern and the summation is given for internal
input for generating rules. The rules are stored in rule pool and
each rule is submitted to fitness evaluation and then for genetic
operation, to remove worst rules. Finally, the unknown test data is
given for classification.
4.4. Genetic Algorithm
Genetic algorithms are roused by Darwin's hypothesis of
development. It is a cutting edge calculation for taking care of any
enhancement issue. Answer for an issue illuminated by hereditary
calculations utilizes a developmental procedure. Calculation
begins with arbitrarily producing the arrangement of conceivable
answers for an advancement issue called populace. Arrangements
from one populace are taken and used to shape another populace.
This is inspired by an expectation, that the new populace will be
superior to the old one. At that point arrangements, which are
chosen to frame new arrangements, are chosen by their wellness -
the more appropriate they are the more shots they need to imitate.
This is rehashed until the point when some condition is fulfilled.
The hereditary tasks are determination, hybrid and transformation.
The new guidelines are created from the principles in the present
populace utilizing hereditary activities. To perform hybrid task,
two fluffy if-then standards are arbitrarily chosen from the present
populace and the better administer with the higher wellness
esteem is picked as a parent string. A couple of parent strings is
picked by repeating this strategy twice. From the chose match of
parent strings, two new strings are produced by a hybrid activity.
The crossover operator is applied to each pair of parent strings
with a pre-specified crossover probability pc.
After new strings are generated, the mutation operation will be
taken for replacing the current string with the pre-specified
mutation probability pm.
4.5. Crossover Operation
Two parents deliver two posterity. Quite possibly the
chromosomes of the two guardians are replicated unmodified as
posterity. Quite possibly the chromosomes of the two guardians
are arbitrarily recombined (hybrid) to frame posterity. For the
most part the possibility of hybrid is in the vicinity of 0.6 and 1.0.
Fig. 4: Example of Crossover
The Fig.4 shows the single point crossover operation. The third bit
of the parent strings is crossover to produce new child strings. The
main goal of the cross over is to decompose two distinct solutions
and then randomly mixes their parts to form novel solutions.
4.6. Mutation Operation
There is a chance that a gene of a child is changed randomly.
Generally the chance of mutation is low (e.g. 0.001).
Fig 5: Example of Mutation
In Fig.5 the original binary string is mutated by switching few bits
from 1 to 0 or from 0 to 1. The selection, crossover and mutation
are iterated until a pre-specified number Nreplace of new strings
are generated.
4.7. Algorithm Summary
The Fig.6 describes the flow of processes involved in the
classification system. Initially, a set of rules are generated from
the weighted training patterns and then an iterative process is
taken for checking fitness and genetic operations until the
stopping condition is satisfied.
Fig. 6: Flow Chart
International Journal of Engineering & Technology
107
The steps involved in hybrid weighted fuzzy rule based classifier
are following:
Step 1: “Parameter Specification : Specify the number of fuzzy if-
then rules Nrule, the number of replaced rules Nreplace, the
crossover probability pc, the mutation probability pm, weights for
training patterns wp, and the stopping condition
Step 2: Weight assignment: Assign weights (wp) for each training
pattern
Step 3: Initialize: Randomly generate Nrules from weighted
training patterns as an initial population
Step 4: Check Fitness: Evaluate the fitness of each rule in the
current population, and test if the end condition is satisfied, stop,
return the best solution, and exit
Step 5: Genetic Operations: Create a new population based on
current population by following genetic operations
Selection: Select two rules randomly from the current population
according to their fitness (the better fitness, the bigger chance to
be selected)
4.8. Cross Over
With a crossover probability cross over the selected parent rules to
form new offspring (children/rules). If no crossover was
performed, offspring is the exact copy of parents
4.9. Mutation
With a mutation probability mutate new offspring at each locus
Step 6: Replace: Place current population using new generated
offspring population for a further run of the algorithm
Step 7: Repeat: Go to step 2”.
4.10. Fitness Evolution
The fitness value of each run is assessed by grouping all the given
preparing designs utilizing the arrangement of fluffy if-then
standards in the present populace. The wellness estimation of the
fluffy if-then control is assessed by the accompanying wellness
work:
fitness (Rj) = NCP(Rj) wp . NMP(Rj)
where, “NCP(Rj) denotes the number of correctly classified
training patterns by rules Rj and NMP(Rj) is the number of
misclassified training patterns”.
4.11. Cost and Accuracy Evaluation
The cost of misclassification or rejection of fuzzy classification
system is calculated as,
Where, “Cost(S) is the cost of misclassification or rejection made
by a fuzzy classification system S, m is the number of training
patterns, wp is the weight of the training pattern yp, and zp(S) is a
binary variable that is determined according to the classification
result of the training pattern xp by S: zp(S) = 0 if yp is correctly
classified by S and zp(S)=1 otherwise[7]”.
The accuracy of the hybrid weighted fuzzy rule based classifier is
calculated as,
Accuracy, a=t/n
Where,
t-> number of samples correctly classified.
n-> total number of sample cases.
5. Experimental Results
We have developed a hybrid weighted fuzzy rule based system
trained by genetic algorithm. The colon cancer and breast cancer
datasets are taken for experimental process. The number of classes
used is 2. The colon dataset contain 6 attributes and 400 patterns.
The breast cancer dataset contains 523 patters and 9 attributes. For
result, we have used the yes or no classes. It is clear that the
method with weights for training patterns can obtain better
classification and fewer rules. The table 1 shows the parameters of
genetic algorithm used for the hybrid weighted fuzzy classifier.
Table 1: Parameter Specification
Parameter
Value
Number of rules, Nrule
20
Crossover probability, Pc
0.9
Mutation probability, Pm
0.1
Number of replaced rules, Nreplace
4
Stopping Condition Cycle
50
6. Conclusion
In this paper, a hybrid weighted fuzzy rule based classification
system is developed with the combination of assigning weights for
training patterns, fuzzy rule generation procedure and genetic
algorithm. The Michigan based procedure is used to construct the
structure. The compactness of classifier also good compared with
existing system. The experimental result shows the hybrid
weighted fuzzy rule based classifier gives higher performance than
the conventional methods.
References
[1] T. R. Golub, D. K. Slonim, P. Tamayo, C. Huard, M. Gaasenbeek,
J. P. Mesirov, H. Coller, M. L. Loh, J. R. Downing, M. A.
Caligiuri, C. D. Bloomfield, and E. S. Lander, “Molecular
classification of cancer: Class discovery and class prediction by
gene expression monitoring,” Science, vol. 286, pp. 531537, 1999.
[2] Pascale Anderle, Manuel Duval, Sorin Draghici, Alexander Kuklin,
Timothy G. Littlejohn, Juan F.Medrano, David Vilanova, and
Matthew Alan Roberts Gene Expression Databases and Data
Mining” Biotechniques. 2003 Mar; Suppl: 36-44.
[3] Gerald Schaefer, Tomoharu Nakashima, “Data Mining of Gene
Expression Data by Fuzzy and Hybrid Fuzzy Methods” IEEE
Trans. Information Technology in Biomedicine, vol. 14, pp. 23-29,
2010.
[4] H. Ishibuchi and T. Nakashima, “Improving the performance of
fuzzy classifier systems for pattern classification problems with
continuous attributes,” IEEE Trans. Ind. Electron., vol. 46, no. 6,
pp. 10571068, Dec. 1999.
[5] Jiawei Han and Micheline Kamber, “Data Mining: Concepts and
Techniques”, 2nd edition, ISBN: 978-55860-901-3, Elsevier.
[6] Max Bramer, “Principles of Data Mining”, ISBN: 978-81-8489-
166-9, springer, 2007.
[7] Tomoharu Nakashima, Yasuyuki Yokota, Hisao Ishibuchi,
“Constructing fuzzy classification systems from weighted training
patterns”, in proc. 19th European conf. on modeling and simulation,
vol 3, pp.2386-2391, 2004.
[8] H. Ishibuchi and T. Nakashima, “Effect of rule weights in fuzzy
rule-based classification systems,” IEEE Trans. Fuzzy Syst., vol. 9,
no. 4,pp. 506515, Aug. 2001.
[9] P.Woolf and Y.Wang, “A fuzzy logic approach to analyzing gene
expression data,” Physiol. Genomics, vol. 3, pp. 9–15, 2000.
[10] C. Z. Janikow, “A genetic algorithm for optimizing fuzzy decision
trees,” in Proc. 6th Int. Conf. Genetic Algorithms, Univ. Pittsburgh,
Pittsburgh, PA, July 1519, 1995, pp. 421428.
[11] S.Sheeba Rani, R.Maheswari, V.Gomathy and P.Sharmila, “Iot
driven vehicle license plate extraction approach” in International
Journal of Engineering and Technology(IJET) , Volume.7, 2018, pp
457-459, April 2018
[12] M.A.Lee, H.Takagi, “Dynamic control of genetic algorithms using
fuzzy logic techniques,” in Pmc.Int.Conf. Genetic
Algorithm,Urbana-Champaign,lL,July 1993,pp.76-83.
[13] Zhun-Ga Liu, Quan Pan, Jean Dezert, “Hybrid Classification
System for Uncertain Data”, IEEE Transactions on Systems, Man,
and Cybernetics: Systems ( Volume: 47, Issue: 10, Oct. 2017 ).
[14] Balakrishnan S, K.Aravind, A. Jebaraj Ratnakumar, “A Novel
108
Approach for Tumor Image Set Classification Based On Multi-
Manifold Deep Metric Learning”, International Journal of Pure and
Applied Mathematics, Vol. 119, No. 10c, 2018, pp. 553-562.
[15] A. Jebaraj Rathnakumar, S.Balakrishnan, “Machine Learning based
Grape Leaf Disease Detection”, Jour of Adv Research in
Dynamical & Control Systems, Vol. 10, 08-Special Issue, 2018. Pp.
775-780.
[16] A. Jebaraj Ratnakumar, S. Balakrishnan, S.Sheeba Rani,
V.Gomathi, “A Machine Learning Based IOT Device for E-Health
Monitoring In a Cloud Environment”, Invest Clin. Vol. 58, issue 3,
pp. 287-299, 2017. (Web of Science).
[17] S. Vasu, A.K. Puneeth Kumar, T. Sujeeth, Dr.S. Balakrishnan, “A
Machine Learning Based Approach for Computer Security”, Jour of
Adv Research in Dynamical & Control Systems. Vol.10, 11-Special
issue, 2018, pp. 915- 919.
[18] Balakrishnan, S., Janet, J., Sujatha, K., & Rani, S. (2018). An
Efficient and Complete Automatic System for Detecting Lung
Module. Indian Journal Of Science And Technology, 11(26).
doi:10.17485/ijst/2018/v11i26/130559
Chapter
Agriculture has a significant role in cultural life, and Agriculture has a significant role in the economies of many countries. To come up with the best possible conclusion from this study. It's essential to pay attention to critical factors, including energy, water availability, labour, and a correct watering plan for crops. Researchers in this study were interested in building a smartphone application that would allow farmers to operate an IoT-based automated irrigation system remotely. Paddy field photos were used to create a deep learning model called Paddy Field Radial Basis Function Networks (PF-RBFNs). The model tells the farmer how much water will be needed in a certain field area for irrigation. A real-time picture dataset and a raspberry pi-based hardware model were used to test this approach. The model was compared to three different deep learning models: LSTMs, RNNs, and GANs. It was found that this proposed PF-RBFNs model has a 93% accuracy rate.
Chapter
Full-text available
Data mining procedures have been broadly used to mine learned data from medicinal information bases. Sentiment Mining is a procedure of programmed extraction of learning by method for the conclusion of others about some specific item, theme or Issue. Sentiment analysis implies deciding the subjectivity, extremity (positive/negative) and extremity quality and so forth, with a bit of text. Clustering is the methodology of making a get together of dynamic things into classes of near articles. In this paper, we are proposing a two way clustering algorithm for breast cancer disease. Apriori hybrid algorithm and K-Modes Algorithm is used to cluster the opinions effectively and to improve the performance in the online data set. Apriori-Hybrid is the mix of count Apriori and Apriori-TID, which can mastermind the huge item sets and improve the accuracy of collection of dangerous development. It can moreover uncover understanding into the basic part that enables each malady type to suffer and thrive, which in this way help in the early revelation of the threatening development. We propose Apriori-Hybrid as an extemporized calculation for tumour characterization.
Chapter
The paper is devoted to capture of vessel traffic data for numerical modeling in scientific research. Data capture from radar, video camera, GPS and special Internet sites is discussed. Data capture and data processing from www.marinetraffic.com are shown. Some problems and its solution are watched, and information system working results with data from Vladivostok port are demonstrated.KeywordsVessel traffic controlShip motionAISShip traceData processing
Chapter
The work is devoted to the problem of determining the indoor object location in cases where the signal of satellite (GPS, GLONASS, etc.) is not available. In this case, alternative sources of navigation data are used to position objects: inertial navigation devices (accelerometers, gyroscopes), Bluetooth and Wi-Fi data transmission devices, atmospheric pressure sensor, magnetometer, etc. The use of Bluetooth beacons for indoor positioning has many advantages. Firstly, it is the possibility of arbitrary configuration of sensors in the room due to their small size and autonomy. Secondly, it is possible to build a system based on inexpensive and widespread equipment that does not require special professional skills of the personnel. The paper discusses the problem of positioning an object in three-dimensional space according to the data of Bluetooth devices located in the room and forming a multi-position beacon surveillance system. It is noted that for the successful operation of such a system, it is necessary to estimate in advance the characteristic error in estimating the coordinates of positioned objects and if necessary, change the configuration of the beacons. A model interpretation of the positioning problem in the form of a system of algebraic linear equations is proposed. Such a representation allows one to construct a priori theoretical estimates of the error in determining the object coordinates to determine areas of space where the positioning accuracy is insufficient. The paper presents the expected accuracy calculations data of solving the problem in various typical situations and the results of field experiments that confirm the calculations. In general, the study is optimistic about the prospects for solving 3D indoor positioning problems using Bluetooth beacons.KeywordsIndoor navigationBeacon systemDistance measurementBluetoothPositioningLeast squares method
Chapter
In this paper, we propose an approach to verifying the correctness of mixed production rules (MPR) in intelligent systems for diagnosing industrial equipment (IE) using the rule evaluation that is necessary to determine the degree of similarity between two rules. Complexity, relevance, and significance of the rules serve as the rule evaluation. Using rule evaluations, it is easy to detect incorrect and redundant rules in the knowledge base (KB) of IE diagnostic systems under conditions of heterogeneous information. The examples of main structural errors that occur in MPR systems (redundancy, cyclicity, incompleteness) are considered in details. The approach proposed is firstly aimed at simplifying the KB containing MPR that are added to the intelligent system of IE diagnostics which results in usefulness and accessibility for the knowledge interpretation by an expert; secondly, it will allow structuring the fuzzy knowledge and facilitating the organization of output control.
Article
Full-text available
A mobile network is an evolving network in which surveillance activities are carried out between mobile users, and information is shared and collected by investing in a single connection between mobile users. Older ad hoc routing protocols are inappropriate or have a detrimental effect on the collection or exchange of information at nodes such as rarely connected mobile social networks. In recent years, some routing protocols (notably social network-based routing) are expected to increase transmission speeds on mobile social networks, but most nodes do not take load into account, and there is a risk that 'they are used in an unstable manner with electricity between nodes. This article provides a power routing protocol that limits the power consumption of nodes. In addition, various other routing protocols are compared and the baud rate is kept within acceptable limits.
Article
Full-text available
Customer satisfaction is playing a vital role for the success of every business. Today’s, business everywhere the world makes out that the buyer is the king. Thus, the present study has been carried out to the customer satisfaction towards TVS Bikes, the present study is proposed to analyze the respondent’s satisfaction levels towards TVS Bikes with the attention of various satisfaction factors in the area of Salem district.
Article
Full-text available
This study examines the being of a Semi Month Effect (SME) in NSE Shariah index using data mining classifications. The current study usages the daily return data of the NSE Shariah Index for the data ranging between January 2009 and March 2019. The outcomes confirm the presence of seasonality Anomalies in NSE Shariah index stock yields in India. The results are also reliable with the Anomalies does exist in the stock market in India, Shariah index is inefficient, and hence, investors can time their share investments to expand earnings.
Article
Full-text available
Jute products come in a very stylish and have different variants available in the market. It is measured one of the best long-lasting substances for belongings. The present study has been carried out the Consumer preference towards Jute Products in Coimbatore district. To predict or estimate the standard error of the mean (MSE) = the sum of the difference and the square bias. Thus, if the aggregation bias is negligible relative to the small relative magnitude, the aggregation results in a lower MSE for the lead using the Decision Trees algorithm with Binary classification. In addition, because there are many units, the subunits can be combined with one scene with a lower MSE. In this case, clustering is essential for transfer to MSE. Once the lists are organized, you can specify a general format for each category for that purpose Garrett ranking method has been applied, to find the respondent’s preference levels towards jute products with the attention of various preference factors in the area of Coimbatore District.
Article
Full-text available
Financial performance is a subjective measure of how well a firm can use assets from its primary mode of business and generate revenues. The primary objective of financial evaluation bring clear picture of the financial position of a company. The main objective of this study has been undertaken to compare the financial position and performance of major player in two and three wheeler automobile segments by assessing five years balance sheet from 2015 to 2019. In PAT it increased 16.45% from FG18 to FG19. The case study details Bajaj Auto's history and focuses on the company's financial performance. It also gives a note on the Indian automotive industry. This allows students to analyze relationships, common sizes, common bases and DuPont models to aid in the analysis of the business impact assessment of a company, Bajaj Auto Ltd. You can analyze your financial performance. The Secondary Data have been collected from various sources. Different financial ratios and Pearson Correlation Coefficient test have been employed to study the financial performance of Bajaj Auto Limited and Hero Moto Corp Limited
Article
Full-text available
Machine Learning (ML) can be considered as a subfield of Artificial Intelligence since those calculations can be viewed as building squares to influence Personal Computers (PC) to figure out how to act more insightfully by some way or another summing up instead of simply putting away and recovering information things like a database framework and different applications would do. Machine learning is a successful system which can be utilized as a part of the field of Computer Security. In this paper, we exhibit the use of machine figuring out how to PC security, especially to interruption location. We apply NBTree learning calculations for the assignment of recognizing interruptions. We at that point remark on the appropriateness of the NBTree calculation for the interruption location errand in view of its high precision and high review. We at long last express the convenience of machine figuring out how to the field of PC security and furthermore remark on the security of machine learning itself.
Article
Full-text available
In digital era, Big Data and Deep learning are two rapidly growing hot trends. In recent years, we have witnessed major and noticeable advances in classification and representation learning brought by Deep learning techniques and/or algorithms. Deep Learning models supports inferences and discover structure at multiple levels. One of the hot and active research topics in recent decades is image classification. In this paper, we propose a modified Multi-manifold Deep Metric Learning (MMDML) approach used for image set classification of any cancer diseases. Initially, each image set (cancer image) is actually modeled as a manifold that is then passed into multiple levels of deep neural networks and mapped into another feature space. Specifically, a deep network is actually class-specific to ensure that different classes have various parameters within their networks. Then, the maximum manifold margin criterion is used to discover the parameters of this manifold. At testing phase, these International Journal of Pure and Applied Mathematics Volume 119 No. 10 2108, 553-562 ISSN: 1311-8080 (printed version); ISSN: 1314-3395 (on-line version) url: http://www.ijpam.eu Special Issue ijpam.eu 553 class-specific deep networks are used to compute the actual similarity between testing image set and all training classes. Lastly, the least distance is commonly used for classification. In this paper, we use generalized exponential loss function for measuring the degree of fit. As a result, the proposed technique is capable of achieving sophisticated performance on three common cancer disease types’ datasets such as, brain cancer, bone cancer and breast cancer by leveraging both discriminative and class-specific information simultaneously.
Article
Full-text available
The objective of the research paper is to capture the design a Vehicle Number Plate Identification System which can be used to identify and read the license number of any vehicle. The basic process involves taking the image of the front/rear of the vehicle which then gets processed and ultimately the number gets displayed on the LCD. This system can be used for a wide variety of installations and establishments such as entry points of schools, colleges, offices and parking spaces. The camera takes an image which is then processed in PC. The result achieved from this is the license number of the vehicle which is then transmitted via the Wi-Fi module and ultimately displayed on the LCD. The hardware especially can be made more rugged and compact so as to handle all the elements of nature and various environments that it can be used in.
Article
Full-text available
In classification problem, several different classes may be partially overlapped in their borders. The objects in the border are usually quite difficult to classify. A hybrid classification system (HCS) is proposed to adaptively utilize the proper classification method for each object according to the K-nearest neighbors (K-NNs), which are found in the weighting vector space obtained by self-organizing map (SOM) in each class. If the K-close weighting vectors (nodes) are all from the same class, it indicates that this object can be correctly classified with high confidence, and the simple hard classification will be adopted to directly classify this object into the corresponding class. If the object likely lies in the border of classes, it implies that this object could be difficult to classify, and the credal classification working with belief functions is recommended. The credal classification allows the object to belong to both singleton classes and sets of classes (meta-class) with different masses of belief, and it is able to well capture the potential imprecision of classification thanks to the meta-class and also reduce the errors. Fuzzy classification is selected for the object close to the border and hard to clearly classify, and it associates the object with different classes by different membership (probability) values. HCS generally takes full advantage of the three classification ways and produces good performance. Moreover, it requires quite low computational burden compared with other K-NNs-based methods due to the use of SOM. The effectiveness of HCS is demonstrated by several experiments with synthetic and real datasets.
Book
Full-text available
The growing interest in data mining is motivated by a common problem across disciplines: how does one store, access, model, and ultimately describe and understand very large data sets? Historically, different aspects of data mining have been addressed independently by different disciplines. This is the first truly interdisciplinary text on data mining, blending the contributions of information science, computer science, and statistics. The book consists of three sections. The first, foundations, provides a tutorial overview of the principles underlying data mining algorithms and their application. The presentation emphasizes intuition rather than rigor. The second section, data mining algorithms, shows how algorithms are constructed to solve specific problems in a principled manner. The algorithms covered include trees and rules for classification and regression, association rules, belief networks, classical statistical models, nonlinear models such as neural networks, and local "memory-based" models. The third section shows how all of the preceding analysis fits together when applied to real-world data mining problems. Topics include the role of metadata, how to handle missing data, and data preprocessing.
Article
Full-text available
Although cancer classification has improved over the past 30 years, there has been no general approach for identifying new cancer classes (class discovery) or for assigning tumors to known classes (class prediction). Here, a generic approach to cancer classification based on gene expression monitoring by DNA microarrays is described and applied to human acute leukemias as a test case. A class discovery procedure automatically discovered the distinction between acute myeloid leukemia (AML) and acute lymphoblastic leukemia (ALL) without previous knowledge of these classes. An automatically derived class predictor was able to determine the class of new leukemia cases. The results demonstrate the feasibility of cancer classification based solely on gene expression monitoring and suggest a general strategy for discovering and predicting cancer classes for other types of cancer, independent of previous biological knowledge.
Article
Full-text available
We have developed a novel algorithm for analyzing gene expression data. This algorithm uses fuzzy logic to transform expression values into qualitative descriptors that can be evaluated by using a set of heuristic rules. In our tests we designed a model to find triplets of activators, repressors, and targets in a yeast gene expression data set. For the conditions tested, the predictions made by the algorithm agree well with experimental data in the literature. The algorithm can also assist in determining the function of uncharacterized proteins and is able to detect a substantially larger number of transcription factors than could be found at random. This technology extends current techniques such as clustering in that it allows the user to generate a connected network of genes using only expression data.
Article
Microarray studies and gene expression analysis have received tremendous attention over the last few years and provide many promising avenues toward the understanding of fundamental questions in biology and medicine. Data mining of these vasts amount of data is crucial in gaining this understanding. In this paper, we present a fuzzy rule-based classification system that allows for effective analysis of gene expression data. The applied classifier consists of a set of fuzzy if-then rules that enable accurate nonlinear classification of input patterns. We further present a hybrid fuzzy classification scheme in which a small number of fuzzy if-then rules are selected through means of a genetic algorithm, leading to a compact classifier for gene expression analysis. Extensive experimental results on various well-known gene expression datasets confirm the efficacy of our approaches.