ArticlePDF Available

Scoring and Analysis of Likert Scale: Few Approaches

Authors:
  • Indian Maritime University, Kolkata Campus
Journal of Knowledge Management and Information Technology
Scoring and Analysis of Likert Scale: Few Approaches
Satyendra Nath Chakrabartty
Former Director of Kolkata Campus of Indian Maritime University
Abstract
To improve upon summative scoring of Likert Scale, two alternate methods of
scoring have been proposed in this paper. The proposed methods of scoring are
based on weighted sums where weights are taken as empirical probabilities and
enable us to find cardinal scores for individuals as well as items as expected
values satisfying conditions of additivity and linearity. Comparison of the
methods was undertaken with an empirical data. Proposed methods of scoring
Likert Scale increased reliability of the questionnaire in terms of Cronbach’s
alpha, tended to introduce independency among the items, helped in better
exploring and interpreting the factors. Summated Likert scores did not follow
Normal distribution but each of the proposed method of scoring passed test of
normality. Thus, the transformed scores offer platform for undertaking almost all
type of analysis being done for continuous quantitative variable following
Normal distribution.
Key words: Likert-scale, Cardinal scores, Quantitative analysis, Reliability.
1. Introduction
Likert (1932) proposed a summated scale for the assessment of survey
respondent’s attitudes. Likert scaling presumes the existence of an underlying (or
latent or natural) continuous variable whose value characterizes the respondents’
attitudes and opinions. Likert scales are quite popular and are widely used in
different areas like psychology, sociology, health care, marketing, attitude,
preference, customers’ quality perceptions or expectations, and of subjective
well-being in health care, etc.
Individual item in Likert scale usually has odd number of response categories
say 5 or 7. Descriptive response alternatives could be Strongly approved,
Approved, Undecided, Disapproved, and Strongly disapproved. The response
categories are usually assigned numbers like 1, 2, 3, 4, and 5 or by -2, -1, 0, 1 and
2 or any linear transformation of such numbers. However assigning successive
integer values to scale categories has also been criticized for not being realistic.
Major limitations of Likert type data are:
a) Respondents may avoid using extreme response categories( Central tendency
bias) or agree with statements as presented (Acquiescanal bias ) or try to
DOS 4.08.15 DOA 4.02.15 Page | 31, Vol 1, Issue - 2
Scoring and Analysis of Likert Scale: Few Approaches
portray themselves or their organization in a more favourable light ( Social
desirability bias)
b) One cannot assume that respondents perceive all pairs of adjacent levels as
equidistant. Mid-point or neutral point or zero point is a perception.
c) There is no hard and fast rule to decide on number of response alternatives.
Number of response categories need not be an odd number. Five- or seven-
point formats appear to be the most prevalent.
d) Distance between response alternatives is assumed to be equidistant. If
response categories are a, b, c, d, e the equidistant property can be achieved
by either of the following two approaches.
* = viz. 1, 2, 3, 4 5 or 0.1, 0.2, 0.3, 0.4, 0.5
*
= 
viz. 1, 2, 4, 8, 16
But verification of such assumptions is usually not done through data.
Assigning successive integer values to scale categories has also been
criticized.
e) Non- Additivity: Naming the response classes by successive numbers does
not mean we can do all further mathematical operations to the data generated
from administration of such a questionnaire. Consider the hypothetical
example where 50% respondents endorsed the response category ‘Most
agree’ and the rest 50% went for “Most disagree”. Then the average score is
neutral, but clearly, they are at two opposite poles. Thus, addition of
responses of Likert items may not be meaningful. If addition is not
meaningful then further statistics like mean, SD, Correlation, analysis like
regression, estimation, testing etc. may not be meaningful. Moreover, there
could be various response distributions even for a fixed number of response
categories where the distributions have equal mean but different SDs.
f) Generated responses: A major problem with Likert items is the generated
responses consisting of a count of responses in each category. The resulting
analysis is inherently limited, primarily to a frequency table, typically with
relative and cumulative relative frequencies. However, assumption is often
made that the Likert item is interval in nature and means and other statistics
are commonly computed. It could be argued that this assumption is incorrect.
The data generated by an instrument based on Likert items simply cannot be
subjected to the more robust, more powerful and more subtle analysis
available with quantitative data. Use of an ordinal scale implies a statement
of “greater than” or “less than” without stating how much greater or lesser
g) Continuous or discrete: Scores of Likert Items sometimes are treated as if
they are not discrete. Few researchers are of the opinion that Likert scale
(from strongly agree to strongly disagree) is ordinal and have to deal with
this scale as a discrete scale. This implies that the mean, variance and
covariance etc. computed from responses to a Likert scale may not be
meaningful.
h) Reliability: Reliability assessment might use the correlation between the
item score and the total score or use a test-retest procedure. In any event, the
Page | 32, Vol -1, Issue -2
Journal of Knowledge Management and Information Technology
items not correlated with the total would be discarded. However, when items
tend to measure various dimensions of the underlying trait, item correlations
may be poor and if this is observed during analysis stage, one may not like to
discard an item.
i)Ordinal or Interval: The level of scaling obtained from Likert procedure is
rather difficult to determine. The scale is clearly at least ordinal. Response
categories tend to be sequential but not linear. Count of each category of
response represents an ordinal variable. In order to achieve an interval scale,
the properties on the scale variable have to correspond to differences in the
trait on the natural variable. In other words, distance between any pair of
response categories must be same. But it seems unlikely that the categories
formed by the misalignment of the five responses will all be equal, the
interval scale assumption seems unlikely.
j) Normality: Assumption of Normality is generally not observed from data
generated from Likert Scale. As a result, statistical analysis in the parametric
set up cannot be meaningfully undertaken from data set generated from
Likert scale.
Thus, need is felt to have new methods of scoring Likert scale so that the new
scores are cardinal and have one to one correspondence with Real number system
or a subset of it and enable us to perform analysis undertaken with quantitative
data in parametric set up.
2. Literature review
Wu (2007) considered example of an injury scale of five categories represented
by none, minor, moderate, severe, and fatal, the degree of injury seriousness
between severe and fatal is more significant than that between none and minor.
Thus, successive integers to the scale categories may not reflect the realistic
differences in injury seriousness between or among scale categories.
While Jacoby et al (1971) suggest only three response categories, Dillman, et al.
(2009) recommend that four or five categories should be used and Fink (1995)
suggested five to seven. Foddy (1994) concludes that a minimum of seven
categories is required to ensure scale validity and reliability. Nine-point format
were used by researchers like Lee & Soutar (2010) and even a 15-point format
was used by Chaiken & Eagly (1983).
Stevens (1946) mentioned that in the strict sense, statistics involving mean and
SD ought not to be used with ordinal scales since mean and SD are in error to the
extent that the successive intervals on the scale are unequal in size or not-
equidistant. However, Muraki (1990) observed that if the data fits the
Polytomous Rasch Model and fulfill the strict formal axioms of the said model, it
may be considered as a basis for obtaining interval level estimates of the
continuum. Chien-Ho Wo (2007) observed that transformation of scale data
based on Snell’s (1964) scaling procedure does not do much to pass the
normality test. From item response theory, it can be seen that even large ordinal
scales can be radically non-linear.
DOS 4.08.15 DOA 4.02.15 Page | 33, Vol 1, Issue - 2
Scoring and Analysis of Likert Scale: Few Approaches
3.Objective
The paper proposes two new methods of scoring Likert scale which result in
cardinal scores with one to one correspondence with Real number system or a
subset of it and also to discuss advantages of such scoring methods and to
compare among the methods primarily through empirical exploration.
4. Formal description
Suppose there are n respondents who answered each of the m-items of a Likert
questionnaire where each item has k-numbers of response categories.
Let  be a general element of the basic data matrix of order n X m where n-
individuals are in rows and m-items are in columns.  represents score of the
i-th individual for the j-th item. Value of  ranges between 1 to k i.e. 1 to 5
for a 5-point scale
Note 
=1 = Sum of scores of all individuals for the j-th item (Item Score
for the j-th item)

=1 = Sum of scores of all the items for i-th individual i.e. total score of
the i-th individual (Individual score)
 = Sum of scores of all the individuals on all the items i.e. total test score
In addition, one can have another matrix ((  )) of order m X k showing
frequency of i-th item to j-th response category. A row total will indicate
frequency of that item and will be equal to the sample size (n) .Similarly, a
column total will indicate total number of times that response category was
chosen by all the respondents. Grand total will be equal to sample size X number
of items ().
Multinomial model for Likert responses: A k-dimensional random variable X
with components 1,2, , has multinomial distribution or X follows
Multinomial with parameters (n; p1, p2 , ..., pk ) if the probability density function
is given by:
{1=1, . =}= !
1!2!….. !1
1 2
2 .
where pi 0 and 1+2+ = 1 and 1+2+. . =
The multinomial distribution Mult (n, 1,2, . . , ) is the joint distribution
of the k -random variables Xi. It is therefore a multivariate, discrete distribution
with mean and variance as follows:
[ ]= and  ( )=(1 )
Page | 34, Vol -1, Issue -2
Journal of Knowledge Management and Information Technology
Thus, a Likert item with k- response categories can be viewed as throwing a k-
faced dice, i.e. each throw of the dice may result into 1 or 2 or 3 or 4 or 5 for a 5-
point scale. So responses to a Likert item can be assumed to follow a multinomial
distribution
Maximum likelihood estimate of a Multinomial distribution is =
for j =
1, 2, .k . Thus, estimate of is simply the observed relative frequency of
outcome j i.e. empirical probability. Reproduction theorem says that sum of
independent multinomial random vectors with identical vectors of choice
probabilities follow a multinomial distribution:
5. Methodology: Three approaches were adopted viz.
Approach 1: with total score of an individual as sum of his/her score on each
item i.e. usual summative Likert scores. Here individual score lies between
  (for k point scale) and is discrete in nature. Similarly item scores is
also discrete.
Approach 2: assigning uniform weights i.e. w1, w2, w3, w4, and w5 to the
response categories which will remain unchanged for all items. Here,
=1 =
1 where is the empirical probability of the i-th response category[ i = 1, 2,
…..,k] and is calculated as = 
=1
 where number of items are 1, 2, …. m.
In other words, weight for the i-th response category is the ratio of total
frequency of the category (over all items) and grand total of the Item Response
Categories frequency matrix ().
Here, each item follows a Multinomial distribution with parameters
(.1,2,3,4,5). Mean and variance of the i-the response category =
. and ( 1 ) respectively. Correlation between i-th and j-th
response category is
(1 )(1 ) .
Total score of i-th item is taken as 
=1 where  denotes frequency of
the j-th response category of the i-th item and denotes weight of the j-th
response category. Similarly, score of the j-th response category is 
=1 .
Score of the i-th individual for the j-th item  is equal to if he/she
responded to the t-th response category of the j-th item (t= 1,2,3,4,5 for a five
point scale). Thus, both individual scores and item scores are in terms of
probabilities or expected values and hence each provides measurements of
continuous variable.
Approach 3: with different weights to different response categories of different
items, so that sum of weights for each item is equal to one. Here, weight to the i-
DOS 4.08.15 DOA 4.02.15 Page | 35, Vol 1, Issue - 2
Scoring and Analysis of Likert Scale: Few Approaches
th item and j-th response category is defined as  = 

=1
i.e. ratio of cell
frequency and total frequency of the item. . Clearly sum of weights for each item
is equal to one and  =. However, sum of weights for a response
category of all items is different from one. Score of an individual in the i-th item
will be 
=1 and score of j-th response category item will
be
=1 .
Here, each item will follow multinomial distribution with different values of
parameters. Item scores are in terms of expectations. However, both item scores
and individual scores are continuous variables.
Alternatively, one can find different weights to different item response category
combinations so that sum of weights (probabilities) of all cells is equal to one by
choosing
 =
 Clearly ∑ ∑  = 1.
Here, transformed cells scores follow multinomial distribution with larger
number of parameters. However, weights in this method are linearly related to
weights proposed in Approach 3. In fact, weights in this approach are equal to
weights from approach 3 divided by number of items.
Weighted scores are in terms of expected values and hence provide
measurements of a continuous variable.
6. Calculation of weights
Data : A test consisting of five Likert type items each with five response
alternatives was administered to 100 respondents where “Strongly agree ” was
assigned 5 and “ Strongly disagree” was assigned 1.
Empirical calculation of weights for various approaches is shown using the Item
Response Categories frequency matrix derived from the data. Here, number of
items ()= 5 , each item had 5 response categories . = 5 and sample
size =100.
Table – 1
Calculation of weights: Approach – 2
Items
*RC - 1
*RC - 2
*RC - 3
*RC - 4
*RC- 5
TOTAL
1
19
32
35
11
3
100
2
7
33
34
19
7
100
3
14
17
36
27
6
100
4
10
14
38
30
8
100
5
4
31
37
20
8
100
TOTAL
54
127
180
107
32
500
Weights to response
categories
0.108
0.254
0.36
0.214
0.064
1.00
Page | 36, Vol -1, Issue -2
Journal of Knowledge Management and Information Technology
RC – i denotes i-th Response category
Here 1= 54
500 ,2= 127
500 and so on
Table – 2
Calculation of weights: Approach - 3
Items
*RC- 1
*RC-2
*RC-3
*RC-4
Total
1
0.19
0.32
0.35
0.11
1.00
2
0.07
0.33
0.34
0.19
1.00
3
0.14
0.17
0.36
0.27
1.00
4
0.10
0.14
0.38
0.30
1.00
5
0.04
0.31
0.37
0.20
1.00
Here, weight for the first response category of Item 1 is 11 divided by total
frequency of the item i.e. sample size. Thus, 11 = 19
100 and so on
7. Observations
In approach 2 and 3:
Weights are taken as probabilities.
Weights are obtained from data considering the frequencies or probabilities
of Item Response categories without involving assumptions of continuous
nature or linearity or normality for the observed variables or the underlying
variable being measured.
It was assumed that there is no item with zero discriminating value i.e. there
is no item with equal frequency for each response category and there is no
item where all individuals recoded their response to only one response
category. The assumption is reasonable since items with zero discriminating
values are excluded as per method of test constructions.
Item scores and individual scores are obtained as expected values and hence
provides measurement of continues variables satisfying conditions of
linearity since
E (x + y) = E(x) + E(Y)
E (αx) = αE(x)
E (αx +βy) = αE(x) +βE(y)
Ranking of individuals are invariant under linear transformation for each of
approach 1, 2 and 3
Distribution of item scores follows multinomial distribution due to
reproduction property of multinomial random variables for approach 2 and 3.
Sum of item scores will follow multinomial distribution if the items are
independent
For large sample size, individual scores and item score may tend to follow
normal distribution under each method. Primafacy, each of approach 2 and 3
DOS 4.08.15 DOA 4.02.15 Page | 37, Vol 1, Issue - 2
Scoring and Analysis of Likert Scale: Few Approaches
is likely to tend to normality but the proposition need to be tested
empirically.
The metric data and linearity of weighted scores by any of the approach 2
and 3 enable us to generate data that is cardinal (quantitative) to permit
calculation of all descriptive statistics and also undertake relevant estimation,
testing of hypothesis, relevant analysis used in multivariate statistics.
8. Analysis
Following analysis was undertaken based on empirical data described above:
8.1 Mean, Variance and Reliability of the questionnaire (Cronbach alpha) for the
three approaches are shown in Table – 3 below:
Table – 3
Mean Variance and Reliability for various Approaches
Description
Approach - 1
Approach - 2
Approach – 3
Test mean
14.36
3.658
3.8464
Test variance
5.9904
0.6357
0.6893
Mean of Item – 1
2.47
0.66484
0.6613
Mean of Item – 2
2.86
0.72744
0.7384
Mean of Item – 3
2.94
0.7406
0.7758
Mean of Item – 4
3.12
0.77472
0.8744
Mean of Item – 5
2.97
0.7582
0.7965
Variance of Item – 1
1.0291
0.1347
0.1085
Variance of Item – 2
1.0604
0.0998
0.0709
Variance of Item – 3
1.2364
0.1209
0.1608
Variance of Item – 4
1.1456
0.1084
0.1861
Variance of Item – 5
0.9891
0.0919
0.0815
Cronbach’s alpha
0.110552
0.157417
0.147983
Observations
*Approach 2 and Approach 3 resulted in reduction of test average and test
variance i.e. the weighted scores made the data more homoscedastic.
*Scores as weighted sum resulted in higher values of alpha. Value of Cronbach’s
Alpha was highest for Approach 2 followed by Approach 3.
8.2 Rank correlation of individual score: Spearman ρ between individual scores
obtained by various approach are shown in Table 4.
Page | 38,Vol -1, Issue -2
Journal of Knowledge Management and Information Technology
Table - 4
Rank Correlation Matrix (Spearman ρ)
Observations
All the rank correlations are found to be significant.
Low value of rank correlation between Approach 1 and other approaches
indicate that ranks of individuals are different for different approaches.
However high value of Spearman between Approach 2 and 3 indicates that
individual ranks are almost unchanged. In other words, ranks of respondents
as obtained from Approach 2 remained more or less same when their ranks
are computed by Approach 3.
8.3 Item Correlation matrix for each approach are shown in Table 5, 6 and 7
Table – 5
Item Correlation Matrix for Approach - 1
*: Significant at 1% level
Table – 6
Item Correlation Matrix for Approach - 2
Items
1
2
3
4
5
1
1.00
0.051
(-)0.007
(-)0.074
0.038
2
1.00
0.009
0.111
0.098
3
1.00
0.052
0.156
4
1.00
(-)0.044
5
1.00
Table – 7
Item Correlation Matrix for Approach - 3
Items
1
2
3
4
5
1
1.0
0.037
0.024
(-) 0.005
(-) 0.055
2
1.0
0.117
0.143
0.101
3
1.0
(-) 0.019
0.189
4
1.0
(-) 0.120
5
1.0
Approach 1
Approach 2
Approach 3
Approach 1
1.0
0.444
0.306
Approach 2
1.0
0.909
Approach 3
1.0
Items
1
2
3
4
5
1
1.0
0.168
0.096
0.003
0.123
2
1.0
(-) 0.007
0.033
0.045
3
1.0
*(-)0.330
(-) 0.11
4
1.0
0.172
5
1.0
DOS 4.08.15 DOA 4.02.15 Page | 39, Vol 1, Issue - 2
Scoring and Analysis of Likert Scale: Few Approaches
Observations
*Weighted sum used in Approach 2 and Approach 3 resulted in changes of
magnitudes and signs of item-correlations
*Significant correlation between item 3 and 4 as obtained in Approach 1 became
insignificant in Approach 2 and also in Approach 3.
* No significant correlations were found in Approach 2 and Approach 3 which
imply that the weighted scores made the items more or less independent
8.4 Test of Normality
Attempt was made to test whether total score of respondents follow Normal
distribution using Anderson Darling test for Normality. It is one of most
powerful statistical test for detecting departures from Normality. The underlying
null hypothesis is that the variable under consideration is normally distributed. A
large p-value corresponding to the test statistic (p > 0.05) would indicate
normality. The test statistic is
Table – 8
Values of test statistic and associated p-values
Value of test
statistics
p – values
Remarks
Approach 1
1.1153
0.0061
H0 is rejected
Approach 2
0.4796
0.483315
H0 is accepted
Approach 3
0.6824
0.07256
H0 is accepted
Observations
*It can be inferred that scores of respondents did not follow Normal distribution
for Approach 1. In other words, summated Likert scores did not follow Normal
distribution. This highlights limitation of Likert data. As a result, a number of
statistical analysis, testing and estimation procedures which presume normality of
data cannot be performed with usual summated scores of Likert type data
*Respondents scores followed Normal distribution for Approach 2 and 3 i.e.
non-linear transformations resulted in the desired property of normality. Thus,
the transformed scores offer platform for undertaking almost all type of analysis
being done for continuous quantitative variable following Normal distribution.
* Higher p- value at Approach 2 in comparison to Approach 3 indicates better
normality in case of Approach 2.
( )( ) ( )( )( )
iNi YFYF
N
i
NAD +
+
= 1
1lnln
12
Page | 40, Vol -1, Issue -2
Journal of Knowledge Management and Information Technology
Similar approach was adapted to test whether item scores follow normal
distribution or not. The test results showed that item scores are not normally
distributed under any of the above said three approaches
8.5 Correlation matrix for the approaches: Correlation of individual scores
obtained by various approach are shown in Table – 9.
Table – 9
Correlations between a pair of approaches
Approach 1
Approach 2
Approach 3
Approach 1
1.0
0.445
0.307
Approach 2
1.0
0.927
Approach 3
1.0
Observations
*Maximum correlation was found between Approach 2 and Approach 3.
8.6 Factor structures of the approaches
Factor Analysis with orthogonal vari-max rotation was undertaken with item
correlation matrix under each approach. The results are as follows
Table 10
Results of Factor Analysis
Factor
Eigen values
Percentage of
Variance explained
Cumulative
percentage of
variance explained
Remarks
APPROACH - 1
1
1.382
27.638
27. 638
Two factors
explaining 52.507%
of variance
2
1.243
24.869
52.507
3
0.958
19.154
71.661
4
0.787
15.741
87.403
5
0.63
12.59
100
APPROACH - 2
1
1.204
24.077
24.077
Three factors
explaining 66.867%
of variance
2
1.105
22.10
46.177
3
1.035
20.69
66.867
4
0.891
17.829
84.696
5
0.765
15.304
100
APPROACH - 3
1
1.277
25.544
25.544
Three factors
explaining 68.755%
of variance
2
1.152
23.042
48.586
3
1.008
20.169
68.755
4
0.825
16.492
85.246
5
0.738
14.754
100
DOS 4.08.15 DOA 4.02.15 Page | 41, Vol 1, Issue - 2
Scoring and Analysis of Likert Scale: Few Approaches
Observations
*Approach 1 gives two factors, combined effect of which explains only 52.50%
of variance
* Each of Approach 2 and Approach 3 gives three factors explaining
cumulatively 66.87% to 68.76% of variance respectively
*The results appear to be in line with item correlation matrix under each
approach where each correlation was found to be insignificant except one in
Approach 1 and also high correlation observed between Approach 2 and
Approach 3
Thus, the non-linear transformations tended to introduce independency of items
9. Limitations
Application of the proposed scoring of Likert items and Scale should take into
account the following facts:
*The methods take no account of the experiment design behind the data.
*The methods are not applicable for items with zero discriminating value.
*Irregularities in data should be within tolerance.
* Test of Normality may be undertaken before application of the proposed
methods of Scoring since distribution of individual score obtained from
Approach-2 or Approach-3 is yet to be established.
10. Conclusions
Weighted scores where weights are data driven and proportional to probabilities
helps to find total score of a respondent and also total score of an item as
expected values and enable us to perform usual analysis for a continuous
quantitative variable. Computation of weights considered the frequencies or
probabilities of Item Response categories without involving assumptions of
continuous nature or linearity or normality for the observed variables or the
underlying variable being measured.
It was assumed that there is no item with zero discriminating value i.e. there is no
item with equal frequency for each response category and there is no item where
all individuals recoded their response to only one response category. The
assumption is reasonable since items with zero discriminating values are
excluded as per method of test constructions.
Scores in terms of expected values resulted in higher reliability of the
questionnaire. Approach 2 registered highest value of Cronbach’s alpha among
the three approaches. Such scores tended to introduce independency among the
items. The Approach 2 and 3 resulted in a situation where the five items were
almost independent and accordingly gave higher number of independent factors.
Page | 42, Vol -1, Issue -2
Journal of Knowledge Management and Information Technology
Thus, scores as per the proposed methods helped in better exploring and
interpreting the factors.
Usual Likert type score did not follow normal distribution. However, weighted
scores as per Approach 2 and also for Approach 3 resulted in the desired property
of normality and are suitable for use in methods of analysis requiring assumption
of normality. Thus, the proposed scores offer platform for undertaking almost all
type of analysis being done for continuous quantitative variable following
Normal distribution. For example, individual scores obtained through Approach
2 or Approach 3 tended to satisfy assumptions of AVOVA, regression analysis,
t-test for testing equality of means, F-test for testing equality of variances,
Discriminant analysis, etc. Proposed methods of Scoring conform better to
Normality.
Thus, scoring of Likert Scale as per Approach 2 or Approach 3 has many
desirable properties and avoids some of the major limitations of usual summative
scores. However, high correlation (over 0.9) was found between Approach 2
and Approach 3. This may imply possible use of Approach 2 instead of
Approach 3 primarily because of easiness to computation. Thus, Scoring method
as proposed in Approach -2 is recommended for Likert-type data for clear
theoretical advantages and easiness in calculations with minimum processing
hour.
References
[1]. Chaiken, S., & Eagly, A. H. (1983) "Communication Modality as a
Determinant of Persuasion: The Role of Communicator Salience". Journal of
Personality and Social Psychology, Vol 45, No. 2, pp 241-256.
[2]. Chien-Ho Wu (2007) An Empirical Study on the Transformation of Likert-
scale Data to Numerical Scores, Applied Mathematical Sciences, Vol. 1,
2007, no. 58, 2851 2862
[3]. Dillman, D. A., Smyth, J. D. & Christian, L. M. (2009) Internet, mail and
mixed-mode surveys: The tailored design method, John Wiley & Sons Inc.,
Hoboken, N.J.
[4]. Fink, A. (1995) How to ask survey questions, Sage Publications, Thousand
Oaks
[5]. Foddy, W. (1994) Constructing questions for interviews and questionnaires:
Theory and practice in social research, Cambridge University Press,
Cambridge.
[6]. Jacoby Jacob, Matell Michael S (1971) Three-point likert scales are good
enough, Journal of Marketing Research; Nov 1971; Vol 8 pg. 495-500
[7]. Lee, J. A. & Soutar, G. (2010) "Is Schwartz's Value Survey an Interval Scale,
and Does It Really Matter?" Journal of Cross-Cultural Psychology,Vol 41,
No 1, pp 76-86.
DOS 4.08.15 DOA 4.02.15 Page | 43, Vol 1, Issue - 2
Scoring and Analysis of Likert Scale: Few Approaches
[8]. Likert R (1932). A Technique for the Measurement of Attitudes. Archives of
Psychology; p. 140.
[9]. Muraki, E (1990) Fitting a Polytomous Item Response Model to Likert
Type Data. Applied Psychological Measurement, Vol. 14, No. 1, March, pp
59 71
[10]. Snell, E. (1964) A Scaling Procedure for Ordered Categorical Data,
Biometrics,
20(3), 592-607.
[11]. Stevens, S. S. (1951). Mathematics, measurement and Psychophysics. In
Handbook of Experimental Psychology. S. S. Stevens (ed.), New York: John
Wiley & Sons pp. 149.
[12]. Wu, Chien-Ho (2007) An Empirical Study on the Transformation of Likert-
scale Data to Numerical Scores. Applied Mathematical Sciences, Vol. 1,
2007, no. 58, 2851 - 2862
Author Profile
Prof. Satyendra Nath Chakrabartty is an M. Stat. (Specialisation - Psychometry)
from Indian Statistical Institute and was Director, Kolkata Campus of Indian
Maritime University. His current research interests include multi-dimensional
measurements and their properties to assess overall progress or overall distance
from the set of goals along with identification of critical areas. He also works on
estimation of true scores, true score variance, reliability of a battery of tests under
classical theory approach, Likert type tests and Non-parametric Reliability and
introducing linearity among non-linear relationships etc.
Page | 44, Vol -1, Issue -2
... The collected data are categorized and presented according to the objectives of the study. The study used ccomposite index (Sava, 2016), computing Likert scales (Chakrabartty, 2014) and multiple regression model (Field, 2009). The findings of the study are interpreted based on empirical literatures (conducted in international and national context) as well as Lee's pushpull, Todaro's migration model and remittance as an alternative for rural development theoretical perspectives. ...
Article
Full-text available
Remittance is becoming prominent source of family income in Nepal. This study thus, analyzes effect of remittance on household welfare. We adopted cross-sectional study design to collect data from 777 randomly selected respondents residing in Chautara Sngachwokgadhi (Mountain region), Galkot (Hill) and Mithila (Tarai) municipalities of Nepal. We used a reliable questionnaire tool having 0.8 cronbach alpha, and we visited the respondents from 6th June- 18th October 2022. The study found that the remittance has positive effect on household welfare of the remittance recipient households. They have good access to households, educational, financial and health facilities. Utilization of remittance helped to increase family income, helped to improve family economic situation and livelihood, helped to reduce family poverty and social exclusion, helped to create self-employment/employment and help to upgrade rural economy in the study area. However, remittance has failed to increase agriculture production and distributions (domestic household hazard) and also failed to increase entrepreneurship development in the local levels. Therefore, the empirical findings of the study can be a reference for developing evidence based policy to the concerned state actor and non-state stakeholders for minimizing public moral hazard and domestic household hazards caused by remittance.
... The interval range of the 5-likert scale ( Table 2) was calculated according to the principle of the grouped data frequency distribution formula (34). The mean of their response scores for each variable represented their level of satisfaction with pharmaceutical warehousing activities and human and material resource management practices, whereas the SD represented their deviation from the central value (35,36). ...
Article
Full-text available
A pharmaceutical warehouse is part of the pharmaceutical supply chain and is essential to maintaining the quality and efficacy of veterinary pharmaceuticals for successful animal health service delivery. However, poor storage conditions, improper handling, and inappropriate use and disposal constitute challenges for veterinary supplies in animal health services. Therefore, this study aimed to assess the existing practices and challenges in warehouse management in government veterinary clinics and private veterinary drug wholesalers in Ethiopia. A cross-sectional study was conducted on 37 veterinary health facilities in four selected zones (south Gondar, west Gondar, central Gondar, and west Gojam zones) and Bahir Dar administrative city. Zones were selected using a simple random sampling technique. Data was collected using a structured questionnaire, pre-defined and tested observational checklists, and semi-structured interview guides. Descriptive statistics were used to analyze the quantitative data, while qualitative data was analyzed using a thematic approach. The study revealed the presence of poor stock management practices, such as the absence of standard operating procedures for warehouse activities in ~59.5% of facilities surveyed. In none of the surveyed facilities, bin cards and system software utilization were satisfactory. The absence of disposal guidelines was detected in 83.8% of the facilities, and the practice of timely disposal of expired drugs was not satisfactory. Compared to the government veterinary clinics, private veterinary drug wholesalers had better storage practices (86.25%) following theoretical recommendations. The storage conditions in government clinics were rated poor at 48.3% (>80%, which is the limit to the acceptable rate for good storage conditions). The challenges of inadequate infrastructure, a lack of qualified staff, problems with the availability and affordability of pharmaceutical products, insufficient regulatory practice, and budget constraints were identified. A holistic approach involving related stakeholders should be followed to improve the existing challenges and the sector's efficiency.
... To assess the level of satisfaction, six factors were identified that can influence student satisfaction. The impact of each factor was assessed using a Likert scale from 0 to 4 [17]. ...
Article
Full-text available
After the outbreak of the pandemic in 2019 and the outbreak of war in the country in 2022, educational institutions at different levels of Ukraine switched to a mixed format of the educational process and were forced to look for modern approaches and technologies for organizing the education of students. This study examines the implementation of microlearning technology using online courses developed on the Moodle platform. Microlearning is a modern learning technology that involves short, intensive training modules focused on the development of specific theoretical knowledge and practical skills. Available online courses, which provide the ability to create and deliver different types of educational content, focus mainly on the formation of necessary knowledge and skills of students, but do not take into account their individual needs and interests in the learning process, and pay little attention to their satisfaction with education in modern conditions. This article investigates the impact of microlearning technology using online courses on students' satisfaction with learning. To determine the level of student satisfaction, an online survey was conducted among 61 students enrolled in the specialty 051 "Economics" programs, which include Business Economics, International Economics, Economic Cybernetics, and Digital Economy, at the National University of Life and Environmental Sciences of Ukraine. All these students were studying using microlearning technology with the use of online courses. As a result, that the level of student satisfaction with learning using this technology is most influenced by such factors as the availability of learning resources, consideration of individual abilities and needs in the online course, opportunities for interaction and communication with the teacher, as well as the format of learning materials and acquired knowledge. Accordingly, these factors should be taken into account when developing online courses and implementing microlearning technologies in the educational process.
... The hand-given questionnaires were filled up by 180 respondents who were visited from January 26th 2022, to February 30th 2022. We ran SPSS version 25 and applied frequency tabulation, central tendency, summative analysis (Chakrabartty, 2014), composite index (Sava, 2016) and multiple regression models (Field, 2009) for organizing, summarizing, describing and generalizing data. We brought theoretical insights from decentralization, multi-level governance and new public management theories during the interpretation. ...
Article
Full-text available
Good governance is a state management system which offers well-public service deliveries. This study aimed to explain good governance practices in Godawari Municipality in Lalitpur district, Nepal. We applied the post-positivism research paradigm and institutional/exit poll survey research design. The data were generated from 180 sample respondents who were elected leaders, administrative staff, local intellectuals, and service receivers in the municipality, and these were selected purposively and randomly. We applied reliable self-administered questionnaires (0.91> 0.78 Cronbach’s alpha value) consisting of seven indicators: accountability, transparency, participation, the rule of law, corruption, responsiveness, and effectiveness and efficiency. This study is explained through the theoretical insights from decentralization and the new public management theories. This study found that good governance practices in Godawari Municipality were satisfactory and fair. Participation of local youths in the local government is remarkable, and the women elected representatives are more the men. The educational status of the respondents is good, and their level of education and the transparency score in the municipality are positively correlated. Service receivers perceive that political leaders and administrative staffs are mainly responsible for corruption. There exist between the principles of good governance and the practice in the study area, which nine possible implications of the research can address.
... However, the use of the Likert response method in combination with self-ratings has been criticized for failing to provide evidence of construct validity (Chakrabartty, 2014). Lombardi et al. (2018) pointed out that students may lack objectivity or may bias their responses due to perceived social pressure. ...
... However, the use of the Likert response method in combination with self-ratings has been criticized for failing to provide evidence of construct validity (Chakrabartty, 2014). Lombardi et al. (2018) pointed out that students may lack objectivity or may bias their responses due to perceived social pressure. ...
Chapter
Full-text available
To address the challenges originating from changes in the global market and with technological progress, sub-Saharan Africa (SSA) is adopting more holis-tic education systems that offer lifelong competence for workers in the twenty-first century. This transition requires integration of complex cognitive and social/inter-personal competencies, such as critical thinking, teamwork, cultural and diversity awareness, multilingualism, and the use of digital technologies into the traditional educational curricula. However, the transition from traditional to holistic curricula is complicated. Issues include how twenty-first century skills are defined in SSA, how they can be taught, how they can be integrated into curricula, and how they can be assessed. A review of the literature on assessments was first conducted in order to review approaches and tools used to measure twenty-first century skills in SSA. Five assessment approaches were identified: scenario-based, questionnaire, video recording and direct observation, portfolio, and technology-based. Seven tools that met study criteria were examined along five dimensions: purpose, type or form, target population, context, and specific skills, in order to determine their utility for assessment of twenty-first century skills. Findings indicate that five of the seven assessment tools support summative purposes while two support formative assessment. Further to this, two tools were designed for large-scale assessment and three targeted adolescents. In terms of method, scenario-based and self-report were the most common approaches used to collect information on twenty-first century skills in SSA. Notably, the outcomes of scenario-based assessments provided compelling evidence of proficiencies, demonstrating the method's efficiency in task creation , analysis, and scoring rubrics that provide clear distinctions across performance levels.
... The questionnaire consists of 42 questions on a Likert scale [45], which must be answered with a value of 1 point to say very disagree, and 5 points for strongly agree, and before they can answer the question, they were conducted to watch and learn what is metaverse and how metaverse works, by watching several introduction and guideline movie about metaverse (Microsoft [46], Microsoft [47], Facebook Inc. [48]). The movie watching is needed because there were limitations due to the absence of established metaverse systems that could be tried directly. ...
Article
Full-text available
Currently several industries are starting to try to apply metaverse in various possible implementations, such as manufacturing, health, business, education and training, architecture, and entertainment. For business in a smaller context, metaverse can be used to interact with other users in virtual meetings and predicted to be able to replace the current concept of online communication using video conferencing. The question is are the employees have intention to work within metaverse environment in the future, and what will be the barrier and the driver for employees to work within metaverse environment To answer this question, a Partial Least Squares Structural Equation Modelling (PLS-SEM) analysis methodology was carried out using a modified dual factor model approach. In this study it is also proposed to add environmental factors which are also a part in a decision-making process. The research result shows that the application of the metaverse in the company does not necessarily need to be driven by external factors. Instead, the company's independence determines its adaptation to the technology. From this study, it was obtained that the factors in the dual factor model had a significant or no significant effect on the intention to work within metaverse. By validity, reliability, and path coefficient tests on research model proposed, it is determined the readiness and interest of employees to switch to work within metaverse.
... The reliability value of the instrument is 0.8774 for teaching effectiveness, 0.876 for knowledge management, and 0.8922 for teamwork. Measurement scale using the Likert scale [43]. ...
Article
Full-text available
Background Ethiopia has scaled up medical education to improve access to healthcare which presented challenges to maintaining training quality. We conducted a study to assess the clinical competence of graduating medical students and the associated factors. Methods and materials A pretest assessment of a quasi-experimental study was conducted in 10 medical schools with a sample size of 240 students. We randomly selected 24 students per school. Clinical competence was assessed in a 12-station objective structured clinical examination. The clinical learning environment (CLE), simulation training, and practice exposure were self-rated. Mean scores for clinical competence, and satisfaction in the CLE and simulation training were calculated. Proportions of students with practice exposure, and who agreed on CLE and simulation items were done. Independent t-tests were used to look at competence differences among subgroups. Bivariate and multiple linear regression models were fitted for the outcome variable: competence score. A 95% statistical confidence interval and p-value < 0.05 were used for making statistical decisions. A 75% cut-off score was used to compare competence scores. Results Graduating medical students had a mean competence score of 72%. Low scores were reported in performing manual vacuum aspiration (62%), lumbar puncture (64%), and managing childbirth (66%). Female students (73%) had a significantly higher competence score than males (70%). Higher cumulative grade point average (CGPA), positive appraisal of the CLE, and conducting more clinical procedures were associated with greater competence scores. Nearly half of the students were not satisfied with the clinical practice particularly due to the large student number and issues affecting the performance assessment. About two-thirds of the students were not satisfied with the sufficiency of models and equipment, and the quality of feedback during simulation training. Nearly one-third of the students never performed lumbar puncture, manual vacuum aspiration, and venipuncture. Conclusions Medical students had suboptimal clinical competence. A better clinical learning environment, higher cumulative GPA, and more practice exposure are associated with higher scores. There is a need to improve student clinical practice and simulation training. Strengthening school accreditation and graduates’ licensing examinations is also a way forward.
Article
Full-text available
This study presents a systematic process to evaluate pivotal factors influencing technology transfer within the Thailand context, incorporating the perceptions of both technology adopters and developers. Utilizing a rigorous triangulation of methods, including preliminary assessments, extensive interviews, and a systematically structured questionnaire, the Evaluation Matrix of Technology Transfer (EMTT) was formulated. The EMTT encompasses six fundamental components: 1) Knowledge/Know-how, 2) Artifacts, 3) User Insight, 4) Marketing, 5) Intellectual Property, and 6) Technology Transfer Management. Notably, among these, Artifacts emerged as paramount. Divergences in perspectives between adopters and developers became evident. While adopters underscored the alignment of research outputs with user requirements, developers accentuated the importance of adept management in technology transfer. In addition, a discernable discrepancy was observed in six evaluative aspects; adopters placed a premium on the R&D prowess of researchers, whereas developers highlighted the value of research collaboration with the industrial sector. Collectively, this robust assessment paradigm offers pertinent insights, underscoring the imperatives for judicious decision-making and fostering efficacious technology transfer processes within Thailand.
Article
Full-text available
Reports 2 studies, using a total of 304 university students, in which a likable or unlikable communicator delivered a persuasive message via writing, audiotape, or videotape. In both studies the likable communicator was more persuasive in video- and audiotape than in writing, but the unlikable communicator was more persuasive in writing. Thus, communicator likability was a significant determinant of persuasion only in the broadcast modalities. Other findings suggest that Ss process more communicator cues when exposed to video- and audiotape messages than when exposed to written ones and that communicator-based (rather than message-based) cognitions predicted opinion change primarily in video and audiotape conditions rather than in written ones. It is concluded that video- and audiotapes enhance communicator-related information, so that communicator characteristics exert a disproportionate effect on persuasion when messages are broadcast. Findings are also discussed in relation to "vividness" phenomena. (40 ref) (PsycINFO Database Record (c) 2012 APA, all rights reserved)
Article
This study examined the application of the MML-EM algorithm to the parameter estimation problems of the normal ogive and logistic polytomous response models for Likert-type items. A rating-scale model was devel oped based on Samejima's (1969) graded response model. The graded response model includes a separate slope parameter for each item and an item response parameter. In the rating-scale model, the item re sponse parameter is resolved into two parameters: the item location parameter, and the category threshold parameter characterizing the boundary between re sponse categories. For a Likert-type questionnaire, where a single scale is employed to elicit different re sponses to the items, this item response model is ex pected to be more useful for analysis because the item parameters can be estimated separately from the threshold parameters associated with the points on a single Likert scale. The advantages of this type of model are shown by analyzing simulated data and data from the General Social Surveys. Index terms: EM algorithm, General Social Surveys, graded response model, item response model, Likert scale, marginal maximum likelihood, polytomous item response model, rating-scale model.
Article
Researchers often assume the numerical ratings approach used to measure values, such as Schwartz’s Value Survey (SVS), conforms to an interval scale. Correspondence analysis was used to examine this assumption by analyzing SVS data obtained from four Anglo (Australia, New Zealand, United Kingdom, and United States) and two Asian (South Korea and China) countries. The analysis suggested the SVS did not exhibit the characteristics of an interval scale, with responses across all countries producing larger intervals at the low end of the scale and smaller intervals from the mid to high end of the scale. Further analysis suggested there were significant differences in the traditional SVS means and the means suggested by the correspondence analysis. However, when correlations and Euclidian distances between SVS and correspondence analysis scores were examined, they were very high, suggesting the lack of interval scaling was unlikely to affect the relationships between the SVS value types and other constructs.
Article
This paper presents a method of determining numerical scores for the categories of subjective scales. The scores so determined are suitable for use in methods of analysis dependent upon assumptions of normality. The exact solution necessitates an iterative procedure but an approximate solution is adequate for most practical purposes. The approximate solution is easily obtained.
Article
The success of any interview or questionnaire depends upon good question design, yet most of the available literature has been devoted to interview techniques, rather than question formulation. This practical book provides a coherent, theoretical basis for the construction of valid and reliable questions for interviews and questionnaires. The theoretical framework used in the book provides a set of principles that, when followed, will increase the validity and reliability of verbal data collected for social research. Dr Foddy outlines the problems which can arise when framing questions with clarity and commonsense. He has written a wide ranging, useful book for survey practitioners working in the social sciences.
Article
The nine-volume Survey Kit is designed to help readers prepare and conduct surveys and become better users of survey results. All the books in the series contain instructional objectives, exercises and answers, examples of surveys in use, illustrations of survey questions, guidelines for action, checklists of "dos and don'ts," and annotated references. This volume, second in the series, is designed to guide the reader to prepare and use reliable and valid survey questions. The first objective is to help the user understand a survey's cultural, psychological, economic, and political contexts. The survey developer is encouraged to ask valid questions that make sense to the respondent, and are concrete, with well-constructed sentences and careful word choice. The user is led to ask questions correctly through the use of meaningful response categories, appropriately grouped. Also discussed is applying special questioning techniques as needed. The following chapters are included: (1) "Asking Questions: A Matter of Context"; (2) "Keep Questions Closed or Open Them Up?"; (3) "Responses: Choices and Measurement"; and (4) "Knowledge, Attitudes, and Behavior: Additional Tips When Creating Survey Questions." A list of 15 annotated additional readings is attached. (Contains 32 examples and 4 tables.) (SLD)
Article
The author presents a discussion of the significance and role of mathematics and mathematical models in scientific investigation and especially in relation to psychological measurement. The major divisions of his discussion are entitled the mathematical model, numerals and measurements, psychophysics and psychophysical methods, probability, and measures and indicants. 67-item bibliography. (PsycINFO Database Record (c) 2012 APA, all rights reserved)
Article
The application of statistical methods to data analysis requires that the data set concerned should follow some particular assumptions. For example, AVOVA assumes that the response variable is normally dis-tributed within groups, and the variances in the different groups are identical. However such assumptions are generally not observed by data collected through Likert Scales. This paper presents a computation pro-cedure for transforming Likert-scale data into numerical scores that bet-ter follow the assumption of normality, based on the scaling procedure proposed by E. J. Snell. We have also conducted an empirical study to investigate the effects of the proposed transformation on data analysis. Finally this paper addresses the decision on whether or not that Likert-scale data should be transformed to scores that are more compliant to statistical assumptions.
Article
The project conceived in 1929 by Gardner Murphy and the writer aimed first to present a wide array of problems having to do with five major "attitude areas"--international relations, race relations, economic conflict, political conflict, and religion. The kind of questionnaire material falls into four classes: yes-no, multiple choice, propositions to be responded to by degrees of approval, and a series of brief newspaper narratives to be approved or disapproved in various degrees. The monograph aims to describe a technique rather than to give results. The appendix, covering ten pages, shows the method of constructing an attitude scale. A bibliography is also given.