What statistical tool (data analysis method) should I use when I would like to see the relationship between a yes-no and a Likert scale variable?

Question

Hi! I&#x27;m a fourth-year college student. This is my first time doing quantitative research with nominal and ordinal data.

I would like to ask for your help and/or advice regarding the statistical tool that I should use when I want to see the relationship between the technology access of students, with 21 statements answerable by yes or no, and student attitudes with statements rateable within a Likert-scale.

In addition, I would like to ask for any ideas on how I could possibly interpret the results or their relationship, with both variables (technology access and student attitude) having three indicators each?

Thank you so much.

Sal Mangiafico · Answer

Are you combining the 21 yes-no questions into a single number?  Or looking at each question separately? ... Are you combining the Likert-type items into a single number (scale)?

David L Morgan · Answer

I agree with Salvatore S. Mangiafico that whether you are combining your items into scale is an important question. At a minimum you could assess the feasibility of combining the Likert-scored items into a scale by using coefficient alpha.
Note, however, that alpha does not apply to binary items. In that case, you could appeal to &#x22;face validity&#x22; to argue that these items make up a score on something like &#x22;technology usage.&#x22;

D. Eastern Kang Sim · Answer

If you are interested in the response pattern, consider latent class (profile) analysis. It might produce a deeper insight related to how participants are heterogenous groups.

Ron&#xE1;n Michael Conroy · Answer

David L Morgan &#x2013; I&#x27;ve heard this before, that alpha does not apply to binary items. In fact, the KR20 coefficient, which was developed for binary items, is mathematically equivalent to alpha.

Neat paper, here : https://www.pbarrett.net/techpapers/kr20.pdf

David L Morgan · Answer

Ron&#xE1;n Michael Conroy thank you for that information

Mohialdeen Alotumi · Answer

For correlating your ordinal scale (i.e., student attitude) and binary scale (i.e., technology access), you could report Spearman correlation coefficient. The following might be of interest.

Chalmers, R. P. (2018). On misconceptions and the limited usefulness of ordinal alpha. Educational and Psychological Measurement, 78(6), 1056&#x2013;1071. https://doi.org/10.1177/0013164417727036
de Winter, J. C. F., Gosling, S. D., &#x26; Potter, J. (2016). Comparing  the Pearson and Spearman correlation coefficients across distributions  and sample sizes: A tutorial using simulations and empirical data. Psychological Methods, 21(3), 273&#x2013;290. https://doi.org/10.1037/met0000079

Good luck,

Sal Mangiafico · Answer

Mohialdeen Alotumi , --- I don&#x27;t know if the topic is applicable for this question ---, but on the topic of finding the association between a dichotomous variable and an ordinal variable, I would recommend the Glass rank biserial correlation(Rg) over measures like Spearman correlation or Kendall correlation.

One simple reason is that Rg is designed for this purpose, whereas correlation is typically used to, well, determine the correlation between two continuous or ordinal variables.

I think this can cause some confusion when using correlation for the effect size of two groups.  If we have values for two groups, say, A and B, the natural inclination is to find the correlation between A and B.  Whereas to use correlation as the effect size for the difference between the groups, we would need to find the correlation between the combined values of A and B, and the numeric equivalent of the two groups.

Rg is also directly related to the probability that an observation in one group is larger than an observation in the other group.  (Compare Cliff&#x27;s delta, Vargha and Delaney&#x27;s A, and the common language effect size).  So it&#x27;s actually quite easy to interpret.

Finally, using correlation in this manner returns a result that has a sign opposite of the usual sign of effect size statistics and similar statistics.  Typically, if the second group has larger values than the first group, the statistic is negative.  You can see this with the t statistic,  z statistic, and signed effect size statistics like Cohen&#x27;s d.  Results for Rg should be in accord with this convention, whereas correlation will return the opposite of this convention.

Muhammad Zia Aslam · Answer

Elaine Robledo  I think you can simply combine Student Attitude and take it as a continuous variable and perform comparison of means (T-Test) for your Yes/No groups. The results might support the common perception that ones who have access to technology shows better attitude towards goal achievements. Tq.

Ron&#xE1;n Michael Conroy · Answer

Muhammad Zia Aslam thinks that &#x22;you can simply combine Student Attitude and take it as a continuous variable&#x22;. I don&#x27;t. You have no reason to believe that the attitudes form a unidimensional scale. (You may have hopes &#x2013; we all have &#x2013; but beliefs require data.)

If you want to explore the structure of student attitudes ( I have a fondness for exploring the structure of questionnaires) I recommend Mokken scaling, which is a nonparametric procedure for building one or more unidimensional scales from a pool of items. I&#x27;m afraid that the best way of doing it is using R, but the R package concerned &#x2013; called mokken &#x2013; is really easy to use and has splendid documentation.

David L Morgan · Answer

Elaine Robledo There are many ways to assess whether a set of items form a scale. The classic approach is to begin with coefficient alpha, which is a conservative test of whether all the items measure the same underlying construct (i.e., they are highly inter-correlated).

Ron&#xE1;n Michael Conroy · Answer

David L Morgan &#x27;s advocacy of alpha must be taken with a caveat. Alpha is the average of all possible split-half correlations. For that reason, alpha can be high when the items are made up of several uncorrelated unidimensional scales. It does not guarantee unidimensionality, and, indeed, assumes unidimensionality. In the case of a scale that measures several constructs, the interpretation of alpha is problematic.

So alpha is like a friend of mine, who claimed that he knew nothing about good music, but could instantly recognise bad music. A low alpha is a useful indicator that your items lack internal consistency, but a high alpha is not an indicator that your items are a meaningful scale.

David L Morgan · Answer

I agree that alpha is not a guarantee of unidimensionality, but the way I see it the separate uni-dimensional scales have to be highly correlated themselves before they can generate a high value of alpha (I personally consider .8 the best cut-off for alpha, but most journals will accept .7).

Ron&#xE1;n Michael Conroy · Answer

This isn&#x27;t true. Imagine a scale made up of equal numbers of items from two completely uncorrelated scales.

Now imagine a split-half reliability. Each of the halves will contain a number of items from each of the subscales &#x2013; in fact, only one possible split half will produce a correlation of zero because it will separate the two sets of items perfectly. You can see the problem! The scores on each of the halves will tend to correlate well. See

Huysamen, G.K., 2006. Coefficient alpha: Unnecessarily ambiguous; unduly ubiquitous. SA Journal of Industrial Psychology, 32(4)

As for the threshold values of alpha, they too are folklore. Researchers frequently invoke the authority of Nunally (Nunnally &#x26; Bernstein 1994) to justify the use of an alpha of 0&#xB7;7 or more as indicating an acceptable level of scale reliability. As Lance points out, Nunally simply didn&#x27;t say this (Lance et al. 2006). And it is worth quoting what Nunally did say:
&#x22;In the early stages of research&#x2026; one saves time and energy by working with instruments that have only modest reliability, for which purpose reliabilities of &#xB7;70 or higher will suffice&#x2026; In contrast to the standards in basic research, in many applied settings a reliability of &#xB7;80 is not nearly high enough&#x2026; In many applied problems, a great deal hinges on the exact score made by a person on a test&#x2026; In such instances it is frightening to think that any measurement error is permitted. Even with a reliability of &#xB7;90, the standard error of measurement is almost one-third as large as the standard deviation of the test scores.&#x22;

Lance, C.E., Butts, M.M. &#x26; Michels, L.C., 2006. The Sources of Four Commonly Reported Cutoff Criteria: What Did They Really Say? Organizational Research Methods, 9(2), pp.202&#x2013;220.

Muhammad Zia Aslam · Answer

Respected Prof. Ron&#xE1;n Michael Conroy  I really recommend your &#x22;statistically rigorous&#x22; approach to the issue BUT how a fourth year degree student would digest and comprehend &#x22;Mokken Scaling&#x22; procedure using R-package for their possibly end of semester research assignment. As a commonly accepted measure of reliability of an existing scale of a latent variable, I still think alpha coefficient would be good enough to move forward to the main objectives of the study. Tq.

David Eugene Booth · Answer

Cut to the chase and use logistic regression with likert variable as your IV. Follow David L Morgan on scale construction.. good luck, David Booth

Ron&#xE1;n Michael Conroy · Answer

David Eugene Booth &#x2013; thank you for introducing a little clarity into what had become a pretty arcane discussion! And thank you, Muhammad Zia Aslam  for pointing out that this poor student has probably enough to do without getting involved in Mokken scaling!

Elaine Robledo · Answer

Salvatore S. Mangiafico Sorry for the late response.
I would like to at each question (binary items) separately since this is also for profiling purposes like what percentage of the students have smartphones, computers... use only mobile data, or have internet access at home, etc.

Thus, I would like to look into if these have an effect on the overall attitude of the students regarding online learning (measured by the Likert Scale).

I don&#x27;t have much experience or knowledge in doing stats. So, I am hoping that you could suggest a simple analysis method for this kind of situation, or in handling these kinds of data?

Thank you.

Elaine Robledo · Answer

Thank you so much for your inputs and suggestions &#x2014; David L Morgan , D. Eastern Kang Sim , Ron&#xE1;n Michael Conroy , Mohialdeen Alotumi , Muhammad Zia Aslam , Oluwaseyi Ayorinde Mohammed , and David Eugene Booth . I would look into these data analysis methods and try to get back to you if I have found the data analysis method that I would use or if I have further questions.

Again, thank you so much. You are all a big help to the success of our thesis.

Sal Mangiafico · Answer

Elaine Robledo , your question isn&#x27;t entirely clear to me, but I&#x27;ll try to make some comments.
Obviously for each of the yes/no questions you can calculate and report the percentage of &#x22;yes&#x22; answers, e.g. percentage of people who use a computer, smartphone, and so on.  It is usually a good idea to report data like this, almost as if it were demographic data.
How you analyze the connection between the yes/no questions and the Likert-type items depend on if you will combine the Likert-type items into a single scale or treat them individually.
With a binary independent variable and an ordinal or continuous dependent variable, a Wilcoxon-Mann-Whitney test will work well for a hypothesis test, and any of Cliff&#x27;s delta, Vargha and Delaney&#x27;s A, or Glass rank biserial correlation, will work as an effect size statistic.  (In the end, these three measure the same thing.)  ... The tests and effect size statistics here all measure if a response in one group are likely to be greater than an observation in another group.  They don&#x27;t address means or medians usually.
If you are interested in means or medians, there are tests and effect size statistics that may be applicable.

Maamir Abdellatif · Answer

The best method is the SEM method, which depends on the interpretation of the concepts that are the variables of the study

Bachir Abdelhamid · Answer

Hello everybody, i think,  (t-test)

Qijia Liao · Answer

If your dependent variable is a binary (yes or no) question, then choose binary logistic regression; if your dependent variable is a continuous variable or original variable, use a linear regression model.

Next, for your independent variables, if the data is normal data, you can convert yes into 1, and no to 0 (dummy coding), and for remaining IVs with ordinal data (1-5 scale), you don&#x27;t need to do anything extra. But for ordinal data, you have to analyze the data quality using SPSS reliability test (rule of thumb: Cronbach alpha greater than 0.7 ).

What statistical tool (data analysis method) should I use when I would like to see the relationship between a yes-no and a Likert scale variable?

Most recent answer

Top contributors to discussions in this field

All Answers (24)

Similar questions and discussions

Related Publications