How can I decide between using principal components analysis versus factor analysis?

Question

These two methods may appear similar to the user, but aren&#x27;t they quite different, and what would you tell a person who is considering using such methods? Thank you for your expert advises.&#xA0;

Linda Sanner · Accepted Answer

Factor analysis (FA) is a group of statistical methods used to understand and simplify patterns of relationships underlying measured variables (Beavers, Lounsbury, Richards, Huck, Skolits, &#x26; Esquivel, 2013; Fabrigar, Wegener, MacCallum, &#x26; Strahan, 1999; Schmitt, 2011). Factor analysis is a concept that includes both exploratory factor analysis (EFA) and confirmatory factor analysis (CFA) (Jennrich &#x26; Bentler, 2011).
CFA tests whether a known factor model can predict a set of observed data (DeCoster, 1998). Researchers use CFA to verify or confirm hypotheses or theory (Ruscio &#x26; Roche, 2012; Schmitt, 2011), establish the validity of the factor model, compare two models using the same data, test the significance of factor loading, test relationships between factor loadings, test for correlation or lack of correlation of factors, and assess convergent and discriminate validity of measures (DeCoster, 1998).
EFA tests the number of common factors that influence measures and tests the strength and relationship between each common factor to the corresponding measure (DeCoster, 1998). Researchers use EFA to identify the nature of constructs that underlie responses given in a questionnaire, determine sets of items that interconnect, demonstrate the depth and breadth of measurement scales, classify the most important features of a group of items, and generate factor scores that represent the underlying constructs (DeCoster, 1998). Because EFA is a multivariate statistical approach, it is appropriate for reducing the number of factors, examining relationships between categories, and evaluating the construct validity of a measurement scale (Williams et al., 2010).
Exploratory factor analysis involves a series of statistical analysis steps. The first is the planning phase, where it is determined if the data is suitable for EFA by selecting the sample size then after collecting the data, creating a correlation matrix and testing for adequacy. The second step is to extract factors. The third step is to determine the number of factors to retain. The fourth step is factor rotation. The fifth step is to interpret the factor structure.
Principal component analysis (PCA) is a method of factor extraction (the second step mentioned above). Researchers use PCA when they want to reduce the number of variables while retaining as much of the original variance as possible (Conway &#x26; Huffcutt, 2003).
REFERNCES
Beavers, A. S., Lounsbury, J. W., Richards, J. K., Huck, S. W., Skolits, G J., &#x26; Esquivel, S. L. (2013). Practical considerations for using exploratory factor analysis in educational research. Practical Assessment, Research &#x26; Evaluation, 18(6), 1-13. Retrieved from http://www.pareonline.net/pdf/v18n6.pdf
Conway, J. M., &#x26; Huffcutt, A. I. (2003). A review and evaluation of exploratory factor analysis practices in organizational research. Organizational Research Methods, 6, 147-168. doi:10.1177/1094428103251541
DeCoster, J. (1998). Overview of Factor Analysis. Retrieved from http://www.stat-help.com/factor.pdf
Fabrigar, L. R., Wegener, D. T., MacCallum, R. C. &#x26; Strahan, E J. (1999). Evaluating the use of exploratory factor analysis in psychological research. Psychological Methods, 4, 272-299. doi:1082-989X/99/S3.00
Jennrich, R. I., &#x26; Bentler, P. M. (2011). Exploratory bi-factor analysis. Psychometrika, 76, 537-549. foi:10.1007/s11336-011-9218-4
Ruscio, J., &#x26; Roche, B. (2012). Determining the number of factors to retain in exploratory factor analysis using comparison data of known factorial structure. Psychologocial Assessment, 24(2), 282-292. doi:10.1037/a0025697
Schmitt, T. A. (2011). Current methodological considerations in exploratory and confirmatory factor analysis. Journal of Psychoeducational Assessment, 29(4), 304-321. doi:10.1177/0734282911406653

Rita Rueff-Lopes · Answer

In factor analysis normally you already have a model where the objective is to predict observed variables from theoretical latent factors whereas in principal component analysis the objective is to extract linear composites of observed variables.

Raid Amin · Answer

Thank you Rita, Thank you Farhat.
If you think that &#x22;there may be some underlying theoretical relationship&#x22;, but you are unsure of it, would you still choose Factor Analysis of PCA?
Say, you suspect that certain cancer rates are somehow associated with air pollution. Could you use a FA model where you &#x22;throw in&#x22; all variables, with the goal to see if the cancer variables somehow appear in certain factors with air pollution?

Cyril Iaconelli · Answer

Dear Raid,
I would say that FA is more for the determination underlying&#xA0;variables which explains why two other variables are correlated. While PCA is more on the distribution&#xA0;of individuals explained by principal component (i.e. by correlation between factors).
I would say that the choice depends on what you are the most interested factors or individual.
Finally I found that the PCA of the package FactoMineR (in R) is&#xA0;the best compromise for multivariate analysis:
http://factominer.free.fr/classical-methods/principal-components-analysis.html
Best regards,
Cyril

Aleksandar Savi&#x107; · Answer

You didn&#x27;t wrote what kind of data you have.
For example, my data are mainly spectroscopic, thus always check physical meaning of extracted components (factor). (option for SPSS: check, scores, save as variables).
In other cases, look up the percentage of explained variance higher is (sometmes) better. For example when apply high kappa for promax in case of fluoresnce emission spectral components become &#x22;over-fitted&#x22; and gaining hiht percentage.&#xA0;
Also check if something is changing in qualitative meaning when changing the methods. Up to date, only once or twice I got different grouping of variables (some HPCL data) applying PCA and FA (with all possible options).&#xA0;

Yoilan Fimia-Le&#xF3;n · Answer

Principal components analysis is only a data reduction method. It was common many decades ago when computers were slow. I know it is the default method in many statistical applications but factor analysis seems to be superior.
You can take a look to the following article where more information about this technique is provided:
Costello, A. B., &#x26; Osborne, J. W. (2005). Best Practices in Exploratory Factor Analysis: Four Recommendations for Getting the Most From Your Analysis. Practical Assessment, Research &#x26; Evaluation, 10(7). Retrieved from http://pareonline.net/getvn.asp?v=10&#x26;n=7
If you need further guidance don&#x27;t hesitate to contact me

Raid Amin · Answer

My main interest in factor analysis is to study relationships between several types of diseases in the population, and how such variables are related to other variables from different fields. Then I aim to output factor scores and use those in a cluster analysis. Can this also be done with PCA?

Cyril Iaconelli · Answer

I would say that PCA and FA are not a (good) tools to find correlation between different variables.
This analysis try to explain several variables in one factor (or component).
For sure, two variables explaining the same factor (or component) should be correlated, but I don&#x27;t think that is the aim of this kind of analysis.
Maybe a simple correlation matrix would help you better than those analysis ? (please find the link below to compute a correlation matrix on R)
Regards
http://www.statmethods.net/stats/correlations.html

Raid Amin · Answer

Correlation analysis is not what I want here. I want to do a cluster analysis on factor scores.

Raid Amin · Answer

Thank you for the detailed answer here, &#xA0;Linda. I posted it to my class.

Linda Sanner · Answer

Happy to help, Raid.

Rogelio Ladr&#xF3;n de Guevara Cort&#xE9;s · Answer

They are actually different tehcniques based on different assumptions and used for different objectives. PCA is only a geometric or statistical trasnformation of data in order to get &#xA0;new&#xA0;synthetic variables, while FA suppose a model with some assumptions about the data generation. I can provide you the link to a publication where we compare both techniques in the&#xA0;financial context. I hope this helps
Estimation of the underlying structure of systematic risk wi...

Raid Amin · Answer

Thank you very much, Rogelio. I will read the article.

Rogelio Ladr&#xF3;n de Guevara Cort&#xE9;s · Answer

My pleasure Dr. Amin, best regards.

Nisha Arora · Answer

&#xA0;A more detailed discussion on the comparision between the two can be seen here:
http://stats.stackexchange.com/questions/1576/what-are-the-differences-between-factor-analysis-and-principal-component-analysi

Deleted profile · Answer

The decision of whether to use EFA or PCA can only be made when the goals of a study are clearly known and specified.
If the goal of a study is to obtain linear composites of observed variables that retain as much variance as possible, then PCA is the correct procedure.
On the other hand, if the goal is to determine interpretable constructs that maximally explain Covariances among a set of observed variables, then EFA is the correct procedure.
Source Byrne, B. M. (2005, P.28). Factor analytic models: Viewing the structure of an assessment instrument from three perspectives. Journal of personality assessment, 85(1), 17-32.
Factor Analytic Models: Viewing the Structure of an Assessme...

Deleted profile · Answer

Please Follow the Link&#xA0;
https://www.researchgate.net/post/EFA_or_CFA
https://www.researchgate.net/post/Factor_analysis_Vs_PCA
https://jalt.org/test/PDF/Brown29.pdf
http://activisiongamescience.github.io/2016/02/09/Principal-Component-Analysis-vs-Exploratory-Factor-Analysis/
http://www.theanalysisfactor.com/the-fundamental-difference-between-principal-component-analysis-and-factor-analysis/
http://psych.wisc.edu/henriques/pca.html
http://support.minitab.com/en-us/minitab/17/topic-library/modeling-statistics/multivariate/principal-components-and-factor-analysis/differences-between-pca-and-factor-analysis/
https://stats.stackexchange.com/questions/1576/what-are-the-differences-between-factor-analysis-and-principal-component-analysi

Raid Amin · Answer

Thank you all for your valuable input to this thread. It shows many &#x22;reads&#x22; by many people so far, so this question may have been in place.

Sarah Coriat · Answer

Thank you for asking this question in 2014 as I am having the same one today in 2017 ;-).&#xA0;

Raid Amin · Answer

Hi Sarah,
Just by looking at the many counts of people who have viewed the responses to my question could be an indication that this topic is still not taught well (or not understood well).

Raid Amin · Answer

Thank you for your insightful contribution above, Paul.&#xA0;

Raid Amin · Answer

I have been encountering some interesting challenges when using factor analysis. If as a first step, we obtain a factor analysis, and then we output factor scores from the first few factors, then how do we use the factor scores in a cluster analysis in the next step?&#xA0;
Specifically: Let&#x27;s say that we want to use the factor scores from Factor 1. The loadings are large for some variables and small for other variables; some are positive and some are negative. What will we &#x22;see&#x22; contained in the factor scores for Factor 1? If we will use the factor scores in a cluster analysis that can identify High clusters and also Low clusters, what exactly do such clusters mean, as realted to the original Factor 1 variables?
Has anyone here done such a cluster analysis? Please share with us your thoughts.

Raid Amin · Answer

Thank you for your detailed response to my question, Paul. While FA is widely taught by Psychology Departments, it is less often found in statistics programs. &#xA0;

Timothy A Ebert · Answer

Neither are taught in entomology departments. Of course there aren&#x27;t many options when your crowning achievement is 4 replicates.

There was a class at UC Davis in the late 80&#x27;s in multivariate analysis that was required as part of getting a Minor in that subject. I know we went over PCA, but maybe not FA. I don&#x27;t remember what textbook we used. The next encounter was about 5 years later when I had a large data set for my Ph.D.. I spent many happy hours stuffing my data through most of the procedures in the SAS-Stat user manual.

Hamza Bouguerra · Answer

Thank you all for your relevant answers.

How can I decide between using principal components analysis versus factor analysis?

Most recent answer

Popular answers (1)

Top contributors to discussions in this field

All Answers (29)

Similar questions and discussions

Related Publications

Related Publications

Creating Scales from Questionnaires: PROC VARCLUS vs. Factor Analysis

Statistical Approaches in Literature: An Application of Principal Component Analysis and Factor Analysis to Analyze the Different Arrangements about the Quran's Suras
Preprint
Full-text available
Nov 2021

A comparative study on principal component analysis and factor analysis for the formation of association rule in data mining domain
Conference Paper
Full-text available
May 2014