Académique Documents
Professionnel Documents
Culture Documents
Chongming Yang
Research Support Center
FHSS College
Mixture of Distributions
Mixture of Distributions
Classification Techniques
Latent Class Analysis (categorical
indicators)
Latent Profile Analysis (continuous
Indicators)
Finite Mixture Modeling (multivariate
normal variables)
Disadvantages of Multi-steps
Practice
Multistep practice
Run classification model
Save membership Variable
Model membership variable and other
variables
Disadvantages
Biases in parameter estimates
Biases in standard errors
Significance
Confidence Intervals
Aim
Identify heterogeneous classes/groups
Estimate class probabilities
Identify good indicators of classes
Relate covariates to Classes
Probabilistic Model
Assumption: Conditional independence
of u
so that interdependence is explained by C like factor analysis model
An
item probability
P(c k ) P(u
k 1
| c k ) P (u2 | c k )...P (u r | c k )
LCA Parameters
Number of Classes -1
Item Probabilities -1
Logit Scale
Mean
=0
P (cik 1| xi )
ck ck x
e
J 1
cj cj x
Posterior Probability
(membership/classification of cases)
Estimation
Maximum Likelihood estimation via
Expectation-Maximization algorithm
E (expectation) step: compute average
posterior probabilities for each class and
item
M (maximization) step: estimate class
and item parameters
Iterate EM to maximize the likelihood of
the parameters
e
Pearson
Chi-square based on likelihood ratio
2 LR 2 o log(o / e)
Determine Number of
Classes
Substantive theory (parsimonious,
interpretable)
Predictive validity
Auxiliary variables / covariates
Statistical information and tests
Bayesian Information Criterion (BIC)
Entropy
Testing K against K-1 Classes
Vuong-Lo-Mendell-Rubin likelihood-ratio test
Bootstrapped likelihood ratio test
Quality of Classification
Entropy
Determine Quality of
Indicators
Good indicators
Item response probability is close to 0 or
1 in each class
Bad indicators
Item response probability is high in more
than one classes, like cross-loading in
factor analysis
Item response probability is low in all
classes like low-loading in factor
analysis
LCA Examples
LCA
LCA with covariates
Class predicts a categorical outcome
Parental Acceptance
Feel people in your family understand you
Feel you want to leave home
Mixture SEM
See mixture growth modeling