潜在类别模型

在统计学中，潜在类别模型（latent class model，LCM ），简称潜类模型，将一组观察到的（通常是离散的）多变量变量与一组潜变量联系起来。LCM是一种潜变量模型。因为潜在变量是离散的，所以它被称为潜类模型。类的特征在于条件概率模式，其指示变量对特定值的可能性。

潜在类别分析 （latent class analysis， LCA ），简称潜类分析，是结构方程建模的子集，用于在多变量分类数据中查找子类型。这些子类型称为“潜类”（latent class）。

假设有如下情况，研究人员可能会选择使用LCA来理解数据：想象一下，在一系列患有X，Y和Z疾病的患者中测量了症状，并且疾病X与症状a， b和c的存在相关，疾病Y与症状为b，c，d的存在相关，以及疾病Z与症状为a，c和d的存在相关。

LCA将尝试检测潜在类别（疾病实体）的存在，从而创建与症状关联的模式。与因子分析一样，LCA也可用于根据条件的最大似然类别成员对其进行分类。 ^[1]

因为解决LCA的标准是实现潜在的类别，在这些潜在类别中，一种症状与另一种症状之间不再有任何关联（因为该类是引起其关联的疾病），以及患者所具有的一组疾病（或类别a） case是一个成员）导致症状关联，症状将是“有条件独立”，即，以类成员身份为条件，他们不再相关。 ^[1]

模型

在每一个潜类中，所观察到的变量统计独立的。这是一个重要方面。通常观察到的变量，在统计学上是相关的。通过引入的潜在可变的、独立的恢复在这个意义上，在类变数是独立的(当地的独立性的)。然后，我们说，该协会之间所观察到的变量是解释的类的潜变量(McCutcheon,1987年)。

在一种形式中，潜在类模型被写为

p_{i_{1},i_{2},\ldots ,i_{N}}\approx \sum _{t}^{T}p_{t}\,\prod _{n}^{N}p_{i_{n},t}^{n},

其中 $T$ 是的数量潜类和 $p_{t}$ 被所谓招聘或无条件的概率，应该总和。 $p_{i_{n},t}^{n}$ 是的边际或有条件的概率。

对一个双向潜在的类型，形式是

p_{ij}\approx \sum _{t}^{T}p_{t}\,p_{it}\,p_{jt}.

这种双向模型与概率潜在语义分析和非负矩阵分解有关。

^ ^1.0 ^1.1 Lazarsfeld，PF和Henry，NW（1968） 潜在结构分析 。波士顿：霍顿米夫林
^ Cheung, Kwok-Wai; Tsui, Kwok-Ching; Liu, Jiming. Extended latent class models for collaborative recommendation. IEEE Transactions on Systems, Man and Cybernetics, Part A: Systems and Humans. 2004, 34 (1): 143–148. doi:10.1109/TSMCA.2003.818877.
^ Eaves, L. J., Silberg, J. L., Hewitt, J. K., Rutter, M., Meyer, J. M., Neale, M. C., & Pickles, A. Analyzing twin resemblance in multisymptom data: genetic applications of a latent class model for symptoms of conduct disorder in juvenile boys. Behavior Genetics. 1993, 23 (1): 5–19. doi:10.1007/bf01067550.
^ Bermingham, M. L., Handel, I. G., Glass, E. J., Woolliams, J. A., de Clare Bronsvoort, B. M., McBride, S. H., Skuce, R. A., Allen, A . R., McDowell, S. W. J., & Bishop, S. C. Hui and Walter's latent-class model extended to estimate diagnostic test properties from surveillance data: a latent model for latent data. Scientific Reports. 2015, 5. doi:10.1038/srep11861.

Linda M. Collins; Stephanie T. Lanza. Latent class and latent transition analysis for the social, behavioral, and health sciences. New York: Wiley. 2010. ISBN 978-0-470-22839-5.
Allan L. McCutcheon. Latent class analysis. Quantitative Applications in the Social Sciences Series No. 64. Thousand Oaks, California: Sage Publications. 1987. ISBN 978-0-521-59451-6.
Leo A. Goodman. Exploratory latent structure analysis using both identifiable and unidentifiable models. Biometrika. 1974, 61 (2): 215–231. doi:10.1093/biomet/61.2.215.
Paul F. Lazarsfeld, Neil W. Henry. Latent Structure Analysis. 1968.

Statistical Innovations, Home Page （页面存档备份，存于互联网档案馆）, 2016. Website with latent class software (Latent GOLD 5.1), free demonstrations, tutorials, user guides, and publications for download. Also included: online courses, FAQs, and other related software.
The Methodology Center, Latent Class Analysis, a research center at Penn State, free software, FAQ
John Uebersax, Latent Class Analysis （页面存档备份，存于互联网档案馆）, 2006. A web-site with bibliography, software, links and FAQ for latent class analysis