日韩久久久精品,亚洲精品久久久久久久久久久,亚洲欧美一区二区三区国产精品 ,一区二区福利

Weka EM 協方差

系統 2002 0

Weka EM covariance

description 1:

Dear All,

??? I am trying to find out what is the real meaning of the minStdDev parameter in the EM clustering algorithm. Can anyone help me?

??? I have not looked at the code, but I suspect that the minStdDev is used as the first estimate of the covariance of a Gaussian in the mixture?model. Am I correct?

??? I have found the equations or perhaps similar equations to the ones used to calculate the parameters for a Gaussian mixture model in the EM algorithm and there are three, which have these functions:

??? The first one calculates the probability of each Gaussian.
??? The second calculates the mean of each Gaussian
??? The third calculates the covariance matrix of each Gaussian

??? But this means to start off with there has to be an initial guess at the parameters for the Gaussian mixture model ie the probability or weighting factor for each Gaussian is needed, as is the mean and Covariance matrix.

???? If I am wrong how is the EM algorithm initiated ie how is the initial guess at the mixture model arrived at? Does minStdDev have any part to play in it? Also is a full covariance matrix calculated in the EM algorithm or are just the standard deviations or variances calculated, ie are right elliptical Gaussians used?

???? I am guessing that the random number generator is used to pick one or more data points at random as initial values for the means.

???? This question really follows up on my previous postings about differences between Mac and PC using the EM algorithm and worries about the stability of the algorithm. I was (naively) using the default value of 1.0E-6. However after a reply to a previous posting I have tried scaling the data to be between -1 and +1 and alsozero mean and unit SD. When I try these scaled data sets Mac and PC produce the same result. So I realised that ought to think about the value of minStdDev.?

????? Many thanks for your help in advance.

John Black

description 2:

EM in java is a naive implementation. That is, it treats each ?
attribute independently of the others given the cluster (much the same ?
as naive Bayes for classification). Therefore, a full covariance ?
matrix is not computed, just the means and standard deviations of each ?
numeric attribute.

The minStdDev parameter is there simply to help prevent numerical ?
problems. This can be a problem when multiplying large densities ?
(arising from small standard deviations) when there are many singleton ?
or near-singleton values. The standard deviation for a given attribute ?
will not be allowed to be less than the minStdDev value.

EM is initialized with the best result out of 10 executions of ?
SimpleKMeans (with different seed values).

Hope this helps.

Cheers,
Mark.

Weka EM 協方差


更多文章、技術交流、商務合作、聯系博主

微信掃碼或搜索:z360901061

微信掃一掃加我為好友

QQ號聯系: 360901061

您的支持是博主寫作最大的動力,如果您喜歡我的文章,感覺我的文章對您有幫助,請用微信掃描下面二維碼支持博主2元、5元、10元、20元等您想捐的金額吧,狠狠點擊下面給點支持吧,站長非常感激您!手機微信長按不能支付解決辦法:請將微信支付二維碼保存到相冊,切換到微信,然后點擊微信右上角掃一掃功能,選擇支付二維碼完成支付。

【本文對您有幫助就好】

您的支持是博主寫作最大的動力,如果您喜歡我的文章,感覺我的文章對您有幫助,請用微信掃描上面二維碼支持博主2元、5元、10元、自定義金額等您想捐的金額吧,站長會非常 感謝您的哦!!!

發表我的評論
最新評論 總共0條評論
主站蜘蛛池模板: 肇源县| 广安市| 乌什县| 镇坪县| 古丈县| 鹿邑县| 新和县| 惠州市| 元江| 伊宁市| 楚雄市| 岐山县| 福州市| 漾濞| 镇雄县| 枝江市| 麻栗坡县| 枞阳县| 武功县| 靖江市| 施甸县| 隆德县| 沂南县| 尚义县| 竹北市| 嵊泗县| 察雅县| 昂仁县| 万山特区| 丹东市| 密山市| 江西省| 松滋市| 北流市| 晋州市| 禄劝| 鄂尔多斯市| 克什克腾旗| 宁安市| 怀安县| 南投县|