Jointly Training Deep Boltzmann Machines

DR.GEEK
Dec 6, 2020
1 min read

(6th-Dec-2020)

• Classic DBMs require greedy unsupervised pretraining, and to perform classiﬁcation well, require a separate MLP-based classiﬁer on top of the hidden features they extract. This has some undesirable properties. It is hard to track performance during training because we cannot evaluate properties of the full DBM while training the ﬁrst RBM. Thus, it is hard to tell how well our hyperparameters are working until quite late in the training process. Software implementations of DBMs need to have many diﬀerent components for CD training of individual RBMs, PCD training of the full DBM, and training based on back-propagation through the MLP. Finally, the MLP on top of the Boltzmann machine loses many of the advantages of the Boltzmann machine probabilistic model, such as being able to perform inference when some input values are missing.

Monologue of

Dr. GEEK

Daily Blog by Dr. GEEK

Jointly Training Deep Boltzmann Machines

Recent Posts

Comments