The Second wave of neural networks

DR.GEEK
Apr 19, 2021
2 min read

(19th-April-2021)

• Another major accomplishment of the connectionist movement was the successful use of back-propagation to train deep neural networks with internal representations and the popularization of the back-propagation algorithm (Rumelhartet al., 1986a; LeCun, 1987). This algorithm has waxed and waned in popularity but as of this writing is currently the dominant approach to training deep models.

• The second wave of neural networks research lasted until the mid-1990s. Ventures based on neural networks and other AI technologies began to make unrealistically ambitious claims while seeking investments.

The third wave of neural networks

• The third wave of neural networks research began with a breakthrough in 2006. Geoﬀrey Hinton showed that a kind of neural network called a deep belief network could be effciently trained using a strategy called greedy layer-wise pretraining (Hinton et al., 2006).

At present time

• The third wave began with a focus on new unsupervised learning techniques and the ability of deep models to generalize well from small datasets, but today there is more interest in much older supervised learning algorithms and the ability of deep models to leverage large labeled datasets.

Increasing Dataset Sizes

• One may wonder why deep learning has only recently become recognized as acrucial technology though the ﬁrst experiments with artiﬁcial neural networks were conducted in the 1950s. Deep learning has been successfully used in commercial applications since the 1990s, but was often regarded as being more of an art than a technology and something that only an expert could use, until recently. It is true that some skill is required to get good performance from a deep learning algorithm.

• Fortunately, the amount of skill required reduces as the amount of training data increases. The learning algorithms reaching human performance on complex tasks today are nearly identical to the learning algorithms that struggled to solve toy problems in the 1980s, though the models we train with these algorithms have undergone changes that simplify the training of very deep architectures. The most important new development is that today we can provide these algorithms with the resources they need to succeed. Figure 1.8 shows how the size of benchmark datasets has increased remarkably over time.

Increasing Model Sizes

• Another key reason that neural networks are wildly successful today after enjoying comparatively little success since the 1980s is that we have the computational resources to run much larger models today. One of the main insights of connectionism is that animals become intelligent when many of their neurons work together.

• An individual neuron or small collection of neurons is not particularly useful.

• Biological neurons are not especially densely connected. As seen in ﬁgure 1.10, our machine learning models have had a number of connections per neuron that was within an order of magnitude of even mammalian brains for decades.

Example inputs from the MNIST dataset.

Monologue of

Dr. GEEK

Daily Blog by Dr. GEEK

The Second wave of neural networks

Recent Posts

Comments