= MNIST database
{c}
{title2=1998}
{wiki}
= MNIST
{c}
{synonym}
60,000 28x28 grayscale images of hand-written digits 0-9, i.e. 10 categories.
This is THE "<OG>" <computer vision dataset>.
Playing with it is the de-facto <computer vision> <hello world>.
But it is important to note that as of the 2010's, the benchmark had become too easy for many application.
The dataset can be downloaded from http://yann.lecun.com/exdb/mnist/[]:
``
wget \
http://yann.lecun.com/exdb/mnist/train-images-idx3-ubyte.gz \
http://yann.lecun.com/exdb/mnist/train-labels-idx1-ubyte.gz \
http://yann.lecun.com/exdb/mnist/t10k-images-idx3-ubyte.gz \
http://yann.lecun.com/exdb/mnist/t10k-labels-idx1-ubyte.gz
``
but doing so is kind of pointless as both files use some crazy single-file custom binary format to store all images and labels. OMG!
OK-ish data explorer: https://knowyourdata-tfds.withgoogle.com/#tab=STATS&dataset=mnist
\Image[http://web.archive.org/web/20230430064700im_/https://i.stack.imgur.com/7q9Zg.png]
{title=<MNIST> image 1 of a '0'}
\Image[http://web.archive.org/web/20230430064700im_/https://i.stack.imgur.com/RemMm.png]
{title=<MNIST> image 21 of a '0'}
\Image[http://web.archive.org/web/20230430064700im_/https://i.stack.imgur.com/qoTGE.png]
{title=<MNIST> image 3 of a '1'}
Back to article page