Content-based image retrieval is the set of techniques for retrieving
relevant images from an image database on the basis of automatically-
derived image features. The need for efficient content-based image re-
trieval has increased tremendously in many application areas such as
biomedicine, the military, commerce, education, and Web image clas-
sification and searching. In the biomedical domain, content-based im-
age retrieval can be used in patient digital libraries, clinical
diagnosis, searching of 2-D electrophoresis gels, and pathology slides.
I started my work on content-based image retrieval in 1995 when I was
with Stanford University. The project was initiated by the Stan- ford
University Libraries and later funded by a research grant from the
National Science Foundation. The goal was to design and implement a
computer system capable of indexing and retrieving large collections of
digitized multimedia data available in the libraries based on the media
contents. At the time, it seemed reasonable to me that I should discover
the solution to the image retrieval problem during the project. Experi-
ence has certainly demonstrated how far we are as yet from solving this
basic problem.