When humans see an image, their brain can easily tell what the image is about, but a computer cannot do it easily.
Computer vision researchers worked on this a lot and they considered it impossible until now! With the advancement in Deep
learning techniques, availability of huge datasets and computer power, we can build models that can generate captions for an
image. Image Caption Generator is a popular research area of Deep Learning that deals with image understanding and a
language description for that image. Generating well-formed sentences requires both syntactic and semantic understanding of
the language.