Knowledge and Semantics from Visual Data

Semantic Scene understanding, Fine-Grained Image Captioning, Visual Question Answering, Scene Graphs