Large-Scale Mining And Retrieval Of Visual Data In A Multimodal Context