iV&L-MM '16: Proceedings of the 2016 ACM workshop on Vision and Language Integration Meets Multimedia Fusion


SESSION: Paper Session 1

  • Tinne Tuytelaars

Exploiting Scene Context for Image Captioning

  • Rakshith Shetty
  • Hamed R.-Tavakoli
  • Jorma Laaksonen

SESSION: Paper Session 2

  • Kate Saenko

News Event Understanding by Mining Latent Factors From Multimodal Tensors

  • Chun-Yu Tsai
  • Ruilin Xu
  • Robert E. Colgan
  • John R. Kender

Cross-modal Classification by Completing Unimodal Representations

  • Thi Quynh Nhi Tran
  • Hervé Le Borgne
  • Michel Crucianu

Semantic Indexing of Wearable Camera Images: Kids'Cam Concepts

  • Alan F. Smeaton
  • Kevin McGuinness
  • Cathal Gurrin
  • Jiang Zhou
  • Noel E. O'Connor
  • Peng Wang
  • Brian Davis
  • Lucas Azevedo
  • Andre Freitas
  • Louise Signal
  • Moira Smith
  • James Stanley
  • Michelle Barr
  • Tim Chambers
  • Cliona Ní Mhurchu

SESSION: Keynote 2

  • Katerina Pastra

Jointly Representing Images and Text: Dependency Graphs, Word Senses, and Multimodal Embeddings

  • Frank Keller

SESSION: Paper Session 3

  • Stephanie Weirich

Multimodal and Crossmodal Representation Learning from Textual and Visual Features with Bidirectional Deep Neural Networks for Video Hyperlinking

  • Vedran Vukotić
  • Christian Raymond
  • Guillaume Gravier

User Video Summarization Based on Joint Visual and Semantic Affinity Graph

  • Zhuo Lei
  • Ke Sun
  • Qian Zhang
  • Guoping Qiu

Disinformation in Multimedia Annotation: Misleading Metadata Detection on YouTube

  • Payal Bajaj
  • Mridul Kavidayal
  • Priyanshu Srivastava
  • Md Nadeem Akhtar
  • Ponnurangam Kumaraguru

SESSION: Keynote 3

  • Marie-Francine Moens

Beyond Language and Vision, Towards Truly Multimedia Integration

  • Tat-Seng Chua