iV&L-MM '16: Proceedings of the 2016 ACM workshop on Vision and Language Integration Meets Multimedia Fusion
SESSION: Paper Session 1
- Tinne Tuytelaars
Exploiting Scene Context for Image Captioning
- Rakshith Shetty
- Hamed R.-Tavakoli
- Jorma Laaksonen
SESSION: Paper Session 2
- Kate Saenko
News Event Understanding by Mining Latent Factors From Multimodal Tensors
- Chun-Yu Tsai
- Ruilin Xu
- Robert E. Colgan
- John R. Kender
Cross-modal Classification by Completing Unimodal Representations
- Thi Quynh Nhi Tran
- Hervé Le Borgne
- Michel Crucianu
Semantic Indexing of Wearable Camera Images: Kids'Cam Concepts
- Alan F. Smeaton
- Kevin McGuinness
- Cathal Gurrin
- Jiang Zhou
- Noel E. O'Connor
- Peng Wang
- Brian Davis
- Lucas Azevedo
- Andre Freitas
- Louise Signal
- Moira Smith
- James Stanley
- Michelle Barr
- Tim Chambers
- Cliona Ní Mhurchu
SESSION: Keynote 2
- Katerina Pastra
Jointly Representing Images and Text: Dependency Graphs, Word Senses, and Multimodal Embeddings
- Frank Keller
SESSION: Paper Session 3
- Stephanie Weirich
Multimodal and Crossmodal Representation Learning from Textual and Visual Features with Bidirectional Deep Neural Networks for Video Hyperlinking
- Vedran Vukotić
- Christian Raymond
- Guillaume Gravier
User Video Summarization Based on Joint Visual and Semantic Affinity Graph
- Zhuo Lei
- Ke Sun
- Qian Zhang
- Guoping Qiu
Disinformation in Multimedia Annotation: Misleading Metadata Detection on YouTube
- Payal Bajaj
- Mridul Kavidayal
- Priyanshu Srivastava
- Md Nadeem Akhtar
- Ponnurangam Kumaraguru
SESSION: Keynote 3
- Marie-Francine Moens
Beyond Language and Vision, Towards Truly Multimedia Integration
- Tat-Seng Chua