Authors: K. Selçuk Candan, Sethuraman Panchanthan, Balakrishnan Prabhakaran, Hari Sundaram, Wu-Chi Feng, Nicu Sebe, Borko Furht, Jin Li, Maria Luisa Sapino
Venue: Scottsdale, Arizona, USA
URL: http://www.acmmm11.org
General Chairs
K. Selçuk Candan, Arizona State University, AZ, USA
Sethuraman Panchanthan, Arizona State University, AZ, USA
Balakrishnan Prabhakaran, University of Texas at Dallas, TX, USA
Technical Program Chair
Hari Sundaram, Arizona State University, AZ, USA
Wu-Chi Feng, Portland State University, OR, USA
Nicu Sebe, University of Trento, IT
Workshop Chairs
Borko Furht, Florida Atlantic University, FL, USA
Jin Li, Microsoft Research
Maria Luisa Sapino, University of Torino, IT
|
Introduction to ACM Multimedia 2011
We are delighted to report on behalf of the entire organizing committee that the 19th ACM International Conference on Multimedia ACM Multimedia 2011 (MM'11) was held between November 28th and December 1st, 2011, in Scottsdale, Arizona, USA, to great success.
ACM Multimedia (MM) is the flagship conference of the Special Interest Group on Multimedia (SIGMM), which profiles cutting-edge scientific developments and showcases innovative industrial multimedia technologies and applications. The conference aims to promote intellectual exchanges and interactions among scientists, engineers, students, multimedia users, and artists through various events, including keynote talks from leaders in the area, oral and poster sessions focused on research challenges and solutions, workshops in up-and-coming key areas of research, technical and industrial demonstrations of prototypes and commercial products, tutorials, research and industrial panels, doctoral symposium, mentoring events, scientific competitions (including an open source software and a multimedia grand challenge competition), and interactive art exhibits.
Our key motivation while organizing the MM'11 conference was to find innovative ways to design an "inclusive" conference program: lowering the barriers between various MM sub-communities, boosting the cross-fertilization of ideas among the contributors and attendees across the various MM events, and maximizing the return-on-investment for the MM'11 participants. Examples of this new approach include the following:
-
New plenary poster sessions, where all contributors (i.e., authors of long and short research papers, of workshop papers, and contributors to all other MM'11 events) are invited to share poster versions of their contributions with the rest of the MM'11 community. These plenary poster sessions allow conference participants to get a quick idea of interesting things happening in the multiple parallel sessions they cannot clone themselves to attend!
-
Workshops were aligned with the other MM'11 events, instead of being held on a separate "workshops day" where many workshop participants never got to know the main conference and vice-versa. Our aim was to integrate workshops (which were chosen, in the first place, to represent emerging topics that complement the areas covered by the main technical program) organically with the other conference events and encourage broader participation by registrants in all conference and workshop programs.
-
These innovations aimed to eliminate barriers in the program had to be supported by corresponding innovations in the MM'11 registration policies. Thus, we have instituted an "all-in-one" registration fee structure, which covers attendance to all MM'11 events, including presentation and poster sessions, panels, demonstrations, tutorials, and workshops.
-
By keeping the overall registration fee lower than recent years and by shaving one day off from the conference program, we also reduced the overall participation cost for most of the MM'11 attendees.
A travel grants program, generously supported by the National Science Foundation (NSF) and SIGMM, also helped us lower the barrier for participation for many students and a full 30% of the registrants to MM'11 were students. A number of mentorship activities was organized at MM'11, including women mentoring event, organized and sponsored by SIGMM, a Doctoral Symposium program, which (in addition to having regular panels and presentations as before) opened up its doors to all student authors who wanted to present posters at the event, a panel on "Job Opportunities and Career Perspective for Fresh Graduates of the Multimedia Community", and a new "vis-a-vis meeting with researchers" social event where graduate students could meet and exchange ideas and receive guidance with internationally recognized researchers in their research area.
Of course, apart from the above, MM'11 also continued with programs that proved to be extremely successful in the past. We have continued with the well-established and highly successful Open Source Software Competition, with special emphasis this year on instructional open source software designed for educational use in teaching multimedia-related courses at undergraduate or graduate level. Like the previous years, the Multimedia Grand Challenges competition attracted challenges from many leaders of the multimedia industry, including HP, Technicolor, Nokia, Yahoo, Huwei, and 3D Life, and proposals from all over the world. A report form the Grand Challenges can be found in a separate article. Similarly, this year's industrial exhibits program, which complemented the MM technical demonstrations program, focussed on cutting-edge research prototypes, including system and product demonstrations from many industrial leaders, such as IBM, FX Palo Alto Labs, Microsoft Research, Exalead, and Yacast. The panels program emphasized opportunities and challenges faced by researchers, industry, and open-source communities in multimedia and thus covers timely topics, such as "Smart Games", "Towards Synergy Between the Open Source and the Research Multimedia Communities", and "Innovating the Multimedia Experience".
We are enthused to report that MM'11 included three exciting keynote talks by three industry and academic leaders in multimedia research: Alex Pentland, Head of the MIT Human Dynamics Lab, Genevieve Bell, Director of the Interaction and Experience Research at the Intel Labs, and Arnaud Robert, Senior Vice President of Technology at The Walt Disney Studios. We are proud that MM'11 hosts the prestigious SIGMM Technical Achievement Award presentation to Prof Shih-Fu Chang (Columbia University) and his award acceptance speech.
The Award for the Best Paper of MM'11 was presented by the technical program chairs to F. Yu, R. Ji and S. Chang for their paper "Active Query Sensing for Mobile Location Search". After a tough competition, the best student paper award of 2011 was shared by two papers. It was granted to W. Wu, A. Arefin, G. Kurillo, B. Agarwal, K. Nahrstedt and R. Bajcsy for their paper "Color-plus-Depth Level-of-Details in 3D Teleimmersive Video - A Psychophysical Approach" and to to R. Garg, A. Varma, M. Wu for their paper "Seeing ENF: Natural Time Stamp for Digital Video via Optical Sensing and Signal Processing". Also the best technical demo was chosen after long deliberation, and the award was presented to David S Monaghan, James O'Sullivan, and Noel O'Connor for their demo "Low-cost Creation of a 3D Interactive Museum Exhibition".
We would like to acknowledge all who have contributed to the success of MM'11. First of all, we would like to thank all authors who submitted papers to the technical program, various workshops, and other events of MM'11. We also thank the authors of the accepted papers who will present their work in MM'11 and the panelists and keynote speakers who have accepted to participate in the conference to discuss current and future challenges in the field of multimedia and to propose innovative solutions. We are grateful to the members of the various program committees and external reviewers who have helped put together a high-quality program and would like acknowledge members of the various MM'11 organizing committees and many student volunteers for their invaluable help at every step of the process. We would like to thank the staff of ACM and Sheridan for their continuous support and the Conference Management Toolkit Team (CMT) at Microsoft for letting us use CMT for handling the submission and review workflows of MM'11. Finally, we would like to thank our sponsors (as of this writing), Google, IBM, Microsoft, FxPal, Technicolor, Qualcomm, Springer, Yahoo!, Arizona State University, and the University of Texas at Dallas, who have extended their generous support to MM'11. We would also like to thank the National Science Foundation (NSF) and SIGMM for their generous support for the MM'11 student travel award program.
The Conference Program
We had an exciting technical program at ACM Multimedia 2011. The process to select the technical program included several innovations. These innovations - guided by the recent report by a select SIG Multimedia committee - included the following: MM'11 moved to technical areas instead of tracks, each submitted paper had a primary area and an optional secondary area set by the authors at the time of submission, and an author rebuttal phase.
Figure 1: Cross-linkages between primary and secondary areas
|
The ten areas were as follows:
-
Multi-modal Integration and Understanding in the Imperfect World
-
Media Analysis and Search
-
Scalability in Media Processing, Analysis, and Applications
-
Multimedia Systems and Middleware
-
Media Transport and Sharing
-
Multimedia Security
-
Media Authoring and Production
-
Location-based and Mobile Multimedia
-
Human, Social, and Educational Aspects of Multimedia
-
Arts and Contemporary Digital Culture
The areas - chosen in consultation with the broad SIG Multimedia community - reflect core research areas (e.g. multimedia systems and middleware), as well as different multimedia research contexts (e.g. location-based mobile multimedia). Typically, each area had two area chairs managing the review process; media analysis and search was an exception: due to the large number of submissions (39%), we assigned six area chairs to manage this area. Figure 1 shows the relationship between areas in the papers submitted for review - there is an edge between two areas when a paper has both areas specified in the paper. In the figure, edges with higher strength are more opaque.
Many authors specified both primary and secondary areas, resulting in the assignment of two reviewers from the primary area and one reviewer from the secondary area. In the minority of cases when there was no secondary area, we assigned all three reviewers from the primary area. The selection of reviewers from two different areas, allowed for a cross-disciplinary evaluation of the submitted paper. In past conferences a paper submitted to one of the four tracks was exclusively evaluated by reviewers from that track. Additionally, to increase the responsibility of the chairs as well as to give more credit to their work on their accepted paper, we have indicated the name of the AC that was supervising the reviewing process and which recommended the paper for acceptance.
The conference was highly competitive with low acceptance rates. We received a total of 666 submissions which included 335 long papers and 331 short papers. Subsequent to the initial notification, authors had one week to rebut the criticisms of the papers. After the rebuttal phase, the area chairs led a discussion with the reviewers on the merits of each paper, which included the author rebuttals. In the end, we accepted 58 long papers, with an acceptance rate of 17.3%, and 120 short papers with an acceptance rate of 36.3%. We additionally recommended 52 long papers to appear as short papers. Figure 2 shows the distribution of primary areas for all papers. We selected three papers were selected for the best paper and best student paper session. We solicited nominations from each area, for best student paper and best paper competition. The technical program chairs selected three from the nominations.
Figure 2: Distribution of papers' primary areas
|
We would like to thank all of our area chairs and reviewers who volunteered a significant amount of their time to ensure a high quality program.
The Workshops
This year, ACM Multimedia experimented with a new workshop format that co-located the workshops with ACM Multimedia and ran in parallel with the conference sessions. The new format influenced the selection process: among the 28 very strong proposals we received, we could select only 11 workshops. We conducted the selection process in accordance with the SIGMM group guidelines, and admitted 11 strong workshops to the program, whose themes are distinct from (and complement) the areas of the main conference. The gave us the following very rich workshops program, spanning from relatively established topics at ACM MM (this is the case for those workshops which are at their 3rd edition) to topics that are very new in the ACM MM community:
-
Workshop on Music Information Retrieval with User-Centered and Multimodal Strategies (MIRUM'11)
-
Workshop on Multimedia in Forensics and Intelligence (MiFor'11)
-
Workshop on Automated Media Analysis and Production for Novel TV Services (AIEMPro 2011)
-
Workshop on Social Media (WSM11)
-
Workshop on Social and Behavioral Networked Media Access (SBNMA'11)
-
Workshop on Multimedia Technologies for Distance Learning (MTDL'11)
-
Workshop on Interactive Multimedia on Mobile and Portable Devices (IMMPD'11)
-
Joint Workshop on Modeling and Representing Events (J-MRE'11)
-
Part 1: Workshop on Events in Multimedia (EiMM11)
-
Part 2: Workshop on Sparse Representation for Event Detection in Multimedia (SRED'11)
-
Joint Workshop on Human Gesture and Behavior Understanding (J-HGBU'11)
-
Part 1: Workshop on Social Signal Processing (SSPW'11)
-
Part 2: Workshop on Multimedia access to 3D Human Objects (MA3HO'11)
-
Workshop on Medical Multimedia Analysis and Retrieval (MMAR)
-
Workshop on Ubiquitous Meta User Interfaces (Ubi-MUI'11)
We would like to thank all the organizers who submitted their workshop proposals, and in particular the organizers of the workshops that appeared in the program. We are aware that new workshop format caused a significant amount of synchronization work from the organizers, who were required to align their internal deadlines and schedule with the ones of the main conference. We really appreciated their collaboration, and we reached a very interesting and successful workshop program. We hope that all ACM MM attendees enjoyed our program, and that the new format will increase the appeal of the workshops, and significantly boost intellectual exchange.
Papers
SESSION: Technical achievement award and best paper candidates
SESSION: Human, social, and educational aspects of multimedia
-
Ramanathan Subramanian, Victoria Yanulevskaya, Nicu Sebe:
Can computers learn from humans to see better?: inferring scene semantics from viewers' eye movements
-
Axel Carlier, Guntur Ravindra, Vincent Charvillat, Wei Tsang Ooi:
Combining content-based analysis and crowdsourcing to improve user interaction with zoomable video
-
Lexing Xie, Apostol Natsev, John R. Kender, Matthew Hill, John R. Smith:
Visual memes in social media: tracking real-world news in YouTube videos
-
Bo Geng, Linjun Yang, Chao Xu, Xian-Sheng Hua, Shipeng Li:
The role of attractiveness in web image search
SESSION: Location-based and mobile multimedia
-
Yang Wang, Tao Mei, Jingdong Wang, Houqiang Li, Shipeng Li:
JIGSAW: interactive mobile visual search with multimodal queries
-
An-Jung Cheng, Yan-Ying Chen, Yen-Ta Huang, Winston H. Hsu, Hong-Yuan Mark Liao:
Personalized travel recommendation by mining people attributes from community-contributed photos
-
Zhijie Shen, Sakire Arslan Ay, Seon Ho Kim, Roger Zimmermann:
Automatic tag generation and ranking for sensor-rich outdoor videos
-
Shu Shi, Cheng-Hsin Hsu, Klara Nahrstedt, Roy Campbell:
Using graphics rendering contexts to enhance the real-time video coding for mobile cloud gaming
SESSION: Multi-modal integration and understanding in the imperfect world
-
Jinfeng Zhuang, Tao Mei, Steven C.H. Hoi, Xian-Sheng Hua, Shipeng Li:
Modeling social strength in social media community via kernel-based learning
-
Wei Jiang, Alexander C. Loui:
Audio-visual grouplet: temporal audio-visual interactions for general video concept classification
-
Zechao Li, Meng Wang, Jing Liu, Changsheng Xu, Hanqing Lu:
News contextualization with geographic and visual information
-
Zhenyong Fu, Horace H.S. Ip, Hongtao Lu, Zhiwu Lu:
Multi-modal constraint propagation for heterogeneous image clustering
SESSION: Arts and contemporary digital culture
SESSION: Media transport and sharing
-
Minhui Zhu, Sebastien Mondet, Géraldine Morin, Wei Tsang Ooi, Wei Cheng:
Towards peer-assisted rendering in networked virtual environments
-
Yao Liu, Fei Li, Lei Guo, Yang Guo, Songqing Chen:
BlueStreaming: towards power-efficient internet P2P streaming to mobile devices
-
Ke Liang, Roger Zimmermann, Wei Tsang Ooi:
Peer-assisted texture streaming in metaverses
-
Yan Huang, Zhenhua Li, Gang Liu, Yafei Dai:
Cloud download: using cloud utilities to achieve high-quality content distribution for unpopular videos
SESSION: Media authoring and production 1
-
Alberto Piacenza, Fabrizio Guerrini, Nicola Adami, Riccardo Leonardi, Julie Porteous, Jonathan Teutenberg, Marc Cavazza:
Generating story variants with constrained video recombination
-
Xirong Li, Efstratios Gavves, Cees G.M. Snoek, Marcel Worring, Arnold W.M. Smeulders:
Personalizing automated image annotation using cross-entropy
-
Song Tan, Chong-Wah Ngo, Hung-Khoon Tan, Lei Pang:
Cross media hyperlinking for search topic browsing
-
Chun-Wei Liu, Tz-Huan Huang, Ming-Hsu Chang, Ken-Yi Lee, Chia-Kai Liang, Yung-Yu Chuang:
3D cinematography principles and their applications to stereoscopic media processing
SESSION: Media analysis and search 1
-
Xiangyu Chen, Xiaotong Yuan, Shuicheng Yan, Jinhui Tang, Yong Rui, Tat-Seng Chua:
Towards multi-semantic image annotation with graph regularized exclusive group lasso
-
Martha Larson, Christoph Kofler, Alan Hanjalic:
Reading between the tags to predict real-world size-class for visually depicted objects in images
-
Zhigang Ma, Yi Yang, Feiping Nie, Jasper Uijlings, Nicu Sebe:
Exploiting the entire feature space with sparsity for automatic image annotation
-
Ju-Chiang Wang, Yu-Chin Shih, Meng-Sung Wu, Hsin-Min Wang, Shyh-Kang Jeng:
Colorizing tags in tag cloud: a novel query-by-tag music search system
SESSION: Media authoring and production 2
-
Rodrigo Laiola Guimarães, Pablo Cesar, Dick C.A. Bulterman, Vilmos Zsombori, Ian Kegel:
Creating personalized memories from social events: community-based support for multi-camera recordings of school concerts
-
Frank Nack, Ichiro Ide:
Why did the prime minister resign?: generation of event explanations from large news repositories
-
Christophe Lino, Marc Christie, Roberto Ranon, William Bares:
The director's lens: an intelligent assistant for virtual cinematography
-
Vivek K. Singh, Jiebo Luo, Dhiraj Joshi, Phoury Lei, Madirakshi Das, Peter Stubler:
Reliving on demand: a total viewer experience
SESSION: Media analysis and search 2
-
Sheng-hua Zhong, Yan Liu, Yang Liu:
Bilinear deep learning for image classification
-
Dayong Wang, Steven C.H. Hoi, Ying He, Jianke Zhu:
Retrieval-based face annotation by weak label regularized local coordinate coding
-
Xinmei Tian, Yijuan Lu, Linjun Yang, Qi Tian:
Learning to judge image search results
-
Olivier Le Meur, Thierry Baccino, Aline Roumy:
Prediction of the inter-observer visual congruency (IOVC) and application to image ranking
SESSION: Multimedia systems and middleware 1
-
Dao T.P. Quynh, Ying He, Xiaoming Chen, Jiazhi Xia, Qian Sun, Steven C.H. Hoi:
Modeling 3D articulated motions with conformal geometry videos (CGVs)
-
Qianqian Xu, Tingting Jiang, Yuan Yao, Qingming Huang, Bowei Yan, Weisi Lin:
Random partial paired comparison for subjective video quality assessment via hodgerank
-
Wei Song, Dian Tjondronegoro, Michael Docherty:
Saving bitrate vs. pleasing users: where is the break-even point in mobile video quality?
-
Peijia Zheng, Jiwu Huang:
Implementation of the discrete wavelet transform and multiresolution analysis in the encrypted domain
SESSION: Media analysis and search 3
-
Jingkuan Song, Yi Yang, Zi Huang, Heng Tao Shen, Richang Hong:
Multiple feature hashing for real-time large scale near-duplicate video retrieval
-
Xianming Liu, Hongxun Yao, Rongrong Ji, Pengfei Xu, Xiaoshuai Sun, Qi Tian:
Learning heterogeneous data for hierarchical web video classification
-
Xiao-Yong Wei, Zhen-Qun Yang:
Coached active learning for interactive video search
-
Jin Yuan, Zheng-Jun Zha, Yao-Tao Zheng, Meng Wang, Xiangdong Zhou, Tat-Seng Chua:
Learning concept bundles for video search with complex queries
SESSION: Multimedia systems and middleware 2
-
Pengpeng Ni, Ragnhild Eg, Alexander Eichhorn, Carsten Griwodz, Pål Halvorsen:
Flicker effects in adaptive video streaming to handheld devices
-
Yao Liu, Lei Guo, Fei Li, Songqing Chen:
An empirical evaluation of battery power consumption for streaming data transmission to mobile devices
-
Jui-Hsin Lai, Chieh-Li Chen, Po-Chen Wu, Chieh-Chi Kao, Shao-Yi Chien:
Tennis real play: an interactive tennis game with models from real videos
-
Xiangwen Chen, Minghua Chen, Baochun Li, Yao Zhao, Yunnan Wu, Jin Li:
Celerity: a low-delay multi-party conferencing solution
SESSION: Media analysis and search 4
-
Wenbin Tang, Rui Cai, Zhiwei Li, Lei Zhang:
Contextual synonym dictionary for visual object retrieval
-
Wenhao Lu, Jingdong Wang, Xian-Sheng Hua, Shengjin Wang, Shipeng Li:
Contextual image search
-
Zhang Liu, Chaokun Wang, Yiyuan Bai, Hao Wang, Jianmin Wang:
MUSIZ: a generic framework for music resizing with stretching and cropping
-
Nobuyuki Morioka, Jingdong Wang:
Robust visual reranking via sparsity and ranking constraints
SESSION: Applications
-
Troy McDaniel, Morris Goldberg, Daniel Villanueva, Lakshmie Narayan Viswanathan, Sethuraman Panchanathan:
Motor learning using a kinematic-vibrotactile mapping targeting fundamental movements
-
Xiaohong Xiang, Mohan S. Kankanhalli:
Affect-based adaptive presentation of home videos
-
Naoko Nitta, Noboru Babaguchi:
Example-based video remixing support system
-
Rongrong Ji, Ling-Yu Duan, Jie Chen, Hongxun Yao, Yong Rui, Shih-Fu Chang, Wen Gao:
Towards low bit rate mobile visual search with multiple-channel coding
SESSION: Plenary talk sessions
SESSION: Events
PANEL SESSION: Panels
-
Abdulmotaleb El Ssaddik:
Serious games
-
Pablo Cesar, Wei Tsang Ooi, Ben Moskowitz, Zohar Babin, Dick Bulterman, Rainer Lienhart, Robert Richter:
Towards synergy between the open source and the research multimedia communities
-
Yu-Ru Lin, Vincent Oria, K. Selcuk Candan, Lyndon Kennedy, Dulce Dulce Ponceleon, Hari Sundaram, Rong Yan, Roger Zimmerman:
Job opportunities and career perspective for fresh graduates of the multimedia community
-
Khaled El-Maleh, Haohong Wang, Susie Wee, Heather Yu, James D. Johnston, Zhengyou Zhang:
Innovating the multimedia experience
WORKSHOP SESSION: Workshop overviews
-
Cynthia C.S. Liem, Meinard Müller, Douglas Eck, George Tzanetakis:
1st international ACM workshop on music information retrieval with user-centered and multimodal strategies (MIRUM)
-
Sebastiano Battiato, Sabu Emmanuel, Adrian Ulges, Marcel Worring:
Third ACM international workshop on multimedia in forensics and intelligence (MiFor 2011)
-
Sid-Ahmed Berrani, Alberto Messina:
AIEMPro 2011: the 4th international workshop on automated media analysis and production for novel TV services
-
Steven Chu-Hong Hoi, Michal Jacovi, Ioannis Kompatsiaris, Jiebo Luo, Konstantinos Tserpes:
WSM2011: third ACM workshop on social media
-
Naeem Ramzan, Fei Wang, Charalampos Z. Patrikakis, Peng Cui, Nikolaos Doulamis, Shiqiang Yang, Gordon Sun:
ACM international workshop on social and behavioral networked media access (SBNMA'11)
-
Vasileios Mezaris, Ansgar Scherp, Ramesh Jain, Mohan Kankanhalli, Huiyu Zhou, Jianguo Zhang, Liang Wang, Zhengyou Zhang:
Modeling and representing events in multimedia
-
Maja Pantic, Alex Pentland, Alessandro Vinciarelli, Rita Cucchiara, Mohamed Daoudi, Alberto Del Bimbo:
Joint ACM workshop on human gesture and behavior understanding: (J-HGBU'11)
-
Yu Cao, Jayashree Kalpathy-Cramer, Devrim Ünay:
Medical multimedia analysis and retrieval
-
Ali Asghar Nazari Shirehjini, Sahin Albayrak, Abdulsalam Yassine:
Ubi-MUI 2011 ACM workshop summary
-
Rynson W.H. Lau, Timothy K. Shih, Frederick W.B. Li, Neil Y. Yen:
The third ACM international workshop on multimedia technologies for distance learning (MTDL 2011)
-
Jiebo Luo, Caifeng Shan, Ling Shao, Minoru Etoh:
ACM international workshop on interactive multimedia on mobile and portable devices (IMMPD'11)
TUTORIAL SESSION: Tutorial overviews
-
Alan Hanjalic, Martha Larson:
Frontiers in multimedia search
-
Tao Mei, Ruofei Zhang, Xian-Sheng Hua:
Internet multimedia advertising: techniques and technologies
-
Cees G.M. Snoek, Arnold W.M. Smeulders:
Internet video search
-
Simone Santini:
Semantic computing in multimedia
-
Gaël Richard:
Tutorial on multimedia music signal processing
-
Gerald Friedland:
Acoustic and multimodal processing for multimedia content analysis
-
Xiao-Ping Zhang, Zhu Liu:
Graphical probabilistic modeling and applications in multimedia content analysis
-
Jialie Shen, Meng Wang, Shuicheng Yan, Xian-Sheng Hua:
Multimedia tagging: past, present and future
-
Harish Katti, Mohan Kankanhalli:
Eye-tracking methodology and applications to images and video
SESSION: Grand challenge session
-
Christoph Kofler, Martha Larson, Alan Hanjalic:
Alice's worlds of wonder: exploiting tags to understand images in terms of size and scale
-
Guan-Long Wu, Yu-Chuan Su, Tzu-Hsuan Chiu, Liang-Chi Hsieh, Winston H. Hsu:
Scalable mobile video question-answering system with locally aggregated descriptors and random projection
-
Yu-Heng Lei, Yan-Ying Chen, Lime Iida, Bor-Chun Chen, Hsiao-Hang Su, Winston H. Hsu:
Photo search by face positions and facial attributes on touch devices
-
Chun Chet Tan, Yu-Gang Jiang, Chong-Wah Ngo:
Towards textually describing complex video contents with audio-visual concept classifiers
-
Dimitrios S. Alexiadis, Philip Kelly, Petros Daras, Noel E. O'Connor, Tamy Boubekeur, Maher Ben Moussa:
Evaluating a dancer's performance using kinect-based skeleton tracking
-
Tsung-Hung Tsai, Wen-Huang Cheng, Yung-Huan Hsieh:
Dynamic social network for narrative video analysis
-
Marc Gowing, Philip Kell, Noel E. O'Connor, Cyril Concolato, Slim Essid, Jean Lefeuvre, Robin Tournemenne, Ebroul Izquierdo, Vlado Kitanovski, Xinyu Lin, Qianni Zhang:
Enhanced visualisation of dance performance from automatically synchronised multimodal recordings
-
Bruno do Nascimento Teixeira, Jùlia Epischina Engràcia de Oliveira, Fillipe Dias Moreira de Souza, Tiago Oliveira Cunha, Arnaldo de Albuquerque Araùjo, Christiane Okamoto, Lucas Figueiredo, Vinìcius de Oliveira Silva, Igor Calil Loures de Oliveira:
News browsing system: multimodal analysis
-
Slim Essid, Yves Grenier, Mounira Maazaoui, Gaël Richard, Robin Tournemenne:
An audio-driven virtual dance-teaching assistant
-
Yoshitaka Ushiku, Tatsuya Harada, Yasuo Kuniyoshi:
Understanding images with natural sentences
-
Wanmin Wu, Ahsan Arefin, Gregorij Kurillo, Pooja Agarwal, Klara Nahrstedt, Ruzena Bajcsy:
A psychophysical approach for real-time 3D video processing
-
Yin-Hsi Kuo, Wen-Yu Lee, Winston H. Hsu, Wen-Huang Cheng:
Augmenting mobile city-view image retrieval with context-rich user-contributed photos
SESSION: Open source software competition
-
Jonathon S. Hare, Sina Samangooei, David P. Dupplaw:
OpenIMAJ and ImageTerrier: Java libraries and tools for scalable multimedia analysis and indexing of images
-
Isaac Esteban, Judith Dijk, Frans C.A. Groen:
From images to 3d models made easy
-
Fabien Cazenave, Vincent Quint, Cécile Roisin:
Timesheets.js: tools for web multimedia
-
Christopher A. Brooks, Markus Ketterl, Adam Hochman, Josh Holtzman, Judy Stern, Tobias Wunden, Kristofor Amundson, Greg Logan, Kenneth Lui, Adam McKenzie, Denis Meyer, Markus Moormann, Matjaz Rihtar, Ruediger Rolf, Nejc Skofic, Micah Sutton, Ruben Perez Vazquez, Benjamin Wulff:
OpenCast Matterhorn 1.1: reaching new heights
-
Sung Hee Park, Andrew Adams, Eino-Ville Talvala:
The FCam API for programmable cameras
-
Jérôme Gorin, Hervé Yviquel, Françoise Prêteux, Mickaël Raulet:
Just-in-time adaptive decoder engine: a universal video decoder based on MPEG RVC
-
Jean Le Feuvre, Cyril Concolato, Jean-Claude Dufourd, Romain Bouqueau, Jean-Claude Moissinac:
Experimenting with multimedia advances using GPAC
-
Sherif Halawa, Derek Pang, Ngai-Man Cheung, Bernd Girod:
ClassX: an open source interactive lecture StreamingSystem
-
Christopher Müller, Christian Timmerer:
A VLC media player plugin enabling dynamic adaptive streaming over HTTP
-
Andrés Barrios, Matìas Barrios, Daniel De Vera, Pablo Rodrìguez-Bocca, Claudia Rostagnol:
GoalBit: a free and open source peer-to-peer streaming network
-
Werner Bailer, Hermann Fürntratt, Peter Schallauer, Georg Thallinger, Werner Haas:
A C++ library for handling MPEG-7 descriptions
-
Mathias Lux:
Content based image retrieval with LIRe
-
Niels Zeilemaker, Mihai Capotă, Arno Bakker, Johan Pouwelse:
Tribler: P2P media search and sharing
-
Jean Bresson, Carlos Agon, Gérard Assayag:
OpenMusic: visual programming environment for music composition, analysis and research
SESSION: Industrial exhibits 1
-
Julien Law-To, Gregory Grefenstette:
VOVALEAD: a scalable video search engine based on content
-
Raphaël Blouet, Charlotte Juan:
MMSI talk: an applicative use case of quaero media monitoring & social impact
-
Jingdong Wang, Xian-Sheng Hua:
Web-scale image search by color sketch
-
Arthur Lenoir, Rémi Landais:
MuMa: a scalable music search engine based on content analysis
-
Christian Wengert, Tobias Jaeggli, Philippe Messmer, Till Quack, Peter Cech, Cristi Prodan, Tomas Carnecky, Franco Sebregondi, David Wisti:
Kooaba interactive posters
DEMONSTRATION SESSION: Technical demos 1
-
Boqing Gong, Jianzhuang Liu, Xiaogang Wang, Xiaoou Tang:
3D object retrieval with semantic attributes
-
Zhou Ren, Jingjing Meng, Junsong Yuan, Zhengyou Zhang:
Robust hand gesture recognition with kinect sensor
-
Zhijie Shen, Sakire Arslan Ay, Seon Ho Kim:
SRV-TaGS: An Automatic TAGging and Search System for Sensor-Rich Outdoor Videos
-
Kazuhiro Otsuka, Kamil Sebastian Mucha, Shiro Kumano, Dan Mikami, Masafumi Matsuda, Junji Yamato:
A system for reconstructing multiparty conversation field based on augmented head motion by dynamic projection
-
Andreas Zingerle, Tyler Freeman:
enabling the VJ as performer with rhythmic wearable interfaces
-
David Sadlier, Paul Ferguson, Dian Zhang, Noel E. O'Connor, Hyowon Lee:
InSPeCT: integrated surveillance for port container traffic
-
Svetha Venkatesh, Stewart Greenhill, Dinh Phung, Brett Adams:
Cognitive intervention in autism using multimedia stimulus
-
Takayuki Yamada, Seiichi Gohshi, Isao Echizen:
iCabinet: stand-alone implementation of a method for preventing illegal recording of displayed content by adding invisible noise signals
-
Jian Dong, Yuzhao Ni, Jiashi Feng, Shuicheng Yan:
Purposive hidden-object game (P-HOG) towards imperceptible human computation
-
Genliang Guan, Zhiyong Wang, Xian-Sheng Hua, Dagan Feng:
StoryImaging: a media-rich presentation system for textual stories
-
Ning Zhang, Tao Mei, Xian-Sheng Hua, Ling Guan, Shipeng Li:
TapTell: understanding visual intents on-the-go
-
Britta Meixner, Johannes Köstler, Harald Kosch:
A mobile player for interactive non-linear video
-
Carmelo Velardo, Jean-Luc Dugelay:
Real time extraction of body soft biometric from 3D videos
-
Ahsan Arefin, Zixia Huang, Raoul Rivas, Shu Shi, Wanmin Wu, Klara Nahrstedt:
Tele-immersive gaming for everybody
-
Diana Siwiak, Nicole Lehrer, Michael Baran, Yinpeng Chen, Margaret Duff, Todd Ingalls, Thanassis Rikakis:
A home-based adaptive mixed reality rehabilitation system
-
Derek Pang, Sherif Halawa, Ngai-Man Cheung, Bernd Girod:
ClassX Mobile: region-of-interest video streaming to mobile devices with multi-touch interaction
-
Changhu Wang, Jun Zhang, Bruce Yang, Lei Zhang:
Sketch2Cartoon: composing cartoon images by sketching
-
Beomjoo Seo, Jia Hao, Guanfeng Wang:
Sensor-rich video exploration on a map interface
-
Andrew Au, Jie Liang:
Ztitch: a mobile phone application for 3D scene creation, navigation, and sharing
-
Emi Myodo, Satoshi Ueno, Koichi Takagi, Shigeyuki Sakazawa:
Automatic comic-like image layout system preserving image order and important regions
SESSION: Industrial exhibits 2
SESSION: Technical demos 2
-
Lei Pang, Song Tan, Hung Khoon Tan, Chong Wah Ngo:
Galaxy browser: exploratory search of web videos
-
Felix X. Yu, Rongrong Ji, Tongtao Zhang, Shih-Fu Chang:
A mobile location search system with active query sensing
-
Florian Mehm, Sandro Hardy, Stefan Göbel, Ralf Steinmetz:
Collaborative authoring of serious games for health
-
Haojie Li, Lei Yi, Jinhui Tang, Xiaohui Wang:
Capturing a great photo via learning from community-contributed photo collections
-
Alberto Piacenza, Fabrizio Guerrini, Nicola Adami, Riccardo Leonardi, Jonathan Teutenberg, Julie Porteous, Marc Cavazza:
Changing video arrangement for constructing alternative stories
-
Hervé Goëau, Alexis Joly, Souheil Selmi, Pierre Bonnet, Elise Mouysset, Laurent Joyeux, Jean-François Molino, Philippe Birnbaum, Daniel Bathelemy, Nozha Boujemaa:
Visual-based plant species identification from crowdsourced data
-
Vivek K. Singh, Jiebo Luo, Dhiraj Joshi, Madirakshi Das, Phoury Lei, Peter Stubler:
Dynamic media show drivable by semantics
-
Steven C.H. Hoi, Pengcheng Wu:
SIRE: a social image retrieval engine
-
Paul B. Beskow, Håkon K. Stensland, Håvard Espeland, Espen A. Kristiansen, Preben N. Olsen, Ståle Kristoffersen, Carsten Griwodz, Pål Halvorsen:
Processing of multimedia data using the P2G framework
-
Qia Wang, Alex Lobzhanidze, Suman Deb Roy, Wenjun Zeng, Yi Shang:
Positionit: an image-based remote target localization system on smartphones
-
David Monaghan, James O'Sullivan, Noel E. O'Connor, Bridget Kelly, Olivier Kazmierczak, Lorraine Comer:
Low-cost creation of a 3D interactive museum exhibition
-
Koichi Mori, Rafael Ballagas, Glenda Revelle, Hayes Raffle, Hiroshi Horii, Mirjana Spasojevic:
Interactive rich reading: enhanced book reading experience with a conversational agent
-
Klaus Schoeffmann, Manfred del Fabro:
Hierarchical video browsing with a 3D carousel
-
Axel Carlier, Arash Shafiei, Julien Badie, Salim Bensiali, Wei Tsang Ooi:
COZI: crowdsourced and content-based zoomable video player
-
Christophe Lino, Marc Christie, Roberto Ranon, William Bares:
A smart assistant for shooting virtual cinematography with motion-tracked cameras
-
Gerald Friedland, Jaeyoung Choi, Adam Janin:
Video2GPS: a demo of multimodal location estimation on flickr videos
-
Alexis Fesnin, Valerie Gouet-Brunet, Scott Kominen, Vincent Oria, Jichao Sun:
Towards a privacy preserving personal photo album manager with semantic classification, indexing and querying capabilities
-
Yang Cai, Linjun Yang, Wei Ping, Fei Wang, Tao Mei, Xian-Sheng Hua, Shipeng Li:
Million-scale near-duplicate video retrieval system
-
Junfeng He, Tai-Hsu Lin, Jinyuan Feng, Shih-Fu Chang:
Mobile product search with bag of hash bits
SESSION: Oral presentation session
POSTER SESSION: Posters Session
SESSION: Short paper session 1
-
Guifang Duan, Neela Sawant, James Z. Wang, Dean Snow, Danni Ai, Yen-Wei Chen:
Analysis of cypriot icon faces using ICA-enhanced active shape model representation
-
Diogo Cabral, João Valente, João Silva, Urândia Aragão, Carla Fernandes, Nuno Correia:
A creation-tool for contemporary dance using multimodal video annotation
-
Gianluca Monaci, Tommaso Gritti, Fabio Vignoli, Wouter Walmink, Maarten Hendriks:
Flower power
-
Bauke Freiburg, Jaap Kamps, Cees G.M. Snoek:
Crowdsourcing visual detectors for video search
-
Alexander Reben, Joseph Paradiso:
A mobile interactive robot for gathering structured social video
-
Andreea Danielescu, Ryan P. Spicer, David Tinapple, Aisling Kelliher, Shawn Nikkila, Sean Burdick:
Abstract rendering of human activity in a dynamic distributed learning environment
-
Steve Mann, Ryan Janzen, Jason Huang:
"WaterTouch": an aquatic interactive multimedia sensory table based on total internal reflection in water
-
Abhishek Bhattacharya, Wanmin Wu, Zhenyu Yang:
Quality of experience evaluation of voice communication systems using affect-based approach
-
Mihalis A. Nicolaou, Hatice Gunes, Maja Pantic:
A multi-layer hybrid framework for dimensional emotion classification
-
Catherine H. Vuong, Todd Ingalls, James J. Abbas:
Transforming clinical rehabilitation into interactive multimedia
-
Ramin Tadayon, Ashish Amresh, Winslow Burleson:
Socially relevant simulation games: a design study
-
Ting Yao, Chong-Wah Ngo, Tao Mei:
Context-based friend suggestion in online photo-sharing community
-
Anastasia Gumulia, BartBomiej Puzon, Naoko Kosugi:
Music visualization: predicting the perceived speed of a composition -- misual project --
-
Xiuzhuang Zhou, Junlin Hu, Jiwen Lu, Yuanyuan Shang, Yong Guan:
Kinship verification from facial images under uncontrolled conditions
-
Mitchell J. Morris, John R. Kender:
VastMM-Tag: a semantic tagging browser for unstructured videos
-
Qiyam Tung, Ranjini Swaminathan, Alon Efrat, Kobus Barnard:
Expanding the point: automatic enlargement of presentation video elements
-
Senthil Kumar, Sreedal Menon, Francis Zane:
Sharing rectangular objects in a video conference
-
Hao Ji, Fei Su:
Biased metric learning for person-independent head pose estimation
-
Dong Liu, Shuicheng Yan, Hong-Jiang Zhang:
Next photo please: towards visually consistent sequential photo browsing
-
Ngo Quang Minh Khiem, Guntur Ravindra, Wei Tsang Ooi:
Towards understanding user tolerance to network latency in zoomable video streaming
-
Marian Ursu, Pedro Torres, Vilmos Zsombori, Michael Franztis, Rene Kaiser:
Socialising through orchestrated video communication
-
Fernanda Brandi, Eckehard Steinbach:
Perceptual coding of recorded telemanipulation sessions
-
Jacopo Staiano, Bruno Lepri, Ramanathan Subramanian, Nicu Sebe, Fabio Pianesi:
Automatic modeling of personality states in small group interactions
-
Si Liu, Qiang Chen, Jian Dong, Shuicheng Yan, Changsheng Xu, Hanqing Lu:
Snap & play: auto-generate personalized find-the-difference mobile game
-
Aveek Shankar Brahmachari, Sudeep Sarkar:
Fast detection of noisy GPS and magnetometer tags in wide-baseline multi-views
-
Linjun Yang, Yang Cai, Alan Hanjalic, Xian-Sheng Hua, Shipeng Li:
Video-based image retrieval
-
Roberto Yus, Eduardo Mena, Jorge Bernad, Sergio Ilarri, Arantza Illarramendi:
Location-aware system based on a dynamic 3D model to help in live broadcasting of sport events
-
Ziying Tang, Orkun Ozbek, Xiaohu Guo:
Real-time 3D interaction with deformable model on mobile devices
-
Jia Hao, Guanfeng Wang, Beomjoo Seo, Roger Zimmermann:
Keyframe presentation for browsing of user-generated videos on map interfaces
-
Xunyi Yu, Aura Ganz:
Detecting and identifying people in mobile videos
-
Fang-Erh Lin, Yin-Hsi Kuo, Winston H. Hsu:
Multiple object localization by context-aware adaptive window search and search-based object recognition
-
Matthew L. Cooper:
Clustering geo-tagged photo collections using dynamic programming
-
Sam S. Tsai, David Chen, Huizhong Chen, Cheng-Hsin Hsu, Kyu-Han Kim, Jatinder P. Singh, Bernd Girod:
Combining image and text features: a hybrid approach to mobile book spine recognition
-
Michael E. Houle, Vincent Oria, Shin'ichi Satoh, Jichao Sun:
Knowledge propagation in large image databases using neighborhood information
-
Qiang Zhou, Shifeng Chen, Jianzhuang Liu, Xiaoou Tang:
Edge-preserving single image super-resolution
-
Chen Cao, Shifeng Chen, Wei Zhang, Xiaoou Tang:
Automatic motion-guided video stylization and personalization
-
Shiai Zhu, Chong-Wah Ngo, Yu-Gang Jiang:
On the pooling of positive examples with ontology for visual concept learning
-
Yuming Fang, Zhenzhong Chen, Weisi Lin, Chia-Wen Lin:
Saliency-based image retargeting in the compressed domain
-
Jian Yi, Yuxin Peng, Jianguo Xiao:
Mining concept relationship in temporal context for effective video annotation
-
Xiangmin Zhou, Lei Chen, Xiaofang Zhou:
Structure tensor series-based matching for near-duplicate video retrieval
-
Junfeng Jiang, Xiao-Ping Zhang:
A smart video player with content-based fast-forward playback
-
Hongyuan Cai, Jiang Yu Zheng:
Video anatomy: cutting video volume for profile
-
Harlyn Baker, Nelson L. Chang, Arun Paruchuri:
Capture and display for live immersive 3D entertainment
-
Junhao Shi, Mingmin Zhang, Zhigeng Pan:
A real-time bimanual 3D interaction method based on bare-hand tracking
-
Xinming Zhang, Zheng-Jun Zha, Changsheng Xu:
Learning "verb-object" concepts for semantic image annotation
-
Yongqing Sun, Akira Kojima:
A novel method for semantic video concept learning using web images
-
Jun Imura, Teppei Fujisawa, Tatsuya Harada, Yasuo Kuniyoshi:
Efficient multi-modal retrieval in conceptual space
-
Xiangyu Wang, Yong Rui, Mohan S. Kankanhalli:
Up-fusion: an evolving multimedia decision fusion method
-
Zhou Ren, Junsong Yuan, Zhengyou Zhang:
Robust hand gesture recognition based on finger-earth mover's distance with a commodity depth camera
-
Xiaodong Yang, Shuai Yuan, YingLi Tian:
Recognizing clothes patterns for blind people by confidence margin based feature combination
-
Milan Redzic, Conor Brennan, Noel E. O'Connor:
Dual-sensor fusion for indoor user localisation
-
Zhonghua Li, Bingjun Zhang, Ye Wang:
Document dependent fusion in multimodal music retrieval
-
Stevan Rudinac, Alan Hanjalic, Martha Larson:
Finding representative and diverse community contributed images to create visual summaries of geographic areas
-
Luca Del Pero, Philip Lee, James Magahern, Emily Hartley, Kobus Barnard, Ping Wang, Atul Kanaujia, Niels Haering:
Fusing object detection and region appearance for image-text alignment
-
Biao Han, Hao Zhu, Youdong Ding:
Bottom-up saliency based on weighted sparse coding residual
-
Yuzhao Ni, Jian Dong, Jiashi Feng, Shuicheng Yan:
Purposive hidden-object-game: embedding human computation in popular game
-
Mayank Bansal, Harpreet S. Sawhney, Hui Cheng, Kostas Daniilidis:
Geo-localization of street views with aerial image databases
SESSION: Short papers session 2
-
Jitao Sang, Jing Liu, Changsheng Xu:
Exploiting user information for image tag refinement
-
Kong-Wah Wan, Yan-Tao Zheng, Lekha Chaisorn:
Known-item video search via query-to-modality mapping
-
Yang Yang, Yi Yang, Zi Huang, Heng Tao Shen:
Transfer tagging from image to video
-
Srinivasan H. Sengamedu, Subhajit Sanyal, Sriram Satish:
Detection of pornographic content in internet images
-
Chunlei Yang, Jialie Shen, Jianping Fan:
Effective summarization of large-scale web images
-
Gang Yu, Junsong Yuan, Zicheng Liu:
Real-time human action search using random forest based hough voting
-
Xin-Shun Xu, Xiangyang Xue, Zhi-Hua Zhou:
Ensemble multi-instance multi-label learning approach for video annotation task
-
Ce Li, Jianru Xue, Nanning Zheng, Zhiqiang Tian:
Nonparametric bottom-up saliency detection using hypercomplex spectral contrast
-
LiMin Wang, Yirui Wu, Tong Lu, Kang Chen:
Multiclass object detection by combining local appearances and context
-
Yong Luo, Dacheng Tao, Bo Geng, Chao Xu, Stephen Maybank:
Shared feature extraction for semi-supervised image classification
-
Yangxi Li, Bo Geng, Zheng-Jun Zha, Dacheng Tao, Linjun Yang, Chao Xu:
Difficulty guided image retrieval using linear multiview embedding
-
Tianlong Chen, Shuqiang Jiang, Lingyang Chu, Qingming Huang:
Detection and location of near-duplicate video sub-clips by finding dense subgraphs
-
Yingfei Li, Bo Geng, Zheng-jun Zha, Yangxi Li, Dacheng Tao, Chao Xu:
Query expansion by spatial co-occurrence for image retrieval
-
Aixin Sun, Sourav S. Bhowmick, Jun-An Chong:
Social image tag recommendation by concept matching
-
Wei Zhang, Yao Lu, Xiangyang Xue, Jianping Fan:
Automatic image annotation with weakly labeled dataset
-
Wei Zhang, Ke Gao, Yongdong Zhang, Jintao Li:
Efficient approximate nearest neighbor search with integrated binary codes
-
Peng Yang, Hui Li, Qingshan Liu, Lin Zhong, Dimitris Metaxas:
Content quality based image retrieval with multiple instance boost ranking
-
Ying Zheng, Steve Gu, Carlo Tomasi:
Detecting motion synchrony by video tubes
-
Teresa Bracamonte, Barbara Poblete:
Automatic image tagging through information propagation in a query log based graph structure
-
Lyndon Kennedy, Malcolm Slaney:
Identifying authoritative sources of multimedia content: mining specificity and expertise from large-scale multimedia databases
-
Kuiyuan Yang, Lei Zhang, Meng Wang, Hong-Jiang Zhang:
Semantic point detector
-
Hsiao-Hang Su, Tse-Wei Chen, Chieh-Chi Kao, Winston H. Hsu, Shao-Yi Chien:
Scenic photo quality assessment with bag of aesthetics-preserving features
-
Hao Xu, Jingdong Wang, Xian-Sheng Hua, Shipeng Li:
Hybrid image summarization
-
Xavier Anguera, Juan Manuel Barrios, Tomasz Adamek, Nuria Oliver:
Multimodal fusion for video copy detection
-
Wang Junqiang, Huadong Ma:
Pedestrian detection with geometric context from a single image
-
Di Niu, Hong Xu, Baochun Li, Shuqiao Zhao:
Risk management for video-on-demand servers leveraging demand forecast
-
Zhi Wang, Lifeng Sun, Shiqiang Yang, Wenwu Zhu:
Prefetching strategy in peer-assisted social video streaming
-
Dan Miao, Wenwu Zhu, Chong Luo, Chang Wen Chen:
Resource allocation for cloud-based free viewpoint video rendering for mobile phones
-
Zhen Wei Zhao, Wei Tsang Ooi:
APRICOD: a distributed caching middleware for fast content discovery of non-continuous media access
-
Sebastiano Battiato, Giovanni Maria Farinella, Enrico Messina, Giovanni Puglisi:
Robust image registration and tampering localization exploiting bag of features based forensic signature
-
Xiangyang Xue, Wei Li, Yue Yin:
Towards content-based audio fragment authentication
-
Rui Min, Jean-Luc Dugelay:
Cap detection for moving people in entrance surveillance
-
Han-Ping Cheng, Yun-Chung Shen, Ja-Ling Wu, Kiyoharu Aizawa:
High efficient distributed video coding with parallelized design for cloud computing
-
Tse-Chung Su, Yun-Chung Shen, Ja-Ling Wu:
Real-time decoding for LDPC based distributed video coding
-
Shingo Uchihashi, Tsutomu Tanzawa:
Mixing remote locations using shared screen as virtual stage
-
Kuan-Ta Chen, Yu-Chun Chang, Po-Han Tseng, Chun-Ying Huang, Chin-Laung Lei:
Measuring the latency of cloud gaming systems
-
Hsiao-Yun Tseng, Yun-Chung Shen, Ja-Ling Wu:
Distributed video coding with compressive measurements
-
Lei Huang, Tian Xia, Ji Wan, Yongdong Zhang, Shouxun Lin:
Personalized portraits ranking
-
Jong-Seok Lee, Lutz Goldmann, Touradj Ebrahimi:
A new analysis method for paired comparison and its application to 3D quality assessment
-
Tommaso Gritti, Gianluca Monaci:
ImagiLight: a vision approach to lighting scene setting
-
Florian Schweiger, Georg Schroth, Michael Eichhorn, Eckehard Steinbach, Michael Fahrmair:
Consensus-based cross-correlation
-
Junyong You, Touradj Ebrahimi, Andrew Perkis:
Modeling motion visual perception for video quality assessment
-
Zhiding Yu, Chunjing Xu, Jianzhuang Liu, Oscar C. Au, Xiaoou Tang:
Automatic object segmentation from large scale 3D urban point clouds through manifold embedded mode seeking
-
Zhiqiang Tian, Jianru Xue, Xuguang Lan, Ce Li, Nanning Zheng:
Key object-based static video summarization
-
Nick C. Tang, Chiou-Ting Hsu, Tsung-Yi Lin, Hong-Yuan Mark Liao:
Example-based human motion extrapolation based on manifold learning
-
Jiangbo Lu, Viet Anh Nguyen, Zeping Niu, Bhavdeep Singh, Zhiping Luo, Minh N. Do:
CuteChat: a lightweight tele-immersive video chat system
-
Haiyang Ma, Deepak Gangadharan, Nalini Venkatasubramanian, Roger Zimmermann:
Energy-aware complexity adaptation for mobile video calls
-
Yanjie Li, Lifeng Sun, Tianfan Xue:
Fast frame-rate up-conversion of depth video via video coding
-
Mu Mu, Johnathan Ishmael, Keith Mitchell, Nicholas Race, Andreas Mauthe:
Multimodal QoE evaluation in P2P-based IPTV systems
-
Steve Mann, Jason Huang, Ryan Janzen, Raymond Lo, Valmiki Rampersad, Alexander Chen, Taqveer Doha:
Blind navigation with a wearable range camera and vibrotactile helmet
-
Snehasis Mukherjee, Sujoy Kumar Biswas, Dipti Prasad Mukherjee:
Recognizing interaction between human performers using 'key pose doublet'
-
Huayou Su, Chunyuan Zhang, Jun Chai, Mei Wen, Nan Wu, Ju Ren:
High-efficient software parallel CAVLC encoder based on programmable stream processor
-
Pengjie Wang, Rynson W.H Lau, Mingmin Zhang, Jiang Wang, Haiyu Song, Zhigeng Pan:
A real-time database architecture for motion capture data
-
Jay Geagan, Dulce Ponceleon:
Once upon a time, i bought a movie and it played everywhere in my home
SESSION: Short papers session 3
-
Hua Wang, Feiping Nie, Heng Huang, Yi Yang:
Learning frame relevance for video classification
-
Wengang Zhou, Houqiang Li, Yijuan Lu, Qi Tian:
Large scale image search with geometric coding
-
Xianwang Wang, Tong Zhang:
Clothes search in consumer photos via color matching and attribute learning
-
Nakamasa Inoue, Koichi Shinoda:
A fast MAP adaptation technique for gmm-supervector-based video semantic indexing systems
-
Shen-Fu Tsai, Liangliang Cao, Feng Tang, Thomas S. Huang:
Compositional object pattern: a new model for album event recognition
-
Linjun Yang, Alan Hanjalic:
Learning from search engine and human supervision for web image search
-
Bor-Chun Chen, Yin-Hsi Kuo, Yan-Ying Chen, Kuan-Yu Chu, Winston Hsu:
Semi-supervised face image retrieval using sparse coding with identity constraint
-
Hong Lu, Renzhong Wei, Yanran Shen, Xiangyang Xue:
Level influence of spatial pyramid matching in object classification
-
Xin-Shun Xu, Yuan Jiang, Liang Peng, Xiangyang Xue, Zhi-Hua Zhou:
Ensemble approach based on conditional random field for multi-label image and video annotation
-
Yingbin Zheng, Renzhong Wei, Hong Lu, Xiangyang Xue:
Refining local descriptors by embedding semantic information for visual categorization
-
Hongtao Xie, Ke Gao, Yongdong Zhang, Jintao Li, Huamin Ren:
Common visual pattern discovery via graph matching
-
Feng Su, Li Yang, Tong Lu, Gongyou Wang:
Environmental sound classification for scene recognition using local discriminant bases and HMM
-
Yang Liu, Yan Liu, Shenghua Zhong, Keith C.C. Chan:
Semi-supervised manifold ordinal regression for image ranking
-
Bolan Su, Shijian Lu, Chew Lim Tan:
Blurred image region detection and classification
-
Zhongwei Cheng, Lei Qin, Qingming Huang, Shuqiang Jiang, Shuicheng Yan, Qi Tian:
Human group activity analysis with fusion of motion and appearance information
-
Lingqiao Liu, Lei Wang:
Exploring latent class information for image retrieval using the bag-of-feature model
-
Zhiwu Lu, Yuxin Peng:
Combining latent semantic learning and reduced hypergraph learning for semi-supervised image categorization
-
Shayok Chakraborty, Vineeth Balasubramanian, Sethuraman Panchanathan:
Optimal batch selection for active learning in multi-label classification
-
Yuta Nakashima, Noboru Babaguchi:
Extracting intentionally captured regions using point trajectories
-
Chih-Fan Chen, Yu-Chiang Frank Wang:
Exploring self-similarities of bag-of-features for image classification
-
Pengjie Li, Huadong Ma, Anlong Ming:
Non-rigid 3D model retrieval using multi-scale local features
-
Miriam Redi, Bernard Merialdo:
Marginal-based visual alphabets for local image descriptors aggregation
-
Christian Beecks, Anca Maria Ivanescu, Steffen Kirchhoff, Thomas Seidl:
Modeling multimedia contents through probabilistic feature signatures
-
Christian Wengert, Matthijs Douze, Hervé Jégou:
Bag-of-colors for improved image search
-
Mihir Jain, Hervé Jégou, Patrick Gros:
Asymmetric hamming embedding: taking the best of our bits for large scale image search
-
Xiangang Cheng, Liang-Tien Chia:
Spatially-coherent pyramid matching based on max-pooling
-
Daan T.J. Vreeswijk, Bouke Huurnink, Arnold W.M. Smeulders:
Text and image subject classifiers: dense works better
-
Damian Borth, Adrian Ulges, Thomas Michael Breuel:
Automatic concept-to-query mapping for web-based concept detector training
-
Yueting Zhuang, Yang Liu, Fei Wu, Yin Zhang, Jian Shao:
Hypergraph spectral hashing for similarity search of social image
-
Michele Merler, John R. Kender:
Selecting the best faces to index presentation videos
-
Sheng He, Junwei Han, Xintao Hu, Ming Xu, Lei Guo, Tianming Liu:
A biologically inspired computational model for image saliency detection
-
Xiaoshuai Sun, Hongxun Yao, Rongrong Ji, Xianming Liu, Pengfei Xu:
Unsupervised fast anomaly detection in crowds
-
Sicheng Zhao, Hongxun Yao, Xiaoshuai Sun, Pengfei Xu, Xianming Liu, Rongrong Ji:
Video indexing and recommendation based on affective analysis of viewers
-
Brett Adams, Dinh Phung, Svetha Venkatesh:
Eventscapes: visualizing events over time with emotive facets
-
Minh-Son Dao, Duc-Tien Dang-Nguyen, Francesco G.B. De Natale:
Signature-image-based event analysis for personal photo albums
-
Lin Pang, Juan Cao, Yongdong Zhang, Shouxun Lin:
Leveraging collective wisdom for web video retrieval through heterogeneous community discovery
-
Keiichiro Hoashi, Chihiro Ono, Daisuke Ishii, Hiroshi Watanabe:
Automatic preview generation of comic episodes for digitized comic search
-
Xiangqian Yu, Vincent Oria, Pierre Gouton, Geneviève Jomier:
2D geon based generic object recognition
-
Ying Yuan, Fei Wu, Yueting Zhuang, Jian Shao:
Image annotation by composite kernel learning with group structure
-
Xiaofeng Zhu, Zi Huang, Heng Tao Shen:
Video-to-shot tag allocation by weighted sparse group lasso
-
Zheshen Wang, Mrityunjay Kumar, Jiebo Luo, Baoxin Li:
Extracting key frames from consumer videos using bi-layer group sparsity
-
Xia Li, Yan Song, Yijuan Lu, Qi Tian:
Spatial pooling for transformation invariant image representation
-
Rui Zhang, Lei Zhang, Xin-Jing Wang, Ling Guan:
Multi-feature pLSA for combining visual features in image annotation
-
Yue Gao, Meng Wang, Huanbo Luan, Jialie Shen, Shuicheng Yan, Dacheng Tao:
Tag-based social image search with visual-text joint hypergraph learning
-
Xiaojian Zhao, Guangda Li, Meng Wang, Jin Yuan, Zheng-Jun Zha, Zhoujun Li, Tat-Seng Chua:
Integrating rich information for video recommendation with multi-task rank aggregation
-
David S. Monaghan, Philip Kelly, Noel O'Connor:
Quantifying human reconstruction accuracy for voxelcarving in a sporting environment
-
Vladislavs Dovgalecs, Rémi Mégret, Yannick Berthoumieu:
Time-aware co-training for indoors localization in visual lifelogs
-
Yoshitaka Ushiku, Tatsuya Harada, Yasuo Kuniyoshi:
Automatic sentence generation from images
-
Vasant Manohar, Stavros Tsakalidis, Pradeep Natarajan, Rohit Prasad, Prem Natarajan:
Audio-visual fusion using bayesian model combination for web video retrieval
-
Lamberto Ballan, Marco Bertini, Alberto Del Bimbo, Giuseppe Serra:
Enriching and localizing semantic tags in internet videos
-
Kazuki Sawai, Tomokazu Takahashi, Daisuke Deguchi, Ichiro Ide, Hiroshi Murase:
Scene segmentation of wedding party videos by scenario-based matching with example videos
-
Aibo Tian, Xuemei Zhang, Daniel R. Tretter:
Content-aware photo-on-photo composition for consumer photos
-
Cheng-Te Li, Hsun-Ping Hsieh, Shou-De Lin:
PhotoFeel: feeling your photo collection with graph-based audiovisual flocking
-
Minwoo Park, Jiebo Luo, Andrew Gallagher, Majid Rabbani:
Learning to produce 3D media from a captured 2D video
-
Andreas Girgensohn, Frank Shipman, Lynn Wilcox, Qiong Liu, Chunyuan Liao, Yuichi Oneda:
A tool for authoring unambiguous links from printed content to digital media
-
Jung-Yu Yeh, Min-Chun Hu, Wen-Huang Cheng, Ja-Ling Wu:
Interactive digital scrapbook generation for travel photos based on design principles of typography
-
Patricia Wang, Xiaofeng Tong, Yangzhou Du, Jianguo Li, Wei Hu, Yimin Zhang:
Augmented makeover based on 3D morphable model
-
Yingbo Li, Bernard Merialdo, Mickael Rouvier, Georges Linares:
Static and dynamic video summaries
|