VCAE2010-Welcome

Home

Schedule

Invited Speaker

Committee

Organization

Sponsors

Transportation

Venue

Welcome to VCAE2010, July 8-10, 2010, Xi'an, China

Prof. Shipeng Li, Microsoft Research Asia
The Trends in Multimedia Computing

Prof. Chong-Wah Ngo, City University of Hong Kong
Online Editing of Lecture Videos by Integrating Posture, Gesture and Text

Prof. Qi Tian, The University of Texas at San Antonio, USA
Building Contextual Visual Vocabulary for Large-scale Image Applications

Prof. Xiansheng Hua, Microsoft Research Asia
Content-Based Multimedia Technologies – A Collection of Demonstrations from MSRA

Prof. Jiebo Luo, Kodak Research Laboratories, USA
Prof. Ming Zhou, Microsoft Research Asia
Prof. Feng Wu, Microsoft Research Asia
Prof. Nicu Sebe, University of Trento, Italy
Prof. Tat-Seng Chua, National University of Singapore
Prof. Shuicheng Yan, National University of Singapore
Prof. Ling Shao, The University of Sheffield, UK
　

Invited Speakers and Abstracts of Speeches:

	Title: The Trends in Multimedia Computing Speaker: Shipeng Li, Ph.D., Principal Researcher/Research Area Manager, Media Computing Group, Microsoft Research Asia Abstract: With the rapid development of Web 2.0 and cloud computing concept and applications, there are many unprecedented web-based multimedia applications are emerging today and they pose many new challenges in multimedia research. In this talk, I first summarize the common features of the new wave of multimedia applications which I call Media 2.0. I use 6 D’s to describe Media 2.0 principles, namely, Democratized media life cycle; Data-driven media value chain; Decoupled media system; Decomposed media contents; Diminished media boundary between physical and virtual world, and Decentralized media business model. Then I explain what the implications of Media 2.0 to multimedia research are and how we should choose our research topics that could make big impacts. Finally, I use example research projects ranging from media codecs, media systems, media search and media related advertisement from MSRA to demonstrate the ideas I have talked about. I hope these ideas and principles could inspire the audience to come up with new media 2.0 research topics and applications in the future. Bio of Speaker: Dr. Shipeng Li joined Microsoft Research Asia (MSRA) in May 1999. He is now a Principal Researcher and Research Manager of the Media Computing group. He also serves as the Research Area Manager coordinating the multimedia research activities at MSRA. His research interests include Signal and Image Processing; Content-based Analysis; Image and Video Coding; HDTV Technology; Multimedia Streaming and Communications over Internet and Wireless Networks; Scalable Multimedia Representation; Application Level Multicast; Digital Right Management; Wireless Communication and Networking; P2P Networking; New Media Formats and Systems; Media Advertisement; eHealth; and User Intent Mining. From Oct. 1996 to May 1999, Dr. Li was with Multimedia Technology Laboratory at Sarnoff Corporation (formerly David Sarnoff Research Center, and RCA Laboratories) as a Member of Technical Staff. Dr. Li has been actively involved in research and development in broad multimedia areas. He has made several major contributions adopted by MPEG-4 and H.264 standards. He invented and developed the world first cost-effective high-quality legacy HDTV decoder in 1998. He started P2P streaming research at MSRA as early as in August 2000. He led to build the first working scalable video streaming prototype across the Pacific Ocean in 2001. He has been an advocate of scalable coding format and is instrumental to the SVC extension of H.264/AVC standard. He first proposed the 6 “D” Media 2.0concepts that outlined the new directions of next generation internet media research (2006). He has authored and co-authored over 200 journal and conference papers and holds 83 granted and 90+ pending US patents in image/video processing, compression and communications, digital television, multimedia and wireless communication. He is the co-author of book chapter “MPEG-4 Texture Coding” in “Multimedia Systems and Standards” published by the Marcel Dekker, Inc. (2000), the principal author of book chapter “Image and Video Coding” in “The Wiley Encyclopedia of Telecommunications” published by the Wiley and Sons, Inc (2003), a co-author of book chapter “Scalable Video Coding for Adaptive Streaming Applications” in “Multimedia over IP and Wireless Networks” published by Academic Press (2007), a co-author of book “Video Technology” published by USTC press (2009). He is a co-editor of “Proceedings of Visual Communications and Image Processing 2005” published by SPIE (2005), and a co-editor of Lecture Notes in Computer Science: Advances in Multimedia Information Processing - PCM 2008 published by Springer (2008). Dr. Li is a Senior Member of IEEE. He the current chair of Visual Signal Processing and Communications technical committee, a member of Multimedia Systems and Applications technical committee, of IEEE Circuits and Systems Society. He is a member of Multimedia Communication technical committee of IEEE Communication Society and a past member of Multimedia Signal Processing technical committee of IEEE Signal Processing Society. He serves as an associate editor of IEEE Transactions on Circuits and Systems for Video Technologyand is on the editorial board of Journal of Visual Communications and Image Representation. He was a special session/tutorial chair of IEEE PCM 2000, a local arrangement chair of IEEE PCM 2001, a technical program chair for VCIP 2005, a general chair of Packet Video 2006, a track chair of ICME 2006, a publicity chair of IEEE ISM 2006, a theme chair of IEEE PSIVT 2006, an award chair of IEEE SiPS 2007, a special session chair of IEEE ICME 2007, a program chair of PCM 2008 and a track chair of ISCAS 2009. He is a general chair of VCIP 2010 and a general chair of CIVR 2010. Dr. Li has also served or been serving in over a dozen technical committees of various international conferences on multimedia. Dr. Li holds guest or adjunct professorships in Sichuan University, Shandong University, Huazhong University of Science and Technology, Shanghai Jiaotong University, the Chinese University of Hong Kong, Nankai University, and Tianjin University, respectively. He is also PhD supervisors for University of Science and Technology of China and Shanghai Jiaotong University. Dr. Li received his B.S. and M.S. in Electrical Engineering from the University of Science and Technology of China (USTC) in 1988 and 1991, respectively. He received his Ph.D. in Electrical Engineering from Lehigh University, Bethlehem, PA, in 1996. He was assistant professor in Electrical Engineering Department at University of Science and Technology of China in 1991-1992. Dr. Li was the only student in the history of University of Science and Technology of China who had ever been awarded twice (1987 and 1991) the top honor – Guo Mo Ruo Fellowship. He was also the first recipient of double Sarnoff Technical Achievement Awards(Sarnoff, 1997) in a single year. Dr. Li received the Best Paper Awardat VCIP 2007, the Best Poster Paper Awardat MMSP 2008 and the Best Paper Award in IEEE Transaction on Circuits and Systems for Video Technology (2009). He also mentored his student to receive the Best Student Paper Award in VCIP 2005. In his 10 years with MSRA, Dr. Li has also incubated three MIT TR 35 Award Winners, including Qian Zhang, Haitao Zheng, and Xian-Sheng Hua.
	Title: Online Editing of Lecture Videos by Integrating Posture, Gesture and Text Speaker: Prof. Chong-Wah Ngo, City University of Hong Kong Abstract: Bio of Speaker: Chong-Wah Ngo received his Ph.D in Computer Science from the Hong Kong University of Science & Technology (HKUST). He received his MSc and BSc, both in Computer Engineering, from Nanyang Technological University of Singapore. His research interests are in multimedia search and computing. He has been serving the technical program committees of numerous multimedia and information retrieval conferences including ACM Multimedia (MM), ACM SIGIR, International Conf. on Image and Video Retrieval (CIVR) and International Conf. on Multimedia and Expo (ICME). In addition, he is on the editorial board of Journal of Multimedia Data Engineering and Management, and Journal of Advances in Multimedia. He is founding leader of VIREO (VIdeo REtrieval grOup). Currently, he also serves as the chairman of ACM (Hong Kong Chapter). Before joining CityU, he was a PhD student under the supervision of Prof. T. C. Pong and Prof Roland Chin in HKUST. He joined the research group of Prof Thomas Huang in Beckman Institute as a post-doctoral visitor for a collaborative project between UIUC and HKUST during 2001. He also worked at Information Technology Institute (ITI) of Singapore in 1996, and as summer intern at Microsoft Research Asia in 1999. He was born in Malaysia (Visit Malaysia and then you will find the smile of people and the smile of nature!). His hometown is at Pekan Nanas (a perfectly peaceful town), Pontian (the southest county in Asia mainland), Johor, Malaysia. He studied in Yu Ming (I) and Foon Yew High School during primary and secondary school time.
	Title: Building Contextual Visual Vocabulary for Large-scale Image Applications Speaker: Prof. Qi Tian, The University of Texas at San Antonio, USA Abstract: Not withstanding its great success and wide adoption in Bag-ofvisual Words representation, visual vocabulary created from single image local features is often shown to be ineffective largely due to three reasons. First, many detected local features are not stable enough, resulting in many noisy and non-descriptive visual words in images. Second, single visual word discards the rich spatial contextual information among the local features, which has been proven to be valuable for visual matching. Third, the distance metric commonly used for generating visual vocabulary does not take the semantic context into consideration, which renders them to be prone to noise. To address these three confrontations, we propose an effective visual vocabulary generation framework containing three novel contributions: 1) we propose an effective unsupervised local feature refinement strategy; 2) we consider local features in groups to model their spatial contexts; 3) we further learn a discriminant distance metric between local feature groups, which we call discriminant group distance. This group distance is further leveraged to induce visual vocabulary from groups of local features. We name it contextual visual vocabulary, which captures both the spatial and semantic contexts. We evaluate the proposed local feature refinement strategy and the contextual visual vocabulary in two large-scale image applications: large-scale near-duplicate image retrieval on a dataset containing 1.5 million images and image search re-ranking tasks. Our experimental results show that the contextual visual vocabulary shows significant improvement over the classic visual vocabulary. Moreover, it outperforms the state-of-the-art Bundled Feature in the terms of retrieval precision, memory consumption and efficiency. Bio of Speaker: Qi Tian is currently an Associate Professor in the Department of Computer Science, the University of Texas at San Antonio (UTSA). During 2008-2009, he took one-year Faculty Leave at Microsoft Research Asia (MSRA) in the Media Computing Group (former Internet Media Group) as Lead Researcher. He received his Ph.D. in 2002 from University of Illinois at Urbana-Champaign, and his B.E. degree in 1992 from Tsinghua University. Dr. Tian’s research interests include multimedia information retrieval and computer vision. He has published about 110 refereed journal and conference papers in these fields. His research projects were funded by ARO, DHS, HP Lab, SALSI, CIAS, and CAS. He was the co-author of a Best Student Paper in ICASSP 2006, and co-author of a Best Paper Candidate in PCM 2007. He was a nominee for 2008 UTSA President Distinguished Research Award. He received ACM Service Award in 2010. He has been serving as Program Chairs, Organization Committee Members, Session Chairs and TPC for over 120 IEEE and ACM Conferences including ACM Multimedia, SIGIR, ICCV, ICME, ICASSP, ICPR, MIR, VCIP, PCM, etc. He is the funding member of International Steering Committee of ACM International Conference on Multimedia Retrieval (ICMR), and ACM Multimedia Conference Review Committee Member. He is the Associate Editor of IEEE Transactions on CSVT, Guest co-Editors of IEEE Transactions on Multimedia, Journal of Computer Vision and Image Understanding, and EURASIP Journal on Advances in Signal Processing and is in the Editorial Board of Journal of Multimedia. He is a Senior Member of IEEE (2003), and a Member of ACM (2004). He currently holds Guest Professorship Position at University of Science and Technology of China (USTC), Zhejiang University and Xidian University, China, respectively.
	Title: Content-Based Multimedia Technologies – A Collection of Demonstrations from MSRA Speaker: Prof. Xinsheng Hua, Lead Researcher, Media Computing Group, Microsoft Research Asia Abstract: Bio of Speaker: Xian-Sheng Hua received the B.S. and Ph.D. degrees from Peking University, Beijing, China, in 1996 and 2001, respectively, both in applied mathematics. Since 2001, he has been with Microsoft Research Asia, Beijing, where he is currently a Lead Researcher with the media computing group. His current research interests are in the areas of video content analysis, multimedia search, management, authoring, sharing, mining, advertising and mobile multimedia computing. He has authored or co-authored more than 180 publications in these areas and has more than 50 filed patents or pending applications. He is now an adjunct professor of University of Science and Technology of China, and serves as an Associate Editor of IEEE Trans. on Multimedia, Associate Editor of ACM Trans. on Intelligent Systems and Technology, Editorial Board Member of Advances in Multimedia and Multimedia Tools and Applications, and editor of Scholarpedia (Multimedia Category). Dr. Hua won the Best Paper Award and Best Demonstration Award in ACM Multimedia 2007, Best Poster Award in 2008 IEEE International Workshop on Multimedia Signal Processing, Best Student Paper Award in ACM Conference on Information and Knowledge Management 2009, and Best Paper Award in International Conference on MultiMedia Modeling 2010. He also won 2008 MIT Technology Review TR35 Young Innovator Award for his outstanding contribution in enhancing video search, and named as one of the Business Elites of People under 40 to Watch by Global Entrepreneur.

Important links


Association for Computing Machinery	Natural Science Foundation of China	Microsoft Research Asia	Xidian University

Copyright @ Xidian University, VCAE2010 Workshop All rights reserved Website designed and hosted by XIDIAN-SEE-VIPSL

骚b色欲网