IT training semantic mining technologies for multimedia databases tao, xu li 2009 04 15

551 78 0
IT training semantic mining technologies for multimedia databases tao, xu  li 2009 04 15

Đang tải... (xem toàn văn)

Tài liệu hạn chế xem trước, để xem đầy đủ mời bạn chọn Tải xuống

Thông tin tài liệu

Semantic Mining Technologies for Multimedia Databases Dacheng Tao Nanyang Technological University, Singapore Dong Xu Nanyang Technological University, Singapore Xuelong Li University of London, UK Information science reference Hershey • New York Director of Editorial Content: Senior Managing Editor: Managing Editor: Assistant Managing Editor: Typesetter: Cover Design: Printed at: Kristin Klinger Jamie Snavely Jeff Ash Carole Coulson Amanda Appicello Lisa Tosheff Yurchak Printing Inc Published in the United States of America by Information Science Reference (an imprint of IGI Global) 701 E Chocolate Avenue, Suite 200 Hershey PA 17033 Tel: 717-533-8845 Fax: 717-533-8661 E-mail: cust@igi-global.com Web site: http://www.igi-global.com/reference and in the United Kingdom by Information Science Reference (an imprint of IGI Global) Henrietta Street Covent Garden London WC2E 8LU Tel: 44 20 7240 0856 Fax: 44 20 7379 0609 Web site: http://www.eurospanbookstore.com Copyright © 2009 by IGI Global All rights reserved No part of this publication may be reproduced, stored or distributed in any form or by any means, electronic or mechanical, including photocopying, without written permission from the publisher Product or company names used in this set are for identi.cation purposes only Inclusion of the names of the products or companies does not indicate a claim of ownership by IGI Global of the trademark or registered trademark Library of Congress Cataloging-in-Publication Data Semantic mining technologies for multimedia databases / Dacheng Tao, Dong Xu, and Xuelong Li, editors p cm Includes bibliographical references and index Summary: "This book provides an introduction to the most recent techniques in multimedia semantic mining necessary to researchers new to the field" Provided by publisher ISBN 978-1-60566-188-9 (hardcover) ISBN 978-1-60566-189-6 (ebook) Multimedia systems Semantic Web Data mining Database management I Tao, Dacheng, 1978- II Xu, Dong, 1979- III.Li, Xuelong, 1976QA76.575.S4495 2009 006.7 dc22 2008052436 British Cataloguing in Publication Data A Cataloguing in Publication record for this book is available from the British Library All work contributed to this book is new, previously-unpublished material The views expressed in this book are those of the authors, but not necessarily of the publisher Table of Contents Preface xv Section I Multimedia Information Representation Chapter I Video Representation and Processing for Multimedia Data Mining Amr Ahmed, University of Lincoln, UK Chapter II Image Features from Morphological Scale-Spaces 32 Sébastien Lefèvre, University of Strasbourg – CNRS, France Chapter III Face Recognition and Semantic Features 80 Huiyu Zhou, Brunel University, UK Yuan Yuan, Aston University, UK Chunmei Shi, People’s Hospital of Guangxi, China Section II Learning in Multimedia Information Organization Chapter IV Shape Matching for Foliage Database Retrieval 100 Haibin Ling, Temple University, USA David W Jacobs, University of Maryland, USA Chapter V Similarity Learning for Motion Estimation 130 Shaohua Kevin Zhou, Siemens Corporate Research Inc., USA Jie Shao, Google Inc., USA Bogdan Georgescu, Siemens Corporate Research Inc., USA Dorin Comaniciu, Siemens Corporate Research Inc., USA Chapter VI Active Learning for Relevance Feedback in Image Retrieval 152 Jian Cheng, National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, China Kongqiao Wang, Nokia Research Center, Beijing, China Hanqing Lu, National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, China Chapter VII Visual Data Mining Based on Partial Similarity Concepts 166 Juliusz L Kulikowski, Polish Academy of Sciences, Poland Section III Semantic Analysis Chapter VIII Image/Video Semantic Analysis by Semi-Supervised Learning 183 Jinhui Tang, National University of Singapore, Singapore Xian-Sheng Hua, Microsoft Research Asia, China Meng Wang, Microsoft Research Asia, China Chapter IX Content-Based Video Semantic Analysis 211 Shuqiang Jiang, Chinese Academy of Sciences, China Yonghong Tian, Peking University, China Qingming Huang, Graduate University of Chinese Academy of Sciences, China Tiejun Huang, Peking University, China Wen Gao, Peking University, China Chapter X Applications of Semantic Mining on Biological Process Engineering 236 Hossam A Gabbar, University of Ontario Institute of Technology, Canada Naila Mahmut, Heart Center - Cardiovascular Research Hospital for Sick Children, Canada Chapter XI Intuitive Image Database Navigation by Hue-Sphere Browsing 263 Gerald Schaefer, Aston University, UK Simon Ruszala, Teleca, UK Section IV Multimedia Resource Annotation Chapter XII Formal Models and Hybrid Approaches for Ef.cient Manual Image Annotation and Retrieval 272 Rong Yan, IBM T.J Watson Research Center, USA Apostol Natsev, IBM T.J Watson Research Center, USA Murray Campbell, IBM T.J Watson Research Center, USA Chapter XIII Active Video Annotation: To Minimize Human Effort 298 Meng Wang, Microsoft Research Asia, China Xian-Sheng Hua, Microsoft Research Asia, China Jinhui Tang, National University of Singapore, Singapore Guo-Jun Qi, University of Science and Technology of China, China Chapter XIV Annotating Images by Mining Image Search 323 Xin-Jing Wang, Microsoft Research Asia, China Lei Zhang, Microsoft Research Asia, China Xirong Li, Microsoft Research Asia, China Wei-Ying Ma, Microsoft Research Asia, China Chapter XV Semantic Classification and Annotation of Images 350 Yonghong Tian, Peking University, China Shuqiang Jiang, Chinese Academy of Sciences, China Tiejun Huang, Peking University, China Wen Gao, Peking University, China Section V Other Topics Related to Semantic Mining Chapter XVI Association-Based Image Retrieval 379 Arun Kulkarni, The University of Texas at Tyler, USA Leonard Brown, The University of Texas at Tyler, USA Chapter XVII Compressed-Domain Image Retrieval Based on Colour Visual Patterns 407 Gerald Schaefer, Aston University, UK Chapter XVIII Resource Discovery Using Mobile Agents 419 M Singh, Middlesex University, UK X Cheng, Middlesex University, UK & Beijing Normal University, China X He, Reading University, UK Chapter XIX Multimedia Data Indexing 449 Zhu Li, Hong Kong Polytechnic University, Hong Kong Yun Fu, BBN Technologies, USA Junsong Yuan, Northwestern University, USA Ying Wu, Northwestern University, USA Aggelos Katsaggelos, Northwestern University, USA Thomas S Huang, University of Illinois at Urbana-Champaign, USA Compilation of References 476 About the Contributors 514 Index 523 Detailed Table of Contents Preface xv Section I Multimedia Information Representation Chapter I Video Representation and Processing for Multimedia Data Mining Amr Ahmed, University of Lincoln, UK Video processing and segmentation are important stages for multimedia data mining, especially with the advance and diversity of video data available The aim of this chapter is to introduce researchers, especially new ones, to the “video representation, processing, and segmentation techniques” This includes an easy and smooth introduction, followed by principles of video structure and representation, and then a state-of-the-art of the segmentation techniques focusing on the shot-detection Performance evaluation and common issues are also discussed before concluding the chapter Chapter II Image Features from Morphological Scale-Spaces 32 Sébastien Lefèvre, University of Strasbourg – CNRS, France Multimedia data mining is a critical problem due to the huge amount of data available Efficient and reliable data mining solutions requires both appropriate features to be extracted from the data and relevant techniques to cluster and index the data In this chapter, the authors deal with the first problem which is feature extraction for image representation A wide range of features has been introduced in the literature, and some attempts have been made to build standards (e.g MPEG-7) These features are extracted with image processing techniques, and the authors focus here on a particular image processing toolbox, namely the mathematical morphology, which stays rather unknown from the multimedia mining community, even if it offers some very interesting feature extraction methods They review here these morphological features, from the basic ones (granulometry or pattern spectrum, differential morphological profile) to more complex ones which manage to gather complementary information Chapter III Face Recognition and Semantic Features 80 Huiyu Zhou, Brunel University, UK Yuan Yuan, Aston University, UK Chunmei Shi, People’s Hospital of Guangxi, China The authors present a face recognition scheme based on semantic features’ extraction from faces and tensor subspace analysis These semantic features consist of eyes and mouth, plus the region outlined by three weight centres of the edges of these features The extracted features are compared over images in tensor subspace domain Singular value decomposition is used to solve the eigenvalue problem and to project the geometrical properties to the face manifold They also compare the performance of the proposed scheme with that of other established techniques, where the results demonstrate the superiority of the proposed method Section II Learning in Multimedia Information Organization Chapter IV Shape Matching for Foliage Database Retrieval 100 Haibin Ling, Temple University, USA David W Jacobs, University of Maryland, USA Computer-aided foliage image retrieval systems have the potential to dramatically speed up the process of plant species identification Despite previous research, this problem remains challenging due to the large intra-class variability and inter-class similarity of leaves This is particularly true when a large number of species are involved In this chapter, the authors present a shape-based approach, the innerdistance shape context, as a robust and reliable solution They show that this approach naturally captures part structures and is appropriate to the shape of leaves Furthermore, they show that this approach can be easily extended to include texture information arising from the veins of leaves They also describe a real electronic field guide system that uses our approach The effectiveness of the proposed method is demonstrated in experiments on two leaf databases involving more than 100 species and 1000 leaves Chapter V Similarity Learning for Motion Estimation 130 Shaohua Kevin Zhou, Siemens Corporate Research Inc., USA Jie Shao, Google Inc., USA Bogdan Georgescu, Siemens Corporate Research Inc., USA Dorin Comaniciu, Siemens Corporate Research Inc., USA Motion estimation necessitates an appropriate choice of similarity function Because generic similarity functions derived from simple assumptions are insufficient to model complex yet structured appearance variations in motion estimation, the authors propose to learn a discriminative similarity function to match images under varying appearances by casting image matching into a binary classification problem They use the LogitBoost algorithm to learn the classifier based on an annotated database that exemplifies the structured appearance variations: An image pair in correspondence is positive and an image pair out of correspondence is negative To leverage the additional distance structure of negatives, they present a location-sensitive cascade training procedure that bootstraps negatives for later stages of the cascade from the regions closer to the positives, which enables viewing a large number of negatives and steering the training process to yield lower training and test errors They also apply the learned similarity function to estimating the motion for the endocardial wall of left ventricle in echocardiography and to performing visual tracking They obtain improved performances when comparing the learned similarity function with conventional ones Chapter VI Active Learning for Relevance Feedback in Image Retrieval 152 Jian Cheng, National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, China Kongqiao Wang, Nokia Research Center, Beijing, China Hanqing Lu, National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, China Relevance feedback is an effective approach to boost the performance of image retrieval Labeling data is indispensable for relevance feedback, but it is also very tedious and time-consuming How to alleviate users’ burden of labeling has been a crucial problem in relevance feedback In recent years, active learning approaches have attracted more and more attention, such as query learning, selective sampling, multi-view learning, etc The well-known examples include Co-training, Co-testing, SVMactive, etc In this literature, the authors will introduce some representative active learning methods in relevance feedback Especially they will present a new active learning algorithm based on multi-view learning, named Co-SVM In Co-SVM algorithm, color and texture are naturally considered as sufficient and uncorrelated views of an image SVM classifier is learned in color and texture feature subspaces, respectively Then the two classifiers are used to classify the unlabeled data These unlabeled samples that disagree in the two classifiers are chose to label The extensive experiments show that the proposed algorithm is beneficial to image retrieval Chapter VII Visual Data Mining Based on Partial Similarity Concepts 166 Juliusz L Kulikowski, Polish Academy of Sciences, Poland Visual data mining is a procedure aimed at a selection from a document’s repository subsets of documents presenting certain classes of objects; the last may be characterized as classes of objects’ similarity or, more generally, as classes of objects satisfying certain relationships In this chapter attention will be focused on selection of visual documents representing objects belonging to similarity classes About the Contributors Xuelong Li is the Reader in Cognitive Computing at Birkbeck College, University of London, a Visiting Professor at the Tianjin University, and a Guest Professor at the University of Science and Technology of China *** Amr Ahmed (BEng’93, MSc’98, PhD’04, MBCS’05) is a Senior Lecturer (equivalent to Assistant Professor in the American system), and the Leader of the DCAPI (Digital Contents Analysis, Production, and Interaction: dcapi.lincoln.ac.uk) research group at the Department of Computing and Informatics, University of Lincoln, UK His research focuses on the analysis, production, and interaction with digital contents His research interests include video processing, segmentation, scene understanding, and the integration between computer vision and graphics Amr worked in the industry for several years, including Sharp Labs of Europe, Oxford, as a Research Scientist He was working as a Research Fellow at the University of Surrey before joining the University of Lincoln in 2005 Dr Ahmed is a Member of the British Computer Society (MBCS) He received his Bachelor’s degree in electrical engineering from Ain Shams Univerity, Egypt, in 1993, his M.Sc degree, by research, in Computer and Systems Engineering from Ain Shams Univerity, Egypt, in 1998, and his Ph.D degree in Computer Graphics and Animation from the University of Surrey, U.K., in 2004 E-mail address: Amr.Ahmed@BCS.org Web site: webpages.lincoln.ac.uk/AAhmed Leonard Brown is an Assistant Professor in the Computer Science Department at The University of Texas at Tyler He received his M.S and Ph.D degrees in Computer Science from The University of Oklahoma in 1997 and 2003, respectively Prior to that, he was a Member of Technical Staff-I at AT&T in Oklahoma City His current research interests include multimedia database management systems and image retrieval He has published over a dozen refereed technical articles in journals and conference proceedings, and he was awarded a NASA Summer Faculty Fellowship in 2004 at the Jet Propulsion Laboratory He is a member of ACM and IEEE Jian Cheng, associate professor of Institute of Automation, Chinese Academy of Sciences He received the B.S and M.S degrees in Mathematics from Wuhan University in 1998 and in 2001, respectively In 2004, he got his Ph.D degree in pattern recognition and intelligent systems from Institute of Automation, Chinese Academy of Sciences From 2004 to 2006, he has been working as postdoctoral in Nokia Research Center Then he joined National Laboratory of Pattern Recognition, Institute of Automation His current research interests include image and video retrieval, machine learning, etc Xiaochun Cheng Dr Xiaochun Cheng had his BEng on Computer Software in 1992 and his PhD on Artificial Intelligence in 1996 He has been a senior member of IEEE since 2004 He is the secretary for IEEE SMC UK&RI He is a member of following technical committee of IEEE SMC: Technical Committee on Systems Safety and Security, Technical Committee on Computational Intelligence Yun Fu received the B.Eng in information and communication engineering and M Eng in pattern recognition and intelligence systems from Xian Jiao Tong University (XJTU) in 2001 and 2004, the M.S degree in statistics and the Ph.D degree in Electrical and Computer Engineering (ECE) from University of Illinois at Urbana-Champaign (UIUC) in 2007 and 2008, respectively From 2001 to 2004, he was a 515 About the Contributors research assistant at the AI&R at XJTU From 2004 to now, he is a graduate fellow and research assistant at the Beckman Institute for Advanced Science and Technology, ECE department and Coordinated Science Laboratory at UIUC He was a research intern with Mitsubishi Electric Research Laboratories, Cambridge, MA, in summer 2005; with Multimedia Research Lab of Motorola Labs, Schaumburg, IL, in summer 2006 He jointed BBN Technologies, Cambridge, MA, as a Scientist in 2008 Hossam A Gabbar is an Associate Professor in the Faculty of Energy Systems and Nuclear Science, UOIT, Canada Dr Gabbar obtained his Ph.D in Process Systems Engineering from Okayama University (Japan) He joined Tokyo Institute of Technology and Japan Chemical Innovative Institute (JCII) between 2001-2004 In 2004, he joined Okayama University as an associate professor in the division of Industrial Innovation Sciences From 2007 till 2008, he was a visiting scholar in University of Toronto, in the Mechanical and Industrial Engineering Department He has contributed significantly to the areas of intelligent green production systems and process systems engineering He published in reputable international journals, books, book chapters, patent, and industrial technical reports He is regularly invited to conferences, tutorial, industrial and scientific events as keynote a speaker He is the Editor-in-Chief of Int J of Process Systems Engineering (PSE), editorial board member of the international journal of Resources, Energy and Development (READ), senior member of IEEE, chair of technical committee on Intelligent Green Production Systems, and board member of a numerous conferences and scientific committees His solutions are widely implemented in industry in the areas of energy systems, safety instrumented systems, and process safety management Wen Gao (M’92-SM’05) received the BSc and MSc degrees in computer science from the Harbin University of Science and Technology and the Harbin Institute of Technology, China, in 1982 and 1985, respectively, and the PhD degree in electronics engineering from the University of Tokyo, Japan, in 1991 He is currently a professor in the School of Electronics Engineering and Computer Science, Peking University, China He is the editor-in-chief of the Journal of Computer (in Chinese), associate editor of the IEEE Transactions on Circuit System for Video Technology and IEEE Transaction on Multimedia, and editor of the Journal of Visual Communication and Image Representation He published four books and more than 300 technical articles in refereed journals and proceedings in the areas of multimedia, video compression, face recognition, sign language recognition and synthesis, image retrieval, multimodal interface, and bioinformatics He is a senior member of the IEEE Xin He Mr Xin He received his BSc in Computer Science with First Class Honours from Beijing Union University, P.R China and BSc in Computer Technology with Second Class (Upper Division) Honours from The University of East London, UK in 2003 He received his MSc in Network Centred Computing with Merit from the University of Reading in 2004 He is currently continuing his study as a PhD student in School of Systems Engineering at the University of Reading His study is supported by the Research Endowment Trust Fund (RETF) His research includes artificial intelligence, multi-agent system and software engineering Xian-Sheng Hua received the B.S and Ph.D degrees from Peking University, Beijing, China, in 1996 and 2001, respectively, both in applied mathematics Since 2001, he has been with Microsoft Research Asia, Beijing, where he is currently a Lead Researcher with the internet media group His current research interests include video content analysis, multimedia search, management, authoring, 516 About the Contributors sharing and advertising He has authored more than 130 publications in these areas and has more than 30 filed patents or pending applications HUA is a member of the Association for Computing Machinery and IEEE He is an adjunct professor of University of Science and Technology of China, and serves as an Associate Editor of IEEE Transactions on Multimedia and Editorial Board Member of Multimedia Tools and Applications Hua won the Best Paper Award and Best Demonstration Award in ACM Multimedia 2007 Qingming Huang (M’04-SM’08) He was born on Dec 23, 1965 in Harbin, China He obtained his B.S in computer science in 1988 and Ph.D in computer engineering in 1994, both from Harbin Institute of Technology, China He was a postdotoral fellow in the National University of Singapore from 1995 to 1996, and a member of research staff in Institute for Infocomm Research, Singapore from 1996 to 2002 He joined the Chinese Academy of Sciences (CAS) in 2003 under the Science100 Plan, and is now a professor in the Graduate Universwity of CAS His research interests include multimedia computing, digital video analysis, video coding, image processing, pattern recognition and computer vision He has published over 100 academic papers in various journals and conference proceedings, and holds/files more than 10 patents in US, Singapore and China Thomas S Huang received his Sc.D from MIT in 1963 He is William L Everitt Distinguished Professor in the University of Illinois Department of Electrical and Computer Engineering and the Coordinated Science Lab (CSL); and a full-time faculty member in the Beckman Institute Image Formation and Processing and Artificial Intelligence groups His professional interests are computer vision, image compression and enhancement, pattern recognition, and multimodal signal processing He is a Member of the National Academy of Engineering, a Foreign Member of the Chinese Academies of Engineering and Sciences, and a Fellow of IAPR and OSA, and has received a Guggenheim Fellowship, an A.V Humboldt Foundation Senior U.S Scientist Award, and a Fellowship from the Japan Association for the Promotion of Science He received the IEEE Signal Processing Society’s Technical Achievement Award in 1987 and the Society Award in 1991 He was awarded the IEEE Third Millennium Medal in 2000 Tiejun Huang (M’01) received the BSc and MSc degrees from the Department of Automation, Wuhan University of Technology in 1992, and the PhD degree from the School of Information Technology & Engineering, Huazhong University of Science and Technology, China, in 1999 He was a postdoctorial researcher from 1999 to 2001 and a research faculty member at the Institute of Computing Technology, Chinese Academy of Sciences He was also the associated director (from 2001 to 2003) and the director (from 2003 to 2006) of the Research Center for Digital Media in Graduate School at the Chinese Academy of Sciences He is currently an associate professor in the School of Electronics Engineering and Computer Science, Peking University His research interests include digital media technology, digital library, and digital rights management He is a member of the IEEE David W Jacobs received the B.A degree from Yale University in 1982.  From 1982 to 1985 he worked for Control Data Corporation on the development of data base management systems, and attended graduate school in computer science at New York University From 1985 to 1992 he attended M.I.T., where he received M.S and Ph.D degrees in computer science.  From 1992 to 2002 he was a Research Scientist and then a Senior Research Scientist at the NEC Research Institute in Princeton, New Jersey In 1998 he spent a sabbatical at the Royal Institute of Technology (KTH) in Stockholm.  Since 2002, he has been an Associate Professor of computer science at the University of Maryland, College Park 517 About the Contributors Shuqiang Jiang (M’07) received the MSc degree from College of Information Science and Engineering, Shandong University of Science and Technology in 2000, and the PhD degree from the Institute of Computing Technology, Chinese Academy of Sciences in 2005 He is currently a faculty member at Digital Media Research Center, Institute of Computing Technology, Chinese Academy of Sciences His research interests include multimedia processing and semantic understanding, pattern recognition, and computer vision He has published over 50 technical papers in the area of multimedia Aggelos K Katsaggelos received the Diploma degree in electrical and mechanical engineering from the Aristotelian University of Thessaloniki, Greece, in 1979 and the M.S and Ph.D degrees both in electrical engineering from the Georgia Institute of Technology, in 1981 and 1985, respectively He is currently Professor of EECS at Northwestern University, and the director of the Motorola Center for Seamless Communications He is the editor of Digital Image Restoration (Springer-Verlag 1991), coauthor of Rate-Distortion Based Video Compression (Kluwer 1997), co-editor of Recovery Techniques for Image and Video Compression and Transmission, (Kluwer 1998), co-author of the books Super-Resolution of Images and Video and Joint Source-Channel Video Transmission (both Morgan & Claypool Publishers 2007) He is a Fellow of the IEEE (1998), and the recipient of the IEEE Third Millennium Medal (2000), the IEEE Signal Processing Society Meritorious Service Award (2001), an IEEE Signal Processing Society Best Paper Award (2001), and an IEEE ICME Best Poster Paper Award (2006) Arun Kulkarni, Professor of Computer Science, has been with The University of Texas at Tyler since 1986 His areas of interest include soft computing, database systems, data mining, artificial intelligence, computer vision, image processing, and pattern recognition He has more than sixty refereed papers to his credit, and he has authored two books His awards include the 2005-2006 President’s Scholarly Achievement Award, 2001-2002 Chancellor’s Council Outstanding Teaching award, 1999-2000 Alpha Chi Outstanding Faculty Member, 1997 NASA/ASEE Summer Faculty Fellowship award, 1997 Piper award nominee for The University of Texas at Tyler, and the 1984 Fulbright Fellowship award He has been listed in who’s who in America He has been a software developer for twenty years, and has developed a variety of software packages He has successfully completed eight research grants during the past ten years Dr Kulkarni obtained his Ph.D from the Indian Institute of Technology, Bombay, and was a post-doctoral fellow at Virginia Tech Juliusz L Kulikowski received MSc degree in electronic engineering from the Warsaw Technical University in 1955, CandSc degree from the Moscow Higher School of Technology in 1959, DSc degree from the Warsaw Technical University in 1966 Since 1966 he was a scientific worker in several Institutes of the Polish Academy of Sciences, since 1981 – in the Institute of Biocybernetics and Biomedical Engineering PAS in Warsaw Nominated professor in 1973 He published more than 200 papers in information sciences, signals detection in noise, image processing methods, artificial intelligence, application of computers in medicine and books and monographs in these domains He is the Editor in Chief of a scientific quarterly “Computer Graphics & Vision”, a member of IFIP TC13 on “Human-Computer Interaction”, of IFAC TC on “Stochastic Systems”, a Chairman of Polish National Committee for cooperation with the Committee of Data for Science and Technology CODATA Sébastien Lefevre received in 1999 the M.Sc and Eng degrees in Computer Engineering from the University of Technology of Compiègne, France, and in 2002 the Ph.D degree in Computer Sci- 518 About the Contributors ence from the University of Tours, France He is currently an Assistant Professor in the Department of Computer Science and the Image Sciences, Computer Sciences and Remote Sensing Laboratory - LSIIT, University Louis Pasteur, Strasbourg, France From 1999 to 2002 he was with AtosOrigin as a Research and Development Engineer In 2003, he was with the Polytechnical School of the University of Tours as an Assistant Professor His research interests are related to mathematical morphology and its applications to content-based image and video indexing, color image and video analysis, multispectral and multitemporal image processing Xirong Li received his B.S and M.S both in Computer Science from Tsinghua University, in 2005 and 2007 He was an intern in Microsoft Research Asia from Oct 2005 to Jun 2007 He is currently a PhD student in the Intelligent Systems Lab Amsterdam (ISLA) at University of Amsterdam Zhu Li received the PhD degree in Electrical & Computer Engineering from Northwestern University, Evanston, USA, in 2004 He is an Assistant Professor with the Dept of Computing, Hong Kong Polytechnic University since 2008 He was with the Multimedia Research Lab (MRL), Motorola Labs, 2000~2008, where he was a Principal Staff Engineer He is an IEEE Senior Member, and the elected Vice Chair (2008-2010) of the IEEE Multimedia Communication Technical Committee (MMTC) His research interests include subspace modeling and machine learning in biometrics, multimedia analysis and search, video coding and adaptation, optimization and distributed computing in multimedia networks and systems He has 12 issued or pending US patents, 30+ publications in these areas He received the Best Poster Paper Award at IEEE Int’l Conf on Multimedia & Expo, 2006, and the DoCoMo Labs Innovative Paper Award (Best Paper) at IEEE Int’l Conf on Image Processing, 2007 Haibin Ling received the B.S degree in mathematics and the MS degree in computer science from Peking University, China, in 1997 and 2000, respectively, and the PhD degree from the University of Maryland, College Park, in computer science in 2006 From 2000 to 2001, he was an assistant researcher in the Multi-Model User Interface Group at Microsoft Research Asia From 2006 to 2007, he worked as a postdoctoral scientist at the University of California Los Angeles After that, he joined Siemens Corporate Research as a research scientist Since fall 2008, he has been an Assistant Professor at Temple University Dr Ling’s research interests include computer vision, medical image analysis, human computer interaction, and machine learning He received the Best Student Paper Award at the ACM Symposium on User Interface Software and Technology (UIST) in 2003 Hanqing Lu, professor of Institute of Automation, Chinese Academy of Sciences He got his B.S and M.S from department of computer science and department of electric engineering in Harbin institute of technology in 1982 and 1985 He received his Ph.D from department of electronic and information science in Huazhong University of sciences and technology His current research interests include Image similarity measure, Video Analysis, Multimedia Technology and System and so on He has published over peer-reviewed 200 papers in related journals and conferences He was award by Second Award of National Nature Sciences, Second Award of CAS Nature Sciences and Third Award of Cultural Department of China Wei-Ying Ma received the B.S degree from the National Tsinghua University in Taiwan in 1990, and the M.S and Ph.D degrees from the University of California at Santa Barbara in 1994 and 1997, 519 About the Contributors respectively He is a senior researcher and research manager in Microsoft Research Asia, where he has been leading a research group to conduct research in the areas of information retrieval, web search, data mining, mobile browsing, and multimedia management He has published book chapters and over 100 international journal and conference papers Guo-Jun Qi received the B.S degree from University of Science and Technology of China in Automation, Hefei, Anhui, China, in 2005 His research interests include computer vision, multimedia, and machine learning, especially content-based image/video retrieval, analysis, management and sharing He has been the winner of the best paper award in the 15th ACM International Conference on Multimedia, Augsburg, Germany, 2007 He is now working in Internet Media Group at Microsoft Research Asia as a research intern Mr Qi is the student member of Association for Computing Machinery Simon Ruszala was awarded a Distinction in MSc Industrial Computing from the Nottingham Trent University in 2005, after gaining a BSC (Hons) 1st in Computing Visualisation from the same university A career change to telecommunications after graduation has led to him managing a team of engineers aiding the build of Vodafone UK’s new core network Interests in image retrieval and colour image analysis have led to a number of publications during and after university Gerald Schaefer gained his PhD in Computer Vision from the University of East Anglia He worked at the Colour & Imaging Institute, University of Derby as a Research Associate (1997-1999), as Senior Research Fellow at the School of Information Systems, University of East Anglia (2000-2001), and as Senior Lecturer in Computing at the School of Computing and Informaticcs at Nottingham Trent University (2001-2006) In 2006 he joined the School of Engineering and Applied Science at Aston University His research interests include colour image analysis, physics-based vision, image retrieval, and image coding He has published more than 150 papers in these areas Chunmei Shi is currently working as a dentist in the People’s Hospital of Guangxi, Nanning, China Manuraj Singh Mr Manuraj Singh received his BEng in Engineering with First Class Distinction from Bangalore University, India in 1999 He is currently working as a System Analyst and Lecturer He is also pursuing his study as MPhil student in School of Computer Science at Middlesex University He is member of British Computer Society since 2004 His research includes mobile agents systems, data mining and semantic web Yonghong Tian (S’02-M’05) received the MSc degree from the School of Computer Science, University of Electronic Science & Technology of China in 2000, and the PhD degree from the Institute of Computing Technology, Chinese Academy of Sciences in 2005 He is currently an associate professor in the School of Electronics Engineering and Computer Science, Peking University, China His research interests include machine learning, data mining, semantic-based multimedia analysis, and retrieval He has published over 40 technical papers in the area of multimedia and machine learning He is a member of the IEEE 520 About the Contributors Jinhui Tang is currently a postdoctoral research fellow in School of Computing, National University of Singapore He received his B.E and PhD degrees in July 2003 and July 2008 respectively, both from the University of Science and Technology of China From Jun 2006 to Feb 2007, he worked as a research intern in Internet Media group at Microsoft Research Asia And from Feb 2008 to May 2008, he worked as a research intern in School of Computing at National University of Singapore His current research interests include content-based image retrieval, video content analysis and pattern recognition Dr Tang is a member of ACM and a student member of IEEE Kongqiao Wang received his Ph.D in 1999 in signal and information processing from University of Science and Technology of China (USTC) Currently, he is the Visual Systems team leader at Nokia Research Center Beijing and a part-time professor at USTC Kongqiao has received over 40 granted/ pending patents, published four book chapters, and authored or co-authored more than 40 conference and journal papers His research interests include visual computing technologies, data mining and machine learning Meng Wang is currently an Associate Researcher in Microsoft Research Asia He received the B.E degree in the Special Class for the Gifted Young and Ph.D degree in the Department of Electronic Engineering and Information Science from the University of Science and Technology of China (USTC), Hefei, China, in 2003 and 2008, respectively His current research interests include multimedia content analysis, computer vision and pattern recognition Dr Wang is a member of ACM Xin-Jing Wang received her PhD degree from Tsinghua University in 2005 She is now an associate researcher in Microsoft Research Asia Her primary research interests include image retrieval, image understanding, and pattern recognition Ying Wu received the B.S from Huazhong University of Science and Technology, Wuhan, China, in 1994, the M.S from Tsinghua University, Beijing, China, in 1997, and the Ph.D in electrical and computer engineering from the University f Illinois at Urbana-Champaign (UIUC), Urbana, Illinois, in 2001 In 2001, he joined the Department of ECE at Northwestern University, Evanston, Illinois, as an assistant professor where he is currently an associate professor of Electrical Engineering and Computer Science His research interests include computer vision, image and video analysis, pattern recognition, machine learning, multimedia data mining, and human-computer interaction He is an associate editor of SPIE Journal of Electronic Imaging and an associate editor of IAPR Journal of Machine Vision and Applications He received the Robert T Chien Award at UIUC in 2001, and the NSF CAREER award in 2003 He is a senior member of the IEEE Rong Yan, Apostol Natsev, and Murray Campbell are with IBM Research Junsong Yuan is currently a Ph.D candidate in Electrical Engineering and Computer Science department of the Northwestern University His research interests include computer vision, multimedia data mining, and machine learning During the summer of 2008, 2007 and 2006, he was a research intern with the Communication and Collaboration Systems group, Microsoft Research, Redmond, WA, Kodak Research Labs, Rochester, NY, and Motorola Labs, Schaumburg, IL, respectively He received his M.Eng from the National University of Singapore in 2005 From 2003 to 2004, he was a research 521 About the Contributors assistant in the Institute for Infocomm Research (I2R) in Singapore He was enrolled in the Special Program for the Gifted Young of Huazhong University of Science and Technology and received his B.S in Communication Engineering in 2002 He was awarded the national outstanding student and the Hu-Chunan fellowship in 2001, by the Ministry of Education in P.R.China Yuan Yuan is currently a Lecturer at the Aston University, United Kingdom She received her BEng degree from the University of Science and Technology of China and PhD degree from the University of Bath, United Kingdom She published more than forty papers in journals and conferences on visual information processing, compression, retrieval etc She is an associate editor of the International Journal of Image and Graphics (World Scientific) She was on program committees of several IEEE/ACM conferences She is a reviewer for several IEEE transactions, other international journals and conferences She is a member of the IEEE Lei Zhang received his PhD degree from Tsinghua University in 2001 He is a researcher and a project lead in Microsoft Research Asia His current research interests include search relevance ranking, web-scale image retrieval, social search, and photo management and sharing He is a member of IEEE and a member of ACM Huiyu Zhou received his BEng degree in radio technology from Huangzhong University of Science and Technology, P R China He was awarded an MSc degree in biomedical engineering from the University of Dundee and recently the PhD degree in computer vision from Heriot-Watt University, Edinburgh, Scotland Currently, he is a research fellow in Brunel University, UK His research interests include computer vision, human motion analysis, intelligent systems and human-computer interface Since 2002 he has more than 50 papers published in international and national journals and conferences Shaohua Kevin Zhou received his Ph.D degree in Electrical Engineering from University of Maryland at College Park in 2004 He then joined Siemens Corporate Research, Princeton, New Jersey, as a research scientist and currently he is a project manager He has general research interests in signal/image/video processing, computer vision, pattern recognition, machine learning, and statistical inference and computing, with applications to biomedical image analysis (especially biomedical image context learning), biometrics and surveillance (especially face and gait recognition), etc He has written two monographes: the lead author of the book entitled Unconstrained Face Recognition and a coauthor of the book entitled Recognition of Humans and Their Activities Using Video, edited a book on Analysis and Modeling of Faces and Gestures, published over 60 book chapters and peer-reviewed journal and conference papers, and possessed over 30 provisional and issued patents He was identified as Siemens Junior Top Talent in 2006 522 523 Index Symbols 2D multi-resolution hidden Markov model (2DMHMM) 352, 369 2D shapes 106 3-dimensional colour histogram 407 3D MARS 265 3D surfaces 105 A ABIR system 379 ABIR system, architecture of 385 abstract semantics 351 accuracy 24 active learning 152,  301 active learning, sample selection strategies in 302 active learning-based video annotation 307 active learning algorithm 152 active learning for relevance feedback 152,  154 active learning in video annotation 298 active learning methods for video annotation 298 active learning techniques 302 active learning with SVM 307 active video annotation 298 activity diagram 446 adaptation 65 agent class 443 AgentPassport Class 443 Aglets 433 algebraic filters 37 angular second moment 389 annotation of images 350 annotation time models 286 ANSI/ISA 238 ANSI/ISA-S88 238 ANSI/ISA-S88 structure 239 approximated matching 467 articulated database 121 articulated shape and models 224 articulation insensitivity 110 articulation invariant signatures 114 association-based image retrieval (ABIR) 379,  384 associative storage and retrieval 393 asynchronous 430 auto-annotation models 367 automatic image annotation 363 autonomous 430 average match percentile (AMP) 414 B bag-of-words model 359 based process modeling 242 beach/ocean images 269 Bhattacharyya distance 135 bidirectional associative memories (BAMs) 393, 396 binary images 68 biological process engineering 236 biological research process hierarchy 241 biomedical imaging 67 biometrics 69 blob-based representation 365 block, definition 410 block-based approach 355 block-based differences 16 block colour co-occurrence matrix 413 boosting 136 Copyright © 2009, IGI Global, distributing in print or electronic forms without written permission of IGI Global is prohibited Index BoostMotion 143 browsing 276 browsing interface 286 browsing time statistics 286 C CAPE-ModE 256 CargoElement Class 442 CBIR components 396 CBIR queries 384 CBIR system 379 CBIR system, schematic diagram of 382 centers of similarity 177 centralized system 421 change measures 10 chord 424 chromatic transition class-based LRV (CLRV) 362 classical image-based face recognition algorithms 81 CMU PIE database 91 Co-EM algorithm 157 Co-SVM algorithm 152,  154, 158 Co-SVM algorithm, flowchart of 158 Co-SVM algorithm, pseudo-code of 160 co-testing 152 co-training 152, 185 color 212 color feature 386 colour histograms 412 colour information 408 colour visual pattern image coding (CVPIC) 407, 408, 409 colour visual patterns 407 compatibility 431 compressed-domain image retrieval 407 compressed data, shot-boundary detection from 22 compressed video data compression algorithm 407 computational complexity 25 computer-aided foliage identification 100 computer-aided foliage image retrieval systems 100 computer-aided operation engineering environment 256 524 computer-aided process engineering environment 256 computing power 211 concept selection 311 constant brightness 134 content-based image retrieval (CBIR) 153,  380, 350 content-based image retrieval (CBIR) field 324 content-based image retrieval (CBIR) systems 379 content-based image retrieval and categorization 68 content-based video semantic analysis 211 content addressable network (CAN) 425 contrast 389 correlation 390 cross-media relevance model 367 CVPIC, edge patterns used 409 CVPIC data 407 CVPIC retrieval, block colour co-occurrence matrix 413 CVPIC retrieval, by colour and shape 410 CVPIC retrieval, uniform/non-uniform colour histograms 412 D database images 380 database management systems (DBMSs) 380 data mining solutions 32 data partition based approach 452 data sets 370 dataset selection 331 DCT-based detection techniques 22 DCT coefficient 153 decentralized system 421 decentralized unstructured P2P systems 421 density 303 descriptors 213 detailed semantics 351 detection-rate 24 detectors 213 dilation 34 dimensionality reduction, generalized feature extraction for 461 directed graphs, learning with 190 discrete cosine transform (DCT) Index discrete Fourier transform (DFT) 392 discriminant simplex analysis 460 discriminative similarity function 132, 136 dissolve 10 distributed agent environment (DAE) 432 distributed computing 336 diversity 304 document analysis 68 document archives 380 document images 68 E echocardiogram, experiments on 144 echocardiography 130 echocardiography tracking 146 edge-based detection of gradual transitions 20 edge-based differences 14 effective data-driven approach 331 efficient algorithms 63 eigenface 94 eigenface-based face recognition 82 eigenfaces 83 electronic field guide prototype 121 electronic field guide system 120 engineering formal language (EFL) 239, 245, 247 entropy 390 erosion 34 error-rate 24 Euclidean distance 336 event detection 211, 225 eyes detection 89 F face images in ORL database 82 face modeling 80 face recognition 80 face recognition and semantic features 80 face recognition scheme 80 fade-in fade-out false detection 24 false positive 24 fault tolerant 431 feature extraction 386 feature selection 138 Fisher’s linear discriminant (FLD) 81, 84 Fisherface 86,  95 Fisherface-based face recognition 84 foliage database retrieval 100 foliage image retrieval 103,  120 formal methods 245 formal methods, limitations of 246 formal methods examples 246 formal methods to engineering design & operation 246 formal models 272 Fourier transform (FT) 392 Fourier transform domain features 392 frame, definition frequency-based annotation 277 FrontClient Class 442 FT coefficients 393 G Gaussian random fields 187 generalized BAM- bus structure 397 generalized BAM-ring structure 397 generalized BAM- tree structure 396 generalized bi-directional associative memory (GBAM) 379,  404 generalized feature extraction for dimensionality reduction 461 generic semantic descriptors 89 generic similarity function 134 generic similarity function, categorization of 134 generic similarity function, difficulty from 136 geodesic distances for 3D surfaces 105 global approach 355 global color histograms 387 global consistency assumption 194 global consistency method 188 global features 15, 212 global histogram-based techniques 17 global morphological features 51 Google images 338 gradual-transition detection from “uncompressed data” 18 gradual transition gradual transitions, edge-based detection of 20 gradual transitions, motion-based detection of 20 525 Index gradual transitions, other combined techniques for detection 21 granulometry 32 graph-based methods 186 graph-based model 458 graph-based semi-supervised learning methods 195 graph construction 190 graph embedding 458 GRF method 195 Grid-based representation 365 group of pictures (GOP) H hamming distance 336 hard-cut transition harmonic functions method 187 hash code based image retrieval 336 hash code filtering 336 hash code generation algorithm 335 hash code generation process 335 hash codes 333 Hasse diagram 179 hierarchical aspect model 352 hierarchical image classification 358 hierarchical video representation 214 high-dimensional space, indexing and querying 463 high-level feature extraction 299 high level video semantic analysis and understanding 225 histogram-based differences 15 histogram-based similarity function 135 HSI-color space transformation 386 hue, saturation, intensity (HSI) 386 hue-sphere 265 hue-sphere browsing 263 hue-value co-ordinate system 266 human-based interface systems, knowledge structure of 258 human effort 298 human factor classifications 258 human factors, modeling 258 human memory 379 hybrid approaches 272 hybrid systems 427 hybrid techniques 23 526 I identity transition image/video database 183 image/video semantic analysis 183, 191 image annotation approaches 278 image auto-annotation 327 image composition levels 176 image database navigation 267 image database visualisation 265 image data sets 402 image features 32 image matching 131 image processing 167 image representation 364 image representations for annotation 366 image retrieval 152, 411 image retrieval algorithm 120 images, annotating 323, 350 independent component analysis (ICA) 81 indexing images 386 indexing solutions 450 index tree structure 451 inner-distance 107 inner-distance, articulation insensitivity of 108 inner-distance and its computation 107 inner-distances and part structures 113 inner-distance shape context (IDSC) 115 inner similarity compactness 174 input video data, types of 10 integral image 224 intensity-based similarity function 134 inter-coding interactive retrieval, application to 282 Intermediate semantic modeling 355 intra-coding intuitive image database navigation 263 inverse Fourier transform 392 IPGen class 442 K Karhunnen-Loeve transformation (KLT) 84 kd-tree example 454 key frame 214 key frame extraction 214 keypoint-based representation 366 Index knowledge-based process modeling approach 244 knowledge acquisition and validation 237 knowledge structuring and representation 237 L laplacianface 88,  95 large-scale image repositories 380 laser alignment operation 254 latent dirichlet allocation (LDA) model 325, 352 latent space representation 359 learned similarity function 130 learning-based annotation 279 learning-based visual tracking 143 linear associative matrices 395 linear methods 460 linear neighborhood propagation (LNP) 188 linkage relationship vector (LRV) 361 local and global consistency method 188 local features 13, 213 locality sensitive hashing (LSH) 464 local learning regularization (LL-Reg) 189 local linear embedding (LLE) 81 local morphological features 49 local semantic concepts 356 location-sensitive cascade training 141 location-sensitive cascade training algorithm 143 long-term memory (LTM) 394 low-level modeling 354 low-level scene representation 358 LSH for Euclidean space 465 LSH for hamming space 465 LUFT computation 468 LUFT examples 469 LUFT space kd-tree partition 469 M machine learning algorithms 157 machine learning tools 132 manifold-ranking 312 manual image annotation 272 manual image annotation and retrieval 272 manual image annotation methods 275 manual image annotation systems, examples of 274 mathematical morphology, basics of 33 maximum a posteriori (MAP) criterion 358 medical image management 380 midstream content access 408 Min-Cut 187 mining image search 323 mining search results 328 mobile agent 433 mobile agent based resource discovery system 437 mobile agents 419 mobile agents, resource discovery using 419 mobile agent system 429 mobile agent system interoperability facility (MASIF) 431 modality-specific similarity function 135 model construction of process systems 237 modeless annotation approach 328 MoFlo, EFL for 253 MoFlo, operation engineering of 248 MoFlo-based biological process 240 MoFlo biological process engineering 240 MoFlo cell sorter 255 MoFlo cell sorter, hazardous operation scenarios of 255 MoFlo cytomation system 242 MoFlo function modeling 243 MoFlo operation 254 MoFlo operation synthesis 249 MoFlo process design 256 MoFlo process model formalization 241 MoFlo structure model 252 MoFlo system architecture 243 MoFlo system components 242 MoFlo system modeling 244 MoFlo systems 236,  241,  262 morphological features, multidimensional extensions of 54 morphological filters 42 morphological scale-spaces 32 morphological scale-spaces, image features from 32 motion-based detection of gradual transitions 20 527 Index motion-based detection techniques 23 motion compensation techniques motion estimation 130,  145 motion information motion vectors mouth detection 89 Moving Pictures Experts Group (mpeg) mpeg mpeg-7 32 mpeg standards multi-aspect similarity measures 173 multi-concept annotation 308 multi-concept multi-modality active learning 309 multi-concept multi-modality active learning approach 311 multi-concept multi-modality active learning scheme 298 multi-label image classification 353 multi-modality learning 308 multi-topic retrieval system, browsing interface 284 multi-topic retrieval system, shot information dialog 284 multi-topic retrieval system, tagging interface 283 multi-view based active learning 157 multi-view learning 152 multi-view scheme 159 multidimensional scaling (MDS) 105,  264 multimedia capture 449 multimedia database management system (MMDBMS) 380, 396 multimedia data indexing 449 multimedia data indexing and management 450 multimedia data mining 1,  32 multimedia data mining, video processing multimedia data mining, video representation multimedia data storage 450 multimedia libraries 380 multimedia modeling and visualization 256 multimedia objects, modeling framework of 255 multimedia query processing 453 multimedia resources 419 multimedia technology 350 528 multiscale representation using morphological filters 42 mutual information (MI) 135 Mythread class 443 N nearest-neighbor query 401 nearest neighbor search 464 neighborhood assumption 192 NN search with kd-tree 455 noisy input and recalled images 402 non-linear appearance algorithms 81 non-uniform colour histograms 412 nonlinear methods 459 nonparametric learning 457 normalized cross correlation (NCC) 135 normalized Laplacian 184 O object based video understanding and representation 217 object composition 351 object detection 217 object representation and indexing 222 object silhouette and contour 224 object tracking 219 off-line processing 11 OntologyImpl class 442 Ontology interface 440 OntologyMole 442 OntologyServer class 442 opening and closing 37 operation design execution system architecture 250 operation design framework 249 operation ontology structure 251 operation reliability 255 operator interface system 257 ORL database 82,  94,  97 ORL database, examples of 93 ORL experiments 93 P P2P systems 419 parametric learning 457 parametric noise model 134 Index partial differential equation (PDE) 194 partial input and recalled images 402 partial similarity concepts 166 partial similarity of images 176 pastry 427 PDE-based diffusion 197 peer-to-peer (P2P) systems 419 peer discovery algorithm 433 performance measures 370 performing visual tracking 130 pie database 92,  96 pie experiments 91 pixel-based differences 13 plant/process object oriented modeling methodology (poom) 236 plant species identification 100 platform classes 440 point-set, indexing and querying 466 points-based method 223 polysemy 359 POOM 236 POOM-based hierarchical structure unit model 244 positive scaling coefficient 169 positivity 303 precision-recall performance 470 preprocessing 386 primitive geometric shapes 223 principal component analysis (pca) 81 probabilistic latent semantic analysis (PLSA) model 352 probabilistic latent space models 368 probability densities of object appearance 224 probability density 195 process engineering 238 process engineering framework 237 processing domain 11 process life cycle activities 237 process operation design 238 proposed interface system modeling 257 proposed modeless image annotation approach 330 psla graphical model 360 pyramid matching 467 Q QBIC 411 query-by-example (QBE) 380 query-by-example (QBE) scheme 324 query image 401 query modification technique 153 R r-tree insertion example 452 radial features 393 Real-Time 11, 333 recognition-by-components (RBC) model 104 red, blue, green (RGB) color space 386 red ellipses 89 regions of interest (ROI) 177 related graph-based semi-supervised learning methods 195 relational support vector classifier (RSVC) 362 relative partial similarity 179 relevance feedback (rf) 152, 153 relevance feedback, active learning for 154 relevance feedback algorithms 153 relevance feedback in image retrieval 152 remote sensing 67 reproducing kernel hilbert space (RKHS) 185 residual resource discovery 419 resource discovery service 419 resource discovery systems, technologies used 420 RGB-color space 386 risk reduction 302 robustness 65 S salient objects 358 sample selection 313 scalable mobile and reliable technology (SMART) 432 scale-invariant feature transform (SIFT) 366 scene, definition scene analysis 216 scene classification 358 scene modeling 354 selective sampling 154 529 ... trademark Library of Congress Cataloging-in-Publication Data Semantic mining technologies for multimedia databases / Dacheng Tao, Dong Xu, and Xuelong Li, editors p cm Includes bibliographical.. .Semantic Mining Technologies for Multimedia Databases Dacheng Tao Nanyang Technological University, Singapore Dong Xu Nanyang Technological University, Singapore Xuelong Li University of... University, Singapore Xuelong Li Email: xuelong@dcs.bbk.ac.uk University of London, UK Section I Multimedia Information Representation  Chapter I Video Representation and Processing for Multimedia

Ngày đăng: 05/11/2019, 15:45

Từ khóa liên quan

Mục lục

  • Semantic Mining Technologies for Multimedia Databases

  • Table of Contents

  • Detailed Table of Contents

  • Preface

  • Video Representation and Processing for Multimedia Data Mining

  • Image Features from Morphological Scale-Spaces

  • Face Recognition and Semantic Features

  • Shape Matching for Foliage Database Retrieval

  • Similarity Learning for Motion Estimation

  • Active Learning for Relevance Feedback in Image Retrieval

  • Visual Data Mining Based on Partial Similarity Concepts

  • Image/Video Semantic Analysis by Semi-Supervised Learning

  • Content-Based Video Semantic Analysis

  • Applications of Semantic Mining on Biological Process Engineering

  • Intuitive Image Database Navigation by Hue-Sphere Browsing

  • Formal Models and Hybrid Approaches for Efficient Manual Image Annotation and Retrieval

  • Active Video Annotation: To Minimize Human Effort

  • Annotating Images by Mining Image Search

  • Semantic Classification and Annotation of Images

  • Association-Based Image Retrieval

Tài liệu cùng người dùng

  • Đang cập nhật ...

Tài liệu liên quan