Page 25 - profile2012.indd
P. 25

Research Laboratories  研究群



                                                           多媒體技術實驗室








 Multimedia Technology   Laboratory




 Research Faculty
 Research Faculty

 Hong-Yuan Mark Liao  Chu-Song Chen  Wen-Liang Hwang  Chun-Shien Lu  Tyng-Luh Liu
 Research Fellow  Research Fellow  Research Fellow  Associate  Research Fellow  Research Fellow





 Group Profile
 In  the  past  two  decades,  multimedia  technology  influences   corders, to perform evidence mining on these videos is more   puter vision. Besides, due to the considerable growing of data amount in the
 many aspects of our daily life.  Besides biotechnology and na-  challenging.  We shall start by addressing the multiple-camera   Internet age, training in large-scale (and possibly noisy) datasets becomes a
 notechnology, multimedia technology has been considered   people counting problem as well as visual knowledge transfer   significant issue. Furthermore, instead of observing the world only with color
 one of the three most promising industries of the twenty-first   among a heterogeneous collection of surveillance camcorders.  images in common viewing angles, 3D imaging (providing further depth in-
 century.  Multimedia research covers a broad scope of tech-  formation) and flying camera (providing more un-common viewing angles
 niques and rich applications, including those working on mu-  B. Compressive Sensing and Sparse Representation  from bird’s eye views) could also bring us chances for developing novel ap-  Multimedia shapes
 sic, video, image, text, and 3-D animation.    plications in the near future. High-level visual concepts, such as aesthetics,
 Compressed Sensing/Sampling (CS) is a revolutionary technol-  have also been shown the possibility of being tackled by machine learning.                           our future.
 In the upcoming few years, we would continue to devote our   ogy of simultaneously sensing and compressing signals, and   To address the above issues, we will study several topics toward understand-
 research efforts in advancing the key fields in multimedia, in-  builds a new sampling theorem beyond the Nyquist rate. It   ing visual information from multi-perspectives: (1) object detection, recogni-
 cluding multi-perspective computer vision, compressive sens-  enables to finish joint data acquisition and compression with   tion, and segmentation from visual saliency, (2) tracking and interacting with
 ing/sparse representation, video forensics, etc.  In what follows,   slight cost at the encoder (for resource-limited mobile devices   flying cameras, (3) on-line aesthetic value assessment when shooting, and
 we shall describe in details some key fields.  and sensors) but shift major computational overhead to the   (4) deriving the 3D structure of conventional camera images. The research
 decoder.  Based  on  the  assumption  of  signal’s  sparsity, CS,  in   outcomes are expected to be helpful in making computers understand hu-
 A. Video Forensics  theory, can perfectly reconstruct the original signal from (far)   man intension, assisting human with better-quality and more-safety life, and
 fewer measurements via convex optimization or greedy algo-  supporting robot to see and understand the world better.
 Since the 911 attacks on the United States, counter-terrorism   rithms. This completely new idea makes CS a hot topic in signal
 strategies have been given a high priority in many countries.   processing-related fields since its first appearance in 2006. Fur-
 Surveillance camcorders are now almost ubiquitous in modern   thermore, for the problems that are inherent sparse or can be
 cities. As a result, the amount of recorded data is enormous,   sparsified, CS have been adopted in broad areas.
 and it is time-consuming to search the digital video content
 manually.  In this next few years, we shall put part of our ef-  Undoubtedly, this emerging area opens opportunities for the
 fort on video forensics, in which a major proportion of related   study of fundamental issues and application-oriented prob-
 research work is to perform mining for criminal evidence in   lems. In the future, we will plan to study the following topics:
 videos recorded by a heterogeneous collection of surveillance   (1) Fast Compressed Image Sensing (CIS); (2) Fast Orthogonal
 camcorders.  This is a new interdisciplinary field, and people   Matching Pursuit (FOMP); (3) Multiple input systems exploiting
 working in the field need video processing skills as well as an   sparse representation (e.g., microphone array signal process-
 in-depth knowledge of forensic science; hence the barrier for   ing); and (4) single-pass codeword learning for sparse repre-
 entering the field is high.  Mining surveillance videos directly   sentation.
 for criminal evidence is very different from conventional crime
 scene investigations.  In the latter, detectives need to actually   C. Multi-perspective computer vision
 visit the crime scene, check all available details and collect as
 much physical evidence as possible.  By contrast, to conduct   Making computers capable of perceiving the real-world visual
 crime scene investigations directly from surveillance videos,   information from various clues is challenging because of high-
 forensic experts need to develop software that facilitates the   complexity conceptions, changing environments, free motion,
 automatic detection, tracking, and recognition of objects in the   high articulations, and so on. As many visual concepts are diffi-
 videos.  Since the videos are captured by heterogeneous cam-  cult to be summarized in simple and plain rules, (statistical) ma-
 chine learning has played an important role in the past decade
 (as witnessed in the main conferences such as CVPR, ICCV, and
 NIPS), and is still expected to be vital to the progress of com-



 研究群
 24  Research Laboratories
 24
                                                                                                                 25
                                                                                                                 25
   20   21   22   23   24   25   26   27   28   29   30