Core technology for multimedia signal processing and productization, intelligent embedded multimedia information processing platform
20100301 - 20111231
Dr Shen-chang CHAO
The proposal will develop core multimedia technologies for machine processing to achieve better content understanding and rendering. • Design and development of high performance and reliable technology IP: 1. Embedded intelligent multimedia information processing platform, which supports 4-channels D1 or single channel 1080i H.264 video compression, and video scene analysis simultaneously in a single DSP chip. 2. Visual scene analysis algorithm, such as background estimation, object segmentation, object tracking using Feature Density Approximation(FDA), behavior analysis, fusion of video compression and scene analysis, which leverage the motion vector information in video compression to reduce the computation overhead in object tracking dramatically. 3. Enhanced impulsive sound detection algorithm, and new word spotting algorithm for 20 sensitive keywords. 4. Fusion of sound/video scene analysis 5. Automatic detection and recognition of intrusion or violence scenes in real time. Other scene matching with new scene template defined by customers are also included. 6. Audio & video de-noising algorithm 7. Object extraction algorithm for speaker and virtual background mixing 8. Integrated sound/video de-noising and high definition video compression technology . 9. Embedded intelligent multimedia information processing platform which can be used in intelligent surveillance and multimedia communication area, and achieve 30% cost down compare with similar market product. • Open and flexible software module design, ready for integration with existing traditional application systems such as surveillance systems, IPTV systems, digital film archiving systems.1. Object tracking technology optimization 2. Video De-Noising technology, based on the de-noising and super resolution algorithm to develop optimized video de-noising on decoder side, trans-coding and multiple-view in server side, to improve the video quality under the mix-rate video network. 3. Net-True virtual reality development, it is based on the object segmentation algorithm developed by ASTRI, to implement conference Net-True technology, virtual single round table conference, and try to become the world leader in this domain. 4, Carrier level surveillance platform development, to be deployed in all over the country. 5. Wireless video surveillance, support multiple standard such as TD-SCDMA, WCDMA,CDMA 2000 and Wi-Max 6. Advanced streaming and storage technology. To achieve mass volume video live and storage via distributed architecture and traffic balance. 7. Carrier level intelligent surveillance server platform, which can support automatically detect and recognize intrusion or violence scenes in real time, set alarm, and request for immediate processing, Analysis the signals from group of cameras deployed in one area, so as to achieve tracking the suspect via camera network, and its commercialization.
Shenzhen R&D center, Polycom Communication Technology (Beijing) Co.Ltd [Sponsor] The University of Hong Kong ZTE [Sponsor]
In this project, we plan to develop intelligent multimedia processing core technologies platform, so as to achieve intelligent information creation, scene analysis, presentation and enhancement, especially for security & multimedia communication applications. Existing human-based monitoring surveillance systems will heavily involve human resource to find out critical cases. The proposed intelligent embedded platform can automatically detect critical cases. It targets to reduce manual overheads in existing surveillance system and facilitates police force’s fast response to critical cases, improves the public safety situation, which will be helpful to the effort of building a harmonious society. Core technologies we plan to develop include embedded multimedia information processing technologies platform, technologies needed for surveillance and multimedia communication, such as H.264 high resolution video encoding, De-noising, abnormal behavior analysis, fusion of video scene and sound analysis, sensitive key word spotting, video enhancement algorithm for video conference & Tele-presence. We will collaborate with local universities on the core algorithm research. This project will create massive market opportunities riding on the fast growing of multimedia market in the Great China Area, especially for security industry and multimedia communication industry.