Free and Latest article publishing for websites and ezines!

Research on Content-Based Audio Information Retrieval Technology

Rapid development of modern information technology, multimedia technology and network technology has resulted in large and ever-increasing stores of multimedia data. Therefore, content-based audio information retrieval (CBAIR) technology has been attracting more and more attention to the efforts to make full use of existing audio information. Audio data can be static or dynamic, and the audio retrieval can be at expression level or semantic level. Different audio form and different retrieval level requires different retrieval methods. Although much work has been done in relevant researches, there still exist many unsolved difficulties in CBAIR area. The major difficulties include the following: the performance of most present retrieval methods deteriorates dramatically under noise; it is very difficult to index audio data which is highly dimensional and of time sequence little work has been done on dynamic audio retrieval; research in the semantic-level retrieval for audio music progresses hard and slowly due to the serious difficulties in extracting semantic information from audio music). Generally speaking, CBAIR technology is still at experimental stage and lacks applicable technology and system.Centering around the problems existing in CBAIR, this dissertation studies the following problems:1. For the problem of expression-level retrieval of static audio, fuzzy histogram audio retrieval method based on principal loudness component is developed. In the design of histogram model, statistical distribution of loudness is used to optimize the histogram model. At the same time, fuzzy histogram is used to improve the robustness of histogram model to noise and small change in loudness value. Active search method is used in the histogram retrieval. Experimental results show that the method has better robustness to noise.2. For the problem of expression-level indexing of static audio, a novel indexing method based on fuzzy histogram of principal loudness component is presented. When audio data is expressed by fuzzy histogram of principal loudness component, the similarity between their histograms can correctly reflect

Recommended Articles from the IT Science Category:

Most Viewed ScienceArticles in the IT Science Category:

  1. Channel Model Simulation and Spread Spectrum OFDM for HF Communication
  2. Study on the Political Function of Mass Media
  3. Research on Algorithms of GPU-Based 3D Medical Image Processing
  4. Study on Radar Tracking and Discrimination for Ballistic Missiles
  5. Research on QoS Based Multicast Routing Protocols in Mobile Ad Hoc Networks
  6. Study on Robot Joint Based on Reversing Ball Screw Mechanism
  7. Research on Real Time Pulse Train Deinterleaving for Radar Intercept System
  8. Reaearch on Optimization Problem of Manufacturing Process in a Discrete Manufacturing Industry
  9. Study of Parallel FDTD Algorithm and EM Scattering in Layered Half-space
  10. Spatial Three Degree-of-Freedom Parallel Mechanisms: Configurations, Performances and Applications
  11. Channel Estimation in MIMO-OFDM Wireless Communication System
  12. Preparation and Investigation of p-ZnO Film and ZnO Light Emitting Device
  13. The Application and Study of Electrochemical Biosensors Based on Nanomaterials
  14. A Study of Space-Frequency Coding and Signal Detection in MIMO-OFDM Systems
  15. Research on Optical Fiber Sensor Based on Metal Nanoparticles


© 2004-2009 Latest-Science-Articles.com - All Rights Reserved Worldwide.