Method and article of manufacture for content-based analysis, storage, retrieval, and segmentation of audio information
Patent 5918223 Issued on June 29, 1999. Estimated Expiration Date: July 21, 2017. Estimated Expiration Date is calculated based on simple USPTO term provisions. It does not account for terminal disclaimers, term adjustments, failure to pay maintenance fees, or other factors which might affect the term of a patent.
A system that performs analysis and comparison of audio data files based upon the content of the data files is presented. The analysis of the audio data produces a set of numeric values (a feature vector) that can be used to classify and rank the similarity between individual audio files typically stored in a multimedia database or on the World Wide Web. The analysis also facilitates the description of user-defined classes of audio files, based on an analysis of a set of audio files that are members of a user-defined class. The system can find sounds within a longer sound, allowing an audio recording to be automatically segmented into a series of shorter audio segments.
Foote, J., "A Similarity Measure for Automatic Audio Classification," Institute of Systems Science, National University of Singapore, 1977, Singapore
Scheirer, E., Slaney, M., "Construction and Evaluation of a Robust Multifeature Speech/Music Discriminator," pp. 1-4, Proceedings of ICASSP-97, Apr. 2-24, Munich, Germany
Scheirer, E.D., "Tempo and Beat Analysis of Acoustic Musical Signals," Machine Listening Group, E15-401D MIT Media Laboratory, pp. 1-21, Aug. 8, 1997, Cambridge, MA
Wold, E., Blum, T., Keislar, D., and Wheaton, J., "Content-Based Classification, Search, and Rerieval of Audio," IEEE Multimedia Magazine, Fall 1996
Blum, T., Keislar, D., Wheaton, J., and Wold, E., "Audio Databases with Content-Based Retrieval," Prodeedings of the 1995 International Joint Conference on Artificial Intelligence (IJCAI) Workshop on Intelligent Multimedia Information Retrieval, 1995
Keislar, D., Blum, T., Wheaton, J., and Wold, E., "Audio Analysis for Content-Based Retrieval" Proceedings of the 1995 International Computer Music Conference, No date
Feiten, B. and Gunzel, S., "Automatic Indexing of a Sound Database Using Self-Organizing Neural Nets," Computer Music Journal, 18:3, pp. 53-65, Fall 1994
Vertegaal, R. and Bonis, E., "ISEE: An Intuitive Sound Editing Environment," Computer Music Journal, 18:2, pp. 21-22, Summer 1994
Cosi, P., De Poli, G., Prandoni, P., "Timbre Characterization with Mel-Cepstrum and Neural Nets," Proceedings of the 1994 International Computer Music Conference, pp. 42-45, San Francisco, No date
Gonzalez, R. and Melih, K., "Content Based Retrieval of Audio," The Institute for Telecommunication Research, University of Wollongong, Australia, No date
Fischer, S., Lienhart, R., and Effelsberg, W., "Automatic Recognition of Film Genres," Reihe Informatik, Jun. 1995, Universitat Mannheim, Praktische Informatik IV, L15, 16, D-68131 Mannheim
Ken C. Pohlmann, "Principles of Digital Audio", SAMS/A Division of Prentice Hall Computer Publishing, no dat