A versatile pitch tracking algorithm : from human speech to killer whale vocalizations
MetadataShow full item record
In this article, a pitch tracking algorithm [named discrete logarithmic Fourier transformation-pitch detection algorithm (DLFT-PDA)], originally designed for human telephone speech, was modified for killer whale vocalizations. The multiple frequency components of some of these vocalizations demand a spectral (rather than temporal) approach to pitch tracking. The DLFT-PDA algorithm derives reliable estimations of pitch and the temporal change of pitch from the harmonic structure of the vocal signal. Scores from both estimations are combined in a dynamic programming search to find a smooth pitch track. The algorithm is capable of tracking killer whale calls that contain simultaneous low and high frequency components and compares favorably across most signal to noise ratio ranges to the peak-picking and sidewinder algorithms that have been used for tracking killer whale vocalizations previously.
Author Posting. © Acoustical Society of America, 2009. This article is posted here by permission of Acoustical Society of America for personal use, not for redistribution. The definitive version was published in Journal of the Acoustical Society of America 126 (2009): 451-459, doi:10.1121/1.3132525.
Suggested CitationArticle: Shapiro, Ari D., Wang, Chao, "A versatile pitch tracking algorithm : from human speech to killer whale vocalizations", Journal of the Acoustical Society of America 126 (2009): 451-459, DOI:10.1121/1.3132525, https://hdl.handle.net/1912/2894
Showing items related by title, author, creator and subject.
Tracking of time-evolving sound speed profiles in shallow water using an ensemble Kalman-particle filter Li, Jianlong; Zhou, Hui (Acoustical Society of America, 2013-03)This paper presents a tracking technique for performing sequential geoacoustic inversion monitoring range-independent environmental parameters in shallow water. The inverse problem is formulated in a state-space model with ...
Li, Weichang (Massachusetts Institute of Technology and Woods Hole Oceanographic Institution, 2006-02)This thesis develops methods for estimating wideband shallow-water acoustic communication channels. The very shallow water wideband channel has three distinct features: large dimension caused by extensive delay spread; ...
Severson, Jared (Massachusetts Institute of Technology and Woods Hole Oceanographic Institution, 2009-02)Marine mammal whistle calls present an attractive medium for covert underwater communications. High quality models of the whistle calls are needed in order to synthesize natural-sounding whistles with embedded information. ...