FIG 1 - uploaded by Raman Sukumar
Content may be subject to copyright.
Spectrograms of examples of the four call types. Sampling frequency for all calls was 48 kHz with 50% overlap. Window size is 100 ms for trumpet, roar, and chirp, and 200 ms for rumbles. Note the different Y -axis scale in G–I. 

Spectrograms of examples of the four call types. Sampling frequency for all calls was 48 kHz with 50% overlap. Window size is 100 ms for trumpet, roar, and chirp, and 200 ms for rumbles. Note the different Y -axis scale in G–I. 

Source publication
Article
Full-text available
Elephants use vocalizations for both long and short distance communication. Whereas the acoustic repertoire of the African elephant (Loxodonta africana) has been extensively studied in its savannah habitat, very little is known about the structure and social context of the vocalizations of the Asian elephant (Elephas maximus), which is mostly found...

Contexts in source publication

Context 1
... for most other calls. Spectral envelope analysis was carried out using linear predictive coding ͑ LPC ͒ . Calls other than rumbles were re-sampled at 16 000 Hz whereas rumbles were re- sampled at 600 Hz. LPC analysis was carried out using the autocorrelation method with a time step of 0.005 s and a window length of 0.005 s ͑ 0.05 s for rumbles ͒ . Harmonicity ͑ harmonics-to-noise ratio ͒ was analyzed using the cross- correlation algorithm with a time step of 0.05 s except for chirps, for which 0.005 s was used due to the short duration of the calls. Calls of different age-sex classes adult male, adult female, young male, and young female ͒ were analyzed sepa- rately. The measured features include the call frequency range, duration, harmonicity, as well as mean, minimum, and maximum values of the fundamental frequency ͑ F 0 ͒ ͑ Table I ͒ . In addition, the frequencies and amplitudes of the LPC peaks were measured to examine spectral patterning. The analyzed acoustic features were separated according to the four age-sex classes and compared statistically ͑ for all categories where the sample size was Ն 3 ͒ for each call type using a combination of one-way analysis of variance and pair-wise comparisons of means by unpaired t -tests. The fea- tures compared included mean F 0 , minimum F 0 , maximum A total of 371 calls were recorded from 154 individuals. F 0 , call duration, harmonicity, and frequency and amplitude Of these, 258 calls were analyzed from 109 individuals, con- of the peaks of the LPC spectrum. sisting of 14 males and 95 females ͑ Table II ͒ . The remaining For the rumbles, the start and end frequencies of F 0 , and calls were discarded on account of low signal-to-noise ratio percentage time from the start to the maximum and mini- or overlap with other calls due to simultaneously vocalizing mum frequencies were measured ͑ Table I ͒ . To characterize individuals. Based on structural characteristics, the calls the frequency modulation in finer detail two measures were could be classified into four types, namely, trumpets, chirps, used ͑ Table I ͒ . The fourth harmonic of all calls was used roars, and rumbles ͑ Fig. 1 ͒ . One of the call types, the rumble, since visual inspection of the spectrograms revealed that the could be distinguished by its unique frequency range ͑ 10– modulation could be measured at high resolution across al- 173 Hz, Fig. 1, G–I ͒ , which was much lower than that of the most all recordings whereas the fundamental was sometimes other calls. Chirps were distinguished by their unique tem- contaminated with noise. The higher harmonics may also poral structure ͑ Table III, Fig. 1, J–K ͒ : they were typically of have greater functional relevance in social recognition as ar- much shorter duration ͑ 0.2 s Ϯ 0.1 s ͒ than the other calls. gued by McComb et al. ͑ 2003 ͒ based on playback experi- Trumpets and roars differed from chirps in their duration ments on wild African elephants. The first measure was the ͑ mean= 1 s and 2 s, Table III ͒ . Although both trumpets and roars shared the same frequency range, as revealed by their number of observed peaks in the fourth harmonic of each spectral peaks ͑ Fig. 2 ͒ , the decrease in power with increasing frequency was steeper in roars. Roars also had significantly call, determined visually from the spectrogram with a mini- lower harmonicity than trumpets ͑ Table III, Mann–Whitney U test, U = 313, Z = −7.8, and P Ͻ 0.0001 ͒ . mum frequency modulation of 4 Hz as a cutoff. The second There were no significant differences between the different age-sex classes in all of the call features that were exam- measure was the frequency change per unit time for the ined in roars, chirps, and rumbles. The only significant differences were between adult male and female trumpets, fourth harmonic, measured by the cumulative change in fre- wherein females had significantly lower mean F 0 and mini- quency divided by the time of the call ͑ window length of 200 mum F 0 ͑ unpaired t -tests, P = 0.015, t = 2.45 and P = 0.043, ms ͒ . t = 2.78 ͒ than males ͑ Table III ͒ . To detect the presence of structural subtypes within the rumbles, 13 measured call features ͑ except harmonicity and LPC peaks ͒ were used to generate pair-wise Euclidean distances between calls. The distance matrix was then subjected to cluster analysis using the Unweighted Pair Group Method with Arithmetic mean ͑ UPGMA ͒ ͑ Sneath and Sokal, 1973; Manly, 1986 ͒ . All statistical analyses were performed using STATISTICA ͑ 1999, Statsoft Inc., Tulsa, USA ͒ . A total of 371 calls were recorded from 154 individuals. Of these, 258 calls were analyzed from 109 individuals, con- sisting of 14 males and 95 females ͑ Table II ͒ . The remaining calls were discarded on account of low signal-to-noise ratio or overlap with other calls due to simultaneously vocalizing individuals. Based on structural characteristics, the calls could be classified into four types, namely, trumpets, chirps, roars, and rumbles ͑ Fig. 1 ͒ . One of the call types, the rumble, could be distinguished by its unique frequency range ͑ 10– 173 Hz, Fig. 1, G–I ͒ , which was much lower than that of the other calls. Chirps were distinguished by their unique temporal structure ͑ Table III, Fig. 1, J–K ͒ : they were typically of much shorter duration ͑ 0.2 s Ϯ 0.1 s ͒ than the other calls. Trumpets and roars differed from chirps in their duration ͑ mean= 1 s and 2 s, Table III ͒ . Although both trumpets and roars shared the same frequency range, as revealed by their spectral peaks ͑ Fig. 2 ͒ , the decrease in power with increasing frequency was steeper in roars. Roars also had significantly lower harmonicity than trumpets ͑ Table III, Mann–Whitney U test, U = 313, Z = −7.8, and P Ͻ 0.0001 ͒ . There were no significant differences between the different age-sex classes in all of the call features that were examined in roars, chirps, and rumbles. The only significant differences were between adult male and female trumpets, wherein females had significantly lower mean F 0 and minimum F 0 ͑ unpaired t -tests, P = 0.015, t = 2.45 and P = 0.043, t = 2.78 ͒ than males ͑ Table III ͒ . The different call types were associated with characteristic body postures and behaviors of the vocalizing individual and other members of the herd. These are described in detail in Table IV. Trumpets were loud, conspicuous high-frequency calls. They were in the frequency range of 405–5879 Hz with a mean duration of about 1 s ͑ Table V ͒ . They had a rich harmonic structure with at least seven clearly visible harmonics ͑ Fig. 1, A–C ͒ . Spectral envelope analysis revealed the first frequency peak to be at 706 Hz ͑ Fig. 2 ͒ . The fourth frequency peak ͑ at 3078 Hz ͒ was about 13 dB lower in amplitude than the first frequency. Out of 73 trumpets where the age-sex class was unambiguous, 58 ͑ 79.5% ͒ were produced by adult or sub-adult females, nine ͑ 12.3% ͒ by juvenile females, five ͑ 6.8% ͒ by adult or sub-adult males, and one 1.4% by a juvenile male ͑ Table II ͒ . Out of the 71 trumpets where the context was clear, seven ͑ 9.9% ͒ were in the context of play, 17 ͑ 23.9% ͒ in the context of disturbance by humans or vehicles, 29 ͑ 40.8% ͒ in the context of disturbance by other non-human species, ten ͑ 14% ͒ in the context of inter-specific aggression, and eight ͑ 11.2% ͒ while running out of a waterhole ͑ Fig. 3 ͒ . Spe- cifically, trumpeting was observed while encountering other species such as deer ͑ Axis axis ͒ , gaur ͑ Bos gaurus ͒ , dhole ͑ Cuon alpinus ͒ , bears ͑ Melursus ursinus ͒ , tigers ͑ Panthera tigris ͒ , egrets ͑ Egretta garzetta ͒ , and humans. Roars were noisy, long calls that were in the frequency range of 305–6150 Hz and had a mean duration of about 2 s ͑ Table V, Fig. 1, D–F ͒ . They were in the same frequency range as trumpets and their spectral patterning was also simi- lar with seven frequency peaks Fig. 2 , the first frequency peak being at 656 Hz. The amplitude fell steeply with increasing frequency, the fourth frequency peak being 23 dB below the first ͑ Fig. 2 ͒ . This was in contrast to trumpets, where the fourth frequency peak was about 13 dB below the first ͑ Fig. 2 ͒ . Roars also showed significantly lower harmonicity than trumpets ͑ Table III ͒ . Both of the above are probably responsible for the unique perceptual quality of roars compared to trumpets. Out of 51 calls where the age-sex class was clear, 42 ͑ 82.3% ͒ were produced by adult or sub-adult females, six ͑ 11.7% ͒ by juvenile females, and three ͑ 5.8% ͒ by adult or sub-adult males ͑ Table II ͒ . Out of 51 cases where the context was clear, seven ͑ 13.7% ͒ were produced during play, four ͑ 7.8% ͒ due to disturbance by humans or vehicles, 28 ͑ 54.9% ͒ in the context of encounters with other non-human species, three ͑ 5.9% ͒ during inter-specific aggressive interactions, and nine ͑ 17.6% ͒ while facing another group or on entering a landscape ͑ Fig. 3 ͒ . Chirps were found to lie in the frequency range of 313– 3370 Hz ͑ Table V ͒ and were produced in a series ͑ Fig. 1, J–K ͒ ranging from 2 to 8 ͑ mean number= 5.2 Ϯ 2.6, n = 25 individuals ͒ in a single bout. The duration of a bout of chirp- ing ranged from 0.68 s to 3.8 s. Spectral envelope analysis revealed up to seven discernible frequency peaks Fig. 2 , with the first two peaks having equal amplitude. Although chirps had seven frequency peaks, the range over which these peaks were distributed was much narrower ͑ Fig. 2 ͒ than in the case of trumpets and roars. Chirps showed significantly lower harmonicity than trumpets and rumbles ͑ Table III, Mann–Whitney U -test, U = 995.5, Z = −4.38, P Ͻ 0.0001, and U = 786.5, Z = −3.8, P Ͻ 0.0001 ͒ . Out of 63 chirp bouts where the age-sex class was clear, 53 ͑ 84% ͒ were produced by adult or sub-adult females and ten ͑ 15.9% ͒ by adult or sub-adult males ͑ Table II ͒ . Out of 66 cases where the context was clear, 30 ͑ 45.5% ͒ were due to disturbance by humans or vehicles, 26 ͑ 39.3% ͒ due to ...
Context 2
... other than rumbles were re-sampled at 16 000 Hz whereas rumbles were re- sampled at 600 Hz. LPC analysis was carried out using the autocorrelation method with a time step of 0.005 s and a window length of 0.005 s ͑ 0.05 s for rumbles ͒ . Harmonicity ͑ harmonics-to-noise ratio ͒ was analyzed using the cross- correlation algorithm with a time step of 0.05 s except for chirps, for which 0.005 s was used due to the short duration of the calls. Calls of different age-sex classes adult male, adult female, young male, and young female ͒ were analyzed sepa- rately. The measured features include the call frequency range, duration, harmonicity, as well as mean, minimum, and maximum values of the fundamental frequency ͑ F 0 ͒ ͑ Table I ͒ . In addition, the frequencies and amplitudes of the LPC peaks were measured to examine spectral patterning. The analyzed acoustic features were separated according to the four age-sex classes and compared statistically ͑ for all categories where the sample size was Ն 3 ͒ for each call type using a combination of one-way analysis of variance and pair-wise comparisons of means by unpaired t -tests. The fea- tures compared included mean F 0 , minimum F 0 , maximum A total of 371 calls were recorded from 154 individuals. F 0 , call duration, harmonicity, and frequency and amplitude Of these, 258 calls were analyzed from 109 individuals, con- of the peaks of the LPC spectrum. sisting of 14 males and 95 females ͑ Table II ͒ . The remaining For the rumbles, the start and end frequencies of F 0 , and calls were discarded on account of low signal-to-noise ratio percentage time from the start to the maximum and mini- or overlap with other calls due to simultaneously vocalizing mum frequencies were measured ͑ Table I ͒ . To characterize individuals. Based on structural characteristics, the calls the frequency modulation in finer detail two measures were could be classified into four types, namely, trumpets, chirps, used ͑ Table I ͒ . The fourth harmonic of all calls was used roars, and rumbles ͑ Fig. 1 ͒ . One of the call types, the rumble, since visual inspection of the spectrograms revealed that the could be distinguished by its unique frequency range ͑ 10– modulation could be measured at high resolution across al- 173 Hz, Fig. 1, G–I ͒ , which was much lower than that of the most all recordings whereas the fundamental was sometimes other calls. Chirps were distinguished by their unique tem- contaminated with noise. The higher harmonics may also poral structure ͑ Table III, Fig. 1, J–K ͒ : they were typically of have greater functional relevance in social recognition as ar- much shorter duration ͑ 0.2 s Ϯ 0.1 s ͒ than the other calls. gued by McComb et al. ͑ 2003 ͒ based on playback experi- Trumpets and roars differed from chirps in their duration ments on wild African elephants. The first measure was the ͑ mean= 1 s and 2 s, Table III ͒ . Although both trumpets and roars shared the same frequency range, as revealed by their number of observed peaks in the fourth harmonic of each spectral peaks ͑ Fig. 2 ͒ , the decrease in power with increasing frequency was steeper in roars. Roars also had significantly call, determined visually from the spectrogram with a mini- lower harmonicity than trumpets ͑ Table III, Mann–Whitney U test, U = 313, Z = −7.8, and P Ͻ 0.0001 ͒ . mum frequency modulation of 4 Hz as a cutoff. The second There were no significant differences between the different age-sex classes in all of the call features that were exam- measure was the frequency change per unit time for the ined in roars, chirps, and rumbles. The only significant differences were between adult male and female trumpets, fourth harmonic, measured by the cumulative change in fre- wherein females had significantly lower mean F 0 and mini- quency divided by the time of the call ͑ window length of 200 mum F 0 ͑ unpaired t -tests, P = 0.015, t = 2.45 and P = 0.043, ms ͒ . t = 2.78 ͒ than males ͑ Table III ͒ . To detect the presence of structural subtypes within the rumbles, 13 measured call features ͑ except harmonicity and LPC peaks ͒ were used to generate pair-wise Euclidean distances between calls. The distance matrix was then subjected to cluster analysis using the Unweighted Pair Group Method with Arithmetic mean ͑ UPGMA ͒ ͑ Sneath and Sokal, 1973; Manly, 1986 ͒ . All statistical analyses were performed using STATISTICA ͑ 1999, Statsoft Inc., Tulsa, USA ͒ . A total of 371 calls were recorded from 154 individuals. Of these, 258 calls were analyzed from 109 individuals, con- sisting of 14 males and 95 females ͑ Table II ͒ . The remaining calls were discarded on account of low signal-to-noise ratio or overlap with other calls due to simultaneously vocalizing individuals. Based on structural characteristics, the calls could be classified into four types, namely, trumpets, chirps, roars, and rumbles ͑ Fig. 1 ͒ . One of the call types, the rumble, could be distinguished by its unique frequency range ͑ 10– 173 Hz, Fig. 1, G–I ͒ , which was much lower than that of the other calls. Chirps were distinguished by their unique temporal structure ͑ Table III, Fig. 1, J–K ͒ : they were typically of much shorter duration ͑ 0.2 s Ϯ 0.1 s ͒ than the other calls. Trumpets and roars differed from chirps in their duration ͑ mean= 1 s and 2 s, Table III ͒ . Although both trumpets and roars shared the same frequency range, as revealed by their spectral peaks ͑ Fig. 2 ͒ , the decrease in power with increasing frequency was steeper in roars. Roars also had significantly lower harmonicity than trumpets ͑ Table III, Mann–Whitney U test, U = 313, Z = −7.8, and P Ͻ 0.0001 ͒ . There were no significant differences between the different age-sex classes in all of the call features that were examined in roars, chirps, and rumbles. The only significant differences were between adult male and female trumpets, wherein females had significantly lower mean F 0 and minimum F 0 ͑ unpaired t -tests, P = 0.015, t = 2.45 and P = 0.043, t = 2.78 ͒ than males ͑ Table III ͒ . The different call types were associated with characteristic body postures and behaviors of the vocalizing individual and other members of the herd. These are described in detail in Table IV. Trumpets were loud, conspicuous high-frequency calls. They were in the frequency range of 405–5879 Hz with a mean duration of about 1 s ͑ Table V ͒ . They had a rich harmonic structure with at least seven clearly visible harmonics ͑ Fig. 1, A–C ͒ . Spectral envelope analysis revealed the first frequency peak to be at 706 Hz ͑ Fig. 2 ͒ . The fourth frequency peak ͑ at 3078 Hz ͒ was about 13 dB lower in amplitude than the first frequency. Out of 73 trumpets where the age-sex class was unambiguous, 58 ͑ 79.5% ͒ were produced by adult or sub-adult females, nine ͑ 12.3% ͒ by juvenile females, five ͑ 6.8% ͒ by adult or sub-adult males, and one 1.4% by a juvenile male ͑ Table II ͒ . Out of the 71 trumpets where the context was clear, seven ͑ 9.9% ͒ were in the context of play, 17 ͑ 23.9% ͒ in the context of disturbance by humans or vehicles, 29 ͑ 40.8% ͒ in the context of disturbance by other non-human species, ten ͑ 14% ͒ in the context of inter-specific aggression, and eight ͑ 11.2% ͒ while running out of a waterhole ͑ Fig. 3 ͒ . Spe- cifically, trumpeting was observed while encountering other species such as deer ͑ Axis axis ͒ , gaur ͑ Bos gaurus ͒ , dhole ͑ Cuon alpinus ͒ , bears ͑ Melursus ursinus ͒ , tigers ͑ Panthera tigris ͒ , egrets ͑ Egretta garzetta ͒ , and humans. Roars were noisy, long calls that were in the frequency range of 305–6150 Hz and had a mean duration of about 2 s ͑ Table V, Fig. 1, D–F ͒ . They were in the same frequency range as trumpets and their spectral patterning was also simi- lar with seven frequency peaks Fig. 2 , the first frequency peak being at 656 Hz. The amplitude fell steeply with increasing frequency, the fourth frequency peak being 23 dB below the first ͑ Fig. 2 ͒ . This was in contrast to trumpets, where the fourth frequency peak was about 13 dB below the first ͑ Fig. 2 ͒ . Roars also showed significantly lower harmonicity than trumpets ͑ Table III ͒ . Both of the above are probably responsible for the unique perceptual quality of roars compared to trumpets. Out of 51 calls where the age-sex class was clear, 42 ͑ 82.3% ͒ were produced by adult or sub-adult females, six ͑ 11.7% ͒ by juvenile females, and three ͑ 5.8% ͒ by adult or sub-adult males ͑ Table II ͒ . Out of 51 cases where the context was clear, seven ͑ 13.7% ͒ were produced during play, four ͑ 7.8% ͒ due to disturbance by humans or vehicles, 28 ͑ 54.9% ͒ in the context of encounters with other non-human species, three ͑ 5.9% ͒ during inter-specific aggressive interactions, and nine ͑ 17.6% ͒ while facing another group or on entering a landscape ͑ Fig. 3 ͒ . Chirps were found to lie in the frequency range of 313– 3370 Hz ͑ Table V ͒ and were produced in a series ͑ Fig. 1, J–K ͒ ranging from 2 to 8 ͑ mean number= 5.2 Ϯ 2.6, n = 25 individuals ͒ in a single bout. The duration of a bout of chirp- ing ranged from 0.68 s to 3.8 s. Spectral envelope analysis revealed up to seven discernible frequency peaks Fig. 2 , with the first two peaks having equal amplitude. Although chirps had seven frequency peaks, the range over which these peaks were distributed was much narrower ͑ Fig. 2 ͒ than in the case of trumpets and roars. Chirps showed significantly lower harmonicity than trumpets and rumbles ͑ Table III, Mann–Whitney U -test, U = 995.5, Z = −4.38, P Ͻ 0.0001, and U = 786.5, Z = −3.8, P Ͻ 0.0001 ͒ . Out of 63 chirp bouts where the age-sex class was clear, 53 ͑ 84% ͒ were produced by adult or sub-adult females and ten ͑ 15.9% ͒ by adult or sub-adult males ͑ Table II ͒ . Out of 66 cases where the context was clear, 30 ͑ 45.5% ͒ were due to disturbance by humans or vehicles, 26 ͑ 39.3% ͒ due to disturbance by other non-human species, four ͑ 6% ͒ in the context of separation of an individual from the herd, ...
Context 3
... grasslands, and within the forests. Surveys were carried out by vehicle and on foot to locate elephant herds during the day. There were a total of 214 encounters with herds during these sur- veys. The herds were classified into three major categories, namely, mixed herds ͑ both females and males in all age classes ͒ , female herds ͑ no adult or sub-adult males ͒ and male herds ͑ adult and sub-adult males only ͒ . Solitary individuals, male or female, were also occasionally encountered. The proportion of adult ͑ Ͼ 15 years ͒ and sub-adult ͑ 5–15 years ͒ males in this population is estimated to be 12.5% ͑ Arivazhagan and Sukumar, 2005 ͒ and approximately 11% from our encounters. Elephants were individually identified based on distinguishing characteristics such as ear-fold, tail characteristics, and pigmentation. The age class of individuals was also determined using methods described elsewhere A narrow-band spectrogram was computed and the minimum and maximum frequencies of the calls were determined using their power spectral densities. To estimate the essential bandwidth, the power spectral density ͑ PSD ͒ of the noisy signal was first computed using a Hamming window ͑ 20 ms ͒ . The PSD of the noise was then estimated using the initial 1 s of the recording. The estimated PSD of the noise was subtracted from that of the noisy signal to yield an esti- mate of the PSD of the signal. In general, the estimate had positive values at all discrete Fourier transform ͑ DFT ͒ bins; any negative values ͑ due to estimation errors ͒ were clamped to zero. From this estimate, the bandpass region containing about 80% of the total energy was considered as the frequency range of the signal. For rumbles, however, calls were first low-pass-filtered with a cutoff of 250 Hz ͑ this value was based on a preliminary inspection of the spectrograms ͒ . The signals thus obtained were re-sampled ͑ by the zero padding approach ͒ to obtain more points along the frequency axis ͑ Oppenheim and Schafer, 1989 ͒ . Spectrograms of these signals ͑ DFT with Hamming window size of 5 ms, 50% overlap ͒ were used for further analysis. Pitch, spectral envelope, and harmonicity analyses were carried out using PRAAT 5.1.07 ͑ www.praat.org, Paul Boersma and David Weenink ͒ . Pitch analysis was carried out using a pitch floor of 10 Hz for rumbles and 100 Hz for most other calls. Spectral envelope analysis was carried out using linear predictive coding ͑ LPC ͒ . Calls other than rumbles were re-sampled at 16 000 Hz whereas rumbles were re- sampled at 600 Hz. LPC analysis was carried out using the autocorrelation method with a time step of 0.005 s and a window length of 0.005 s ͑ 0.05 s for rumbles ͒ . Harmonicity ͑ harmonics-to-noise ratio ͒ was analyzed using the cross- correlation algorithm with a time step of 0.05 s except for chirps, for which 0.005 s was used due to the short duration of the calls. Calls of different age-sex classes adult male, adult female, young male, and young female ͒ were analyzed sepa- rately. The measured features include the call frequency range, duration, harmonicity, as well as mean, minimum, and maximum values of the fundamental frequency ͑ F 0 ͒ ͑ Table I ͒ . In addition, the frequencies and amplitudes of the LPC peaks were measured to examine spectral patterning. The analyzed acoustic features were separated according to the four age-sex classes and compared statistically ͑ for all categories where the sample size was Ն 3 ͒ for each call type using a combination of one-way analysis of variance and pair-wise comparisons of means by unpaired t -tests. The fea- tures compared included mean F 0 , minimum F 0 , maximum A total of 371 calls were recorded from 154 individuals. F 0 , call duration, harmonicity, and frequency and amplitude Of these, 258 calls were analyzed from 109 individuals, con- of the peaks of the LPC spectrum. sisting of 14 males and 95 females ͑ Table II ͒ . The remaining For the rumbles, the start and end frequencies of F 0 , and calls were discarded on account of low signal-to-noise ratio percentage time from the start to the maximum and mini- or overlap with other calls due to simultaneously vocalizing mum frequencies were measured ͑ Table I ͒ . To characterize individuals. Based on structural characteristics, the calls the frequency modulation in finer detail two measures were could be classified into four types, namely, trumpets, chirps, used ͑ Table I ͒ . The fourth harmonic of all calls was used roars, and rumbles ͑ Fig. 1 ͒ . One of the call types, the rumble, since visual inspection of the spectrograms revealed that the could be distinguished by its unique frequency range ͑ 10– modulation could be measured at high resolution across al- 173 Hz, Fig. 1, G–I ͒ , which was much lower than that of the most all recordings whereas the fundamental was sometimes other calls. Chirps were distinguished by their unique tem- contaminated with noise. The higher harmonics may also poral structure ͑ Table III, Fig. 1, J–K ͒ : they were typically of have greater functional relevance in social recognition as ar- much shorter duration ͑ 0.2 s Ϯ 0.1 s ͒ than the other calls. gued by McComb et al. ͑ 2003 ͒ based on playback experi- Trumpets and roars differed from chirps in their duration ments on wild African elephants. The first measure was the ͑ mean= 1 s and 2 s, Table III ͒ . Although both trumpets and roars shared the same frequency range, as revealed by their number of observed peaks in the fourth harmonic of each spectral peaks ͑ Fig. 2 ͒ , the decrease in power with increasing frequency was steeper in roars. Roars also had significantly call, determined visually from the spectrogram with a mini- lower harmonicity than trumpets ͑ Table III, Mann–Whitney U test, U = 313, Z = −7.8, and P Ͻ 0.0001 ͒ . mum frequency modulation of 4 Hz as a cutoff. The second There were no significant differences between the different age-sex classes in all of the call features that were exam- measure was the frequency change per unit time for the ined in roars, chirps, and rumbles. The only significant differences were between adult male and female trumpets, fourth harmonic, measured by the cumulative change in fre- wherein females had significantly lower mean F 0 and mini- quency divided by the time of the call ͑ window length of 200 mum F 0 ͑ unpaired t -tests, P = 0.015, t = 2.45 and P = 0.043, ms ͒ . t = 2.78 ͒ than males ͑ Table III ͒ . To detect the presence of structural subtypes within the rumbles, 13 measured call features ͑ except harmonicity and LPC peaks ͒ were used to generate pair-wise Euclidean distances between calls. The distance matrix was then subjected to cluster analysis using the Unweighted Pair Group Method with Arithmetic mean ͑ UPGMA ͒ ͑ Sneath and Sokal, 1973; Manly, 1986 ͒ . All statistical analyses were performed using STATISTICA ͑ 1999, Statsoft Inc., Tulsa, USA ͒ . A total of 371 calls were recorded from 154 individuals. Of these, 258 calls were analyzed from 109 individuals, con- sisting of 14 males and 95 females ͑ Table II ͒ . The remaining calls were discarded on account of low signal-to-noise ratio or overlap with other calls due to simultaneously vocalizing individuals. Based on structural characteristics, the calls could be classified into four types, namely, trumpets, chirps, roars, and rumbles ͑ Fig. 1 ͒ . One of the call types, the rumble, could be distinguished by its unique frequency range ͑ 10– 173 Hz, Fig. 1, G–I ͒ , which was much lower than that of the other calls. Chirps were distinguished by their unique temporal structure ͑ Table III, Fig. 1, J–K ͒ : they were typically of much shorter duration ͑ 0.2 s Ϯ 0.1 s ͒ than the other calls. Trumpets and roars differed from chirps in their duration ͑ mean= 1 s and 2 s, Table III ͒ . Although both trumpets and roars shared the same frequency range, as revealed by their spectral peaks ͑ Fig. 2 ͒ , the decrease in power with increasing frequency was steeper in roars. Roars also had significantly lower harmonicity than trumpets ͑ Table III, Mann–Whitney U test, U = 313, Z = −7.8, and P Ͻ 0.0001 ͒ . There were no significant differences between the different age-sex classes in all of the call features that were examined in roars, chirps, and rumbles. The only significant differences were between adult male and female trumpets, wherein females had significantly lower mean F 0 and minimum F 0 ͑ unpaired t -tests, P = 0.015, t = 2.45 and P = 0.043, t = 2.78 ͒ than males ͑ Table III ͒ . The different call types were associated with characteristic body postures and behaviors of the vocalizing individual and other members of the herd. These are described in detail in Table IV. Trumpets were loud, conspicuous high-frequency calls. They were in the frequency range of 405–5879 Hz with a mean duration of about 1 s ͑ Table V ͒ . They had a rich harmonic structure with at least seven clearly visible harmonics ͑ Fig. 1, A–C ͒ . Spectral envelope analysis revealed the first frequency peak to be at 706 Hz ͑ Fig. 2 ͒ . The fourth frequency peak ͑ at 3078 Hz ͒ was about 13 dB lower in amplitude than the first frequency. Out of 73 trumpets where the age-sex class was unambiguous, 58 ͑ 79.5% ͒ were produced by adult or sub-adult females, nine ͑ 12.3% ͒ by juvenile females, five ͑ 6.8% ͒ by adult or sub-adult males, and one 1.4% by a juvenile male ͑ Table II ͒ . Out of the 71 trumpets where the context was clear, seven ͑ 9.9% ͒ were in the context of play, 17 ͑ 23.9% ͒ in the context of disturbance by humans or vehicles, 29 ͑ 40.8% ͒ in the context of disturbance by other non-human species, ten ͑ 14% ͒ in the context of inter-specific aggression, and eight ͑ 11.2% ͒ while running out of a waterhole ͑ Fig. 3 ͒ . Spe- cifically, trumpeting was observed while encountering other species such as deer ͑ Axis axis ͒ , gaur ͑ Bos gaurus ͒ , dhole ͑ Cuon alpinus ͒ , bears ͑ Melursus ursinus ͒ , tigers ͑ Panthera tigris ͒ , egrets ͑ Egretta garzetta ͒ , and humans. ...
Context 4
... remaining For the rumbles, the start and end frequencies of F 0 , and calls were discarded on account of low signal-to-noise ratio percentage time from the start to the maximum and mini- or overlap with other calls due to simultaneously vocalizing mum frequencies were measured ͑ Table I ͒ . To characterize individuals. Based on structural characteristics, the calls the frequency modulation in finer detail two measures were could be classified into four types, namely, trumpets, chirps, used ͑ Table I ͒ . The fourth harmonic of all calls was used roars, and rumbles ͑ Fig. 1 ͒ . One of the call types, the rumble, since visual inspection of the spectrograms revealed that the could be distinguished by its unique frequency range ͑ 10– modulation could be measured at high resolution across al- 173 Hz, Fig. 1, G–I ͒ , which was much lower than that of the most all recordings whereas the fundamental was sometimes other calls. Chirps were distinguished by their unique tem- contaminated with noise. The higher harmonics may also poral structure ͑ Table III, Fig. 1, J–K ͒ : they were typically of have greater functional relevance in social recognition as ar- much shorter duration ͑ 0.2 s Ϯ 0.1 s ͒ than the other calls. gued by McComb et al. ͑ 2003 ͒ based on playback experi- Trumpets and roars differed from chirps in their duration ments on wild African elephants. The first measure was the ͑ mean= 1 s and 2 s, Table III ͒ . Although both trumpets and roars shared the same frequency range, as revealed by their number of observed peaks in the fourth harmonic of each spectral peaks ͑ Fig. 2 ͒ , the decrease in power with increasing frequency was steeper in roars. Roars also had significantly call, determined visually from the spectrogram with a mini- lower harmonicity than trumpets ͑ Table III, Mann–Whitney U test, U = 313, Z = −7.8, and P Ͻ 0.0001 ͒ . mum frequency modulation of 4 Hz as a cutoff. The second There were no significant differences between the different age-sex classes in all of the call features that were exam- measure was the frequency change per unit time for the ined in roars, chirps, and rumbles. The only significant differences were between adult male and female trumpets, fourth harmonic, measured by the cumulative change in fre- wherein females had significantly lower mean F 0 and mini- quency divided by the time of the call ͑ window length of 200 mum F 0 ͑ unpaired t -tests, P = 0.015, t = 2.45 and P = 0.043, ms ͒ . t = 2.78 ͒ than males ͑ Table III ͒ . To detect the presence of structural subtypes within the rumbles, 13 measured call features ͑ except harmonicity and LPC peaks ͒ were used to generate pair-wise Euclidean distances between calls. The distance matrix was then subjected to cluster analysis using the Unweighted Pair Group Method with Arithmetic mean ͑ UPGMA ͒ ͑ Sneath and Sokal, 1973; Manly, 1986 ͒ . All statistical analyses were performed using STATISTICA ͑ 1999, Statsoft Inc., Tulsa, USA ͒ . A total of 371 calls were recorded from 154 individuals. Of these, 258 calls were analyzed from 109 individuals, con- sisting of 14 males and 95 females ͑ Table II ͒ . The remaining calls were discarded on account of low signal-to-noise ratio or overlap with other calls due to simultaneously vocalizing individuals. Based on structural characteristics, the calls could be classified into four types, namely, trumpets, chirps, roars, and rumbles ͑ Fig. 1 ͒ . One of the call types, the rumble, could be distinguished by its unique frequency range ͑ 10– 173 Hz, Fig. 1, G–I ͒ , which was much lower than that of the other calls. Chirps were distinguished by their unique temporal structure ͑ Table III, Fig. 1, J–K ͒ : they were typically of much shorter duration ͑ 0.2 s Ϯ 0.1 s ͒ than the other calls. Trumpets and roars differed from chirps in their duration ͑ mean= 1 s and 2 s, Table III ͒ . Although both trumpets and roars shared the same frequency range, as revealed by their spectral peaks ͑ Fig. 2 ͒ , the decrease in power with increasing frequency was steeper in roars. Roars also had significantly lower harmonicity than trumpets ͑ Table III, Mann–Whitney U test, U = 313, Z = −7.8, and P Ͻ 0.0001 ͒ . There were no significant differences between the different age-sex classes in all of the call features that were examined in roars, chirps, and rumbles. The only significant differences were between adult male and female trumpets, wherein females had significantly lower mean F 0 and minimum F 0 ͑ unpaired t -tests, P = 0.015, t = 2.45 and P = 0.043, t = 2.78 ͒ than males ͑ Table III ͒ . The different call types were associated with characteristic body postures and behaviors of the vocalizing individual and other members of the herd. These are described in detail in Table IV. Trumpets were loud, conspicuous high-frequency calls. They were in the frequency range of 405–5879 Hz with a mean duration of about 1 s ͑ Table V ͒ . They had a rich harmonic structure with at least seven clearly visible harmonics ͑ Fig. 1, A–C ͒ . Spectral envelope analysis revealed the first frequency peak to be at 706 Hz ͑ Fig. 2 ͒ . The fourth frequency peak ͑ at 3078 Hz ͒ was about 13 dB lower in amplitude than the first frequency. Out of 73 trumpets where the age-sex class was unambiguous, 58 ͑ 79.5% ͒ were produced by adult or sub-adult females, nine ͑ 12.3% ͒ by juvenile females, five ͑ 6.8% ͒ by adult or sub-adult males, and one 1.4% by a juvenile male ͑ Table II ͒ . Out of the 71 trumpets where the context was clear, seven ͑ 9.9% ͒ were in the context of play, 17 ͑ 23.9% ͒ in the context of disturbance by humans or vehicles, 29 ͑ 40.8% ͒ in the context of disturbance by other non-human species, ten ͑ 14% ͒ in the context of inter-specific aggression, and eight ͑ 11.2% ͒ while running out of a waterhole ͑ Fig. 3 ͒ . Spe- cifically, trumpeting was observed while encountering other species such as deer ͑ Axis axis ͒ , gaur ͑ Bos gaurus ͒ , dhole ͑ Cuon alpinus ͒ , bears ͑ Melursus ursinus ͒ , tigers ͑ Panthera tigris ͒ , egrets ͑ Egretta garzetta ͒ , and humans. Roars were noisy, long calls that were in the frequency range of 305–6150 Hz and had a mean duration of about 2 s ͑ Table V, Fig. 1, D–F ͒ . They were in the same frequency range as trumpets and their spectral patterning was also simi- lar with seven frequency peaks Fig. 2 , the first frequency peak being at 656 Hz. The amplitude fell steeply with increasing frequency, the fourth frequency peak being 23 dB below the first ͑ Fig. 2 ͒ . This was in contrast to trumpets, where the fourth frequency peak was about 13 dB below the first ͑ Fig. 2 ͒ . Roars also showed significantly lower harmonicity than trumpets ͑ Table III ͒ . Both of the above are probably responsible for the unique perceptual quality of roars compared to trumpets. Out of 51 calls where the age-sex class was clear, 42 ͑ 82.3% ͒ were produced by adult or sub-adult females, six ͑ 11.7% ͒ by juvenile females, and three ͑ 5.8% ͒ by adult or sub-adult males ͑ Table II ͒ . Out of 51 cases where the context was clear, seven ͑ 13.7% ͒ were produced during play, four ͑ 7.8% ͒ due to disturbance by humans or vehicles, 28 ͑ 54.9% ͒ in the context of encounters with other non-human species, three ͑ 5.9% ͒ during inter-specific aggressive interactions, and nine ͑ 17.6% ͒ while facing another group or on entering a landscape ͑ Fig. 3 ͒ . Chirps were found to lie in the frequency range of 313– 3370 Hz ͑ Table V ͒ and were produced in a series ͑ Fig. 1, J–K ͒ ranging from 2 to 8 ͑ mean number= 5.2 Ϯ 2.6, n = 25 individuals ͒ in a single bout. The duration of a bout of chirp- ing ranged from 0.68 s to 3.8 s. Spectral envelope analysis revealed up to seven discernible frequency peaks Fig. 2 , with the first two peaks having equal amplitude. Although chirps had seven frequency peaks, the range over which these peaks were distributed was much narrower ͑ Fig. 2 ͒ than in the case of trumpets and roars. Chirps showed significantly lower harmonicity than trumpets and rumbles ͑ Table III, Mann–Whitney U -test, U = 995.5, Z = −4.38, P Ͻ 0.0001, and U = 786.5, Z = −3.8, P Ͻ 0.0001 ͒ . Out of 63 chirp bouts where the age-sex class was clear, 53 ͑ 84% ͒ were produced by adult or sub-adult females and ten ͑ 15.9% ͒ by adult or sub-adult males ͑ Table II ͒ . Out of 66 cases where the context was clear, 30 ͑ 45.5% ͒ were due to disturbance by humans or vehicles, 26 ͑ 39.3% ͒ due to disturbance by other non-human species, four ͑ 6% ͒ in the context of separation of an individual from the herd, and six ͑ 9% ͒ during intra-group aggression produced by an individual other than those directly involved in the aggression ͑ Fig. 3 ͒ . Rumbles were the only call type in the repertoire with infrasonic components. Rumbles were found to lie in the frequency range of 10–173 Hz, with a mean duration of 5.2 s ͑ Table V ͒ . They had a distinct harmonic structure ͑ Fig. 1, G–I, Table III ͒ , similar to trumpets ͑ U = 1508, Z = 0.616, and P = 0.54 ͒ . Spectral envelope analysis revealed three peaks, with the first peak at 37 Hz Fig. 2 . The power of the third peak was about 11 dB lower than that of the first frequency ͑ Fig. 2 ͒ . Of the 33 cases where the identity of the rumbling individual was unambiguous, 27 ͑ 81.8% ͒ were produced by adult or sub-adult females, two ͑ 6.1% ͒ by juvenile females, and four ͑ 12.1% ͒ by juvenile males ͑ Table II ͒ . Out of 56 cases where the context of rumbling was clear, 20 ͑ 35.7% ͒ were produced due to disturbance by humans or vehicles, six ͑ 10.7% ͒ due to disturbance by non-human species, three ͑ 5.4% ͒ by matriarchs to assemble the group ͑ “let’s go” rumble ͒ , three ͑ 5.4% ͒ in the context of contacting other herd members ͑ contact calls ͒ , and 24 ͑ 42.9% ͒ during intra- and inter-group interactions at close range ͑ Fig. 3 ͒ . Cluster analysis based on the distance matrix of pair- wise measures of overall similarity between calls did not reveal discrete structural groups, suggesting that the variation in call ...
Context 5
... 2.78 ͒ than males ͑ Table III ͒ . To detect the presence of structural subtypes within the rumbles, 13 measured call features ͑ except harmonicity and LPC peaks ͒ were used to generate pair-wise Euclidean distances between calls. The distance matrix was then subjected to cluster analysis using the Unweighted Pair Group Method with Arithmetic mean ͑ UPGMA ͒ ͑ Sneath and Sokal, 1973; Manly, 1986 ͒ . All statistical analyses were performed using STATISTICA ͑ 1999, Statsoft Inc., Tulsa, USA ͒ . A total of 371 calls were recorded from 154 individuals. Of these, 258 calls were analyzed from 109 individuals, con- sisting of 14 males and 95 females ͑ Table II ͒ . The remaining calls were discarded on account of low signal-to-noise ratio or overlap with other calls due to simultaneously vocalizing individuals. Based on structural characteristics, the calls could be classified into four types, namely, trumpets, chirps, roars, and rumbles ͑ Fig. 1 ͒ . One of the call types, the rumble, could be distinguished by its unique frequency range ͑ 10– 173 Hz, Fig. 1, G–I ͒ , which was much lower than that of the other calls. Chirps were distinguished by their unique temporal structure ͑ Table III, Fig. 1, J–K ͒ : they were typically of much shorter duration ͑ 0.2 s Ϯ 0.1 s ͒ than the other calls. Trumpets and roars differed from chirps in their duration ͑ mean= 1 s and 2 s, Table III ͒ . Although both trumpets and roars shared the same frequency range, as revealed by their spectral peaks ͑ Fig. 2 ͒ , the decrease in power with increasing frequency was steeper in roars. Roars also had significantly lower harmonicity than trumpets ͑ Table III, Mann–Whitney U test, U = 313, Z = −7.8, and P Ͻ 0.0001 ͒ . There were no significant differences between the different age-sex classes in all of the call features that were examined in roars, chirps, and rumbles. The only significant differences were between adult male and female trumpets, wherein females had significantly lower mean F 0 and minimum F 0 ͑ unpaired t -tests, P = 0.015, t = 2.45 and P = 0.043, t = 2.78 ͒ than males ͑ Table III ͒ . The different call types were associated with characteristic body postures and behaviors of the vocalizing individual and other members of the herd. These are described in detail in Table IV. Trumpets were loud, conspicuous high-frequency calls. They were in the frequency range of 405–5879 Hz with a mean duration of about 1 s ͑ Table V ͒ . They had a rich harmonic structure with at least seven clearly visible harmonics ͑ Fig. 1, A–C ͒ . Spectral envelope analysis revealed the first frequency peak to be at 706 Hz ͑ Fig. 2 ͒ . The fourth frequency peak ͑ at 3078 Hz ͒ was about 13 dB lower in amplitude than the first frequency. Out of 73 trumpets where the age-sex class was unambiguous, 58 ͑ 79.5% ͒ were produced by adult or sub-adult females, nine ͑ 12.3% ͒ by juvenile females, five ͑ 6.8% ͒ by adult or sub-adult males, and one 1.4% by a juvenile male ͑ Table II ͒ . Out of the 71 trumpets where the context was clear, seven ͑ 9.9% ͒ were in the context of play, 17 ͑ 23.9% ͒ in the context of disturbance by humans or vehicles, 29 ͑ 40.8% ͒ in the context of disturbance by other non-human species, ten ͑ 14% ͒ in the context of inter-specific aggression, and eight ͑ 11.2% ͒ while running out of a waterhole ͑ Fig. 3 ͒ . Spe- cifically, trumpeting was observed while encountering other species such as deer ͑ Axis axis ͒ , gaur ͑ Bos gaurus ͒ , dhole ͑ Cuon alpinus ͒ , bears ͑ Melursus ursinus ͒ , tigers ͑ Panthera tigris ͒ , egrets ͑ Egretta garzetta ͒ , and humans. Roars were noisy, long calls that were in the frequency range of 305–6150 Hz and had a mean duration of about 2 s ͑ Table V, Fig. 1, D–F ͒ . They were in the same frequency range as trumpets and their spectral patterning was also simi- lar with seven frequency peaks Fig. 2 , the first frequency peak being at 656 Hz. The amplitude fell steeply with increasing frequency, the fourth frequency peak being 23 dB below the first ͑ Fig. 2 ͒ . This was in contrast to trumpets, where the fourth frequency peak was about 13 dB below the first ͑ Fig. 2 ͒ . Roars also showed significantly lower harmonicity than trumpets ͑ Table III ͒ . Both of the above are probably responsible for the unique perceptual quality of roars compared to trumpets. Out of 51 calls where the age-sex class was clear, 42 ͑ 82.3% ͒ were produced by adult or sub-adult females, six ͑ 11.7% ͒ by juvenile females, and three ͑ 5.8% ͒ by adult or sub-adult males ͑ Table II ͒ . Out of 51 cases where the context was clear, seven ͑ 13.7% ͒ were produced during play, four ͑ 7.8% ͒ due to disturbance by humans or vehicles, 28 ͑ 54.9% ͒ in the context of encounters with other non-human species, three ͑ 5.9% ͒ during inter-specific aggressive interactions, and nine ͑ 17.6% ͒ while facing another group or on entering a landscape ͑ Fig. 3 ͒ . Chirps were found to lie in the frequency range of 313– 3370 Hz ͑ Table V ͒ and were produced in a series ͑ Fig. 1, J–K ͒ ranging from 2 to 8 ͑ mean number= 5.2 Ϯ 2.6, n = 25 individuals ͒ in a single bout. The duration of a bout of chirp- ing ranged from 0.68 s to 3.8 s. Spectral envelope analysis revealed up to seven discernible frequency peaks Fig. 2 , with the first two peaks having equal amplitude. Although chirps had seven frequency peaks, the range over which these peaks were distributed was much narrower ͑ Fig. 2 ͒ than in the case of trumpets and roars. Chirps showed significantly lower harmonicity than trumpets and rumbles ͑ Table III, Mann–Whitney U -test, U = 995.5, Z = −4.38, P Ͻ 0.0001, and U = 786.5, Z = −3.8, P Ͻ 0.0001 ͒ . Out of 63 chirp bouts where the age-sex class was clear, 53 ͑ 84% ͒ were produced by adult or sub-adult females and ten ͑ 15.9% ͒ by adult or sub-adult males ͑ Table II ͒ . Out of 66 cases where the context was clear, 30 ͑ 45.5% ͒ were due to disturbance by humans or vehicles, 26 ͑ 39.3% ͒ due to disturbance by other non-human species, four ͑ 6% ͒ in the context of separation of an individual from the herd, and six ͑ 9% ͒ during intra-group aggression produced by an individual other than those directly involved in the aggression ͑ Fig. 3 ͒ . Rumbles were the only call type in the repertoire with infrasonic components. Rumbles were found to lie in the frequency range of 10–173 Hz, with a mean duration of 5.2 s ͑ Table V ͒ . They had a distinct harmonic structure ͑ Fig. 1, G–I, Table III ͒ , similar to trumpets ͑ U = 1508, Z = 0.616, and P = 0.54 ͒ . Spectral envelope analysis revealed three peaks, with the first peak at 37 Hz Fig. 2 . The power of the third peak was about 11 dB lower than that of the first frequency ͑ Fig. 2 ͒ . Of the 33 cases where the identity of the rumbling individual was unambiguous, 27 ͑ 81.8% ͒ were produced by adult or sub-adult females, two ͑ 6.1% ͒ by juvenile females, and four ͑ 12.1% ͒ by juvenile males ͑ Table II ͒ . Out of 56 cases where the context of rumbling was clear, 20 ͑ 35.7% ͒ were produced due to disturbance by humans or vehicles, six ͑ 10.7% ͒ due to disturbance by non-human species, three ͑ 5.4% ͒ by matriarchs to assemble the group ͑ “let’s go” rumble ͒ , three ͑ 5.4% ͒ in the context of contacting other herd members ͑ contact calls ͒ , and 24 ͑ 42.9% ͒ during intra- and inter-group interactions at close range ͑ Fig. 3 ͒ . Cluster analysis based on the distance matrix of pair- wise measures of overall similarity between calls did not reveal discrete structural groups, suggesting that the variation in call structure is graded. Examination of the spectrograms, however, revealed differences between calls, particularly in the direction and extent of frequency modulation. Some calls showed an overall downward modulation of frequency ͓ Fig. 4 ͑ A ͔͒ , others showed little or no frequency modulation ͓ Fig. 4 ͑ B ͔͒ , and some showed an overall upward modulation in frequency ͓ Fig. 4 ͑ C ͔͒ . Yet another type con- tained extensive frequency modulation within the call ͓ Fig. 4 ͑ D ͔͒ . Preliminary comparisons did not reveal any particular correspondence between these features and either age-sex class or behavioral context, but the sample sizes for many of the groups are too small to permit meaningful conclusions at this stage. This study of the vocalizations of wild Asian elephants classifies them into four mutually exclusive categories based on structural features: trumpets, chirps, roars, and rumbles. Three of the four call types, namely, trumpets, chirps, and roars show extensive overlap in their frequency ranges. They are, however, clearly distinguishable from each other by their temporal and/or spectral structures. Trumpets show high harmonicity relative to chirps and roars, which are noisy. Chirps may be distinguished from roars by their characteristic temporal and spectral structures: short durations and frequency peaks over a narrower range. Roars exhibit low harmonicity and have no specific temporal structure. Rumbles, which constitute the fourth call type, do not overlap with any of the other call types in frequency and exhibit a distinct harmonic structure. Rumbles are also much longer in duration compared with the other call types. Our observations can be compared with previous studies on Asian and African elephants. On the basis of auditory assessments in the field and visual assessments of a few spectrograms, McKay ͑ 1973 ͒ classified the vocalizations of wild Asian elephants in Sri Lanka and zoo elephants into three “basic sounds” with eight “resulting sounds,” depending on their modification by change in amplitude, temporal patterning, and stressing of overtones, as well as non-vocal sounds produced in the trunk. However, the spectral and temporal characteristics were not defined. These “basic sounds” ͑ with “resulting sounds” ͒ were growl ͑ growl, rumble, roar, and “motor- cycle” ͒ , squeak ͑ chirp and trumpet ͒ , and snort ͑ “snort” and “boom” ͒ . It is now clear that the categories of resulting sounds such as rumbles ...
Context 6
... ͑ 0.2 s Ϯ 0.1 s ͒ than the other calls. gued by McComb et al. ͑ 2003 ͒ based on playback experi- Trumpets and roars differed from chirps in their duration ments on wild African elephants. The first measure was the ͑ mean= 1 s and 2 s, Table III ͒ . Although both trumpets and roars shared the same frequency range, as revealed by their number of observed peaks in the fourth harmonic of each spectral peaks ͑ Fig. 2 ͒ , the decrease in power with increasing frequency was steeper in roars. Roars also had significantly call, determined visually from the spectrogram with a mini- lower harmonicity than trumpets ͑ Table III, Mann–Whitney U test, U = 313, Z = −7.8, and P Ͻ 0.0001 ͒ . mum frequency modulation of 4 Hz as a cutoff. The second There were no significant differences between the different age-sex classes in all of the call features that were exam- measure was the frequency change per unit time for the ined in roars, chirps, and rumbles. The only significant differences were between adult male and female trumpets, fourth harmonic, measured by the cumulative change in fre- wherein females had significantly lower mean F 0 and mini- quency divided by the time of the call ͑ window length of 200 mum F 0 ͑ unpaired t -tests, P = 0.015, t = 2.45 and P = 0.043, ms ͒ . t = 2.78 ͒ than males ͑ Table III ͒ . To detect the presence of structural subtypes within the rumbles, 13 measured call features ͑ except harmonicity and LPC peaks ͒ were used to generate pair-wise Euclidean distances between calls. The distance matrix was then subjected to cluster analysis using the Unweighted Pair Group Method with Arithmetic mean ͑ UPGMA ͒ ͑ Sneath and Sokal, 1973; Manly, 1986 ͒ . All statistical analyses were performed using STATISTICA ͑ 1999, Statsoft Inc., Tulsa, USA ͒ . A total of 371 calls were recorded from 154 individuals. Of these, 258 calls were analyzed from 109 individuals, con- sisting of 14 males and 95 females ͑ Table II ͒ . The remaining calls were discarded on account of low signal-to-noise ratio or overlap with other calls due to simultaneously vocalizing individuals. Based on structural characteristics, the calls could be classified into four types, namely, trumpets, chirps, roars, and rumbles ͑ Fig. 1 ͒ . One of the call types, the rumble, could be distinguished by its unique frequency range ͑ 10– 173 Hz, Fig. 1, G–I ͒ , which was much lower than that of the other calls. Chirps were distinguished by their unique temporal structure ͑ Table III, Fig. 1, J–K ͒ : they were typically of much shorter duration ͑ 0.2 s Ϯ 0.1 s ͒ than the other calls. Trumpets and roars differed from chirps in their duration ͑ mean= 1 s and 2 s, Table III ͒ . Although both trumpets and roars shared the same frequency range, as revealed by their spectral peaks ͑ Fig. 2 ͒ , the decrease in power with increasing frequency was steeper in roars. Roars also had significantly lower harmonicity than trumpets ͑ Table III, Mann–Whitney U test, U = 313, Z = −7.8, and P Ͻ 0.0001 ͒ . There were no significant differences between the different age-sex classes in all of the call features that were examined in roars, chirps, and rumbles. The only significant differences were between adult male and female trumpets, wherein females had significantly lower mean F 0 and minimum F 0 ͑ unpaired t -tests, P = 0.015, t = 2.45 and P = 0.043, t = 2.78 ͒ than males ͑ Table III ͒ . The different call types were associated with characteristic body postures and behaviors of the vocalizing individual and other members of the herd. These are described in detail in Table IV. Trumpets were loud, conspicuous high-frequency calls. They were in the frequency range of 405–5879 Hz with a mean duration of about 1 s ͑ Table V ͒ . They had a rich harmonic structure with at least seven clearly visible harmonics ͑ Fig. 1, A–C ͒ . Spectral envelope analysis revealed the first frequency peak to be at 706 Hz ͑ Fig. 2 ͒ . The fourth frequency peak ͑ at 3078 Hz ͒ was about 13 dB lower in amplitude than the first frequency. Out of 73 trumpets where the age-sex class was unambiguous, 58 ͑ 79.5% ͒ were produced by adult or sub-adult females, nine ͑ 12.3% ͒ by juvenile females, five ͑ 6.8% ͒ by adult or sub-adult males, and one 1.4% by a juvenile male ͑ Table II ͒ . Out of the 71 trumpets where the context was clear, seven ͑ 9.9% ͒ were in the context of play, 17 ͑ 23.9% ͒ in the context of disturbance by humans or vehicles, 29 ͑ 40.8% ͒ in the context of disturbance by other non-human species, ten ͑ 14% ͒ in the context of inter-specific aggression, and eight ͑ 11.2% ͒ while running out of a waterhole ͑ Fig. 3 ͒ . Spe- cifically, trumpeting was observed while encountering other species such as deer ͑ Axis axis ͒ , gaur ͑ Bos gaurus ͒ , dhole ͑ Cuon alpinus ͒ , bears ͑ Melursus ursinus ͒ , tigers ͑ Panthera tigris ͒ , egrets ͑ Egretta garzetta ͒ , and humans. Roars were noisy, long calls that were in the frequency range of 305–6150 Hz and had a mean duration of about 2 s ͑ Table V, Fig. 1, D–F ͒ . They were in the same frequency range as trumpets and their spectral patterning was also simi- lar with seven frequency peaks Fig. 2 , the first frequency peak being at 656 Hz. The amplitude fell steeply with increasing frequency, the fourth frequency peak being 23 dB below the first ͑ Fig. 2 ͒ . This was in contrast to trumpets, where the fourth frequency peak was about 13 dB below the first ͑ Fig. 2 ͒ . Roars also showed significantly lower harmonicity than trumpets ͑ Table III ͒ . Both of the above are probably responsible for the unique perceptual quality of roars compared to trumpets. Out of 51 calls where the age-sex class was clear, 42 ͑ 82.3% ͒ were produced by adult or sub-adult females, six ͑ 11.7% ͒ by juvenile females, and three ͑ 5.8% ͒ by adult or sub-adult males ͑ Table II ͒ . Out of 51 cases where the context was clear, seven ͑ 13.7% ͒ were produced during play, four ͑ 7.8% ͒ due to disturbance by humans or vehicles, 28 ͑ 54.9% ͒ in the context of encounters with other non-human species, three ͑ 5.9% ͒ during inter-specific aggressive interactions, and nine ͑ 17.6% ͒ while facing another group or on entering a landscape ͑ Fig. 3 ͒ . Chirps were found to lie in the frequency range of 313– 3370 Hz ͑ Table V ͒ and were produced in a series ͑ Fig. 1, J–K ͒ ranging from 2 to 8 ͑ mean number= 5.2 Ϯ 2.6, n = 25 individuals ͒ in a single bout. The duration of a bout of chirp- ing ranged from 0.68 s to 3.8 s. Spectral envelope analysis revealed up to seven discernible frequency peaks Fig. 2 , with the first two peaks having equal amplitude. Although chirps had seven frequency peaks, the range over which these peaks were distributed was much narrower ͑ Fig. 2 ͒ than in the case of trumpets and roars. Chirps showed significantly lower harmonicity than trumpets and rumbles ͑ Table III, Mann–Whitney U -test, U = 995.5, Z = −4.38, P Ͻ 0.0001, and U = 786.5, Z = −3.8, P Ͻ 0.0001 ͒ . Out of 63 chirp bouts where the age-sex class was clear, 53 ͑ 84% ͒ were produced by adult or sub-adult females and ten ͑ 15.9% ͒ by adult or sub-adult males ͑ Table II ͒ . Out of 66 cases where the context was clear, 30 ͑ 45.5% ͒ were due to disturbance by humans or vehicles, 26 ͑ 39.3% ͒ due to disturbance by other non-human species, four ͑ 6% ͒ in the context of separation of an individual from the herd, and six ͑ 9% ͒ during intra-group aggression produced by an individual other than those directly involved in the aggression ͑ Fig. 3 ͒ . Rumbles were the only call type in the repertoire with infrasonic components. Rumbles were found to lie in the frequency range of 10–173 Hz, with a mean duration of 5.2 s ͑ Table V ͒ . They had a distinct harmonic structure ͑ Fig. 1, G–I, Table III ͒ , similar to trumpets ͑ U = 1508, Z = 0.616, and P = 0.54 ͒ . Spectral envelope analysis revealed three peaks, with the first peak at 37 Hz Fig. 2 . The power of the third peak was about 11 dB lower than that of the first frequency ͑ Fig. 2 ͒ . Of the 33 cases where the identity of the rumbling individual was unambiguous, 27 ͑ 81.8% ͒ were produced by adult or sub-adult females, two ͑ 6.1% ͒ by juvenile females, and four ͑ 12.1% ͒ by juvenile males ͑ Table II ͒ . Out of 56 cases where the context of rumbling was clear, 20 ͑ 35.7% ͒ were produced due to disturbance by humans or vehicles, six ͑ 10.7% ͒ due to disturbance by non-human species, three ͑ 5.4% ͒ by matriarchs to assemble the group ͑ “let’s go” rumble ͒ , three ͑ 5.4% ͒ in the context of contacting other herd members ͑ contact calls ͒ , and 24 ͑ 42.9% ͒ during intra- and inter-group interactions at close range ͑ Fig. 3 ͒ . Cluster analysis based on the distance matrix of pair- wise measures of overall similarity between calls did not reveal discrete structural groups, suggesting that the variation in call structure is graded. Examination of the spectrograms, however, revealed differences between calls, particularly in the direction and extent of frequency modulation. Some calls showed an overall downward modulation of frequency ͓ Fig. 4 ͑ A ͔͒ , others showed little or no frequency modulation ͓ Fig. 4 ͑ B ͔͒ , and some showed an overall upward modulation in frequency ͓ Fig. 4 ͑ C ͔͒ . Yet another type con- tained extensive frequency modulation within the call ͓ Fig. 4 ͑ D ͔͒ . Preliminary comparisons did not reveal any particular correspondence between these features and either age-sex class or behavioral context, but the sample sizes for many of the groups are too small to permit meaningful conclusions at this stage. This study of the vocalizations of wild Asian elephants classifies them into four mutually exclusive categories based on structural features: trumpets, chirps, roars, and rumbles. Three of the four call types, namely, trumpets, chirps, and roars show extensive overlap in their frequency ranges. They are, however, clearly distinguishable from each other by their temporal and/or spectral structures. Trumpets show high harmonicity relative to chirps and ...
Context 7
... and pigmentation. The age class of individuals was also determined using methods described elsewhere Fieldwork was conducted mainly in the dry months ͑ February to May ͒ of 2006 and 2007, in and around waterholes, salt licks, open areas such as swamps or grasslands, and within the forests. Surveys were carried out by vehicle and on foot to locate elephant herds during the day. There were a total of 214 encounters with herds during these sur- veys. The herds were classified into three major categories, namely, mixed herds ͑ both females and males in all age classes ͒ , female herds ͑ no adult or sub-adult males ͒ and male herds ͑ adult and sub-adult males only ͒ . Solitary individuals, male or female, were also occasionally encountered. The proportion of adult ͑ Ͼ 15 years ͒ and sub-adult ͑ 5–15 years ͒ males in this population is estimated to be 12.5% ͑ Arivazhagan and Sukumar, 2005 ͒ and approximately 11% from our encounters. Elephants were individually identified based on distinguishing characteristics such as ear-fold, tail characteristics, and pigmentation. The age class of individuals was also determined using methods described elsewhere A narrow-band spectrogram was computed and the minimum and maximum frequencies of the calls were determined using their power spectral densities. To estimate the essential bandwidth, the power spectral density ͑ PSD ͒ of the noisy signal was first computed using a Hamming window ͑ 20 ms ͒ . The PSD of the noise was then estimated using the initial 1 s of the recording. The estimated PSD of the noise was subtracted from that of the noisy signal to yield an esti- mate of the PSD of the signal. In general, the estimate had positive values at all discrete Fourier transform ͑ DFT ͒ bins; any negative values ͑ due to estimation errors ͒ were clamped to zero. From this estimate, the bandpass region containing about 80% of the total energy was considered as the frequency range of the signal. For rumbles, however, calls were first low-pass-filtered with a cutoff of 250 Hz ͑ this value was based on a preliminary inspection of the spectrograms ͒ . The signals thus obtained were re-sampled ͑ by the zero padding approach ͒ to obtain more points along the frequency axis ͑ Oppenheim and Schafer, 1989 ͒ . Spectrograms of these signals ͑ DFT with Hamming window size of 5 ms, 50% overlap ͒ were used for further analysis. Pitch, spectral envelope, and harmonicity analyses were carried out using PRAAT 5.1.07 ͑ www.praat.org, Paul Boersma and David Weenink ͒ . Pitch analysis was carried out using a pitch floor of 10 Hz for rumbles and 100 Hz for most other calls. Spectral envelope analysis was carried out using linear predictive coding ͑ LPC ͒ . Calls other than rumbles were re-sampled at 16 000 Hz whereas rumbles were re- sampled at 600 Hz. LPC analysis was carried out using the autocorrelation method with a time step of 0.005 s and a window length of 0.005 s ͑ 0.05 s for rumbles ͒ . Harmonicity ͑ harmonics-to-noise ratio ͒ was analyzed using the cross- correlation algorithm with a time step of 0.05 s except for chirps, for which 0.005 s was used due to the short duration of the calls. Calls of different age-sex classes adult male, adult female, young male, and young female ͒ were analyzed sepa- rately. The measured features include the call frequency range, duration, harmonicity, as well as mean, minimum, and maximum values of the fundamental frequency ͑ F 0 ͒ ͑ Table I ͒ . In addition, the frequencies and amplitudes of the LPC peaks were measured to examine spectral patterning. The analyzed acoustic features were separated according to the four age-sex classes and compared statistically ͑ for all categories where the sample size was Ն 3 ͒ for each call type using a combination of one-way analysis of variance and pair-wise comparisons of means by unpaired t -tests. The fea- tures compared included mean F 0 , minimum F 0 , maximum A total of 371 calls were recorded from 154 individuals. F 0 , call duration, harmonicity, and frequency and amplitude Of these, 258 calls were analyzed from 109 individuals, con- of the peaks of the LPC spectrum. sisting of 14 males and 95 females ͑ Table II ͒ . The remaining For the rumbles, the start and end frequencies of F 0 , and calls were discarded on account of low signal-to-noise ratio percentage time from the start to the maximum and mini- or overlap with other calls due to simultaneously vocalizing mum frequencies were measured ͑ Table I ͒ . To characterize individuals. Based on structural characteristics, the calls the frequency modulation in finer detail two measures were could be classified into four types, namely, trumpets, chirps, used ͑ Table I ͒ . The fourth harmonic of all calls was used roars, and rumbles ͑ Fig. 1 ͒ . One of the call types, the rumble, since visual inspection of the spectrograms revealed that the could be distinguished by its unique frequency range ͑ 10– modulation could be measured at high resolution across al- 173 Hz, Fig. 1, G–I ͒ , which was much lower than that of the most all recordings whereas the fundamental was sometimes other calls. Chirps were distinguished by their unique tem- contaminated with noise. The higher harmonics may also poral structure ͑ Table III, Fig. 1, J–K ͒ : they were typically of have greater functional relevance in social recognition as ar- much shorter duration ͑ 0.2 s Ϯ 0.1 s ͒ than the other calls. gued by McComb et al. ͑ 2003 ͒ based on playback experi- Trumpets and roars differed from chirps in their duration ments on wild African elephants. The first measure was the ͑ mean= 1 s and 2 s, Table III ͒ . Although both trumpets and roars shared the same frequency range, as revealed by their number of observed peaks in the fourth harmonic of each spectral peaks ͑ Fig. 2 ͒ , the decrease in power with increasing frequency was steeper in roars. Roars also had significantly call, determined visually from the spectrogram with a mini- lower harmonicity than trumpets ͑ Table III, Mann–Whitney U test, U = 313, Z = −7.8, and P Ͻ 0.0001 ͒ . mum frequency modulation of 4 Hz as a cutoff. The second There were no significant differences between the different age-sex classes in all of the call features that were exam- measure was the frequency change per unit time for the ined in roars, chirps, and rumbles. The only significant differences were between adult male and female trumpets, fourth harmonic, measured by the cumulative change in fre- wherein females had significantly lower mean F 0 and mini- quency divided by the time of the call ͑ window length of 200 mum F 0 ͑ unpaired t -tests, P = 0.015, t = 2.45 and P = 0.043, ms ͒ . t = 2.78 ͒ than males ͑ Table III ͒ . To detect the presence of structural subtypes within the rumbles, 13 measured call features ͑ except harmonicity and LPC peaks ͒ were used to generate pair-wise Euclidean distances between calls. The distance matrix was then subjected to cluster analysis using the Unweighted Pair Group Method with Arithmetic mean ͑ UPGMA ͒ ͑ Sneath and Sokal, 1973; Manly, 1986 ͒ . All statistical analyses were performed using STATISTICA ͑ 1999, Statsoft Inc., Tulsa, USA ͒ . A total of 371 calls were recorded from 154 individuals. Of these, 258 calls were analyzed from 109 individuals, con- sisting of 14 males and 95 females ͑ Table II ͒ . The remaining calls were discarded on account of low signal-to-noise ratio or overlap with other calls due to simultaneously vocalizing individuals. Based on structural characteristics, the calls could be classified into four types, namely, trumpets, chirps, roars, and rumbles ͑ Fig. 1 ͒ . One of the call types, the rumble, could be distinguished by its unique frequency range ͑ 10– 173 Hz, Fig. 1, G–I ͒ , which was much lower than that of the other calls. Chirps were distinguished by their unique temporal structure ͑ Table III, Fig. 1, J–K ͒ : they were typically of much shorter duration ͑ 0.2 s Ϯ 0.1 s ͒ than the other calls. Trumpets and roars differed from chirps in their duration ͑ mean= 1 s and 2 s, Table III ͒ . Although both trumpets and roars shared the same frequency range, as revealed by their spectral peaks ͑ Fig. 2 ͒ , the decrease in power with increasing frequency was steeper in roars. Roars also had significantly lower harmonicity than trumpets ͑ Table III, Mann–Whitney U test, U = 313, Z = −7.8, and P Ͻ 0.0001 ͒ . There were no significant differences between the different age-sex classes in all of the call features that were examined in roars, chirps, and rumbles. The only significant differences were between adult male and female trumpets, wherein females had significantly lower mean F 0 and minimum F 0 ͑ unpaired t -tests, P = 0.015, t = 2.45 and P = 0.043, t = 2.78 ͒ than males ͑ Table III ͒ . The different call types were associated with characteristic body postures and behaviors of the vocalizing individual and other members of the herd. These are described in detail in Table IV. Trumpets were loud, conspicuous high-frequency calls. They were in the frequency range of 405–5879 Hz with a mean duration of about 1 s ͑ Table V ͒ . They had a rich harmonic structure with at least seven clearly visible harmonics ͑ Fig. 1, A–C ͒ . Spectral envelope analysis revealed the first frequency peak to be at 706 Hz ͑ Fig. 2 ͒ . The fourth frequency peak ͑ at 3078 Hz ͒ was about 13 dB lower in amplitude than the first frequency. Out of 73 trumpets where the age-sex class was unambiguous, 58 ͑ 79.5% ͒ were produced by adult or sub-adult females, nine ͑ 12.3% ͒ by juvenile females, five ͑ 6.8% ͒ by adult or sub-adult males, and one 1.4% by a juvenile male ͑ Table II ͒ . Out of the 71 trumpets where the context was clear, seven ͑ 9.9% ͒ were in the context of play, 17 ͑ 23.9% ͒ in the context of disturbance by humans or vehicles, 29 ͑ 40.8% ͒ in the context of disturbance by other non-human species, ten ͑ 14% ͒ in the context of inter-specific aggression, and eight ͑ 11.2% ͒ while running out of a waterhole ͑ Fig. 3 ...
Context 8
... to zero. from our encounters. Elephants were individually identified The onsets and cessations of calls could be determined from based on distinguishing characteristics such as ear-fold, tail the thresholded envelope. characteristics, and pigmentation. The age class of individuals was also determined using methods described elsewhere Fieldwork was conducted mainly in the dry months ͑ February to May ͒ of 2006 and 2007, in and around waterholes, salt licks, open areas such as swamps or grasslands, and within the forests. Surveys were carried out by vehicle and on foot to locate elephant herds during the day. There were a total of 214 encounters with herds during these sur- veys. The herds were classified into three major categories, namely, mixed herds ͑ both females and males in all age classes ͒ , female herds ͑ no adult or sub-adult males ͒ and male herds ͑ adult and sub-adult males only ͒ . Solitary individuals, male or female, were also occasionally encountered. The proportion of adult ͑ Ͼ 15 years ͒ and sub-adult ͑ 5–15 years ͒ males in this population is estimated to be 12.5% ͑ Arivazhagan and Sukumar, 2005 ͒ and approximately 11% from our encounters. Elephants were individually identified based on distinguishing characteristics such as ear-fold, tail characteristics, and pigmentation. The age class of individuals was also determined using methods described elsewhere A narrow-band spectrogram was computed and the minimum and maximum frequencies of the calls were determined using their power spectral densities. To estimate the essential bandwidth, the power spectral density ͑ PSD ͒ of the noisy signal was first computed using a Hamming window ͑ 20 ms ͒ . The PSD of the noise was then estimated using the initial 1 s of the recording. The estimated PSD of the noise was subtracted from that of the noisy signal to yield an esti- mate of the PSD of the signal. In general, the estimate had positive values at all discrete Fourier transform ͑ DFT ͒ bins; any negative values ͑ due to estimation errors ͒ were clamped to zero. From this estimate, the bandpass region containing about 80% of the total energy was considered as the frequency range of the signal. For rumbles, however, calls were first low-pass-filtered with a cutoff of 250 Hz ͑ this value was based on a preliminary inspection of the spectrograms ͒ . The signals thus obtained were re-sampled ͑ by the zero padding approach ͒ to obtain more points along the frequency axis ͑ Oppenheim and Schafer, 1989 ͒ . Spectrograms of these signals ͑ DFT with Hamming window size of 5 ms, 50% overlap ͒ were used for further analysis. Pitch, spectral envelope, and harmonicity analyses were carried out using PRAAT 5.1.07 ͑ www.praat.org, Paul Boersma and David Weenink ͒ . Pitch analysis was carried out using a pitch floor of 10 Hz for rumbles and 100 Hz for most other calls. Spectral envelope analysis was carried out using linear predictive coding ͑ LPC ͒ . Calls other than rumbles were re-sampled at 16 000 Hz whereas rumbles were re- sampled at 600 Hz. LPC analysis was carried out using the autocorrelation method with a time step of 0.005 s and a window length of 0.005 s ͑ 0.05 s for rumbles ͒ . Harmonicity ͑ harmonics-to-noise ratio ͒ was analyzed using the cross- correlation algorithm with a time step of 0.05 s except for chirps, for which 0.005 s was used due to the short duration of the calls. Calls of different age-sex classes adult male, adult female, young male, and young female ͒ were analyzed sepa- rately. The measured features include the call frequency range, duration, harmonicity, as well as mean, minimum, and maximum values of the fundamental frequency ͑ F 0 ͒ ͑ Table I ͒ . In addition, the frequencies and amplitudes of the LPC peaks were measured to examine spectral patterning. The analyzed acoustic features were separated according to the four age-sex classes and compared statistically ͑ for all categories where the sample size was Ն 3 ͒ for each call type using a combination of one-way analysis of variance and pair-wise comparisons of means by unpaired t -tests. The fea- tures compared included mean F 0 , minimum F 0 , maximum A total of 371 calls were recorded from 154 individuals. F 0 , call duration, harmonicity, and frequency and amplitude Of these, 258 calls were analyzed from 109 individuals, con- of the peaks of the LPC spectrum. sisting of 14 males and 95 females ͑ Table II ͒ . The remaining For the rumbles, the start and end frequencies of F 0 , and calls were discarded on account of low signal-to-noise ratio percentage time from the start to the maximum and mini- or overlap with other calls due to simultaneously vocalizing mum frequencies were measured ͑ Table I ͒ . To characterize individuals. Based on structural characteristics, the calls the frequency modulation in finer detail two measures were could be classified into four types, namely, trumpets, chirps, used ͑ Table I ͒ . The fourth harmonic of all calls was used roars, and rumbles ͑ Fig. 1 ͒ . One of the call types, the rumble, since visual inspection of the spectrograms revealed that the could be distinguished by its unique frequency range ͑ 10– modulation could be measured at high resolution across al- 173 Hz, Fig. 1, G–I ͒ , which was much lower than that of the most all recordings whereas the fundamental was sometimes other calls. Chirps were distinguished by their unique tem- contaminated with noise. The higher harmonics may also poral structure ͑ Table III, Fig. 1, J–K ͒ : they were typically of have greater functional relevance in social recognition as ar- much shorter duration ͑ 0.2 s Ϯ 0.1 s ͒ than the other calls. gued by McComb et al. ͑ 2003 ͒ based on playback experi- Trumpets and roars differed from chirps in their duration ments on wild African elephants. The first measure was the ͑ mean= 1 s and 2 s, Table III ͒ . Although both trumpets and roars shared the same frequency range, as revealed by their number of observed peaks in the fourth harmonic of each spectral peaks ͑ Fig. 2 ͒ , the decrease in power with increasing frequency was steeper in roars. Roars also had significantly call, determined visually from the spectrogram with a mini- lower harmonicity than trumpets ͑ Table III, Mann–Whitney U test, U = 313, Z = −7.8, and P Ͻ 0.0001 ͒ . mum frequency modulation of 4 Hz as a cutoff. The second There were no significant differences between the different age-sex classes in all of the call features that were exam- measure was the frequency change per unit time for the ined in roars, chirps, and rumbles. The only significant differences were between adult male and female trumpets, fourth harmonic, measured by the cumulative change in fre- wherein females had significantly lower mean F 0 and mini- quency divided by the time of the call ͑ window length of 200 mum F 0 ͑ unpaired t -tests, P = 0.015, t = 2.45 and P = 0.043, ms ͒ . t = 2.78 ͒ than males ͑ Table III ͒ . To detect the presence of structural subtypes within the rumbles, 13 measured call features ͑ except harmonicity and LPC peaks ͒ were used to generate pair-wise Euclidean distances between calls. The distance matrix was then subjected to cluster analysis using the Unweighted Pair Group Method with Arithmetic mean ͑ UPGMA ͒ ͑ Sneath and Sokal, 1973; Manly, 1986 ͒ . All statistical analyses were performed using STATISTICA ͑ 1999, Statsoft Inc., Tulsa, USA ͒ . A total of 371 calls were recorded from 154 individuals. Of these, 258 calls were analyzed from 109 individuals, con- sisting of 14 males and 95 females ͑ Table II ͒ . The remaining calls were discarded on account of low signal-to-noise ratio or overlap with other calls due to simultaneously vocalizing individuals. Based on structural characteristics, the calls could be classified into four types, namely, trumpets, chirps, roars, and rumbles ͑ Fig. 1 ͒ . One of the call types, the rumble, could be distinguished by its unique frequency range ͑ 10– 173 Hz, Fig. 1, G–I ͒ , which was much lower than that of the other calls. Chirps were distinguished by their unique temporal structure ͑ Table III, Fig. 1, J–K ͒ : they were typically of much shorter duration ͑ 0.2 s Ϯ 0.1 s ͒ than the other calls. Trumpets and roars differed from chirps in their duration ͑ mean= 1 s and 2 s, Table III ͒ . Although both trumpets and roars shared the same frequency range, as revealed by their spectral peaks ͑ Fig. 2 ͒ , the decrease in power with increasing frequency was steeper in roars. Roars also had significantly lower harmonicity than trumpets ͑ Table III, Mann–Whitney U test, U = 313, Z = −7.8, and P Ͻ 0.0001 ͒ . There were no significant differences between the different age-sex classes in all of the call features that were examined in roars, chirps, and rumbles. The only significant differences were between adult male and female trumpets, wherein females had significantly lower mean F 0 and minimum F 0 ͑ unpaired t -tests, P = 0.015, t = 2.45 and P = 0.043, t = 2.78 ͒ than males ͑ Table III ͒ . The different call types were associated with characteristic body postures and behaviors of the vocalizing individual and other members of the herd. These are described in detail in Table IV. Trumpets were loud, conspicuous high-frequency calls. They were in the frequency range of 405–5879 Hz with a mean duration of about 1 s ͑ Table V ͒ . They had a rich harmonic structure with at least seven clearly visible harmonics ͑ Fig. 1, A–C ͒ . Spectral envelope analysis revealed the first frequency peak to be at 706 Hz ͑ Fig. 2 ͒ . The fourth frequency peak ͑ at 3078 Hz ͒ was about 13 dB lower in amplitude than the first frequency. Out of 73 trumpets where the age-sex class was unambiguous, 58 ͑ 79.5% ͒ were produced by adult or sub-adult females, nine ͑ 12.3% ͒ by juvenile females, five ͑ 6.8% ͒ by adult or sub-adult males, and one 1.4% by a juvenile male ͑ Table II ͒ . Out of the 71 trumpets where the context was clear, seven ͑ 9.9% ͒ were in the context of play, 17 ͑ 23.9% ͒ in the ...
Context 9
... roars shared the same frequency range, as revealed by their spectral peaks ͑ Fig. 2 ͒ , the decrease in power with increasing frequency was steeper in roars. Roars also had significantly lower harmonicity than trumpets ͑ Table III, Mann–Whitney U test, U = 313, Z = −7.8, and P Ͻ 0.0001 ͒ . There were no significant differences between the different age-sex classes in all of the call features that were examined in roars, chirps, and rumbles. The only significant differences were between adult male and female trumpets, wherein females had significantly lower mean F 0 and minimum F 0 ͑ unpaired t -tests, P = 0.015, t = 2.45 and P = 0.043, t = 2.78 ͒ than males ͑ Table III ͒ . The different call types were associated with characteristic body postures and behaviors of the vocalizing individual and other members of the herd. These are described in detail in Table IV. Trumpets were loud, conspicuous high-frequency calls. They were in the frequency range of 405–5879 Hz with a mean duration of about 1 s ͑ Table V ͒ . They had a rich harmonic structure with at least seven clearly visible harmonics ͑ Fig. 1, A–C ͒ . Spectral envelope analysis revealed the first frequency peak to be at 706 Hz ͑ Fig. 2 ͒ . The fourth frequency peak ͑ at 3078 Hz ͒ was about 13 dB lower in amplitude than the first frequency. Out of 73 trumpets where the age-sex class was unambiguous, 58 ͑ 79.5% ͒ were produced by adult or sub-adult females, nine ͑ 12.3% ͒ by juvenile females, five ͑ 6.8% ͒ by adult or sub-adult males, and one 1.4% by a juvenile male ͑ Table II ͒ . Out of the 71 trumpets where the context was clear, seven ͑ 9.9% ͒ were in the context of play, 17 ͑ 23.9% ͒ in the context of disturbance by humans or vehicles, 29 ͑ 40.8% ͒ in the context of disturbance by other non-human species, ten ͑ 14% ͒ in the context of inter-specific aggression, and eight ͑ 11.2% ͒ while running out of a waterhole ͑ Fig. 3 ͒ . Spe- cifically, trumpeting was observed while encountering other species such as deer ͑ Axis axis ͒ , gaur ͑ Bos gaurus ͒ , dhole ͑ Cuon alpinus ͒ , bears ͑ Melursus ursinus ͒ , tigers ͑ Panthera tigris ͒ , egrets ͑ Egretta garzetta ͒ , and humans. Roars were noisy, long calls that were in the frequency range of 305–6150 Hz and had a mean duration of about 2 s ͑ Table V, Fig. 1, D–F ͒ . They were in the same frequency range as trumpets and their spectral patterning was also simi- lar with seven frequency peaks Fig. 2 , the first frequency peak being at 656 Hz. The amplitude fell steeply with increasing frequency, the fourth frequency peak being 23 dB below the first ͑ Fig. 2 ͒ . This was in contrast to trumpets, where the fourth frequency peak was about 13 dB below the first ͑ Fig. 2 ͒ . Roars also showed significantly lower harmonicity than trumpets ͑ Table III ͒ . Both of the above are probably responsible for the unique perceptual quality of roars compared to trumpets. Out of 51 calls where the age-sex class was clear, 42 ͑ 82.3% ͒ were produced by adult or sub-adult females, six ͑ 11.7% ͒ by juvenile females, and three ͑ 5.8% ͒ by adult or sub-adult males ͑ Table II ͒ . Out of 51 cases where the context was clear, seven ͑ 13.7% ͒ were produced during play, four ͑ 7.8% ͒ due to disturbance by humans or vehicles, 28 ͑ 54.9% ͒ in the context of encounters with other non-human species, three ͑ 5.9% ͒ during inter-specific aggressive interactions, and nine ͑ 17.6% ͒ while facing another group or on entering a landscape ͑ Fig. 3 ͒ . Chirps were found to lie in the frequency range of 313– 3370 Hz ͑ Table V ͒ and were produced in a series ͑ Fig. 1, J–K ͒ ranging from 2 to 8 ͑ mean number= 5.2 Ϯ 2.6, n = 25 individuals ͒ in a single bout. The duration of a bout of chirp- ing ranged from 0.68 s to 3.8 s. Spectral envelope analysis revealed up to seven discernible frequency peaks Fig. 2 , with the first two peaks having equal amplitude. Although chirps had seven frequency peaks, the range over which these peaks were distributed was much narrower ͑ Fig. 2 ͒ than in the case of trumpets and roars. Chirps showed significantly lower harmonicity than trumpets and rumbles ͑ Table III, Mann–Whitney U -test, U = 995.5, Z = −4.38, P Ͻ 0.0001, and U = 786.5, Z = −3.8, P Ͻ 0.0001 ͒ . Out of 63 chirp bouts where the age-sex class was clear, 53 ͑ 84% ͒ were produced by adult or sub-adult females and ten ͑ 15.9% ͒ by adult or sub-adult males ͑ Table II ͒ . Out of 66 cases where the context was clear, 30 ͑ 45.5% ͒ were due to disturbance by humans or vehicles, 26 ͑ 39.3% ͒ due to disturbance by other non-human species, four ͑ 6% ͒ in the context of separation of an individual from the herd, and six ͑ 9% ͒ during intra-group aggression produced by an individual other than those directly involved in the aggression ͑ Fig. 3 ͒ . Rumbles were the only call type in the repertoire with infrasonic components. Rumbles were found to lie in the frequency range of 10–173 Hz, with a mean duration of 5.2 s ͑ Table V ͒ . They had a distinct harmonic structure ͑ Fig. 1, G–I, Table III ͒ , similar to trumpets ͑ U = 1508, Z = 0.616, and P = 0.54 ͒ . Spectral envelope analysis revealed three peaks, with the first peak at 37 Hz Fig. 2 . The power of the third peak was about 11 dB lower than that of the first frequency ͑ Fig. 2 ͒ . Of the 33 cases where the identity of the rumbling individual was unambiguous, 27 ͑ 81.8% ͒ were produced by adult or sub-adult females, two ͑ 6.1% ͒ by juvenile females, and four ͑ 12.1% ͒ by juvenile males ͑ Table II ͒ . Out of 56 cases where the context of rumbling was clear, 20 ͑ 35.7% ͒ were produced due to disturbance by humans or vehicles, six ͑ 10.7% ͒ due to disturbance by non-human species, three ͑ 5.4% ͒ by matriarchs to assemble the group ͑ “let’s go” rumble ͒ , three ͑ 5.4% ͒ in the context of contacting other herd members ͑ contact calls ͒ , and 24 ͑ 42.9% ͒ during intra- and inter-group interactions at close range ͑ Fig. 3 ͒ . Cluster analysis based on the distance matrix of pair- wise measures of overall similarity between calls did not reveal discrete structural groups, suggesting that the variation in call structure is graded. Examination of the spectrograms, however, revealed differences between calls, particularly in the direction and extent of frequency modulation. Some calls showed an overall downward modulation of frequency ͓ Fig. 4 ͑ A ͔͒ , others showed little or no frequency modulation ͓ Fig. 4 ͑ B ͔͒ , and some showed an overall upward modulation in frequency ͓ Fig. 4 ͑ C ͔͒ . Yet another type con- tained extensive frequency modulation within the call ͓ Fig. 4 ͑ D ͔͒ . Preliminary comparisons did not reveal any particular correspondence between these features and either age-sex class or behavioral context, but the sample sizes for many of the groups are too small to permit meaningful conclusions at this stage. This study of the vocalizations of wild Asian elephants classifies them into four mutually exclusive categories based on structural features: trumpets, chirps, roars, and rumbles. Three of the four call types, namely, trumpets, chirps, and roars show extensive overlap in their frequency ranges. They are, however, clearly distinguishable from each other by their temporal and/or spectral structures. Trumpets show high harmonicity relative to chirps and roars, which are noisy. Chirps may be distinguished from roars by their characteristic temporal and spectral structures: short durations and frequency peaks over a narrower range. Roars exhibit low harmonicity and have no specific temporal structure. Rumbles, which constitute the fourth call type, do not overlap with any of the other call types in frequency and exhibit a distinct harmonic structure. Rumbles are also much longer in duration compared with the other call types. Our observations can be compared with previous studies on Asian and African elephants. On the basis of auditory assessments in the field and visual assessments of a few spectrograms, McKay ͑ 1973 ͒ classified the vocalizations of wild Asian elephants in Sri Lanka and zoo elephants into three “basic sounds” with eight “resulting sounds,” depending on their modification by change in amplitude, temporal patterning, and stressing of overtones, as well as non-vocal sounds produced in the trunk. However, the spectral and temporal characteristics were not defined. These “basic sounds” ͑ with “resulting sounds” ͒ were growl ͑ growl, rumble, roar, and “motor- cycle” ͒ , squeak ͑ chirp and trumpet ͒ , and snort ͑ “snort” and “boom” ͒ . It is now clear that the categories of resulting sounds such as rumbles and motorcycle with infrasonic components are structurally different from roars, which are calls with low harmonicity and no infrasonic frequencies. Simi- larly, the chirp and the trumpet are sufficiently different in their spectrograms not to be placed together under the basic sound squeak. Non-vocal sounds such as snort and boom are not considered here since our study was confined to vocalizations. Several observers including Sanderson ͑ 1878 ͒ , Krishnan ͑ 1972 ͒ and McKay ͑ 1973 ͒ also described calls of Asian elephants that clearly indicate low frequency sounds. A study on captive female Asian elephants by Payne et al. ͑ 1986 ͒ later recorded infrasound with fundamental frequencies of 14–24 Hz and 10–15 s duration. There have been a number of attempts at classifying African elephant vocalizations, with most of the studies fo- cusing on infrasound. One of the earliest studies on the African species ͑ Berg, 1983 ͒ described the characteristics of the vocalizations of a group of captive elephants based on visual inspection of spectrograms and divided them into ten call types. Our recordings of Asian elephant vocalizations show correspondence with three of these ten types, namely, trumpets, roars, and barks ͓ which we refer to as chirps in accor- dance with McKay ͑ 1973 ͔͒ . The trumpets and roars recorded by Berg ͑ 1983 ͒ are longer in duration ͑ 2s and 3.8s ͒ but similar in terms of dominant ...
Context 10
... method with a time step of 0.005 s and a window length of 0.005 s ͑ 0.05 s for rumbles ͒ . Harmonicity ͑ harmonics-to-noise ratio ͒ was analyzed using the cross- correlation algorithm with a time step of 0.05 s except for chirps, for which 0.005 s was used due to the short duration of the calls. Calls of different age-sex classes adult male, adult female, young male, and young female ͒ were analyzed sepa- rately. The measured features include the call frequency range, duration, harmonicity, as well as mean, minimum, and maximum values of the fundamental frequency ͑ F 0 ͒ ͑ Table I ͒ . In addition, the frequencies and amplitudes of the LPC peaks were measured to examine spectral patterning. The analyzed acoustic features were separated according to the four age-sex classes and compared statistically ͑ for all categories where the sample size was Ն 3 ͒ for each call type using a combination of one-way analysis of variance and pair-wise comparisons of means by unpaired t -tests. The fea- tures compared included mean F 0 , minimum F 0 , maximum A total of 371 calls were recorded from 154 individuals. F 0 , call duration, harmonicity, and frequency and amplitude Of these, 258 calls were analyzed from 109 individuals, con- of the peaks of the LPC spectrum. sisting of 14 males and 95 females ͑ Table II ͒ . The remaining For the rumbles, the start and end frequencies of F 0 , and calls were discarded on account of low signal-to-noise ratio percentage time from the start to the maximum and mini- or overlap with other calls due to simultaneously vocalizing mum frequencies were measured ͑ Table I ͒ . To characterize individuals. Based on structural characteristics, the calls the frequency modulation in finer detail two measures were could be classified into four types, namely, trumpets, chirps, used ͑ Table I ͒ . The fourth harmonic of all calls was used roars, and rumbles ͑ Fig. 1 ͒ . One of the call types, the rumble, since visual inspection of the spectrograms revealed that the could be distinguished by its unique frequency range ͑ 10– modulation could be measured at high resolution across al- 173 Hz, Fig. 1, G–I ͒ , which was much lower than that of the most all recordings whereas the fundamental was sometimes other calls. Chirps were distinguished by their unique tem- contaminated with noise. The higher harmonics may also poral structure ͑ Table III, Fig. 1, J–K ͒ : they were typically of have greater functional relevance in social recognition as ar- much shorter duration ͑ 0.2 s Ϯ 0.1 s ͒ than the other calls. gued by McComb et al. ͑ 2003 ͒ based on playback experi- Trumpets and roars differed from chirps in their duration ments on wild African elephants. The first measure was the ͑ mean= 1 s and 2 s, Table III ͒ . Although both trumpets and roars shared the same frequency range, as revealed by their number of observed peaks in the fourth harmonic of each spectral peaks ͑ Fig. 2 ͒ , the decrease in power with increasing frequency was steeper in roars. Roars also had significantly call, determined visually from the spectrogram with a mini- lower harmonicity than trumpets ͑ Table III, Mann–Whitney U test, U = 313, Z = −7.8, and P Ͻ 0.0001 ͒ . mum frequency modulation of 4 Hz as a cutoff. The second There were no significant differences between the different age-sex classes in all of the call features that were exam- measure was the frequency change per unit time for the ined in roars, chirps, and rumbles. The only significant differences were between adult male and female trumpets, fourth harmonic, measured by the cumulative change in fre- wherein females had significantly lower mean F 0 and mini- quency divided by the time of the call ͑ window length of 200 mum F 0 ͑ unpaired t -tests, P = 0.015, t = 2.45 and P = 0.043, ms ͒ . t = 2.78 ͒ than males ͑ Table III ͒ . To detect the presence of structural subtypes within the rumbles, 13 measured call features ͑ except harmonicity and LPC peaks ͒ were used to generate pair-wise Euclidean distances between calls. The distance matrix was then subjected to cluster analysis using the Unweighted Pair Group Method with Arithmetic mean ͑ UPGMA ͒ ͑ Sneath and Sokal, 1973; Manly, 1986 ͒ . All statistical analyses were performed using STATISTICA ͑ 1999, Statsoft Inc., Tulsa, USA ͒ . A total of 371 calls were recorded from 154 individuals. Of these, 258 calls were analyzed from 109 individuals, con- sisting of 14 males and 95 females ͑ Table II ͒ . The remaining calls were discarded on account of low signal-to-noise ratio or overlap with other calls due to simultaneously vocalizing individuals. Based on structural characteristics, the calls could be classified into four types, namely, trumpets, chirps, roars, and rumbles ͑ Fig. 1 ͒ . One of the call types, the rumble, could be distinguished by its unique frequency range ͑ 10– 173 Hz, Fig. 1, G–I ͒ , which was much lower than that of the other calls. Chirps were distinguished by their unique temporal structure ͑ Table III, Fig. 1, J–K ͒ : they were typically of much shorter duration ͑ 0.2 s Ϯ 0.1 s ͒ than the other calls. Trumpets and roars differed from chirps in their duration ͑ mean= 1 s and 2 s, Table III ͒ . Although both trumpets and roars shared the same frequency range, as revealed by their spectral peaks ͑ Fig. 2 ͒ , the decrease in power with increasing frequency was steeper in roars. Roars also had significantly lower harmonicity than trumpets ͑ Table III, Mann–Whitney U test, U = 313, Z = −7.8, and P Ͻ 0.0001 ͒ . There were no significant differences between the different age-sex classes in all of the call features that were examined in roars, chirps, and rumbles. The only significant differences were between adult male and female trumpets, wherein females had significantly lower mean F 0 and minimum F 0 ͑ unpaired t -tests, P = 0.015, t = 2.45 and P = 0.043, t = 2.78 ͒ than males ͑ Table III ͒ . The different call types were associated with characteristic body postures and behaviors of the vocalizing individual and other members of the herd. These are described in detail in Table IV. Trumpets were loud, conspicuous high-frequency calls. They were in the frequency range of 405–5879 Hz with a mean duration of about 1 s ͑ Table V ͒ . They had a rich harmonic structure with at least seven clearly visible harmonics ͑ Fig. 1, A–C ͒ . Spectral envelope analysis revealed the first frequency peak to be at 706 Hz ͑ Fig. 2 ͒ . The fourth frequency peak ͑ at 3078 Hz ͒ was about 13 dB lower in amplitude than the first frequency. Out of 73 trumpets where the age-sex class was unambiguous, 58 ͑ 79.5% ͒ were produced by adult or sub-adult females, nine ͑ 12.3% ͒ by juvenile females, five ͑ 6.8% ͒ by adult or sub-adult males, and one 1.4% by a juvenile male ͑ Table II ͒ . Out of the 71 trumpets where the context was clear, seven ͑ 9.9% ͒ were in the context of play, 17 ͑ 23.9% ͒ in the context of disturbance by humans or vehicles, 29 ͑ 40.8% ͒ in the context of disturbance by other non-human species, ten ͑ 14% ͒ in the context of inter-specific aggression, and eight ͑ 11.2% ͒ while running out of a waterhole ͑ Fig. 3 ͒ . Spe- cifically, trumpeting was observed while encountering other species such as deer ͑ Axis axis ͒ , gaur ͑ Bos gaurus ͒ , dhole ͑ Cuon alpinus ͒ , bears ͑ Melursus ursinus ͒ , tigers ͑ Panthera tigris ͒ , egrets ͑ Egretta garzetta ͒ , and humans. Roars were noisy, long calls that were in the frequency range of 305–6150 Hz and had a mean duration of about 2 s ͑ Table V, Fig. 1, D–F ͒ . They were in the same frequency range as trumpets and their spectral patterning was also simi- lar with seven frequency peaks Fig. 2 , the first frequency peak being at 656 Hz. The amplitude fell steeply with increasing frequency, the fourth frequency peak being 23 dB below the first ͑ Fig. 2 ͒ . This was in contrast to trumpets, where the fourth frequency peak was about 13 dB below the first ͑ Fig. 2 ͒ . Roars also showed significantly lower harmonicity than trumpets ͑ Table III ͒ . Both of the above are probably responsible for the unique perceptual quality of roars compared to trumpets. Out of 51 calls where the age-sex class was clear, 42 ͑ 82.3% ͒ were produced by adult or sub-adult females, six ͑ 11.7% ͒ by juvenile females, and three ͑ 5.8% ͒ by adult or sub-adult males ͑ Table II ͒ . Out of 51 cases where the context was clear, seven ͑ 13.7% ͒ were produced during play, four ͑ 7.8% ͒ due to disturbance by humans or vehicles, 28 ͑ 54.9% ͒ in the context of encounters with other non-human species, three ͑ 5.9% ͒ during inter-specific aggressive interactions, and nine ͑ 17.6% ͒ while facing another group or on entering a landscape ͑ Fig. 3 ͒ . Chirps were found to lie in the frequency range of 313– 3370 Hz ͑ Table V ͒ and were produced in a series ͑ Fig. 1, J–K ͒ ranging from 2 to 8 ͑ mean number= 5.2 Ϯ 2.6, n = 25 individuals ͒ in a single bout. The duration of a bout of chirp- ing ranged from 0.68 s to 3.8 s. Spectral envelope analysis revealed up to seven discernible frequency peaks Fig. 2 , with the first two peaks having equal amplitude. Although chirps had seven frequency peaks, the range over which these peaks were distributed was much narrower ͑ Fig. 2 ͒ than in the case of trumpets and roars. Chirps showed significantly lower harmonicity than trumpets and rumbles ͑ Table III, Mann–Whitney U -test, U = 995.5, Z = −4.38, P Ͻ 0.0001, and U = 786.5, Z = −3.8, P Ͻ 0.0001 ͒ . Out of 63 chirp bouts where the age-sex class was clear, 53 ͑ 84% ͒ were produced by adult or sub-adult females and ten ͑ 15.9% ͒ by adult or sub-adult males ͑ Table II ͒ . Out of 66 cases where the context was clear, 30 ͑ 45.5% ͒ were due to disturbance by humans or vehicles, 26 ͑ 39.3% ͒ due to disturbance by other non-human species, four ͑ 6% ͒ in the context of separation of an individual from the herd, and six ͑ 9% ͒ during intra-group aggression produced by an individual other than those directly involved in the aggression ͑ Fig. 3 ͒ . ...

Similar publications

Article
Full-text available
The ultrasonic vocalizations of rats can transmit affective states to listeners. For example, rats typically produce shorter calls in a higher frequency range in social situations (pleasant call: PC), whereas they emit longer calls with lower frequency in distress situations (distress call: DC). Knowing what acoustical features contribute to audito...
Article
Full-text available
Numerous species possess cortical regions that are most sensitive to vocalizations produced by their own kind (conspecifics). In humans, the superior temporal sulci (STSs) putatively represent homologous voice-sensitive areas of cortex. However, superior temporal sulcus (STS) regions have recently been reported to represent auditory experience or "...
Article
Full-text available
The tseet contact call, common to black-capped (Poecile atricapillus) and mountain chickadees (P. gambeli), is the most frequently produced vocalization of each species. Previous work has characterized the tseet call of black-capped and mountain chickadees from different geographic locations in terms of nine acoustic features. In the current study,...
Article
Full-text available
Identifying the respective functions of distinct call types is an important step towards understanding the diversification of mammal vocal repertoires. Red deer (Cervus elaphus) stags give two distinct types of roars during the rut, termed 'common roars' and 'harsh roars'. This study tests the hypothesis that harsh roars function to raise and maint...
Article
Full-text available
Human song exhibits great structural diversity, yet certain aspects of melodic shape (how pitch is patterned over time) are widespread. These include a predominance of arch-shaped and descending melodic contours in musical phrases, a tendency for phrase-final notes to be relatively long, and a bias toward small pitch movements between adjacent note...

Citations

... Elephants communicate using low frequency vocalizations that propagate long distances, making it possible to maintain contact with extended family members [34,35], share information about the environment and potential danger [36][37][38], as well as advertising reproductive states [35,39,40]. The ability to hear these low frequency sounds enables the elephant to maintain complex social relationships across many different landscapes. ...
Article
Full-text available
Elephants have a unique auditory system that is larger than any other terrestrial mammal. To quantify the impact of larger middle ear (ME) structures, we measured 3D ossicular motion and ME sound transmission in cadaveric temporal bones from both African and Asian elephants in response to air-conducted (AC) tonal pressure stimuli presented in the ear canal (P EC). Results were compared to similar measurements in humans. Velocities of the umbo (V U) and stapes (V ST) were measured using a 3D laser Doppler vibrometer in the 7-13,000 Hz frequency range, stapes velocity serving as a measure of energy entering the cochlea-a proxy for hearing sensitivity. Below the elephant ME resonance frequency of about 300 Hz, the magnitude of V U /P EC was an order of magnitude greater than in human, and the magnitude of V ST /P EC was 5x greater. Phase of V ST /P EC above ME resonance indicated that the group delay in elephant was approximately double that of human, which may be related to the unexpectedly high magnitudes at high frequencies. A boost in sound transmission across the incus long process and stapes near 9 kHz was also observed. We discuss factors that contribute to differences in sound transmission between these two large mammals.
... Elephants communicate using low frequency vocalizations that propagate long distances, making it possible to maintain contact with extended family members [34,35], share information about the environment and potential danger [36][37][38], as well as advertising reproductive states [35,39,40]. The ability to hear these low frequency sounds enables the elephant to maintain complex social relationships across many different landscapes. ...
Article
Full-text available
Elephants have a unique auditory system that is larger than any other terrestrial mammal. To quantify the impact of larger middle ear (ME) structures, we measured 3D ossicular motion and ME sound transmission in cadaveric temporal bones from both African and Asian elephants in response to air-conducted (AC) tonal pressure stimuli presented in the ear canal (Pec). Results were compared to similar measurements in humans. Velocities of the umbo (Vu) and stapes (Vst) were measured using a 3D laser Doppler vibrometer in the 7-13,000 Hz frequency range, stapes velocity serving as a measure of energy entering the cochlea-a proxy for hearing sensitivity. Below the elephant ME resonance frequency of about 300 Hz, the magnitude of Vu/Pec was an order of magnitude greater than in human, and the magnitude of Vst/Pec was 5x greater. Phase of Vst/Pec above ME resonance indicated that the group delay in elephant was approximately double that of human, which may be related to the unexpectedly high magnitudes at high frequencies. A boost in sound transmission across the incus long process and stapes near 9 kHz was also observed. We discuss factors that contribute to differences in sound transmission between these two large mammals.
... Calls containing harsh components (such as squeals) may be emitted while approaching food, for example, in rhesus macaques (Macaca mulatta) or during social play in elephants (Elephas maximus; Nair et al., 2009). Harsher calls were also more frequently produced in alarm contexts, in response to predators in chacma baboons (Papio cynocephalus ursinus; Fischer et al., 2001). ...
Article
Full-text available
Exploratory behaviors describe the actions performed by an animal to obtain information on an object, environment, or individual by using its different senses. Exploration is described in some marine mammals, but not yet in manatees. Our study investigated behavioral and acoustic responses of two groups of Antillean manatees (N = 12 and N = 4) housed in zoological parks toward various stimuli involving three sensory modalities: visual, tactile, and auditory. Simultaneous audio and video recordings were collected during three periods of time (i.e., before, during, and after the presentation of all stimuli). Behaviors related to interest, social behaviors, the number and type of calls produced, and their frequency and duration were recorded and analyzed. Manatees reacted more to submerged stimuli than to out-of-water and sound stimuli, with an increase in approach, social contacts, and number of vocalizations. The proportion of squeaks and squeals call types also varied according to stimuli, and call entropy and F0 range varied according to periods. Our results suggest that manatees display sensory preferences when exploring stimuli, with more interest in manipulable stimuli, supporting the importance of their somatic perception. We highlight the need for particular enrichment programs (i.e., involving submerged objects) in zoological facilities. By displaying social contacts and by producing vocalizations, manatees communicate information such as their motivational state. The increase in call rate, harsh calls, and entropy values could be valid indicators of heightened arousal. We encourage further studies to associate acoustic recordings with ethological data collection to increase the understanding of manatees’ behaviors andperception.
... Moreover, these vocalisations can be classified into different call types based on their physical properties [6]. There are four main types of vocalisations for Asian elephants, namely, trumpet, roar, chirp, and rumble [7], [8]. Elephants use these calls in various contexts such as when being disturbed, playing with each other, moving in the presence of other species or vehicles, and communicating within the herd. ...
Article
Full-text available
Elephants generate infrasonic vocalisations that traverse through the air for long distances. Utilising this phenomenon , a previous work proposed a system, called Eloc, to localise and track elephants in the wild. The Eloc system has been demonstrated to be accurate in calculating the location of infrasonic sources. However, it still lacks the capability to accurately distinguish elephant infrasonic calls from various other infrasonic sources using limited computing power on board. Addressing this problem, the work presented in this paper introduces an approach to distinguish elephant infrasonic calls with a high accuracy on low-resourced hardware. Firstly, a sequence of operations are performed to reduce the effect of noise in the infrasonic signal captured by an Eloc node. Secondly, a wavelet-based signal reconstruction technique is applied to extract spectral features from the infrasonic signal. Finally, the extracted features are fed to a pre-trained machine learning classifier to distinguish the infrasonic vocalisations of elephants. The experimental evaluation using Asian elephant (Elephas Max-imus Maximus) infrasonic vocalisation datasets demonstrates that the proposed approach is capable of accurately distinguishing elephant infrasonic calls on low-resourced hardware platform of the Eloc system, with accuracy levels over 82% under varying environmental conditions.
... Furthermore, as Asian elephants are vocal learners [29], learned calls may vary beyond these categories [27]. Elephant calls can be detected from the infrasound range (<20 Hz) [28] to the human-audible range 5 kHz or 6 kHz, such as trumpets or roars [30]. The calls of elephants have distinctive features that can be used to differentiate them from other species and background noise. ...
... More training data that are labelled will allow for higher accuracy and more precise classification of diverse anthropogenic and environment sounds. For elephant sounds, there is room to expand the classification to different types of elephant vocalisations [27,30]. Additionally, with the availability of a curated dataset that is relevant to local scenarios, the complexity of the model can be increased to allow multi-class classifications based on various anthropogenic sounds and for examining elephant behaviours. ...
Conference Paper
Full-text available
Human-elephant conflict (HEC) in Malaysia occurs in areas where humans and Asian elephants (Elephas maximus) share landscapes, especially at forest edges and agricultural lands. Due to increased anthropogenic activities, such as logging, agriculture, and infrastructure developments, a large portion of natural habitat has been converted and fragmented, resulting in increased interactions between people and wild elephants. This presents a unique challenge of conserving elephants, safeguarding human livelihoods and ensuring habitat connectivity for endangered wildlife. The ability to detect and predict the presence of elephants in human-dominated landscapes can help increase safety for communities and provide insights for decision-makers to better manage conflict situations and promote a harmonious human-elephant coexistence. In the past decade, technologies used to study elephants, such as camera traps, drone imaging, and GPS tracking, have allowed us to understand their activity patterns, habitat preferences, and behaviour, but they often have limitations in terms of cost, coverage effort, and manpower needed. The increasing application of artificial intelligence (AI) in conservation offers an opportunity to develop real-time solutions to detect elephant presence in areas with a high risk of human-elephant encounters and HEC. In this study, we present a prototype of an early warning system, WildTechAlert, which combines bioacoustics and AI technology. Bioacoustics is the study of biological sounds; and the prototype device, being able to record sounds travelling from all angles, may potentially have higher detection rates from its surroundings (omnidirectional coverage) than sensors with fixed angles, such as camera traps. A sound stored in the raw waveform format can be converted into a spectrogram, which is a visual representation of the sound signature in the frequency, time, and amplitude domains. The spectrograms of elephant sounds were used as images for training a deep learning model using convolutional neural networks. WildTechAlert includes a device equipped with bioacoustic sensors which connect to a cloud-based deep learning algorithm, a user-friendly web interface, and mobile notification alert functions. The alerts serve as a tool to increase the safety of communities and plantation workers and promote preparedness for mitigation actions. We trained binary and multiclass classifiers to detect elephant sounds and achieved up to a 94% accuracy in detecting elephant sounds with the binary classifier. The performance of this system has yet to be tested in field conditions but shows potential in HEC management and to promote coexistence between humans and elephants.
... Elephant low-frequency rumbles are among the most common types of vocalizations produced by elephants, with fundamental frequency ranges of 14-35 Hz [1][2][3][4]. These rumbles are used for short- [5][6][7][8] and longdistance communication [1,6,[8][9][10][11][12][13] in a wide range of situations. There are several rumble vocalizations that have been identified and described in specific behavioral contexts, including the acoustically unique, "let's go" rumble sequence [1]. ...
... Elephant low-frequency rumbles are among the most common types of vocalizations produced by elephants, with fundamental frequency ranges of 14-35 Hz [1][2][3][4]. These rumbles are used for short- [5][6][7][8] and longdistance communication [1,6,[8][9][10][11][12][13] in a wide range of situations. There are several rumble vocalizations that have been identified and described in specific behavioral contexts, including the acoustically unique, "let's go" rumble sequence [1]. ...
Article
We propose a novel use of laser Doppler vibrometry (LDV) for elephant field bioacoustics and behavioral ecology, allowing investigators to determine who initiates and responds within a vocalization bout. LDV has been used in a variety of applications including engineering, biomedical research, and animal communication. We present LDV data collected from one captive African elephant in a group of three elephants within an open field during vocal exchanges. While rumble vocalizations were emitted within the group, we were able to identify the caller, as well as record the call structure parameters of that specific individual’s vocalization. The rumble vocalization had a duration of 5 s, with a fundamental frequency between 17 and 20 Hz and two harmonics at approximately 40 and 60 Hz. This LDV technique could be used to identify and record individual callers within a group where it is difficult to identify the caller, and where it is not feasible to use voice-activated collars. Specifically, LDV technology would make it possible to unobtrusively record an entire “let’s go” rumble sequence while being able to identify the ordering of callers within the sequence. LDV technology may also facilitate vocal communication studies in species where identifying the caller within a group is challenging. Finally, LDV technology could inform studies focused on the physics of signal propagation of coordinated signals within group-living animals.
... Asian elephants (Elephas maximus) are highly social but spatially dispersed species [3,23]. Therefore, acoustic communication over short and long distances is crucial for mating as well as group cohesion and coordination [8,11,16]. Asian elephants produce a wide range of calls, from low-frequency rumble and growl to high-frequency trumpet, chirp, roar, bark as well as a range of imitation and combination calls [8,10,16,21]. ...
... Therefore, acoustic communication over short and long distances is crucial for mating as well as group cohesion and coordination [8,11,16]. Asian elephants produce a wide range of calls, from low-frequency rumble and growl to high-frequency trumpet, chirp, roar, bark as well as a range of imitation and combination calls [8,10,16,21]. ...
... All vocalizations were identified using field notes, listening, and visual inspection of spectrograms. Based on the findings in these studies [1,8,16], we categorized vocalizations into four major call types and combination calls. Our fieldwork yielded a total of 401 elephant calls, spanning all age groups and sex. ...
Chapter
Full-text available
This paper explores the possibility of classifying elephant vocalizations as per their associated contexts. While elephants produce a variety of calls, trumpet calls only were explored to distinguish between two contexts: interaction with other elephants and interaction with their human caretakers (mahouts). For this study, we collected task-specific elephant vocalization data through fieldwork. A support vector machine based classifier is developed on openSMILE features for the said classification. The classification accuracy in categorizing elephant directed and human directed trumpet calls was found to be 81.43%. A detailed analysis of the employed acoustic features revealed that loudness, the spectral slope of 500–1500 Hz band, and spectral flux were found to be maximally contributing to the said categorization. KeywordsBioacousticsElephant acousticsTrumpet callsMahout interactionsSocial interactionsSound classification
... This is interpreted as an adaptation to lowfrequency hearing since the absence of a secondary bony lamina widens the basilar membrane, making it less stiff and therefore more sensitive to low frequencies (Court 1992;Ketten 1992;Meng et al. 1997). Elephants are known to have the lowest low-frequency hearing limit of all extant terrestrial mammals (17Hz at 60dB in Elephas, Manoussaki et al. 2008), which aligns well with the infrasound they can produce by both vocalization (20Hz) and foot-drumming (10 to 40Hz) (Payne et al. 1986;Poole et al. 1988Poole et al. , 2005O'Connell-Rodwell et al. 2001Günther et al. 2004;O'Connell-Rodwell 2007;Nair et al. 2009;Stoeger et al. 2011;Stoeger and Manger 2014). The radii ratio of the cochlear canal (the quotient between the radius of the basal turn over that of the apical turn) is between 5.35 and 8.85, which is consistent with low-frequency hearing (Manoussaki et al. 2008). ...
Chapter
Full-text available
The elephant brain is famous for its higher than average encephalization quotient, memory capacities, large cerebellum, large facial and trigeminal nerves, and the extensive repertoire of complex behaviors and social interactions it produces, the last of which being supported by infrasonic communication. The evolutionary history of Proboscidea is amongst the best-documented among mammals but knowledge of the group’s paleoneurological history remains comparatively fragmentary. Here, we summarize and build upon more than 150 years of research on the evolution of the proboscidean nervous system. We find that the morphology of the endocranial cast and bony labyrinth of the basal-most proboscideans is consistent with the generalized plesiomorphic conditions for placental mammals (e.g. linearly organized brain parts, low encephalization quotient, presence of a secondary common crus), whereas their conditions become essentially elephant-like in the Elephantimorpha around the Oligocene. This suggests that a higher encephalization quotient and adaptations to low-frequency hearing (e.g. loss of the secondary bony lamina) evolved in parallel with the formation and evolution of a trunk, adaptation to a drier environment, and a higher body mass. We hypothesize that these structures co-evolved as a response to the changing climate in the Oligocene.
... Compared to this extensive body of research on African elephants, the Asian elephant acoustic communication has been considerably less investigated [63][64][65]. Their low-frequency calls were found to vary among individuals (but the formants were not analyzed) [65], and among behavioral contexts, although here individual differences were not considered [66]. DeSilva, 2010, distinguished between growls, which have no spectral energy over 500 Hz and are emitted through a nearly closed mouth, and rumbles, which have a broader frequency spectrum and are uttered through the open mouth. ...
Article
Full-text available
Sound production mechanisms set the parameter space available for transmitting biologically relevant information in vocal signals. Low–frequency rumbles play a crucial role in coordinating social interactions in elephants’ complex fission–fusion societies. By emitting rumbles through either the oral or the three-times longer nasal vocal tract, African elephants alter their spectral shape significantly. In this study, we used an acoustic camera to visualize the sound emission of rumbles in Asian elephants, which have received far less research attention than African elephants. We recorded nine adult captive females and analyzed the spectral parameters of 203 calls, including vocal tract resonances (formants). We found that the majority of rumbles (64%) were nasally emitted, 21% orally, and 13% simultaneously through the mouth and trunk, demonstrating velopharyngeal coupling. Some of the rumbles were combined with orally emitted roars. The nasal rumbles concentrated most spectral energy in lower frequencies exhibiting two formants, whereas the oral and mixed rumbles contained higher formants, higher spectral energy concentrations and were louder. The roars were the loudest, highest and broadest in frequency. This study is the first to demonstrate velopharyngeal coupling in a non-human animal. Our findings provide a foundation for future research into the adaptive functions of the elephant acoustic variability for information coding, localizability or sound transmission, as well as vocal flexibility across species.
... Their meanings are that the tiger cry characteristic is to subdue the baseline by roaring with an intense, condensed roar to the opponent, frighten them with a dispersed, distracted roar, and immobilize them with a low, uniform growl. [20][21] [22] ...