Referenced by
Claims1. A method of characterizing a relationship between a first and a second audio samples, comprising the steps of:
2. The method according to claim 1 in which the relationship between the first and second audio samples is characterized as substantially matching if a statistically significant peak is found. 3. The method according to claim 1 or 2, further comprising the step of estimating a global relative value with a location of the peak on an axis of the histogram, the global relative value further characterizing the relationship between the first and second audio samples. 4. The method according to claim 3, further comprising the step of determining a hyperfine estimate of the global relative value, wherein the step of determining comprises:
5. The method according to claim 1 in which each fingerprint object has an invariant component, and the first and second fingerprint objects in each pair of matched fingerprint objects have matching invariant components. 6. The method according to claim 5 in which the invariant component is generated using at least one of:
7. The method according to claim 6 in which each local feature is a spectrogram peak and each frequency value is determined from a frequency coordinate of a corresponding spectrogram peak. 8. The method according to claim 1 or 5 in which each fingerprint object has a variant component, and the relative value of each pair of matched fingerprint objects is determined using respective variant components of the first and second fingerprint objects. 9. The method according to claim 8 in which the variant component is a frequency value determined from a local feature near the respective location of each fingerprint object such that the relative value of a pair of matched fingerprint objects being characterized as a ratio of respective frequency values of the first and second fingerprint objects and the peak in the histogram characterizing the relationship between the first and second audio samples being characterized as a relative pitch, or, in case of linear stretch, a relative playback speed. 10. The method according to claim 9, wherein the ratio of respective frequency values is characterized as being either a division or a difference of logarithms. 11. The method according to claim 9, in which each local feature is a spectrogram peak and each frequency value is determined from a frequency coordinate of a corresponding spectrogram peak. 12. The method according to claim 8, in which the variant component is a delta time value determined from a first and a second local features near the respective location of each fingerprint object such that the relative value of a pair of matched fingerprint objects being characterized as a ratio of respective variant delta time values and the peak in the histogram characterizing the relationship between the first and second audio samples being characterized as a relative playback speed, or, in case of linear stretch, a relative pitch. 13. The method according to claim 12, wherein the ratio of respective variant delta time values is characterized as being either a division or a difference of logarithms. 14. The method according to claim 12, in which each local feature is a spectrogram peak and each frequency value is determined from a frequency coordinate of a corresponding spectrogram peak. 15. The method according to claim 8, further comprising the steps of:
16. The method according to claim 1, wherein R is a relative playback speed value determined from the peak of the histogram of the relative values, further comprising the steps of:
17. A computer program product for performing a method according to any preceding claim. 18. A computer system for performing a method according to any one of claims 1 to 16, the computer system comprising a client for sending information necessary for the characterization of the relationship between the first and second audio samples to a server that performs the characterization. |